llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	b1d12619c9	Add a few (as yet unused) query methods to determine if the attribute that's stored here is of a certain kind. This is in preparation for when an Attribute object represents a single attribute, instead of a bitmask of attributes. llvm-svn: 171247	2012-12-30 01:38:39 +00:00
Dmitri Gribenko	b137c9e551	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. llvm-svn: 171246	2012-12-30 01:28:40 +00:00
Bill Wendling	5e8ff877f4	Uniquify the AttributeImpl based on the Constant pointer, since those are already uniquified. Note: This will be expanded in the future to add more than just one pointer value. llvm-svn: 171245	2012-12-30 01:23:08 +00:00
Bill Wendling	3e4c4c9607	s/Raw/getBitMask/g to be more in line with current naming conventions. This method won't be sticking around. llvm-svn: 171244	2012-12-30 01:05:42 +00:00
NAKAMURA Takumi	5a495a5c96	llvm/test/Transforms/GVN/null-aliases-nothing.ll: Fix a RUN line not to emit ModuleID. Larry Evans reported it fails if source tree contains "load", like "download". llvm-svn: 171243	2012-12-30 00:33:26 +00:00
Craig Topper	fe82eb6bcd	Remove intrinsic specific instructions for (V)SQRTPS/PD. Instead lower to target-independent ISD nodes and use the existing patterns for those. llvm-svn: 171237	2012-12-29 18:18:20 +00:00
Craig Topper	f4a9c6e21b	Merge similar functionality using a nested switch. llvm-svn: 171229	2012-12-29 17:19:06 +00:00
Craig Topper	6b27251a76	Remove intrinsic specific instructions for SSE/SSE2/AVX floating point max/min instructions. Lower them to target specific nodes and use those patterns instead. This also allows them to be commuted if UnsafeFPMath is enabled. llvm-svn: 171227	2012-12-29 16:44:25 +00:00
Jakub Staszak	215f94143c	Simplify code, no functionality change. llvm-svn: 171226	2012-12-29 15:57:26 +00:00
Jakub Staszak	afe8109fce	Delete executive bit on ./lib/Target/Hexagon/HexagonAsmPrinter.h. llvm-svn: 171225	2012-12-29 15:23:06 +00:00
Bill Wendling	0cd0f7f832	Use a 'Constant' object instead of a bit field to store the attribute data. llvm-svn: 171221	2012-12-29 12:29:38 +00:00
Bill Wendling	4fdde84613	Use the accessor method instead of the raw ivar to get the bits. llvm-svn: 171220	2012-12-29 12:10:46 +00:00
Chandler Carruth	405d681340	Nuke some dead code that snuck in some how. I thought I had already deleted this, but apparantly not. Charmingly, Clang didn't warn on it but GCC did. llvm-svn: 171197	2012-12-28 14:50:51 +00:00
Chandler Carruth	86ed53089f	Fix a stunning oversight in the inline cost analysis. It was never propagating one of the values it simplified to a constant across a myriad of instructions. Notably, ptrtoint instructions when we had a constant pointer (say, 0) didn't propagate that, blocking a massive number of down-stream optimizations. This was uncovered when investigating why we fail to inline and delete the boilerplate in: void f() { std::vector<int> v; v.push_back(1); } It turns out most of the efforts I've made thus far to improve the analysis weren't making it far purely because of this. After this is fixed, the store-to-load forwarding patch enables LLVM to optimize the above to an empty function. We still can't nuke a second push_back, but for different reasons. There is a very real chance this will cause somewhat noticable changes in inlining behavior, so please let me know if you see regressions (or improvements!) because of this patch. llvm-svn: 171196	2012-12-28 14:43:42 +00:00
Chandler Carruth	753e21d057	Teach the inline cost analysis about calls that can be simplified and how to propagate constants through insert and extract value instructions. With the recent improvements to instsimplify, this allows inline cost analysis to constant fold through intrinsic functions, including notably the with.overflow intrinsic math routines which often show up inside of STL abstractions. This is yet another piece in the puzzle of breaking down the code for: void f() { std::vector<int> v; v.push_back(1); } But it still isn't enough. There are a pile of bugs in inline cost still blocking this. llvm-svn: 171195	2012-12-28 14:23:32 +00:00
Chandler Carruth	f6182155f6	Teach instsimplify to use the constant folder where appropriate for constant folding calls. Add the initial tests for this which show that now instsimplify can simplify blindingly obvious code patterns expressed with both intrinsics and library calls. llvm-svn: 171194	2012-12-28 14:23:29 +00:00
Chandler Carruth	9dc3558920	Add entry points to instsimplify for simplifying calls. The entry points are nice and decomposed so that we can simplify synthesized calls as easily as actually call instructions. The internal utility still has the same behavior, it just now operates on a more generic interface so that I can extend the set of call simplifications that instsimplify knows about. llvm-svn: 171189	2012-12-28 11:30:55 +00:00
Alexey Samsonov	3efc87e92d	Add proper support for -fsanitize-blacklist= flag for TSan and MSan. LLVM part. llvm-svn: 171183	2012-12-28 09:30:44 +00:00
Nadav Rotem	9785f519b4	CostModel: initial checkin for code that estimates the cost of special shuffles. llvm-svn: 171180	2012-12-28 08:19:03 +00:00
Nadav Rotem	c982a2dc25	wrap 80-col lines. llvm-svn: 171179	2012-12-28 07:28:43 +00:00
Nadav Rotem	3da9ac72fa	AVX: Move the ZEXT/ANYEXT DAGCo optimizations to the lowering of these optimizations. The old test cases still cover all of these lowering/optimizations. The single change that we have is that now anyext does not need to zero a register, because it does not use the exact code path as the zero_extend. llvm-svn: 171178	2012-12-28 05:45:24 +00:00
Nadav Rotem	68441914a5	Reverse the 'if' condition and reduce the indentation. llvm-svn: 171172	2012-12-27 23:08:05 +00:00
Craig Topper	ab2e6842cc	Merge basic_sse12_fp_binop_p_int and basic_sse12_fp_binop_p_y_int multiclasses. llvm-svn: 171171	2012-12-27 22:53:47 +00:00
Nadav Rotem	3b34190100	AVX/AVX2: Move the SEXT lowering code from a target specific DAGco to a lowering function. llvm-svn: 171170	2012-12-27 22:47:16 +00:00
Craig Topper	e2eec3c52b	Merge basic_sse12_fp_binop_p and basic_sse12_fp_binop_p_y multiclasses. llvm-svn: 171166	2012-12-27 18:51:50 +00:00
Chandler Carruth	3edd52c1d0	Add support to BasicBlocks for iterating backwards over the instructions. This just exposes the already present reverse iterators of the instruction ilist. llvm-svn: 171159	2012-12-27 12:00:56 +00:00
Chandler Carruth	a3c0d67d5b	Provide a common half-open interval map info implementation, and just re-use that for SlotIndexes. This way other users who want half-open semantics can share the implementation. llvm-svn: 171158	2012-12-27 11:29:17 +00:00
Chandler Carruth	e40e60eed5	Make this parameter be named consistently with most other getAnalysisUsage implementations. llvm-svn: 171157	2012-12-27 11:17:15 +00:00
Sean Silva	0f2eabce10	docs: Add FAQ about "storing to a virtual register". This came up for the N+1'st time today in IRC. llvm-svn: 171155	2012-12-27 10:23:04 +00:00
Sean Silva	33fc6cff4b	docs: Move link to the new "external tutorials" area. llvm-svn: 171154	2012-12-27 08:57:08 +00:00
Alexey Samsonov	29dd7f2090	[ASan] Fix lifetime intrinsics handling. Now for each intrinsic we check if it describes one of 'interesting' allocas. Assume that allocas can go through casts and phi-nodes before apperaring as llvm.lifetime arguments llvm-svn: 171153	2012-12-27 08:50:58 +00:00
Nadav Rotem	9aa00f0363	DAGCombinerInformation: add a getter that exposes the dagcombine level. llvm-svn: 171152	2012-12-27 08:44:35 +00:00
Alexey Samsonov	75ceb5b56b	Fix new[]/delete mismatch in FullDependence spotted by AddressSanitizer llvm-svn: 171150	2012-12-27 08:40:37 +00:00
Nadav Rotem	f85d3ee072	docs: Update the benchmark with updated perf numbers. llvm-svn: 171149	2012-12-27 08:32:44 +00:00
Nadav Rotem	2a054b4475	On AVX/AVX2 the type v8i1 is legalized to v8i16, which is an XMM sized register. In most cases we actually compare or select YMM-sized registers and mixing the two types creates horrible code. This commit optimizes some of the transition sequences. PR14657. llvm-svn: 171148	2012-12-27 08:15:45 +00:00
Nadav Rotem	8e5d80eba3	AVX/AVX2: Move the code that lowers vector-trunc from a DAGCo-hook to custom lowering hook. The vector truncs were scalarized during LegalizeVectorOps, later vectorized again by some DAGCombine optimization and finally, lowered by a dagcombing optimization. Now, they are properly lowered during LegalizeVectorOps. No new testcase because the original testcases still work. llvm-svn: 171146	2012-12-27 07:45:10 +00:00
Craig Topper	757f3fc394	Add hasSideEffects=0 to some forms of ROUND, RCP, and RSQRT. llvm-svn: 171143	2012-12-27 07:16:08 +00:00
Nadav Rotem	b1dd52450e	Refactor DAGCombinerInfo. Change the different booleans that indicate if we are before or after different runs of DAGCo, with the CombineLevel enum. Also, added a new API for checking if we are running before or after the LegalizeVectorOps phase. llvm-svn: 171142	2012-12-27 06:47:41 +00:00
Craig Topper	09ce4b9efe	Move single letter 'P' prefix out of multiclass now that tablegen allows defm to start with #NAME. This makes instruction names more searchable again. llvm-svn: 171141	2012-12-27 06:34:54 +00:00
Craig Topper	8f0b73942e	Update tablegen parser to allow defm names to start with #NAME. llvm-svn: 171140	2012-12-27 06:32:52 +00:00
Craig Topper	396cb795bc	Add hasSideEffects=0 to some shift and rotate instructions. None of which are currently used by code generation. llvm-svn: 171137	2012-12-27 03:35:44 +00:00
Craig Topper	c7910828e4	Mark the divide instructions as hasSideEffects=0. llvm-svn: 171136	2012-12-27 03:01:18 +00:00
Eric Christopher	3bf29fda91	For the dwarf5 split debug info code split out the string section per compile unit/skeleton compile unit. Update tests accordingly. llvm-svn: 171133	2012-12-27 02:14:01 +00:00
Eric Christopher	c8a88ee691	FileCheck-ize. llvm-svn: 171132	2012-12-27 02:13:58 +00:00
Eric Christopher	d6152aabbb	FileCheck-ize. llvm-svn: 171131	2012-12-27 02:13:55 +00:00
Craig Topper	5b807aaa38	Add hasSideEffects=0 to CMP*rr_REV. llvm-svn: 171130	2012-12-27 02:08:46 +00:00
Nadav Rotem	b3f6751df5	whitespace llvm-svn: 171129	2012-12-27 02:04:12 +00:00
Craig Topper	89e8607755	Add mayLoad, mayStore, and hasSideEffects tags to BT/BTS/BTR/BTC instructions. Shouldn't change any functionality since they don't have patterns to select them. llvm-svn: 171128	2012-12-27 02:01:33 +00:00
Eric Christopher	5a6acfa4c8	Right now all of the relocations are 32-bit dwarf, and the relocation information doesn't return an addend for Rel relocations. Go ahead and use this information to fix relocation handling inside dwarfdump for 32-bit ELF REL. llvm-svn: 171126	2012-12-27 01:07:07 +00:00
Nadav Rotem	5350cd314b	If all of the write objects are identified then we can vectorize the loop even if the read objects are unidentified. PR14719. llvm-svn: 171124	2012-12-26 23:30:53 +00:00
Craig Topper	c557343956	Fix operands and encoding form for ARPL instruction. Register form had and reversed. Memory form writes memory, but was marked as MRMSrcMem. llvm-svn: 171123	2012-12-26 23:27:57 +00:00
Craig Topper	d47a70de9f	Add hasSideEffects=0 to some atomic instructions. llvm-svn: 171122	2012-12-26 23:08:12 +00:00
Craig Topper	af2372087b	Mark the AL/AX/EAX forms of the basic arithmetic operations has never having side effects. llvm-svn: 171121	2012-12-26 22:19:23 +00:00
Nick Lewycky	fca2acb618	80 columns. No functionality change. llvm-svn: 171120	2012-12-26 22:00:49 +00:00
Nick Lewycky	90053a1214	Remove mid-optimizer warning. This situation should be handled differently, such as by a compiler warning, a check in clang -fsanitizer=undefined, being optimized to unreachable, or a combination of the above. PR14722. llvm-svn: 171119	2012-12-26 22:00:35 +00:00
Craig Topper	1b8c0750ee	Mark all the _REV instructions as not having side effects. They aren't really emitted by the backend, but it reduces the number of instructions in the output files with unmodelled side effects to make auditing easier. llvm-svn: 171118	2012-12-26 21:30:22 +00:00
Craig Topper	18f2675e9b	Remove a special conditional setting of neverHasSideEffects if the instruction didn't have a pattern. This was leftover from when tablegen used to complain if things were already inferred from patterns. llvm-svn: 171117	2012-12-26 21:04:30 +00:00
Nadav Rotem	0bbf81e311	Update the docs with the new workload that was added. llvm-svn: 171115	2012-12-26 19:45:00 +00:00
Nadav Rotem	3f7c4f36ba	LoopVectorizer: Optimize the vectorization of consecutive memory access when the iteration step is -1 llvm-svn: 171114	2012-12-26 19:08:17 +00:00
Eli Bendersky	8d5f8dc485	Fix comment typo llvm-svn: 171113	2012-12-26 18:15:42 +00:00
Evgeniy Stepanov	5eb5bf8b46	[msan] Raise alignment of origin stores/loads when possible. Origin alignment is as high as the alignment of the corresponding application location, but never less than 4. llvm-svn: 171110	2012-12-26 11:55:09 +00:00
Evgeniy Stepanov	d8be0c510c	[msan] Expand the file comment with track-origins info. llvm-svn: 171109	2012-12-26 10:59:00 +00:00
Benjamin Kramer	d14720dced	Fix quoting in configure. Patch by Krzysztof Parzyszek! llvm-svn: 171108	2012-12-26 10:48:49 +00:00
Craig Topper	24f316e4db	Merge still more SSE/AVX instruction definitions. llvm-svn: 171103	2012-12-26 07:54:43 +00:00
Craig Topper	af629e2700	Merge more SSE/AVX instruction definitions. llvm-svn: 171102	2012-12-26 07:20:35 +00:00
NAKAMURA Takumi	bf99a426cb	TableGen/FixedLenDecoderEmitter.cpp: Fix a potential mask overflow in fieldFromInstruction(). Reported by Yang Yongyong, thanks! llvm-svn: 171101	2012-12-26 06:43:14 +00:00
Nadav Rotem	a1d2436b5f	revert an accidental commit. llvm-svn: 171098	2012-12-26 06:16:03 +00:00
Craig Topper	65fe30450d	Fix 80 column violation. llvm-svn: 171097	2012-12-26 06:15:53 +00:00
Craig Topper	f4d0fe8fcd	Fix class name in comment. llvm-svn: 171096	2012-12-26 06:15:09 +00:00
Craig Topper	59747c4dbd	Merge SSE/AVX PCMPEQ/PCMPGT instruction definitions. llvm-svn: 171095	2012-12-26 06:14:15 +00:00
Nadav Rotem	7375d35711	Doc: add fmuladd to the list of vectorizeable functions. Thanks hfinkel. llvm-svn: 171094	2012-12-26 06:03:35 +00:00
Craig Topper	8a48677586	Remove 'v' from mnemonic to fix asm matching failures. llvm-svn: 171093	2012-12-26 06:02:15 +00:00
Craig Topper	b4ef0fa3a1	Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for a bunch of SSE2 integer arithmetic instructions. llvm-svn: 171092	2012-12-26 05:49:15 +00:00
Nadav Rotem	5267bb71b8	Reformat the docs. llvm-svn: 171091	2012-12-26 04:59:20 +00:00
Nadav Rotem	0e1d662d56	white space llvm-svn: 171090	2012-12-26 04:58:12 +00:00
Craig Topper	a2594dd5f0	Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for PAND/POR/PXOR/PANDN llvm-svn: 171087	2012-12-26 04:36:03 +00:00
Craig Topper	97730a0d6a	Merge an AVX/SSE 256-bit and 128-bit multiclass. llvm-svn: 171086	2012-12-26 03:56:47 +00:00
Craig Topper	8b59746390	Mark VANDNPD/VANDNPDS as not commutable. llvm-svn: 171085	2012-12-26 03:48:10 +00:00
NAKAMURA Takumi	40aa3285f4	llvm/test/CodeGen/X86: FileCheck-ize two tests in r171083. llvm-svn: 171084	2012-12-26 03:19:30 +00:00
NAKAMURA Takumi	334f685328	llvm/test/CodeGen/X86: Disable avx in two tests corresponding to r171082. llvm-svn: 171083	2012-12-26 03:08:55 +00:00
Craig Topper	81d1e596bb	Remove alignment from a bunch more VEX encoded operations in the folding tables. llvm-svn: 171082	2012-12-26 02:44:47 +00:00
Craig Topper	b2922164f0	Remove alignment from folding table for VMOVUPD as an unaligned instruction it shouldn't require alignment... llvm-svn: 171081	2012-12-26 02:14:19 +00:00
Craig Topper	d09a9af9b6	Remove alignment requirements from (V)EXTRACTPS. This instruction does 32-bit stores which aren't required to be aligned on SSE or AVX. llvm-svn: 171080	2012-12-26 01:47:12 +00:00
Hal Finkel	30e95a8ebb	BBVectorize: Use VTTI to compute costs for intrinsics vectorization For the time being this includes only some dummy test cases. Once the generic implementation of the intrinsics cost function does something other than assuming scalarization in all cases, or some target specializes the interface, some real test cases can be added. Also, for consistency, I changed the type of IID from unsigned to Intrinsic::ID in a few other places. llvm-svn: 171079	2012-12-26 01:36:57 +00:00
Craig Topper	caef1c5d86	Remove alignment requirement from VCVTSS2SD in folding tables. Reverting r171049. This instruction doesn't require alignment. llvm-svn: 171078	2012-12-26 00:35:47 +00:00
Hal Finkel	b44f890133	LoopVectorize: Enable vectorization of the fmuladd intrinsic llvm-svn: 171076	2012-12-25 23:21:29 +00:00
Hal Finkel	2a456112ec	BBVectorize: Enable vectorization of the fmuladd intrinsic llvm-svn: 171075	2012-12-25 22:36:08 +00:00
Hal Finkel	2ebe6d08cd	Loosen scheduling restrictions on the PPC dcbt intrinsic As with the prefetch intrinsic to which it maps, simply have dcbt marked as reading from and writing to its arguments instead of having unmodeled side effects. While this might cause unwanted code motion (because aliasing checks don't really capture cache-line sharing), it is more important that prefetches in unrolled loops don't block the scheduler from rearranging the unrolled loop body. llvm-svn: 171073	2012-12-25 18:51:18 +00:00
Hal Finkel	1b5ff08d43	Expand PPC64 atomic load and store Use of store or load with the atomic specifier on 64-bit types would cause instruction-selection failures. As with the 32-bit case, these can use the default expansion in terms of cmp-and-swap. llvm-svn: 171072	2012-12-25 17:22:53 +00:00
Evgeniy Stepanov	f19c086d1e	[msan] Fix handling of vectors of pointers. VectorType::getInteger() can not be used with them, because pointer size depends on the target. llvm-svn: 171070	2012-12-25 16:04:38 +00:00
Evgeniy Stepanov	ec8371283b	[msan] Fix handling of select with vector condition. llvm-svn: 171069	2012-12-25 14:56:21 +00:00
Benjamin Kramer	a9f265ee98	Harden test so it's not affected by changes to compare lowering. This only failed on hosts that don't have SSE41. llvm-svn: 171066	2012-12-25 13:23:23 +00:00
Benjamin Kramer	81b5a8fd2e	X86: Shave off one shuffle from the pcmpeqq sequence for SSE2 by making use of and commutativity. llvm-svn: 171064	2012-12-25 13:09:08 +00:00
Benjamin Kramer	df4af41b9b	X86: Custom lower <2 x i64> eq and ne when SSE41 is not available. pcmpeqd, pshufd, pshufd, pand is cheaper than unpack + cmpq, sbbq, cmpq, sbbq + pack. Small speedup on loop-vectorized viterbi (-march=core2). llvm-svn: 171063	2012-12-25 12:54:19 +00:00
Alexey Samsonov	788381b8ac	ASan: initialize callbacks from ASan module pass in a separate function for consistency llvm-svn: 171061	2012-12-25 12:28:20 +00:00
Alexey Samsonov	1e3f7ba8f7	ASan: move stack poisoning logic into FunctionStackPoisoner struct llvm-svn: 171060	2012-12-25 12:04:36 +00:00
Nick Lewycky	d192517cf3	Fix whitespace. No functionality change. llvm-svn: 171051	2012-12-25 06:13:25 +00:00
Nadav Rotem	00410ae625	VCVTSS2SD requires a strict alignment. Thanks Elena. llvm-svn: 171049	2012-12-25 03:29:18 +00:00
Bob Wilson	fe73ac34c5	Rename LLVMContext diagnostic handler types and functions. These are now generally used for all diagnostics from the backend, not just for inline assembly, so this drops the "InlineAsm" from the names. No functional change. (I've left aliases for the old names but only for long enough to let me switch over clang to use the new ones.) llvm-svn: 171047	2012-12-25 00:07:12 +00:00
NAKAMURA Takumi	04a664e92e	[CMake] AddLLVM.cmake: Tweak the corner case that "check-all" doesn't have any tests. "check-all" can be executed with 0 status, "check-all does nothing, no tools built." LLVM_EXTERNAL_CLANG_BUILD=OFF LLVM_BUILD_TOOLS=OFF can reproduce this. Oscar Fuentes reported this. Thank you. llvm-svn: 171046	2012-12-24 22:43:59 +00:00
Nick Lewycky	521e0d59f3	Quiet gcc's -Wparenthesis warning. No functionality change. llvm-svn: 171044	2012-12-24 19:58:45 +00:00
Nick Lewycky	fb43258080	Fix typo "Makre" -> "Make". llvm-svn: 171043	2012-12-24 19:55:47 +00:00
Benjamin Kramer	9d46110ff1	Use a std::string rather than a dynamically allocated char* buffer. This affords us to use std::string's allocation routines and use the destructor for the memory management. Switching to that also means that we can use operator==(const std::string&, const char *) to perform the string comparison rather than resorting to libc functionality (i.e. strcmp). Patch by Saleem Abdulrasool! Differential Revision: http://llvm-reviews.chandlerc.com/D230 llvm-svn: 171042	2012-12-24 19:23:30 +00:00
Bob Wilson	4ed23578da	Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368> When the backend is used from clang, it should produce proper diagnostics instead of just printing messages to errs(). Other clients may also want to register their own error handlers with the LLVMContext, and the same handler should work for warnings in the same way as the existing emitError methods. llvm-svn: 171041	2012-12-24 18:15:21 +00:00
Dmitri Gribenko	06b84eb414	Fix a typo introduced in r168577: FlAGS -> FLAGS (note the lowercase ell) Now we really pass -Wcovered-switch-default if the compiler supports it. llvm-svn: 171040	2012-12-24 17:52:48 +00:00
Dmitri Gribenko	a6c2729ccc	AutoRegen.sh: update reference to documentation llvm-svn: 171037	2012-12-24 15:01:59 +00:00
NAKAMURA Takumi	1b18db7ea3	llvm/test/CodeGen/X86/fold-vex.ll: Add explicit triple. llvm-svn: 171029	2012-12-24 11:14:06 +00:00
Nadav Rotem	3ee6b10dd4	CostModel: We have API for checking the costs of known shuffles. This patch adds support for the insert-subvector and extract-subvector kinds. llvm-svn: 171027	2012-12-24 10:04:03 +00:00
Elena Demikhovsky	517afbff01	Added 6 more value types: v32i1, v64i1, v32i16, v32i8, v64i8, v8f64 llvm-svn: 171026	2012-12-24 10:03:57 +00:00
Elena Demikhovsky	2fdeb6da8d	Removed "static" from "__jit_debug_descriptor" because "static" adds C++ mangling prefix to this symbol. llvm-svn: 171025	2012-12-24 09:42:27 +00:00
Nadav Rotem	dc0ad92b64	Some x86 instructions can load/store one of the operands to memory. On SSE, this memory needs to be aligned. When these instructions are encoded in VEX (on AVX) there is no such requirement. This changes the folding tables and removes the alignment restrictions from VEX-encoded instructions. llvm-svn: 171024	2012-12-24 09:40:33 +00:00
Nadav Rotem	5f7c12cfbd	LoopVectorizer: When checking for vectorizable types, also check the StoreInst operands. PR14705. llvm-svn: 171023	2012-12-24 09:14:18 +00:00
Nadav Rotem	7e1599e100	Change the codegen Cost Model API for shuffeles. This patch removes the API for broadcast and adds a more general API that accepts an enum of known shuffles. llvm-svn: 171022	2012-12-24 08:57:47 +00:00
Alexey Samsonov	098842b401	Fix typo in comments llvm-svn: 171021	2012-12-24 08:52:53 +00:00
Nadav Rotem	99868e4f9d	Update the docs of the cost model. llvm-svn: 171016	2012-12-24 05:51:12 +00:00
NAKAMURA Takumi	fec2ea1b3d	llvm/MC/MCMachObjectWriter.h: ComputeSymbolTable(): Prune one description in the comment. [-Wdocumentation] /// \param StringIndexMap [out] - Map from symbol names to offsets in the string table. llvm-svn: 171010	2012-12-24 01:24:04 +00:00
Nadav Rotem	bd5d1d832a	LoopVectorizer: Fix an endless loop in the code that looks for reductions. The bug was in the code that detects PHIs in if-then-else block sequence. PR14701. llvm-svn: 171008	2012-12-24 01:22:06 +00:00
Dmitri Gribenko	32e0aa3a50	Documentation: fix typos reported in PR13866 llvm-svn: 171006	2012-12-23 18:46:11 +00:00
Nadav Rotem	cf9999d9d5	CostModel: Change the default target-independent implementation for finding the cost of arithmetic functions. We now assume that the cost of arithmetic operations that are marked as Legal or Promote is low, but ops that are marked as custom are higher. llvm-svn: 171002	2012-12-23 17:31:23 +00:00
Benjamin Kramer	28691400dd	LoopVectorize: Fix accidentaly inverted condition. llvm-svn: 171001	2012-12-23 13:21:41 +00:00
Benjamin Kramer	855ba03408	LoopVectorize: For scalars and void types there is no need to compute vector insert/extract costs. Fixes an assert during the build of oggenc in the test suite. llvm-svn: 171000	2012-12-23 13:19:18 +00:00
Nadav Rotem	aa92ea4f12	We are not ready to estimate the cost of integer expansions based on the number of parts. This test is too noisy. llvm-svn: 170999	2012-12-23 09:11:07 +00:00
Sean Silva	ff120c7fc5	docs: Add link to external LLVM backend tutorial. llvm-svn: 170998	2012-12-23 07:34:51 +00:00
Nadav Rotem	b15c69a725	whitespace llvm-svn: 170997	2012-12-23 07:33:44 +00:00
Nadav Rotem	1bef5a0509	Rename a function. llvm-svn: 170996	2012-12-23 07:30:09 +00:00
Nadav Rotem	2cade68025	Loop Vectorizer: Update the cost model of scatter/gather operations and make them more expensive. llvm-svn: 170995	2012-12-23 07:23:55 +00:00
Craig Topper	1bef2c859f	Remove trailing whitespace. llvm-svn: 170991	2012-12-22 19:15:35 +00:00
Craig Topper	4c94775198	Remove trailing whitespace llvm-svn: 170990	2012-12-22 18:09:02 +00:00
Jakob Stoklund Olesen	7bca670a8b	Remove a special case that doesn't seem necessary any longer. Back when this exception was added, it was skipping a lot more code, but now it just looks like a premature optimization. llvm-svn: 170989	2012-12-22 17:33:22 +00:00
Jakob Stoklund Olesen	b089483993	Use getNumOperands() instead of Operands.size(). The representation of the Operands array is going to change soon so it can be allocated from a BumpPtrAllocator. llvm-svn: 170988	2012-12-22 17:13:06 +00:00
Benjamin Kramer	76268ac682	X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available. pmuludq is slow, but it turns out that all the unpacking and packing of the scalarized mul is even slower. 10% speedup on loop-vectorized paq8p. llvm-svn: 170985	2012-12-22 16:07:56 +00:00
Benjamin Kramer	b2f0a2bd4b	X86: Emit vector sext as shuffle + sra if vpmovsx is not available. Also loosen the SSSE3 dependency a bit, expanded pshufb + psra is still better than scalarized loads. Fixes PR14590. llvm-svn: 170984	2012-12-22 11:34:28 +00:00
Craig Topper	fc5ee3516c	Add a comma to fix the build. llvm-svn: 170982	2012-12-22 08:22:01 +00:00
Craig Topper	c9dcbe6987	Use a negative value to represent INVALID_SIMPLE_VALUE_TYPE instead of 256. Its much cheaper for the isSimple() checks to look for values less than 0 rather than a value greater than 255. This shaves ~8k off the size of the llc binary on x86-64. llvm-svn: 170981	2012-12-22 08:16:17 +00:00
Craig Topper	8289b327ac	Add vAny and Metadata to the switch in getSizeInBits for consistency since every other enum was listed. llvm-svn: 170977	2012-12-22 03:08:37 +00:00
Daniel Dunbar	fa40268fb7	[utils] Tweak utils/clang-parse-diagnostics-file to ignore autoconf diagnostics. - Also, don't print headers if we aren't going to print any diagnostics. llvm-svn: 170973	2012-12-22 00:47:06 +00:00
Bill Wendling	c79e42c5ce	Change 'AttrVal' to 'AttrKind' to better reflect that it's a kind of attribute instead of the value of the attribute. llvm-svn: 170972	2012-12-22 00:37:52 +00:00
Richard Smith	2450f1c2c5	Fix some undefined behavior when parsing YAML input: don't try to compare an uninitialized value against a default value. Found by -fsanitize=enum. llvm-svn: 170970	2012-12-22 00:31:54 +00:00
Richard Smith	045e4f1365	Don't call back() on an empty SmallVector. Found by -fsanitize=enum! llvm-svn: 170968	2012-12-22 00:15:13 +00:00
Nadav Rotem	d5aae980cb	In some cases, due to scheduling constraints we copy the EFLAGS. The only way to read the eflags is using push and pop. If we don't adjust the stack then we run over the first frame index. This is not something that we want to do, so we have to make sure that our machine function does not copy the flags. If it does then we have to emit the prolog that adjusts the stack. rdar://12896831 llvm-svn: 170961	2012-12-21 23:48:49 +00:00
Akira Hatanaka	6ac2fc4976	[mips] Refactor subword-swap, EXT/INS, load-effective-address and read-hardware instructions. llvm-svn: 170956	2012-12-21 23:21:32 +00:00
Akira Hatanaka	beea8a34c3	[mips] Refactor SYNC and multiply/divide instructions. llvm-svn: 170955	2012-12-21 23:17:36 +00:00
Akira Hatanaka	31ddec5887	[mips] Refactor BAL instructions. llvm-svn: 170954	2012-12-21 23:15:59 +00:00
Akira Hatanaka	d6b694f036	[mips] Fix encoding of BAL instruction. Also, fix assembler test case which was not catching the error. llvm-svn: 170953	2012-12-21 23:13:59 +00:00
Akira Hatanaka	a158042a56	[mips] Refactor jump, jump register, jump-and-link and nop instructions. llvm-svn: 170952	2012-12-21 23:03:50 +00:00
Akira Hatanaka	e1826d7464	[mips] Refactor load/store left/right and load-link and store-conditional instructions. llvm-svn: 170950	2012-12-21 23:01:24 +00:00
Akira Hatanaka	d9bf8424e5	[mips] Refactor load/store instructions. llvm-svn: 170948	2012-12-21 22:58:55 +00:00
Akira Hatanaka	b59b047fbe	[mips] Remove unnecessary isPseudo parameter. llvm-svn: 170947	2012-12-21 22:57:26 +00:00
Akira Hatanaka	e738efc95b	[mips] Refactor LUI instruction. llvm-svn: 170944	2012-12-21 22:46:07 +00:00
Akira Hatanaka	895e1cb2aa	[mips] Refactor count leading zero or one instructions. llvm-svn: 170942	2012-12-21 22:43:58 +00:00
Akira Hatanaka	4f4c4aa05e	[mips] Refactor sign-extension-in-register instructions. llvm-svn: 170940	2012-12-21 22:41:52 +00:00
Akira Hatanaka	b14c6e4e5f	[mips] Refactor instructions which copy from and to HI/LO registers. llvm-svn: 170939	2012-12-21 22:39:17 +00:00
Akira Hatanaka	9e89195dce	[mips] Refactor logical NOR instructions. llvm-svn: 170937	2012-12-21 22:35:47 +00:00
Akira Hatanaka	ac10697207	[mips] Move instruction definitions in MipsInstrInfo.td. llvm-svn: 170936	2012-12-21 22:33:43 +00:00
Tom Stellard	09ef8425e9	R600: Coding style - remove empty spaces from the beginning of functions No functionality change. llvm-svn: 170923	2012-12-21 20:12:02 +00:00
Tom Stellard	41398026e7	R600: Fix MAX_UINT definition Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170922	2012-12-21 20:12:01 +00:00
Tom Stellard	4fa7ac29f1	R600: Add SHADOWCUBE to TEX_SHADOW pattern Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170921	2012-12-21 20:11:59 +00:00
Benjamin Kramer	5521b94b07	Cleanup compiler warnings on discarding type qualifiers in casts. Switch to C++ style casts. Patch by Saleem Abdulrasool! Differential Revision: http://llvm-reviews.chandlerc.com/D204 llvm-svn: 170917	2012-12-21 19:09:53 +00:00
Jakob Stoklund Olesen	0edb164723	Add a missing assertion, the null register has no register units. llvm-svn: 170916	2012-12-21 18:38:09 +00:00
Benjamin Kramer	b4688f84bd	try to unbreak ppc buildbots. llvm-svn: 170913	2012-12-21 18:11:45 +00:00
Benjamin Kramer	d0eb39232c	Teach sort_includes.py to drop duplicated includes. llvm-svn: 170911	2012-12-21 18:00:08 +00:00
Benjamin Kramer	82d1c371e2	X86: Match pmin/pmax as a target specific dag combine. This occurs during vectorization. Part of PR14667. llvm-svn: 170908	2012-12-21 17:46:58 +00:00
Roman Divacky	a229186a82	Remove duplicate includes. llvm-svn: 170902	2012-12-21 17:06:44 +00:00
Tom Stellard	a8b0351720	R600: Expand vec4 INT <-> FP conversions llvm-svn: 170901	2012-12-21 16:33:24 +00:00
Benjamin Kramer	4669d18893	X86: Match the SSE/AVX min/max vector ops using a custom node instead of intrinsics This is very mechanical, no functionality change. Preparation for PR14667. llvm-svn: 170898	2012-12-21 14:04:55 +00:00
Duncan Sands	0ac8473db0	Test that a landingpad gets the name provided when it was created (see commit 170318). llvm-svn: 170886	2012-12-21 12:03:03 +00:00
Evgeniy Stepanov	4fbc0d08bf	[msan] Remove unreachable blocks before instrumenting a function. llvm-svn: 170883	2012-12-21 11:18:49 +00:00
Nadav Rotem	eacbb731d3	Add a missing "virtual" keyword. llvm-svn: 170842	2012-12-21 05:02:12 +00:00
Nadav Rotem	3b850b70b3	Enable if-conversion. llvm-svn: 170841	2012-12-21 04:47:54 +00:00
Quentin Colombet	b1b66e7a25	Add ARM cortex-r5 subtarget. llvm-svn: 170840	2012-12-21 04:35:05 +00:00
Rafael Espindola	73bf9fa7ba	Don't skip __DWARF, Now that we don't merge section and segment names, we don't need to skip the segment name to get to the section name. llvm-svn: 170839	2012-12-21 04:08:03 +00:00
Rafael Espindola	a9f810b6b5	Add a function to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be inform the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. The main difference from the previous patch is that it doesn't use InMemoryStruct. It is extremely dangerous: if the endians match it returns a pointer to the file buffer, if not, it returns a pointer to an internal buffer that is overwritten in the next API call. We should change all of this code to use support::detail::packed_endian_specific_integral like ELF, but since these functions only handle strings, they work with big and little endian machines as is. I have tested this by installing ubuntu 12.10 ppc on qemu, that is why it took so long :-) llvm-svn: 170838	2012-12-21 03:47:03 +00:00
Evan Cheng	59421aee3d	Add targets to skip running the GC passes. llvm-svn: 170836	2012-12-21 02:57:04 +00:00
Evan Cheng	99cafb1db2	Every pass deserves a name, even codegenprep. llvm-svn: 170831	2012-12-21 01:48:14 +00:00
Nadav Rotem	6d4fdd6d2c	Improve the X86 cost model for loads and stores. llvm-svn: 170830	2012-12-21 01:33:59 +00:00
Nadav Rotem	a4b53f20a3	BB-Vectorizer: Check the cost of the store pointer type and not the return type, which is void. A number of test cases fail after adding the assertion in TTImpl. llvm-svn: 170828	2012-12-21 01:24:36 +00:00
Reed Kotler	93f778d2bd	Add test case for r170674 llvm-svn: 170823	2012-12-21 00:55:10 +00:00
Reed Kotler	9bff1ead0e	Call llvm_unreachable instead of assert. llvm-svn: 170822	2012-12-21 00:44:59 +00:00
Sean Silva	850861df62	docs: More robust image scaling fix. Hopefully these benchmarks will be updated in the future, so avoid hardcoding image dimensions. llvm-svn: 170819	2012-12-21 00:28:42 +00:00
Sean Silva	35915c6459	docs: Prevent image scaling. Tell the image to be its natural size. llvm-svn: 170816	2012-12-21 00:20:25 +00:00
Nadav Rotem	e7785686a5	Fix a bug in the code that checks if we can vectorize loops while using dynamic memory bound checks. Before the fix we were able to vectorize this loop from the Livermore Loops benchmark: for ( k=1 ; k<n ; k++ ) x[k] = x[k-1] + y[k]; llvm-svn: 170811	2012-12-21 00:07:35 +00:00
Eric Christopher	6e47b725ff	Move these files over to the debug info directory. llvm-svn: 170810	2012-12-21 00:03:42 +00:00
Sean Silva	e9ba463632	docs: Try out nosidebar. Please squawk if you find this appalling or otherwise don't like it. llvm-svn: 170803	2012-12-20 23:35:22 +00:00
Sean Silva	287e7d275c	docs: Cleanup trailing whitespace. llvm-svn: 170799	2012-12-20 22:59:36 +00:00
Jakob Stoklund Olesen	2455b58551	Require the two-argument MI::addOperand(MF, MO) for dangling instructions. Instructions that are inserted in a basic block can still be decorated with addOperand(MO). Make the two-argument addOperand() function contain the actual implementation. This function will now always have a valid MF reference that it can use for memory allocation. llvm-svn: 170798	2012-12-20 22:54:05 +00:00
Jakob Stoklund Olesen	33f5d1492d	Add an MF argument to MI::copyImplicitOps(). This function is often used to decorate dangling instructions, so a context reference is required to allocate memory for the operands. Also add a corresponding MachineInstrBuilder method. llvm-svn: 170797	2012-12-20 22:54:02 +00:00
Jakob Stoklund Olesen	ac4210eacb	Use two-arg addOperand(MF, MO) internally in MachineInstr when possible. llvm-svn: 170796	2012-12-20 22:53:58 +00:00
Jakob Stoklund Olesen	2ea203694d	MachineInstrBuilderize ARM. llvm-svn: 170795	2012-12-20 22:53:55 +00:00
Jakob Stoklund Olesen	4255c96aed	MachineInstrBuilderize NVPTX. llvm-svn: 170794	2012-12-20 22:53:53 +00:00
Eli Bendersky	75a7a338fc	Fix an unitialized member variable that may have caused sporadic failures for code that wasn't even in bundling mode. llvm-svn: 170793	2012-12-20 22:51:52 +00:00
Sean Silva	e140b2ee67	docs: actually indent these consistently llvm-svn: 170792	2012-12-20 22:49:13 +00:00
Sean Silva	8c44a4733c	docs: Indent consistently in code examples. llvm-svn: 170791	2012-12-20 22:47:41 +00:00
Sean Silva	99e12f91a6	docs: Improve navigation for Vectorizers.rst Add links in the intro paragraph. Add table of contents. llvm-svn: 170790	2012-12-20 22:42:20 +00:00
Sean Silva	fd706f7da9	docs: bring back link for reddit. llvm-svn: 170776	2012-12-20 22:24:37 +00:00
Eric Christopher	48fef599a4	Whitespace and 80-column cleanup. llvm-svn: 170771	2012-12-20 21:58:40 +00:00
Eric Christopher	e698f53740	Start splitting out the debug string section handling by moving it into the DwarfUnits class. llvm-svn: 170770	2012-12-20 21:58:36 +00:00
Sean Silva	eae2d90508	docs: Make document name congruent with title. Hopefully nobody has linked to it yet... OK'd by Nadav. llvm-svn: 170768	2012-12-20 21:50:41 +00:00
Bill Wendling	66e978f904	Some random comment, naming, and format changes. Rename the AttributeImpl* from Attrs to pImpl to be consistent with other code. Add comments where none were before. Or doxygen-ify other comments. llvm-svn: 170767	2012-12-20 21:28:43 +00:00
Jakob Stoklund Olesen	00b28ecfae	Remove two dead functions. llvm-svn: 170766	2012-12-20 21:12:42 +00:00
Bob Wilson	7bba4f8957	Revert "Adding support for llvm.arm.neon.vaddl[su].* and" This reverts r170694. The operations can be represented in IR without adding any new intrinsics. llvm-svn: 170765	2012-12-20 21:09:38 +00:00
Nadav Rotem	2ababf68d7	LoopVectorize: Fix a bug in the scalarization of instructions. Before if-conversion we could check if a value is loop invariant if it was declared inside the basic block. Now that loops have multiple blocks this check is incorrect. This fixes External/SPEC/CINT95/099_go/099_go llvm-svn: 170756	2012-12-20 20:24:40 +00:00
Evan Cheng	ddc0cb6dc5	On some ARM cpus, flags setting movs with shifter operand, i.e. lsl, lsr, asr, are more expensive than the non-flag setting variant. Teach thumb2 size reduction pass to avoid generating them unless we are optimizing for size. rdar://12892707 llvm-svn: 170728	2012-12-20 19:59:30 +00:00
Eli Bendersky	4cfb5b9e64	Change Lit error redirection to FileCheck to a more common syntax since it can potentially cause some bots to fail. llvm-svn: 170726	2012-12-20 19:54:02 +00:00
Eli Bendersky	f658e92724	Add a largish auto-generated test for the aligned bundling feature, along with the script generating it. The test should never be modified manually. If anyone needs to change it, please change the script and re-run it. The script is placed into utils/testgen - I couldn't think of a better place, and after some discussion on IRC this looked like a logical location. llvm-svn: 170720	2012-12-20 19:16:57 +00:00
Eli Bendersky	4c4f11eb0d	Tests for the aligned bundling support added in r170718 llvm-svn: 170719	2012-12-20 19:07:30 +00:00
Eli Bendersky	f483ff9204	Aligned bundling support. Following the discussion here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-December/056754.html The proposal and implementation are fully documented here: https://sites.google.com/a/chromium.org/dev/nativeclient/pnacl/aligned-bundling-support-in-llvm Tests will follow shortly. llvm-svn: 170718	2012-12-20 19:05:53 +00:00
Jakob Stoklund Olesen	2705333253	Use MachineInstrBuilder for PHI nodes in SelectionDAGISel. llvm-svn: 170716	2012-12-20 18:46:29 +00:00
Jim Grosbach	759292c93f	Fix inadvertant delete of 'has'. llvm-svn: 170713	2012-12-20 18:09:48 +00:00
Jakob Stoklund Olesen	b109a7b430	Use MachineInstrBuilder in InstrEmitter. This is supposed to be a mechanical change with no functional effects. InstrEmitter can generate all types of MachineOperands which revealed that MachineInstrBuilder was missing a few methods, added by this patch. Besides providing a context pointer to MI::addOperand(), MachineInstrBuilder seems like a better fit for this code. llvm-svn: 170712	2012-12-20 18:08:09 +00:00
Jakob Stoklund Olesen	f623e9870d	Use MachineInstrBuilder in a few CodeGen passes. This automatically passes a context pointer to MI->addOperand(). llvm-svn: 170711	2012-12-20 18:08:06 +00:00
Rafael Espindola	642c7cd56e	Simplify the testcase a bit. I checked that it would still crash llc before the corresponding fix. llvm-svn: 170709	2012-12-20 17:47:27 +00:00
Nadav Rotem	8b20c0a814	Loop Vectorizer: turn-off if-conversion. llvm-svn: 170708	2012-12-20 17:42:53 +00:00
James Molloy	4f6fb953a7	Add a new attribute, 'noduplicate'. If a function contains a noduplicate call, the call cannot be duplicated - Jump threading, loop unrolling, loop unswitching, and loop rotation are inhibited if they would duplicate the call. Similarly inlining of the function is inhibited, if that would duplicate the call (in particular inlining is still allowed when there is only one callsite and the function has internal linkage). llvm-svn: 170704	2012-12-20 16:04:27 +00:00
Roman Divacky	ff95a1dc12	Remove MCTargetAsmLexer and its derived classes now that edis, its only user, is gone. llvm-svn: 170699	2012-12-20 14:43:30 +00:00
Renato Golin	6b2ea4a48f	Adding support for llvm.arm.neon.vaddl[su].* and llvm.arm.neon.vsub[su].* intrinsics. Patch by Pete Couperus <pjcoup@gmail.com> llvm-svn: 170694	2012-12-20 13:52:11 +00:00
NAKAMURA Takumi	a1d528baa5	llvmbuild/main.py: Let LibraryDependencies.inc deterministic. FYI, llvm and clang can be built deterministically between stage 2 and stage3, among iterative clean rebuilds, with GNU ar; configure --disable-timestamps make AR.Flags=crsD RANLIB=echo llvm-svn: 170682	2012-12-20 10:35:18 +00:00
Craig Topper	ae48cb2e5a	Formatting fixes. Remove some unnecessary 'else' after 'return'. No functional change. llvm-svn: 170676	2012-12-20 07:15:54 +00:00
Craig Topper	9d4171afed	Removing trailing whitespace llvm-svn: 170675	2012-12-20 07:09:41 +00:00
Reed Kotler	d11acc7dc0	Implement cfi_def_cfa_offset. "Make check" test case for this comming in the next few days but it's already tested a lot from test-suite and works fine. This patch completes almost 100% pass of test-suite for mips 16. llvm-svn: 170674	2012-12-20 06:59:37 +00:00
Reed Kotler	8965d24a2a	There is one more patch to finish large frames. Make sure we assert on code that has large frames which will not yet compile correctly. llvm-svn: 170673	2012-12-20 06:57:00 +00:00
Jyotsna Verma	56605448f2	Add constant extender support to GP-relative load/store instructions. llvm-svn: 170672	2012-12-20 06:52:46 +00:00
Jyotsna Verma	bf75aaf53e	Add TSFlags to ALU32 type instructions for constant-extender/Relationship maps. llvm-svn: 170671	2012-12-20 06:45:39 +00:00
Reed Kotler	7bff8f1d7a	set register class properly for mips16 here llvm-svn: 170669	2012-12-20 06:06:35 +00:00
Rafael Espindola	fb8ac2df09	Undefine PPC harder. This was causing a build failure while trying to build on ppc ubuntu 12.10 with cmake. llvm-svn: 170668	2012-12-20 05:13:09 +00:00
Reed Kotler	92fc33bc97	This assert is overly restrictive and does not work for mips16. llvm-svn: 170667	2012-12-20 05:09:15 +00:00
Reed Kotler	fd633229f7	Turn on register scavenger for Mips 16 We use an unused Mips 32 register for the emergency slot instead of using the stack. llvm-svn: 170665	2012-12-20 04:44:58 +00:00
Akira Hatanaka	e7f1acc7c0	[mips] Refactor SLT (set on less than) instructions. Separate encoding information from the rest. llvm-svn: 170664	2012-12-20 04:27:52 +00:00
Akira Hatanaka	bbd197e9c4	[mips] Refactor unconditional branch instruction. Separate encoding information from the rest. llvm-svn: 170663	2012-12-20 04:22:39 +00:00
Akira Hatanaka	b1527b7505	[mips] Remove asm string parameter from pseudo instructions. Add InstrItinClass parameter. llvm-svn: 170661	2012-12-20 04:20:09 +00:00
Akira Hatanaka	14f9ce0f83	[mips] Delete definition of CPRESTORE instruction. llvm-svn: 170660	2012-12-20 04:15:30 +00:00
Akira Hatanaka	c0ea0bb99b	[mips] Refactor conditional branch instructions with one register operand. Separate encoding information from the rest. llvm-svn: 170659	2012-12-20 04:13:23 +00:00
Richard Smith	4a8e454ab2	Don't use isa<CallInst>(this) in the constructor for CallInst's base class. This has undefined behavior, because the classof implementation attempts to access parts of the not-yet-constructed derived class. Found by clang -fsanitize=vptr. llvm-svn: 170658	2012-12-20 04:11:02 +00:00
Akira Hatanaka	f71ffd29d9	[mips] Refactor conditional branch instructions with two register operands. Separate encoding information from the rest. llvm-svn: 170657	2012-12-20 04:10:13 +00:00
Reed Kotler	d019dbf75e	fix most of remaining issues with large frames. these patches are tested a lot by test-suite but make check tests are forthcoming once the next few patches that complete this are committed. with the next few patches the pass rate for mips16 is near 100% llvm-svn: 170656	2012-12-20 04:07:42 +00:00
Akira Hatanaka	f423672117	[mips] Use "or $r0, $r1, $zero" instead of "addu $r0, $zero, $r1" to copy physical register $r1 to $r0. GNU disassembler recognizes an "or" instruction as a "move", and this change makes the disassembled code easier to read. Original patch by Reed Kotler. llvm-svn: 170655	2012-12-20 04:06:06 +00:00
Richard Smith	15b1e3727b	Fix use-before-construction of X86TargetLowering. llvm-svn: 170654	2012-12-20 04:04:17 +00:00
Richard Smith	e7701ebfec	Don't use -1 as a value of an unsigned 7-bit enumeration; that has undefined behavior and violates the !range constraints we put on loads of this enum. Found by clang -fsanitize=enum. llvm-svn: 170653	2012-12-20 04:02:58 +00:00
Richard Smith	3287fac591	Don't leave IsUnsigned uninitialized in a default-constructed APSInt. Copying such a structure has undefined behavior. Caught by -fsanitize=bool. llvm-svn: 170652	2012-12-20 03:59:24 +00:00
Akira Hatanaka	7d75f9e3d3	[mips] Change the order of template parameters. Move the default parameters to the end. llvm-svn: 170651	2012-12-20 03:52:08 +00:00
Akira Hatanaka	244f9e874c	[mips] Refactor shift instructions with register operands. Separate encoding information from the rest. llvm-svn: 170650	2012-12-20 03:48:24 +00:00
Akira Hatanaka	7f96ad325f	[mips] Refactor shift immediate instructions. Separate encoding information from the rest. llvm-svn: 170649	2012-12-20 03:44:41 +00:00
Akira Hatanaka	ab1b715bf2	[mips] Refactor arithmetic and logic instructions with immediate operands. Separate encoding information from the rest. llvm-svn: 170648	2012-12-20 03:40:03 +00:00
Akira Hatanaka	1b37c4af01	[mips] Refactor arithmetic and logic instructions. Separate encoding information from the rest. llvm-svn: 170647	2012-12-20 03:34:05 +00:00
Sean Silva	fe15616449	docs: Show TOC for GettingStarted.rst. This is a pretty lengthy document, so put the table of contents in your face so that it's easier to scope out the content. This document is a mess currently and needs to be refactored/revised/split-up. llvm-svn: 170646	2012-12-20 03:32:39 +00:00
Akira Hatanaka	73495897b1	[mips] Delete ArithOverflowR and ArithOverflow and use ArithLogicR and ArithLogicI as the instruction base classes. llvm-svn: 170642	2012-12-20 03:00:16 +00:00
Sean Silva	08fd0888cb	docs: Clean up adornments. For whatever reason the usage of '^^^' and '---' adornments were reversed compared to the "canonical" style of the LLVM docs (which is currently "the style used in SphinxQuickstartTemplate.rst"). This change doesn't affect the document structure at all, I'm just doing it for trivial stylistic consistency (the document content is much more important---thanks Nadav for writing this up!). Also, trim the adornments to be the same length as the section names. llvm-svn: 170638	2012-12-20 02:40:45 +00:00
Sean Silva	13ed79c66b	docs: ASCII-fy llvm-svn: 170637	2012-12-20 02:23:25 +00:00
Nadav Rotem	7bdc45b570	Loop Vectorizer: Enable if-conversion. llvm-svn: 170632	2012-12-20 02:00:02 +00:00
Bill Wendling	4607f4bdad	s/AttributesImpl/AttributeImpl/g This is going to apply to Attribute, not Attributes. llvm-svn: 170631	2012-12-20 01:36:59 +00:00
Bob Wilson	3365b80290	Do not introduce vector operations in functions marked with noimplicitfloat. <rdar://problem/12879313> llvm-svn: 170630	2012-12-20 01:36:20 +00:00
Jim Grosbach	f9c2e5e450	Clean up some DOxygen comments. llvm-svn: 170629	2012-12-20 01:14:48 +00:00
Jim Grosbach	23f1f957d5	Clean up some DOxygen comments. llvm-svn: 170628	2012-12-20 01:14:45 +00:00
Richard Smith	a7bb16ad86	Fix an uninitialized member variable, found by -fsanitize=bool. llvm-svn: 170627	2012-12-20 01:05:39 +00:00
Nadav Rotem	28408a20c9	whitespace llvm-svn: 170626	2012-12-20 00:49:56 +00:00
Nadav Rotem	17d745618e	doc: resize the image. llvm-svn: 170622	2012-12-20 00:29:18 +00:00
NAKAMURA Takumi	2a0b40f584	Target/R600: Update MIB according to r170588. llvm-svn: 170620	2012-12-20 00:22:11 +00:00
Nadav Rotem	12da396abc	Doc: update the chart. llvm-svn: 170618	2012-12-20 00:03:36 +00:00
Bill Wendling	6ad6c3b1c2	Add a context so that once we uniquify strings we can access them easily. llvm-svn: 170615	2012-12-19 23:55:43 +00:00
Jim Grosbach	6df94846ec	MC: Add MCInstrDesc::mayAffectControlFlow() method. MC disassembler clients (LLDB) are interested in querying if an instruction may affect control flow other than by virtue of being an explicit branch instruction. For example, instructions which write directly to the PC on some architectures. llvm-svn: 170610	2012-12-19 23:38:53 +00:00
Jim Grosbach	01ab714758	Add isSubRegisterEq() and isSuperRegisterEq(). isSub and isSuper return false if RegA == RegB. Add variants which also include the identity function. llvm-svn: 170609	2012-12-19 23:38:49 +00:00
Jim Grosbach	74c6944a31	Move isSubRegister() and isSuperRegister to MCRegisterInfo. These were defined on TargetRegisterInfo, but they don't use any information that's not available in MCRegisterInfo, so sink them down to be available at the MC layer. llvm-svn: 170608	2012-12-19 23:38:46 +00:00
Jim Grosbach	98e0b8e273	Fix doc comment. '///' not '//'. llvm-svn: 170607	2012-12-19 23:38:44 +00:00
Michael Ilseman	b99f80dea7	Refactor isIntrinsic() to be quicker, and change classof() (and thus, isa<IntrinsicInst>()) to use it. This decreases the number of occurrences of the slow-path string matching performed by getIntrinsicID(). llvm-svn: 170602	2012-12-19 23:17:20 +00:00
Bill Wendling	6848e38daf	s/AttributeListImpl/AttributeSetImpl/g to match the namechange of AttributeList. llvm-svn: 170600	2012-12-19 22:42:22 +00:00
Jakob Stoklund Olesen	8fb0c99a12	Always use addOperand(MF, MO) from MachineInstrBuilder. The single-argument MachineInstr::addOperand(MO) will be removed soon. llvm-svn: 170599	2012-12-19 22:35:46 +00:00
Dmitri Gribenko	349d1a35ff	Add a missing 'else'. Found by grep '} if' No testcase because it is apparently not so trivial to construct. llvm-svn: 170595	2012-12-19 22:13:01 +00:00
Tom Stellard	abdff2ba2d	R600: Add entry in CODE_OWNERS.TXT llvm-svn: 170594	2012-12-19 22:10:35 +00:00
Tom Stellard	1c315d5411	R600: Remove unecessary VREG alignment. Unlike SGPRs VGPRs doesn't need to be aligned. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170593	2012-12-19 22:10:34 +00:00
Tom Stellard	e7b907d85c	R600: control flow optimization Branch if we have enough instructions so that it makes sense. Also remove branches if they don't make sense. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170592	2012-12-19 22:10:33 +00:00
Tom Stellard	f8794354b2	R600: New control flow for SI v2 This patch replaces the control flow handling with a new pass which structurize the graph before transforming it to machine instruction. This has a couple of different advantages and currently fixes 20 piglit tests without a single regression. It is now a general purpose transformation that could be not only be used for SI/R6xx, but also for other hardware implementations that use a form of structurized control flow. v2: further cleanup, fixes and documentation Patch by: Christian König Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170591	2012-12-19 22:10:31 +00:00
Eric Christopher	3c5a1914b6	Split out abbreviations for the skeleton info from the rest of the abbreviations. Part of implementing split dwarf. llvm-svn: 170589	2012-12-19 22:02:53 +00:00
Jakob Stoklund Olesen	b159b5ff0d	Remove the explicit MachineInstrBuilder(MI) constructor. Use the version that also takes an MF reference instead. It would technically be possible to extract an MF reference from the MI as MI->getParent()->getParent(), but that would not work for MIs that are not inserted into any basic block. Given the reasonably small number of places this constructor was used at all, I preferred the compile time check to a run time assertion. llvm-svn: 170588	2012-12-19 21:31:56 +00:00
Nadav Rotem	11350aafb4	Fix a bug that was found by building clang with -fsanitize. I introduced it in r166785. PR14291. If TD is unavailable use getScalarSizeInBits, but don't optimize pointers or vectors of pointers. llvm-svn: 170586	2012-12-19 20:47:04 +00:00
Meador Inge	0fbf321af2	docs: Fix title underline warnings Building Vectorizers.rst produces a few warnings of the form: WARNING: Title underline too short. Fixed by adding the extra needed dashes under the title. llvm-svn: 170582	2012-12-19 20:16:40 +00:00
Evan Cheng	eae6d2ccea	LLVM sdisel normalize bit extraction of the form: ((x & 0xff00) >> 8) << 2 to (x >> 6) & 0x3fc This is general goodness since it folds a left shift into the mask. However, the trailing zeros in the mask prevents the ARM backend from using the bit extraction instructions. And worse since the mask materialization may require an addition instruction. This comes up fairly frequently when the result of the bit twiddling is used as memory address. e.g. = ptr[(x & 0xFF0000) >> 16] We want to generate: ubfx r3, r1, #16, #8 ldr.w r3, [r0, r3, lsl #2] vs. mov.w r9, #1020 and.w r2, r9, r1, lsr #14 ldr r2, [r0, r2] Add a late ARM specific isel optimization to ARMDAGToDAGISel::PreprocessISelDAG(). It folds the left shift to the 'base + offset' address computation; change the mask to one which doesn't have trailing zeros and enable the use of ubfx. Note the optimization has to be done late since it's target specific and we don't want to change the DAG normalization. It's also fairly restrictive as shifter operands are not always free. It's only done for lsh 1 / 2. It's known to be free on some cpus and they are most common for address computation. This is a slight win for blowfish, rijndael, etc. rdar://12870177 llvm-svn: 170581	2012-12-19 20:16:09 +00:00
Benjamin Kramer	870f4fe261	Remove edis remnant. llvm-svn: 170580	2012-12-19 20:11:17 +00:00
Roman Divacky	e3d323052f	Remove edis - the enhanced disassembler. Fixes PR14654. llvm-svn: 170578	2012-12-19 19:55:47 +00:00
Paul Redmond	5917f4c715	Transform (x&C)>V into (x&C)!=0 where possible When the least bit of C is greater than V, (x&C) must be greater than V if it is not zero, so the comparison can be simplified. Although this was suggested in Target/X86/README.txt, it benefits any architecture with a directly testable form of AND. Patch by Kevin Schoedel llvm-svn: 170576	2012-12-19 19:47:13 +00:00
Jakob Stoklund Olesen	35641e41eb	Add an MF argument to MachineInstr::addOperand(). Just like for addMemOperand(), the function pointer provides a context for allocating memory. This will make it possible to use a better memory allocation strategy for the MI operand list, which is currently a slow std::vector. Most calls to addOperand() come from MachineInstrBuilder, so give that class an MF reference as well. Code using BuildMI() won't need changing at all since the MF reference is already required to allocate a MachineInstr. Future patches will fix code that calls MI::addOperand(Op) directly, as well as code that uses the now deprecated MachineInstrBuilder(MI) constructor. llvm-svn: 170574	2012-12-19 19:19:01 +00:00
Chad Rosier	5f69df3f03	Remove superfluous brief command from getAsString. llvm-svn: 170569	2012-12-19 18:06:44 +00:00
Nadav Rotem	0328f5e57d	doc: add subsections. llvm-svn: 170568	2012-12-19 18:04:44 +00:00
Nadav Rotem	8f4a6cced2	DOC: document the use of O2, O3 and Os with -fvectorize. llvm-svn: 170567	2012-12-19 18:02:36 +00:00
Benjamin Kramer	c5071466d4	PowerPC: Expand VSELECT nodes. There's probably a better expansion for those nodes than the default for altivec, but this is better than crashing. VSELECTs occur in loop vectorizer output. llvm-svn: 170551	2012-12-19 15:49:14 +00:00
Patrik Hagglund	f9934613e8	Change AsmOperandInfo::ConstraintVT to MVT, instead of EVT. Accordingly, add MVT::getVT. llvm-svn: 170550	2012-12-19 15:19:11 +00:00
Rafael Espindola	0f00de40dd	Revert 170545 while I debug the ppc failures. llvm-svn: 170547	2012-12-19 14:48:05 +00:00
Benjamin Kramer	ae0bb61053	Make TargetLowering::getTypeConversion more resilient against odd illegal MVTs. - An MVT can become an EVT when being split (e.g. v2i8 -> v1i8, the latter doesn't exist) - Return the scalar value when an MVT is scalarized (v1i64 -> i64) Fixes PR14639ff. llvm-svn: 170546	2012-12-19 14:34:28 +00:00
Rafael Espindola	aa7b27801c	Add r170095 back. I cannot reproduce it the failures locally, so I will keep an eye at the ppc bots. This patch does add the change to the "Disassembly of section" message, but that is not what was failing on the bots. Original message: Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be infor the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170545	2012-12-19 14:15:04 +00:00
Evgeniy Stepanov	abeae5c7d5	[msan] Add track-origins argument to the pass constructor. llvm-svn: 170544	2012-12-19 13:55:51 +00:00
Dmitri Gribenko	d3be5d9bf6	Documentation: add a missing space llvm-svn: 170542	2012-12-19 12:51:48 +00:00
Patrik Hagglund	00e7a11904	Split the usage of 'EVT PartVT' into 'MVT PartVT' and 'EVT PartEVT'. llvm-svn: 170540	2012-12-19 12:33:30 +00:00
Alexey Samsonov	e6ddb98565	CMake: factor out a function that returns the expected directory for unit test llvm-svn: 170539	2012-12-19 12:30:33 +00:00
Patrik Hagglund	4e0f828686	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 170538	2012-12-19 12:23:01 +00:00
Patrik Hagglund	e09cac9a67	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. llvm-svn: 170537	2012-12-19 12:02:25 +00:00
Patrik Hagglund	3f1905199b	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 170536	2012-12-19 11:53:21 +00:00
Patrik Hagglund	bad545ccba	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 170535	2012-12-19 11:48:16 +00:00
Patrik Hagglund	93060569ba	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 170534	2012-12-19 11:42:00 +00:00
Patrik Hagglund	2fc3c59a45	Change TargetLowering::getRepRegClassCostFor, getIndexedLoadAction, getIndexedStoreAction, and addRegisterClass to take and MVT, instead of EVT. llvm-svn: 170533	2012-12-19 11:37:12 +00:00
Patrik Hagglund	f9eb168ef4	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 170532	2012-12-19 11:30:36 +00:00
Evgeniy Stepanov	d7571cd4bc	[msan] Heuristically instrument unknown intrinsics. This changes adds shadow and origin propagation for unknown intrinsics by examining the arguments and ModRef behaviour. For now, only 3 classes of intrinsics are handled: - those that look like simple SIMD store - those that look like simple SIMD load - those that don't have memory effects and look like arithmetic/logic/whatever operation on simple types. llvm-svn: 170530	2012-12-19 11:22:04 +00:00
Patrik Hagglund	fd41b5b969	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 170529	2012-12-19 11:21:04 +00:00
Benjamin Kramer	e300004bd5	LoopVectorize: Make iteration over induction variables not depend on pointer values. MapVector is a bit heavyweight, but I don't see a simpler way. Also the InductionList is unlikely to be large. This should help 3-stage selfhost compares (PR14647). llvm-svn: 170528	2012-12-19 11:09:15 +00:00
Benjamin Kramer	44ba3753ad	MapVector: Add lookup(). llvm-svn: 170527	2012-12-19 11:08:33 +00:00
Patrik Hagglund	ffd057a3e1	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 170524	2012-12-19 10:19:55 +00:00
NAKAMURA Takumi	89209462fe	X86ISelLowering.cpp: Fix warnings. [-Wlogical-op-parentheses] llvm-svn: 170523	2012-12-19 10:12:48 +00:00
Patrik Hagglund	deee9003ed	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 170522	2012-12-19 10:09:26 +00:00
Bill Wendling	a87cdc27d9	Inline hasFunctionOnlyAttrs into its only use. llvm-svn: 170518	2012-12-19 09:15:11 +00:00
Bill Wendling	e9506a211f	Inline the only use of the hasParameterOnlyAttrs method. llvm-svn: 170517	2012-12-19 09:04:58 +00:00
Bill Wendling	d97b75d816	Inline the 'hasIncompatibleWithVarArgsAttrs' method into its only uses. And some minor comment reformatting. llvm-svn: 170516	2012-12-19 08:57:40 +00:00
Nadav Rotem	90c8b4bfa5	DOC: fix the url format. llvm-svn: 170513	2012-12-19 08:43:05 +00:00
Patrik Hagglund	d7cdcf8cb5	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 170510	2012-12-19 08:28:51 +00:00
Nadav Rotem	15bdbbe309	DOC: add a benchmarks that compares us to gcc and icc. llvm-svn: 170509	2012-12-19 08:28:24 +00:00
Elena Demikhovsky	14a4af0e66	Optimized load + SIGN_EXTEND patterns in the X86 backend. llvm-svn: 170506	2012-12-19 07:50:20 +00:00
Nadav Rotem	33360d8ae9	After reducing the size of an operation in the DAG we zero-extend the reduced bitwidth op back to the original size. If we reduce ANDs then this can cause an endless loop. This patch changes the ZEXT to ANY_EXTEND if the demanded bits are equal or smaller than the size of the reduced operation. llvm-svn: 170505	2012-12-19 07:39:08 +00:00
Nadav Rotem	af14a3f20b	docs: fix typos. llvm-svn: 170504	2012-12-19 07:36:35 +00:00
Nadav Rotem	c4efbb8b4e	DOC: Add a webpage that describes the loop and bb vectorizers. llvm-svn: 170503	2012-12-19 07:22:24 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Craig Topper	3f194c8f4f	Remove more of 'else's after 'returns'. No functional change. llvm-svn: 170497	2012-12-19 06:43:58 +00:00
Craig Topper	5dd8291cbe	Remove a bunch of 'else's after 'returns' llvm-svn: 170496	2012-12-19 06:39:17 +00:00
Craig Topper	63f5921776	Teach SimplifySetCC that comparing AssertZext i1 against a constant 1 can be rewritten as a compare against a constant 0 with the opposite condition. llvm-svn: 170495	2012-12-19 06:12:28 +00:00
Reed Kotler	3aad762d1d	Add some missing Defs and Uses. llvm-svn: 170493	2012-12-19 04:06:15 +00:00
Shuxin Yang	5b841c4a64	Make sure the buffer, which containas an instance of APFloat, has proper alignment. llvm-svn: 170486	2012-12-19 01:10:17 +00:00
Kevin Enderby	85cf531593	Add to the disassembler C API an option to print the disassembled instructions in the assembly code variant if one exists. The intended use for this is so tools like lldb and darwin's otool(1) can be switched to print Intel-flavored disassembly. I discussed extensively this API with Jim Grosbach and we feel while it may not be fully general, in reality there is only one syntax for each assembly with the exception of X86 which has exactly two for historical reasons. rdar://10989182 llvm-svn: 170477	2012-12-18 23:47:28 +00:00
Jakob Stoklund Olesen	7aafc4bcdd	Remove MachineInstr::setIsInsideBundle(). The bundle flags are now maintained by the slightly higher-level functions bundleWithPred() / bundleWithSucc() which enforce consistent bundle flags between neighboring instructions. See also MIBundleBuilder for an even higher-level approach to building bundles. llvm-svn: 170475	2012-12-18 23:40:14 +00:00
Jakob Stoklund Olesen	d742533dbc	Use bidirectional bundle flags to simplify important functions. The bundle_iterator::operator++ function now doesn't need to dig out the basic block and check against end(). It can use the isBundledWithSucc() flag to find the last bundled instruction safely. Similarly, MachineInstr::isBundled() no longer needs to look at iterators etc. It only has to look at flags. llvm-svn: 170473	2012-12-18 23:21:49 +00:00
Shuxin Yang	37a1efe1c6	rdar://12801297 InstCombine for unsafe floating-point add/sub. llvm-svn: 170471	2012-12-18 23:10:12 +00:00
Nadav Rotem	9aee065e3c	Enable the loop vectorizer in clang and not in the pass manager, so that we can disable it in clang. llvm-svn: 170470	2012-12-18 23:09:44 +00:00
Jakob Stoklund Olesen	00f6c7754b	Verify bundle flag consistency when setting them. Now that the bundle flag aware APIs are all in place, it is possible to continuously verify the flag consistency. llvm-svn: 170465	2012-12-18 23:00:28 +00:00
Jakub Staszak	338863a546	Reverse order of checking SSE level when calculating compare cost, so we check AVX2 before AVX. llvm-svn: 170464	2012-12-18 22:57:56 +00:00
Jakob Stoklund Olesen	29c277197e	Verify bundle flags for consistency in MachineVerifier. The new bidirectional bundle flags are redundant, so inadvertent bundle tearing can be detected in the machine code verifier. llvm-svn: 170463	2012-12-18 22:55:07 +00:00
Quentin Colombet	23b404d5ad	Disable ARM partial flag dependency optimization at -Oz To not over constrain the scheduler for ARM in thumb mode, some optimizations for code size reduction, specific to ARM thumb, are blocked when they add a dependency (like write after read dependency). Disables this check when code size is the priority, i.e., code is compiled with -Oz. llvm-svn: 170462	2012-12-18 22:47:16 +00:00
Jakob Stoklund Olesen	a33f504b3e	Don't allow the automatically updated MI flags to be set directly. The bundle-related MI flags need to be kept in sync with the neighboring instructions. Don't allow the bulk flag-setting setFlags() function to change them. Also don't copy MI flags when cloning an instruction. The clone's bundle flags will be set when it is explicitly inserted into a bundle. llvm-svn: 170459	2012-12-18 21:36:05 +00:00
Jakob Stoklund Olesen	78eaf05fa7	Tighten up the splice() API for bundled instructions. Remove the instr_iterator versions of the splice() functions. It doesn't seem useful to be able to splice sequences of instructions that don't consist of full bundles. The normal splice functions that take MBB::iterator arguments are not changed, and they can move whole bundles around without any problems. llvm-svn: 170456	2012-12-18 20:59:41 +00:00
Andrew Trick	ec2564818c	MISched: add dependence to ExitSU to model live-out latency. llvm-svn: 170454	2012-12-18 20:53:01 +00:00
Andrew Trick	ef23569858	MISched: Cleanup, redundant statement. llvm-svn: 170453	2012-12-18 20:52:58 +00:00
Andrew Trick	d6d5ad3d7b	MISched: Heuristics, compare latency more precisely. It matters more for some targets. llvm-svn: 170452	2012-12-18 20:52:56 +00:00
Andrew Trick	44f54d97a4	MISched: Remove SchedRemainder::IsResourceLimited. I don't know how to compute it. llvm-svn: 170451	2012-12-18 20:52:54 +00:00
Andrew Trick	493b867b5d	MISched: cleanup, use the proper iterator type. llvm-svn: 170450	2012-12-18 20:52:52 +00:00
Andrew Trick	ffb6168e85	MISched: minor improvement, initialize remaining resources before the first scheduling decision. llvm-svn: 170449	2012-12-18 20:52:49 +00:00
Jakob Stoklund Olesen	b8d29bf2e4	Add an assertion for a likely ilist::splice() contract violation. The single-element ilist::splice() function supports a noop move: List.splice(I, List, I); The corresponding std::list function doesn't allow that, so add a unit test to document that behavior. This also means that List.splice(I, List, F); is somewhat surprisingly not equivalent to List.splice(I, List, F, next(F)); This patch adds an assertion to catch the illegal case I == F above. Alternatively, we could make I == F a legal noop, but that would make ilist differ even more from std::list. llvm-svn: 170443	2012-12-18 19:28:37 +00:00
Benjamin Kramer	f0e5d2f032	LoopVectorize: Emit reductions as log2(vectorsize) shuffles + vector ops instead of scalar operations. For example on x86 with SSE4.2 a <8 x i8> add reduction becomes movdqa %xmm0, %xmm1 movhlps %xmm1, %xmm1 ## xmm1 = xmm1[1,1] paddw %xmm0, %xmm1 pshufd $1, %xmm1, %xmm0 ## xmm0 = xmm1[1,0,0,0] paddw %xmm1, %xmm0 phaddw %xmm0, %xmm0 pextrb $0, %xmm0, %edx instead of pextrb $2, %xmm0, %esi pextrb $0, %xmm0, %edx addb %sil, %dl pextrb $4, %xmm0, %esi addb %dl, %sil pextrb $6, %xmm0, %edx addb %sil, %dl pextrb $8, %xmm0, %esi addb %dl, %sil pextrb $10, %xmm0, %edi pextrb $14, %xmm0, %edx addb %sil, %dil pextrb $12, %xmm0, %esi addb %dil, %sil addb %sil, %dl llvm-svn: 170439	2012-12-18 18:40:20 +00:00
Eli Bendersky	39e7c6e370	Get rid of the pesky -Woverloaded-virtual warning. No change in functionality. llvm-svn: 170438	2012-12-18 18:21:29 +00:00
Jakob Stoklund Olesen	422e07b091	Tighten the insert() API for bundled instructions. The normal insert() function takes an MBB::iterator position, and inserts a stand-alone MachineInstr as before. The insert() function that takes an MBB::instr_iterator position can insert instructions inside a bundle, and will now update the bundle flags correctly when that happens. When the insert position is between two bundles, it is unclear whether the instruction should be appended to the previous bundle, prepended to the next bundle, or stand on its own. The MBB::insert() function doesn't bundle the instruction in that case, use the MIBundleBuilder class for that. llvm-svn: 170437	2012-12-18 17:54:53 +00:00
Hal Finkel	943f76d1b3	Check multiple register classes for inline asm tied registers A register can be associated with several distinct register classes. For example, on PPC, the floating point registers are each associated with both F4RC (which holds f32) and F8RC (which holds f64). As a result, this code would fail when provided with a floating point register and an f64 operand because it would happen to find the register in the F4RC class first and return that. From the F4RC class, SDAG would extract f32 as the register type and then assert because of the invalid implied conversion between the f64 value and the f32 register. Instead, search all register classes. If a register class containing the the requested register has the requested type, then return that register class. Otherwise, as before, return the first register class found that contains the requested register. llvm-svn: 170436	2012-12-18 17:50:58 +00:00
Nadav Rotem	c0699854dd	Enable the loop vectorizer. llvm-svn: 170416	2012-12-18 06:37:12 +00:00
Nadav Rotem	cb23342876	Rename the test so that we can add additional vectors-of-pointers tests into the same file in the future. llvm-svn: 170414	2012-12-18 05:50:54 +00:00
Nadav Rotem	a5024fc3e1	SROA: Replace calls to getScalarSizeInBits to DataLayout's API because getScalarSizeInBits could not handle vectors of pointers. llvm-svn: 170412	2012-12-18 05:23:31 +00:00
NAKAMURA Takumi	ad0c80b8e6	llvm/test/MC/ELF/comp-dir.s: Appease MSYS Bash. llvm-svn: 170410	2012-12-18 05:08:12 +00:00
Rafael Espindola	46b9c8a2cd	Initialize NoRedZone and remove unused default values. llvm-svn: 170404	2012-12-18 03:35:05 +00:00
Eli Bendersky	fede6b1d62	Cleanup comment and formatting llvm-svn: 170398	2012-12-18 00:53:36 +00:00
Jakob Stoklund Olesen	41bbf9c256	Repair bundles that were broken by removing and reinserting the first instruction. This isn't strictly necessary at the moment because Thumb2SizeReduction also copies all MI flags from the old instruction to the new. However, a future patch will make that kind of direct flag tampering illegal. llvm-svn: 170395	2012-12-18 00:46:39 +00:00
Eric Christopher	79f165699d	Formatting. llvm-svn: 170394	2012-12-18 00:42:26 +00:00
Eric Christopher	906da23229	Add support for passing -main-file-name all the way through to the assembler. Part of PR14624 llvm-svn: 170390	2012-12-18 00:31:01 +00:00
Eric Christopher	a7c3273e85	Cleanup formatting and whitespace. llvm-svn: 170389	2012-12-18 00:30:54 +00:00
Jakob Stoklund Olesen	43b1e13386	Extract a method, no functional change intended. Sadly, this costs us a perfectly good opportunity to use 'goto'. llvm-svn: 170385	2012-12-18 00:13:11 +00:00
Jakob Stoklund Olesen	ccfb5fb472	Tighten up the erase/remove API for bundled instructions. Most code is oblivious to bundles and uses the MBB::iterator which only visits whole bundles. MBB::erase() operates on whole bundles at a time as before. MBB::remove() now refuses to remove bundled instructions. It is not safe to remove all instructions in a bundle without deleting them since there is no way of returning pointers to all the removed instructions. MBB::remove_instr() and MBB::erase_instr() will now update bundle flags correctly, lifting individual instructions out of bundles while leaving the remaining bundle intact. The MachineInstr convenience functions are updated so eraseFromParent() erases a whole bundle as before eraseFromBundle() erases a single instruction, leaving the rest of its bundle. removeFromParent() refuses to operate on bundled instructions, and removeFromBundle() lifts a single instruction out of its bundle. These functions will no longer accidentally split or coalesce bundles - bundle flags are updated to preserve the existing bundling, and explicit bundleWith* / unbundleFrom* functions should be used to change the instruction bundling. This API update is still a work in progress. I am going to update APIs first so they maintain bundle flags automatically when possible. Then I'll add stricter verification of the bundle flags. llvm-svn: 170384	2012-12-17 23:55:38 +00:00
Reed Kotler	0c1745e56a	EmitDebugLabel should by default be the same as EmitLabel everywhere. It must be explicity set in MCPureStreamer because otherwise it will inherit incorrectly from the parent. llvm-svn: 170383	2012-12-17 23:41:45 +00:00
Eli Bendersky	d371eb3060	fix indentation llvm-svn: 170381	2012-12-17 22:50:56 +00:00
Chad Rosier	150d35bc1d	[arm fast-isel] Minor cleanup. No functional change intended. llvm-svn: 170379	2012-12-17 22:35:29 +00:00
Nick Kledzik	bed953d699	Fix some integer constant warnings by using a suffix llvm-svn: 170376	2012-12-17 22:11:17 +00:00
Chandler Carruth	d75be9b4fb	Add a triple to this test -- it has to be an ELF platform... llvm-svn: 170374	2012-12-17 21:44:50 +00:00
Chandler Carruth	10700aad85	Prepare LLVM to fix PR14625, exposing a hook in MCContext to manage the compilation directory. This defaults to the current working directory, just as it always has, but now an assembler can choose to override it with a custom directory. I've taught llvm-mc about this option and added a test case. llvm-svn: 170371	2012-12-17 21:32:42 +00:00
Nick Kledzik	52bfd38ee0	re-enable test cases now that traits work with g++. Fix some g++ warnings llvm-svn: 170369	2012-12-17 20:43:53 +00:00
Michael Ilseman	5feb4e17d0	Remove trailing whitespace llvm-svn: 170368	2012-12-17 20:40:14 +00:00
Michael Ilseman	acdb76d339	Removed trailing whitespace llvm-svn: 170367	2012-12-17 20:37:55 +00:00
Chad Rosier	62a144f099	[arm fast-isel] Fast-isel only handles simple VTs, so make sure the necessary checks are in place. Some minor cleanup as well. llvm-svn: 170360	2012-12-17 19:59:43 +00:00
Nick Kledzik	95850c24a4	Use different trait techniques to be compatible with g++ llvm-svn: 170355	2012-12-17 19:02:05 +00:00
Chandler Carruth	e3f4119b06	Fix another SROA crasher, PR14601. This was a silly oversight, we weren't pruning allocas which were used by variable-length memory intrinsics from the set that could be widened and promoted as integers. Fix that. llvm-svn: 170353	2012-12-17 18:48:07 +00:00
Tim Northover	d05e6b5817	Query section for whether it should be executable. llvm-svn: 170350	2012-12-17 17:59:35 +00:00
Tim Northover	5edabc131a	Teach MachO which sections contain code llvm-svn: 170349	2012-12-17 17:59:32 +00:00
Evgeniy Stepanov	88b8dceddf	[msan] Fix lint warning. llvm-svn: 170347	2012-12-17 16:30:05 +00:00
Richard Osborne	459e35c261	Add instruction encodings / disassembly support for l2r instructions. llvm-svn: 170345	2012-12-17 16:28:02 +00:00
Tom Stellard	5a6879466a	R600: enable S_N2_ instructions They seem to work fine. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170343	2012-12-17 15:14:56 +00:00
Tom Stellard	9e90b5895d	R600: BB operand support for SI Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170342	2012-12-17 15:14:54 +00:00
Tom Stellard	16a17c6d3e	R600: remove nonsense setPrefLoopAlignment The Align parameter is a power of two, so 16 results in 64K alignment. Additional to that even 16 byte alignment doesn't make any sense, so just remove it. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170341	2012-12-17 15:14:53 +00:00
Chandler Carruth	21eb4e96c2	Teach the rewriting of memcpy calls to support subvector copies. This also cleans up a bit of the memcpy call rewriting by sinking some irrelevant code further down and making the call-emitting code a bit more concrete. Previously, memcpy of a subvector would actually miscompile (!!!) the copy into a single vector element copy. I have no idea how this ever worked. =/ This is the memcpy half of PR14478 which we probably weren't noticing previously because it didn't actually assert. The rewrite relies on the newly refactored insert- and extractVector functions to do the heavy lifting, and those are the same as used for loads and stores which makes the test coverage a bit more meaningful here. llvm-svn: 170338	2012-12-17 14:51:24 +00:00
Patrik Hagglund	c494d24a68	Revert/correct some FastISel changes in r170104 (EVT->MVT for TargetLowering::getRegClassFor). Some isSimple() guards were missing, or getSimpleVT() were hoisted too far, resulting in asserts on valid LLVM assembly input. llvm-svn: 170336	2012-12-17 14:30:06 +00:00
Evgeniy Stepanov	95a80abead	Optimize tree walking in markAliveBlocks. Check whether a BB is known as reachable before adding it to the worklist. This way BB's with multiple predecessors are added to the list no more than once. llvm-svn: 170335	2012-12-17 14:28:00 +00:00
Richard Osborne	51bf1b269a	Add instruction encodings for PEEK and ENDIN. Previously these were marked with the wrong format. llvm-svn: 170334	2012-12-17 14:23:54 +00:00
Chandler Carruth	cacda256a1	Fix a secondary bug I introduced while fixing the first part of PR14478. The first half of fixing this bug was actually in r170328, but was entirely coincidental. It did however get me to realize the nature of the bug, and adapt the test case to test more interesting behavior. In turn, that uncovered the rest of the bug which I've fixed here. This should fix two new asserts that showed up in the vectorize nightly tester. llvm-svn: 170333	2012-12-17 14:03:01 +00:00
Richard Osborne	c104bf2769	Fix parameter name in prototypes in XCoreDisassembler. llvm-svn: 170332	2012-12-17 13:55:49 +00:00
Chandler Carruth	95e1fb8a42	Hoist a convertValue call to the two paths where it is needed. I noticed this while looking at r170328. We only ever do a vector rewrite when the alloca is the vector type, so it's good to not paper over bugs here by doing a convertValue that isn't needed. llvm-svn: 170331	2012-12-17 13:51:03 +00:00
Richard Osborne	041071c558	Add instruction encodings / disassembly support for rus instructions. llvm-svn: 170330	2012-12-17 13:50:04 +00:00
Chandler Carruth	ce4562bdcb	Hoist the insertVector helper to be a static helper. This will allow its use inside of memcpy rewriting as well. This routine is more complex than extractVector, and some of its uses are not 100% where I want them to be so there is still some work to do here. While this can technically change the output in some cases, it shouldn't be a change that matters -- IE, it can leave some dead code lying around that prior versions did not, etc. Yet another step in the refactorings leading up to the solution to the last component of PR14478. llvm-svn: 170328	2012-12-17 13:41:21 +00:00
Richard Osborne	e405e58639	Add instruction encodings for ZEXT and SEXT. Previously these were marked with the wrong format. llvm-svn: 170327	2012-12-17 13:20:37 +00:00
Chandler Carruth	b6bc8749e8	Lift the extractVector helper all the way out to a static helper function. The method helpers all implicitly act upon the alloca, and what we really want is a fully generic helper. Doing memcpy rewrites is more special than all other rewrites because we are at times rewriting instructions which touch pointers other than the alloca. As a consequence all of the helpers needed by memcpy rewriting of sub-vector copies will need to be generalized fully. Note that all of these helpers ({insert,extract}{Integer,Vector}) are woefully uncommented. I'm going to go back through and document them once I get the factoring correct. No functionality changed. llvm-svn: 170325	2012-12-17 13:07:30 +00:00
Chandler Carruth	769445ef03	Factor the vector load rewriting into a more generic form. This makes it suitable for use in rewriting memcpy in the presence of subvector memcpy intrinsics. No functionality changed. llvm-svn: 170324	2012-12-17 12:50:21 +00:00
Richard Osborne	3a0d5cc314	Add instruction encodings / disassembly support for 2r instructions. llvm-svn: 170323	2012-12-17 12:29:31 +00:00
Richard Osborne	016967e4ff	Add instruction encodings / disassembly support for 0r instructions. llvm-svn: 170322	2012-12-17 12:26:29 +00:00
Richard Osborne	1cc2b68ad6	Simplify assertion in XCoreInstPrinter. llvm-svn: 170321	2012-12-17 12:13:46 +00:00
Richard Osborne	4e1e14bccd	Update comments to match recommended doxygen style. llvm-svn: 170320	2012-12-17 12:13:41 +00:00
Richard Osborne	eb31fa483e	Remove unnecessary include. llvm-svn: 170319	2012-12-17 12:13:32 +00:00
Duncan Sands	7cb52522fe	Fix typo that results in new landing pads not getting a name, fixing PR14617. Patch by Chris Toshok. llvm-svn: 170318	2012-12-17 12:02:36 +00:00
Duncan Sands	66c2cd3d88	Fix comment typo. llvm-svn: 170317	2012-12-17 11:43:15 +00:00
Craig Topper	354ed773b8	Remove EFLAGS from the BLSI/BLSMSK/BLSR patterns. The nodes created by DAG combine don't contain an EFLAGS def. llvm-svn: 170308	2012-12-17 06:13:48 +00:00
Craig Topper	f3ff6ae066	Simplify BMI ANDN matching to use patterns instead of a DAG combine. Also add ANDN to isDefConvertible. llvm-svn: 170305	2012-12-17 05:12:30 +00:00
Craig Topper	f924a58af1	Add rest of BMI/BMI2 instructions to the folding tables as well as popcnt and lzcnt. llvm-svn: 170304	2012-12-17 05:02:29 +00:00
Craig Topper	5b08cf7736	Remove store forms of DEC/INC from isDefConvertible. Since they are stores they don't have a register def. llvm-svn: 170303	2012-12-17 04:55:07 +00:00
Chandler Carruth	ccca504f3a	Fix the first part of PR14478: memset now works. PR14478 highlights a serious problem in SROA that simply wasn't being exercised due to a lack of vector input code mixed with C-library function calls. Part of SROA was written carefully to handle subvector accesses via memset and memcpy, but the rewriter never grew support for this. Fixing it required refactoring the subvector access code in other parts of SROA so it could be shared, and then fixing the splat formation logic and using subvector insertion (this patch). The PR isn't quite fixed yet, as memcpy is still broken in the same way. I'm starting on that series of patches now. Hopefully this will be enough to bring the bullet benchmark back to life with the bb-vectorizer enabled, but that may require fixing memcpy as well. llvm-svn: 170301	2012-12-17 04:07:37 +00:00
Chandler Carruth	eae65a5629	Extract the logic for inserting a subvector into a vector alloca. No functionality changed. Another step of refactoring toward solving PR14487. llvm-svn: 170300	2012-12-17 04:07:35 +00:00
Chandler Carruth	514f34f9c4	Lift the integer splat computation into a helper function. No functionality changed. Refactoring leading up to the fix for PR14478 which requires some significant changes to the memset and memcpy rewriting. llvm-svn: 170299	2012-12-17 04:07:30 +00:00

... 6 7 8 9 10 ...

88204 Commits