llvm-project

Commit Graph

Author	SHA1	Message	Date
Hans Wennborg	15823d49b6	Switch lowering: extract jump tables and bit tests before building binary tree (PR22262) This is a re-commit of r235101, which also fixes the problems with the previous patch: - Switches with only a default case and non-fallthrough were handled incorrectly - The previous patch tickled a bug in PowerPC Early-Return Creation which is fixed here. > This is a major rewrite of the SelectionDAG switch lowering. The previous code > would lower switches as a binary tre, discovering clusters of cases > suitable for lowering by jump tables or bit tests as it went along. To increase > the likelihood of finding jump tables, the binary tree pivot was selected to > maximize case density on both sides of the pivot. > > By not selecting the pivot in the middle, the binary trees would not always > be balanced, leading to performance problems in the generated code. > > This patch rewrites the lowering to search for clusters of cases > suitable for jump tables or bit tests first, and then builds the binary > tree around those clusters. This way, the binary tree will always be balanced. > > This has the added benefit of decoupling the different aspects of the lowering: > tree building and jump table or bit tests finding are now easier to tweak > separately. > > For example, this will enable us to balance the tree based on profile info > in the future. > > The algorithm for finding jump tables is quadratic, whereas the previous algorithm > was O(n log n) for common cases, and quadratic only in the worst-case. This > doesn't seem to be major problem in practice, e.g. compiling a file consisting > of a 10k-case switch was only 30% slower, and such large switches should be rare > in practice. Compiling e.g. gcc.c showed no compile-time difference. If this > does turn out to be a problem, we could limit the search space of the algorithm. > > This commit also disables all optimizations during switch lowering in -O0. > > Differential Revision: http://reviews.llvm.org/D8649 llvm-svn: 235560	2015-04-22 23:14:56 +00:00
David Majnemer	7d0e99c601	[InstCombine] Use a more targeted fix instead of r235544 Only clear out the NSW/NUW flags if we are optimizing 'add'/'sub' while taking advantage that the sign bit is not set. We do this optimization to further shrink the mask but shrinking the mask isn't NSW/NUW preserving in this case. llvm-svn: 235558	2015-04-22 22:42:05 +00:00
Reid Kleckner	64a2a6a473	[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works This removes the -sehprepare flag and makes __C_specific_handler functions always to use WinEHPrepare. This was tested by building all of chromium_builder_tests and running a few tests that use SEH, but if something breaks, we can revert this. llvm-svn: 235557	2015-04-22 22:13:09 +00:00
Lang Hames	34cfa49b66	[RuntimeDyld][COFF] Add external symbol resolution support to RuntimeDyldCOFF. Patch by Andy Ayers. Thanks Andy! llvm-svn: 235554	2015-04-22 21:38:37 +00:00
Krzysztof Parzyszek	952d951418	[Hexagon] Some cleanup of instruction selection code llvm-svn: 235552	2015-04-22 21:17:00 +00:00
Reid Kleckner	fd7df284b8	[WinEH] Demote values and phis live across exception handlers up front In particular, this handles SSA values that are live out of a handler. The existing code only handles values that are live in to a handler. It also handles phi nodes in the block where normal control should resume after the end of a catch handler. When EH return points have phi nodes, we need to split the return edge. It is impossible for phi elimination to emit copies in the previous block if that block gets outlined. The indirectbr that we leave in the function is only notional, and is eliminated from the MachineFunction CFG early on. Reviewers: majnemer, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D9158 llvm-svn: 235545	2015-04-22 21:05:21 +00:00
David Majnemer	fe58d13a17	[InstCombine] Clear out nsw/nuw if we modify computation in the chain An nsw/nuw operation relies on the values feeding into it to not overflow if 'poison' is not to be produced. This means that optimizations which make modifications to the bottom of a chain (like SimplifyDemandedBits) must strip out nsw/nuw if they cannot ensure that they will be preserved. This fixes PR23309. llvm-svn: 235544	2015-04-22 20:59:28 +00:00
Krzysztof Parzyszek	cd97c985c7	[Hexagon] Use A2_tfrsi for constant pool and jump table addresses llvm-svn: 235535	2015-04-22 18:25:53 +00:00
David Blaikie	d2db881e85	Revert "[opaque pointer type] Avoid using PointerType::getElementType for a few cases of CallInst" This reverts commit r235458. It looks like this might be breaking something LTO-ish. Looking into it & will recommit with a fix/test case/etc once I've got more to go on. llvm-svn: 235533	2015-04-22 18:16:49 +00:00
Pete Cooper	037b700b7f	[AArch64] Use MachineRegisterInfo instead of LiveIntervals to calculate liveness. NFC. The CondOpt pass currently uses LiveIntervals to set the dead flag on a def. This patch uses MachineRegisterInfo::use_empty instead as that is equivalent to the def being dead. This removes an instance of LiveIntervals in the pass manager pipeline and saves 3.8% of compile time on llc conpiled for AArch64. Reviewed by Chad Rosier and Zhaoshi. llvm-svn: 235532	2015-04-22 18:05:13 +00:00
Sanjay Patel	c96ee08016	don't repeat function names in comments; NFC llvm-svn: 235531	2015-04-22 18:04:46 +00:00
Krzysztof Parzyszek	05902163b6	[Hexagon] Consider constant-extended offsets to be valid llvm-svn: 235529	2015-04-22 17:51:26 +00:00
Luqman Aden	c76f470c2d	Test commit: fix typo in comment. llvm-svn: 235526	2015-04-22 17:42:37 +00:00
Krzysztof Parzyszek	9ee04e401a	Fix Windows build break: use LLVM_FUNCTION_NAME instead of __func__. llvm-svn: 235525	2015-04-22 17:19:44 +00:00
Matt Arsenault	deaef8e24b	R600: Fix always inline pass breaking noinline functions No test since calls are not actually supported yet. llvm-svn: 235524	2015-04-22 17:10:44 +00:00
Krzysztof Parzyszek	4fa2a9f7fd	[Hexagon] Overhaul of stack object allocation - Use static allocation for aligned stack objects. - Simplify dynamic stack object allocation. - Simplify elimination of frame-indices. llvm-svn: 235521	2015-04-22 16:43:53 +00:00
David Blaikie	e169e8206b	[opaque pointer type] Use pointee type retrieved from asm, rather than accessing it via the pointer type llvm-svn: 235520	2015-04-22 16:37:35 +00:00
Sanjay Patel	cab567873f	[x86] Add store-folded memop patterns for vcvtps2ph Differential Revision: http://reviews.llvm.org/D7296 llvm-svn: 235517	2015-04-22 16:11:19 +00:00
Krzysztof Parzyszek	6bbcb31fda	[Hexagon] Treat CFI as solo instructions llvm-svn: 235516	2015-04-22 15:47:35 +00:00
Krzysztof Parzyszek	badf3a6356	[Hexagon] Implement HexagonInstPrinter::printRegName llvm-svn: 235514	2015-04-22 15:38:17 +00:00
Brendon Cahoon	f9751ad1b0	Fix a type mismatch assert in SCEV division An assert was triggered when attempting to create a new SCEV with operands of different types in the visitAddRecExpr. In this test case, the operand types of the numerator and denominator are different. The SCEV division code should generate a conservative answer when this happens. Differential Revision: http://reviews.llvm.org/D9021 llvm-svn: 235511	2015-04-22 15:06:40 +00:00
Andrea Di Biagio	6cd2f42fac	[X86][AVX] Fix failure due to a missing ISel pattern to select VBROADCAST nodes (PR23259). This fixes a regression introduced at revision 218263. On AVX, if we optimize for size, a splat build_vector of a load is lowered into a VBROADCAST node. This is done even if the value type of the splat build_vector node is v2i64. Since AVX doesn't support v2f64/v2i64 broadcasts, revision 218263 added two extra tablegen patterns to allow selecting a VMOVDDUPrm from an X86VBroadcast where the scalar element comes from a loadi64/loadf64. However, revision 218263 forgot to add an extra fallback pattern for the case where we have a X86VBroadcast of a loadi64 with multiple uses. This patch adds the missing tablegen pattern in X86InstrSSE.td. This patch also adds an extra test to 'splat-for-size.ll' to verify that ISel doesn't crash with a 'fatal error in the backend' due to a missing AVX pattern to select v2i64 X86ISD::BROADCAST nodes. llvm-svn: 235509	2015-04-22 14:53:39 +00:00
Olivier Sallenave	c587bee405	Fixed logic to enable complex FMA formation. llvm-svn: 235508	2015-04-22 14:07:26 +00:00
Zoran Jovanovic	b59a541926	[mips][microMIPSr6] Implement mips32 to microMIPSr6 mapping support Differential Revision: http://reviews.llvm.org/D8661 llvm-svn: 235505	2015-04-22 13:27:34 +00:00
Hal Finkel	0d49cf2645	[DAGCombine] Disable select(c, load,load) for indexed loads This turned up after r235333, but was a pre-existing bug. The optimization which transforms select(c, load, load) into a load of a select of the addresses does not handle indexed loads (pre/post inc/dec). However, it did not check for them either, leading to a crash if it tried to transform one of them. llvm-svn: 235497	2015-04-22 11:32:25 +00:00
Vasileios Kalintiris	e7508c9fc7	Revert "[mips][FastISel] Implement shift ops for Mips fast-isel." This reverts commit r235194. It was causing a failure in FastISel buildbots due to sign-extension issues. llvm-svn: 235495	2015-04-22 10:08:46 +00:00
James Molloy	cd2334e86e	[AArch64] Disable complex GEP optimization by default. Enough concerns were raised that this optimization is pessimising some code patterns. The obvious fix, to add a Reassociate run afterwards, causes even more pessimisation in some cases due to fewer complex addressing modes being matched. As there isn't a trivial fix for this, backing this out by default until someone gets a chance to fix the addressing mode matcher. llvm-svn: 235491	2015-04-22 09:11:38 +00:00
Filipe Cabecinhas	ea79c5b4f7	Have more strict type checks when creating BinOp nodes in BitcodeReader Summary: Bug found with AFL. Reviewers: rafael, bkramer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9015 llvm-svn: 235489	2015-04-22 09:06:21 +00:00
Lang Hames	65613a634a	[patchpoint] Add support for symbolic patchpoint targets to SelectionDAG and the X86 backend. The code generated for symbolic targets is identical to the code generated for constant targets, except that a relocation is emitted to fix up the actual target address at link-time. This allows IR and object files containing patchpoints to be cached across JIT-invocations where the target address may change. llvm-svn: 235483	2015-04-22 06:02:31 +00:00
Craig Topper	1f429e4926	[TableGen] Use range based for loops. llvm-svn: 235482	2015-04-22 05:27:47 +00:00
Craig Topper	d05991304b	[TableGen] Remove some deletes that violate ownership semantics. These don't seem to execute in our codebase today and date back to a time when there was an allocation in this function. llvm-svn: 235481	2015-04-22 05:27:11 +00:00
Craig Topper	fe0cdf9899	[TableGen] Make BitRecTy::baseClassOf return true when RHS is an IntRecTy. Previously the code was accidentally checking if 'this' was an IntRecTy which it can't be since 'this' is a BitRecTy. Looking back at the history it appears it was intended to check RHS. llvm-svn: 235477	2015-04-22 04:18:32 +00:00
Craig Topper	e8005f90f5	Don't use 'nullptr' in comment. Just use 'null'. llvm-svn: 235476	2015-04-22 04:18:27 +00:00
David Blaikie	50a0615264	[opaque pointer types] Serialize the value type for atomic store instructions Without pointee types the space optimization of storing only the pointer type and not the value type won't be viable - so add the extra type information that would be missing. llvm-svn: 235475	2015-04-22 04:14:46 +00:00
David Blaikie	612ddbfde0	[opaque pointer types] Serialize the value type for store instructions Without pointee types the space optimization of storing only the pointer type and not the value type won't be viable - so add the extra type information that would be missing. Storeatomic coming soon. llvm-svn: 235474	2015-04-22 04:14:42 +00:00
Duncan P. N. Exon Smith	e868123d8f	Linker: Add flag to override linkage rules Add a flag to lib/Linker (and `llvm-link`) to override linkage rules. When set, the functions in the source module always replace those in the destination module. The `llvm-link` option is `-override=abc.ll`. All the "regular" modules are loaded and linked first, followed by the `-override` modules. This is useful for debugging workflows where some subset of the module (e.g., a single function) is extracted into a separate file where it's optimized differently, before being merged back in. Patch by Luqman Aden! llvm-svn: 235473	2015-04-22 04:11:00 +00:00
Craig Topper	b534eabf9e	Revert "[TableGen] Use cast instead of dyn_cast where result isn't checked before being dereferenced." Turns out I misread the parentheses. Though I'm pretty sure its always a RecordRecTy and non of the callers really seem to expect null. But until I'm completely sure I'm going to revert this. llvm-svn: 235469	2015-04-22 02:59:06 +00:00
Craig Topper	c7a9cfb7f6	Fix stale comment that mentioned 0 instead of nullptr. NFC. llvm-svn: 235468	2015-04-22 02:59:03 +00:00
Craig Topper	0e04bee8df	[TableGen] Remove Pool helper class and just use unique_ptr in the maps. llvm-svn: 235467	2015-04-22 02:20:44 +00:00
Craig Topper	b15012bc6b	[TableGen] Use StringRecTy::get() instead of allocating (and leaking) a StringRecTy object. llvm-svn: 235466	2015-04-22 02:09:47 +00:00
Craig Topper	1bf3d1f5dd	[TableGen] Use 'isa' to identify UnsetInits rather than comparing with the singleton object created by UnsetInit::get(). Makes it more consistent with the other types. llvm-svn: 235465	2015-04-22 02:09:45 +00:00
Craig Topper	f8344c60a6	[TableGen] Use cast instead of dyn_cast where result isn't checked before being dereferenced. llvm-svn: 235463	2015-04-22 02:09:42 +00:00
Sanjay Patel	fe1365ac50	[x86] allow 64-bit extracted vector element integer stores on a 32-bit system With SSE2, we can generate a 'movq' or other 64-bit store op on a 32-bit system even though 64-bit integers are not legal types. So instead of producing this: pshufd $229, %xmm0, %xmm1 ## xmm1 = xmm0[1,1,2,3] movd %xmm0, (%eax) movd %xmm1, 4(%eax) We can do: movq %xmm0, (%eax) This is a fix for the problem noted in D7296. Differential Revision: http://reviews.llvm.org/D9134 llvm-svn: 235460	2015-04-22 00:24:30 +00:00
Reid Kleckner	f14787dad8	[WinEH] Correctly handle inlined __finally blocks with captures We should also teach the inliner to collapse framerecover of frameaddress of the current frame down to an alloca, but that can happen later. llvm-svn: 235459	2015-04-22 00:07:52 +00:00
David Blaikie	506993636e	[opaque pointer type] Avoid using PointerType::getElementType for a few cases of CallInst Calls to llvm::Value::mutateType are becoming extra-sensitive now that instructions have extra type information that will not be derived from operands or result type (alloca, gep, load, call/invoke, etc... ). The special-handling for mutateType will get more complicated as this work continues - it might be worth making mutateType virtual & pushing the complexity down into the classes that need special handling. But with only two significant uses of mutateType (vectorization and linking) this seems OK for now. Totally open to ideas/suggestions/improvements, of course. With this, and a bunch of exceptions, we can roundtrip an indirect call site through bitcode and IR. (a direct call site is actually trickier... I haven't figured out how to deal with the IR deserializer's lazy construction of Function/GlobalVariable decl's based on the type of the entity which means looking through the "pointer to T" type referring to the global) llvm-svn: 235458	2015-04-21 23:26:57 +00:00
Wei Mi	a0adf9fd41	Limiting gep merging to fix the performance problem described in https://llvm.org/bugs/show_bug.cgi?id=23163. Gep merging sometimes behaves like a reverse CSE/LICM optimization, which has negative impact on performance. In this patch we restrict gep merging to happen only when the indexes to be merged are both consts, which ensures such merge is always beneficial. The patch makes gep merging only happen in very restrictive cases. It is possible that some analysis/optimization passes rely on the merged geps to get better result, and we havn't notice them yet. We will be ready to further improve it once we see the cases. Differential Revision: http://reviews.llvm.org/D8911 llvm-svn: 235455	2015-04-21 23:02:15 +00:00
Wei Mi	2940bc82ac	Revert r235451 since it is attached to a wrong Differential Revision. Sorry. llvm-svn: 235453	2015-04-21 22:56:09 +00:00
Wei Mi	6e3344ed98	Limiting gep merging to fix the performance problem described in https://llvm.org/bugs/show_bug.cgi?id=23163. Gep merging sometimes behaves like a reverse CSE/LICM optimizations, which has negative impact on performance. In this patch we restrict gep merging to happen only when the indexes to be merged are both consts, which ensures such merge is always beneficial. The patch makes gep merging only happen in very restrictive cases. It is possible that some analysis/optimization passes rely on the merged geps to get better result, and we havn't notice them yet. We will be ready to further improve it once we see the cases. Differential Revision: http://reviews.llvm.org/D9007 llvm-svn: 235451	2015-04-21 22:37:09 +00:00
Ahmed Bougacha	9692e30e8b	[MemCpyOpt] Use the raw i8* dest when optimizing memset+memcpy. MemIntrinsic::getDest() looks through pointer casts, and using it directly when building the new GEP+memset results in stuff like: %0 = getelementptr i64* %p, i32 16 %1 = bitcast i64* %0 to i8* call ..memset(i8* %1, ...) instead of the correct: %0 = bitcast i64* %p to i8* %1 = getelementptr i8* %0, i32 16 call ..memset(i8* %1, ...) Instead, use getRawDest, which just gives you the i8* value. While there, use the memcpy's dest, as it's live anyway. In most cases, when the optimization triggers, the memset and memcpy sizes are the same, so the built memset is 0-sized and eliminated. The problem occurs when they're different. Fixes a regression caused by r235232: PR23300. llvm-svn: 235419	2015-04-21 21:28:33 +00:00
Krzysztof Parzyszek	499bc5faa1	[Hexagon] Patterns for frame index with offset for isel llvm-svn: 235418	2015-04-21 21:28:03 +00:00
Daniel Berlin	b4e7a4a40c	Revamp PredIteratorCache interface to be cleaner. Summary: This lets us use range based for loops. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9169 llvm-svn: 235416	2015-04-21 21:11:50 +00:00
Jingyue Wu	66a161f05e	[NVPTX] do not run DCE after SLSR and SeparateConstOffsetFromGEP Summary: With D9096 and D9101, there's no need to run DCE after SLSR and SeparateConstOffsetFromGEP. Test Plan: no regression Reviewers: jholewinski, meheff Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D9172 llvm-svn: 235415	2015-04-21 20:47:15 +00:00
Sanjoy Das	7be03d69e5	[LSR][NFC] Remove a stale comment. The comment was made stale in r171735. llvm-svn: 235414	2015-04-21 20:42:50 +00:00
Duncan P. N. Exon Smith	aa861aa483	DebugInfo: Remove DIArray and DITypeArray typedefs Remove the `DIArray` and `DITypeArray` typedefs, preferring the underlying types (`DebugNodeArray` and `MDTypeRefArray`, respectively). llvm-svn: 235413	2015-04-21 20:07:38 +00:00
Jingyue Wu	f1edf3e88f	[SLSR] garbage-collect unused instructions Summary: After we rewrite a candidate, the instructions used by the old form may become unused. This patch cleans up these unused instructions so that we needn't run DCE after SLSR. Test Plan: removed -dce in all the SLSR tests Reviewers: broune, meheff Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9101 llvm-svn: 235410	2015-04-21 19:56:18 +00:00
Jingyue Wu	f763c3fd45	[SeparateConstOffsetFromGEP] garbage-collect intermediate instructions Summary: so that we needn't run DCE after this pass. Test Plan: removed -dce from the commandline in split-gep.ll and split-gep-and-gvn.ll Reviewers: meheff Subscribers: llvm-commits, HaoLiu, hfinkel, jholewinski Differential Revision: http://reviews.llvm.org/D9096 llvm-svn: 235409	2015-04-21 19:53:18 +00:00
Yaron Keren	1b8332aa6d	Remove FilesToRemove->push_back(Filename) from sys::DontRemoveFileOnSignal. llvm-svn: 235408	2015-04-21 19:25:11 +00:00
Daniel Berlin	2372a193ba	Move IDF Calculation to a separate file, expose an interface to it. Summary: MemorySSA uses this algorithm as well, and this enables us to reuse the code in both places. There are no actual algorithm or datastructure changes in here, just code movement. Reviewers: qcolombet, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9118 llvm-svn: 235406	2015-04-21 19:13:02 +00:00
Duncan P. N. Exon Smith	60635e39b6	DebugInfo: Drop rest of DIDescriptor subclasses Delete the remaining subclasses of (the already deleted) `DIDescriptor`. Part of PR23080. llvm-svn: 235404	2015-04-21 18:44:06 +00:00
Duncan P. N. Exon Smith	d4a19a396d	DebugInfo: Assert dbg.declare/value insts are valid Remove early returns for when `getVariable()` is null, and just assert that it never happens. The Verifier already confirms that there's a valid variable on these intrinsics, so we should assume the debug info isn't broken. I also updated a check for a `!dbg` attachment, which the Verifier similarly guarantees. llvm-svn: 235400	2015-04-21 18:24:23 +00:00
Reid Kleckner	d2a1a51996	Re-land r235154-r235156 under the existing -sehprepare flag Keep the old SEH fan-in lowering on by default for now, since projects rely on it. This will make it easy to test this change with a simple flag flip. llvm-svn: 235399	2015-04-21 18:23:57 +00:00
Matthias Braun	9e9e8b3230	X86: Match for X86ISD nodes in LowerBUILD_VECTOR instead of BUILD_VECTORCombine There doesn't seem to be a reason to perform this target ISD node matching in an DAGCombine, moving it to lowering fixes PR23296. Differential Revision: http://reviews.llvm.org/D9137 llvm-svn: 235394	2015-04-21 17:21:36 +00:00
Elena Demikhovsky	0e6d6d54ce	AVX-512: Added VPMOVx2M instructions for SKX, fixed encoding of VPMOVM2x. llvm-svn: 235385	2015-04-21 14:38:31 +00:00
Elena Demikhovsky	431b81e41f	AVX-512: Added VPTESTM and VPTESTNM instructions for SKX llvm-svn: 235383	2015-04-21 13:13:46 +00:00
Toma Tabacu	11e14a9467	[mips] [IAS] Implement the .asciiz directive. Summary: This directive is exactly the same as .asciz, except it's only used by MIPS. It is used to store null terminated strings in object files. Reviewers: rafael, dsanders, echristo Reviewed By: dsanders, echristo Subscribers: echristo, llvm-commits Differential Revision: http://reviews.llvm.org/D7530 llvm-svn: 235382	2015-04-21 11:50:52 +00:00
Jozef Kolek	8e086cedfa	[mips][microMIPSr6] Implement CACHE and PREF instructions Implement CACHE and PREF instructions using mapping. Differential Revision: http://reviews.llvm.org/D8893 llvm-svn: 235379	2015-04-21 11:17:25 +00:00
Vasileios Kalintiris	41b0100dea	[mips] Cleanup old floating-point flag conditions definitions. NFC. Reviewers: dsanders Differential Revision: http://reviews.llvm.org/D7947 llvm-svn: 235377	2015-04-21 10:53:57 +00:00
Vasileios Kalintiris	32177d6bec	[mips] Optimize code generation for 64-bit variable shift instructions. Summary: The 64-bit version of the variable shift instructions uses the shift_rotate_reg class which uses a GPR32Opnd to specify the variable shift amount. With this patch we avoid the generation of a redundant SLL instruction for the variable shift instructions in 64-bit targets. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7413 llvm-svn: 235376	2015-04-21 10:49:03 +00:00
Elena Demikhovsky	50b88ddb87	AVX-512: Added logical and arithmetic instructions for SKX by Asaf Badouh (asaf.badouh@intel.com) llvm-svn: 235375	2015-04-21 10:27:40 +00:00
Simon Pilgrim	398ce22b86	[X86][SSE] Provide execution domains for scalar floating point operations This is an updated version of Chandler's patch D7402 that got accepted but never committed, and has bit-rotted a bit since. I've updated the execution domain declarations to match the approach of the packed templates and also added some extra scalar unary tests. Differential Revision: http://reviews.llvm.org/D9095 llvm-svn: 235372	2015-04-21 08:40:22 +00:00
Simon Pilgrim	860f08779c	CONCAT_VECTOR of BUILD_VECTOR - minor fix Fixed issue with the combine of CONCAT_VECTOR of 2 BUILD_VECTOR nodes - the optimisation wasn't ensuring that the scalar operands of both nodes were the same type/size for implicit truncation. Test case spotted by Patrik Hagglund llvm-svn: 235371	2015-04-21 08:05:43 +00:00
Pawel Bylica	57c2f7c756	Fix generic shift expansion when shift amount is 0 Summary: This fixes http://llvm.org/bugs/show_bug.cgi?id=16439. This is one possible way to approach this. The other would be to split InL>>(nbits-Amt) into (InL>>(nbits-1-Amt))>>1, which is also valid since since we only need to care about Amt up nbits-1. It's hard to tell which one is better since the shift might be expensive if this stage of expansion is not yet a legal machine integer, whereas comparisons with zero are relatively cheap at all sizes, but more expensive than a shift if the shift is on a legal machine type. Patch by Keno Fischer! Test Plan: regression test from http://reviews.llvm.org/D7752 Reviewers: chfast, resistor Reviewed By: chfast, resistor Subscribers: sanjoy, resistor, chfast, llvm-commits Differential Revision: http://reviews.llvm.org/D4978 llvm-svn: 235370	2015-04-21 06:28:36 +00:00
Matthias Braun	b6b5aaad98	X86: Do not select X86 custom vector nodes if operand types don't match X86ISD::ADDSUB, X86ISD::(F)HADD, X86ISD::(F)HSUB should not be selected if the operand types do not match the result type because vector type legalization cannot deal with this for custom nodes. Testcase X86ISD::ADDSUB is attached. I could not create a testcase for the FHADD/FHSUB cases because of: https://llvm.org/bugs/show_bug.cgi?id=23296 Differential Revision: http://reviews.llvm.org/D9120 llvm-svn: 235367	2015-04-21 01:13:41 +00:00
Derek Schuff	2a1678a789	[MC] When using bundle aligment, align sections to bundle size Summary: Bundle aligment requires that the functions always start at an aligned address. Usually this is ensured by the compiler, but assembly code does not always begin with a .align directive. This change ensures that sections get the correct alignment if they contain any instructions and bundling is enabled. (It also makes LLVM match the behavior of GNU as). Differential Revision: http://reviews.llvm.org/D9131 llvm-svn: 235365	2015-04-21 00:14:25 +00:00
Fiona Glaser	0d41db11a2	InstCombine: fold (sitofp (zext x)) to (uitofp x) This is okay because the zext guarantees the high bit is zero, and so the value is unsigned. llvm-svn: 235364	2015-04-21 00:05:41 +00:00
Andrew Kaylor	00e5d9ee5f	[WinEH] Fix problem with landing pad return values used in PHI nodes during outlining. llvm-svn: 235358	2015-04-20 22:53:42 +00:00
Duncan P. N. Exon Smith	2fbe13540a	DebugInfo: Delete subclasses of DIScope Delete subclasses of (the already defunct) `DIScope`, updating users to use the raw pointers from the `Metadata` hierarchy directly. llvm-svn: 235356	2015-04-20 22:10:08 +00:00
Andrew Kaylor	41758517bf	[WinEH] Fix problem with mapping shared empty handler blocks. Differential Revision: http://reviews.llvm.org/D9125 llvm-svn: 235354	2015-04-20 22:04:09 +00:00
Duncan P. N. Exon Smith	c62468859a	DebugInfo: Delete old subclasses of DIType Delete subclasses of (the already deleted) `DIType` in favour of directly using pointers from the `Metadata` hierarchy. While `DICompositeType` wraps `MDCompositeTypeBase` and `DIDerivedType` wraps `MDDerivedTypeBase`, most uses of each really meant the more specific `MDCompositeType` and `MDDerivedType`. llvm-svn: 235351	2015-04-20 21:17:32 +00:00
Duncan P. N. Exon Smith	698df36ab7	DwarfUnit: Split MDSubroutineType version of constructTypeDIE() The version of `constructTypeDIE()` for `MDSubroutineType` is unrelated to (and has different callers than) the `MDCompositeType`. Split the two in half. This simplifies an upcoming patch to delete `DICompositeType`. There shouldn't be any real functionality change here. `createTypeDIE()` is `cast<>`'ing where it didn't need to before, but that function in turn is only called for true `MDCompositeType`s. llvm-svn: 235349	2015-04-20 21:04:33 +00:00
Lang Hames	dc4260db2a	[Orc] Make the makeStub function propagate argument attributes onto the call to the function body. This is necessary for correctness when lazily compiling. Also, flesh out the Orc unit test infrastructure slightly, and add a unit test for this. llvm-svn: 235347	2015-04-20 20:41:45 +00:00
Duncan P. N. Exon Smith	d89ef16aa9	DwarfUnit: Cleanup comments Update comment style in `DwarfUnit`. - Drop duplicated comments at definition, and update the comments at the declaration where the definition comments looked newer or more complete. - Drop the `functionName -` prefix. - Add `\brief` in a few places. - Remove a few comments entirely that weren't adding value (just turned the function name and arguments into a sentence). llvm-svn: 235345	2015-04-20 20:29:51 +00:00
Olivier Sallenave	b99c2eb0f0	Refactoring and enhancement to FMA combine. llvm-svn: 235344	2015-04-20 20:29:40 +00:00
Pirama Arumuga Nainar	34056dea1b	[MIPS] OperationAction for FP_TO_FP16, FP16_TO_FP Summary: Set operation action for FP16 conversion opcodes, so the Op legalizer can choose the gnu_* libcalls for Mips. Set LoadExtAction and TruncStoreAction for f16 scalars and vectors to prevent (fpext (load )) and (store (fptrunc)) from getting combined into unsupported operations. Added test cases to test that these operations are handled correctly for f16 scalars and vectors. This patch depends on http://reviews.llvm.org/D8755. Reviewers: srhines Subscribers: llvm-commits, ab Differential Revision: http://reviews.llvm.org/D8804 llvm-svn: 235341	2015-04-20 20:15:36 +00:00
Tom Stellard	69a7b91e95	DAGCombine: Remove redundant NaN checks around ISD::FSQRT This folds: (select (setcc x, -0.0, *lt), NaN, (fsqrt x)) -> ( fsqrt x) llvm-svn: 235333	2015-04-20 19:38:27 +00:00
Tom Stellard	67246d137d	IR: Add ConstantFP::getNaN() This is a wrapper around APFloat::getNaN(). llvm-svn: 235332	2015-04-20 19:38:24 +00:00
Duncan P. N. Exon Smith	9928a909c6	DebugInfo: Remove DIType This is the last major parent class, so I'll probably start deleting classes in batches now. Looks like many of the references to the DI* hierarchy were updated organically along the way. llvm-svn: 235331	2015-04-20 18:52:06 +00:00
Andrew Kaylor	f18771bdfd	[WinEH] Fix memory leak with catch-all mapping. llvm-svn: 235328	2015-04-20 18:48:45 +00:00
Duncan P. N. Exon Smith	be9e4fe768	DebugInfo: Remove DIScope Replace uses of `DIScope` with `MDScope`. There was one spot where I've left an `MDScope` uninitialized (where `DIScope` would have been default-initialized to `nullptr`) -- this is intentional, since the if/else that follows should unconditional assign it to a value. llvm-svn: 235327	2015-04-20 18:32:29 +00:00
Lang Hames	67e6e04a1e	[Orc] Use the 64-bit versions of FXSAVE/FXRSTOR for JIT reentry. llvm-svn: 235325	2015-04-20 18:25:44 +00:00
Duncan P. N. Exon Smith	848af387d8	DebugInfo: Remove typedefs for DITypeRef, etc. Remove typedefs for type refs: - DITypeRef => MDTypeRef - DIScopeRef => MDScopeRef - DIDescriptorRef => DebugNodeRef llvm-svn: 235323	2015-04-20 18:20:03 +00:00
Jozef Kolek	207d248eba	[mips][microMIPSr6] Implement BITSWAP instruction Implement BITSWAP instruction using mapping. Differential Revision: http://reviews.llvm.org/D8857 llvm-svn: 235321	2015-04-20 18:14:59 +00:00
Vladimir Sukharev	bad1d1dc02	[AArch64] LORID_EL1 register must be treated as read-only Patch by: John Brawn Reviewers: jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9105 llvm-svn: 235314	2015-04-20 16:54:37 +00:00
Akira Hatanaka	2cc2b63f53	[InlineFunction] Don't add lifetime markers for zero-sized allocas. This commit fixes the code which adds lifetime markers in InlineFunction to skip zero-sized allocas instead of asserting on them. rdar://problem/20531155 llvm-svn: 235312	2015-04-20 16:11:05 +00:00
Brendon Cahoon	a57cc8bc81	Recognize n/1 in the SCEV divide function n/1 generates a quotient equal to n and a remainder of 0. If this case is not recognized, then the SCEV divide() function can return a remainder that is greater than or equal to the denominator, which means the delinearized subscripts for the test case will be incorrect. Differential Revision: http://reviews.llvm.org/D9003 llvm-svn: 235311	2015-04-20 16:03:28 +00:00
Bill Schmidt	6779075c44	[PowerPC] Flow oversized lines for r235309 llvm-svn: 235310	2015-04-20 15:58:46 +00:00
Bill Schmidt	1962f709c7	[PowerPC] Add future work for vector insert/extract to README_ALTIVEC.txt llvm-svn: 235309	2015-04-20 15:54:26 +00:00
Jozef Kolek	676d60125c	[mips][microMIPSr6] Implement disassembler support Implement disassembler support for microMIPS32r6. Differential Revision: http://reviews.llvm.org/D8490 llvm-svn: 235307	2015-04-20 14:40:38 +00:00
Rafael Espindola	4ba9af1141	Don't allow pwrite to resize a stream. The current implementations could exhibit some behavior differences: raw_fd_ostream: Whatever the underlying fd does with seek+write. In a normal file, the write position would be back to the old offset. raw_svector_ostream: The write position is always the end of the stream, so after pwrite the write position would be the new end. This matches what OS_X (all BSD?) do with a pwrite in a O_APPEND fd. Given that we don't need that feature and don't use O_APPEND a lot in LLVM, just disallow it. I am open to suggestions on renaming pwrite to something else, but this fixes the issue for now. Thanks to Yaron Keren for reporting it. llvm-svn: 235303	2015-04-20 13:04:30 +00:00
Jozef Kolek	5de4a6c0af	[mips][microMIPSr6] Implement BALC and BC instructions This patch implements BALC and BC instructions using mapping. Differential Revision: http://reviews.llvm.org/D8388 llvm-svn: 235302	2015-04-20 13:04:14 +00:00
Rafael Espindola	29c8270916	Look past locals in comdats. We have to avoid converting a reference to a global into a reference to a local, but it is fine to look past a local. Patch by Vasileios Kalintiris. I just moved the comment and added thet test. llvm-svn: 235300	2015-04-20 12:44:06 +00:00
Jozef Kolek	6ca13eaf82	[mips][microMIPSr6] Implement initial mapping support Differential Revision: http://reviews.llvm.org/D8387 llvm-svn: 235298	2015-04-20 12:42:08 +00:00
Jozef Kolek	c22555d977	[mips][microMIPSr6] Implement initial subtarget support Differential Revision: http://reviews.llvm.org/D8386 llvm-svn: 235296	2015-04-20 12:23:06 +00:00
Andrea Di Biagio	98c367093d	[X86][FastIsel] Fix assertion failure when selecting int-to-double conversion (PR23273). This fixes a regression introduced at revision 231243. The target-independent selection algorithm in FastISel knows how to select a SINT_TO_FP if the target is SSE but not AVX. That is because on X86, the tablegen'd 'fastEmit' functions know how to select CVTSI2SSrr and CVTSI2SDrr. Method X86FastISel::X86SelectSIToFP was therefore working under the wrong assumption that the target was AVX. That assumption was incorrect since we can have a target that is neither AVX nor SSE. So, rather than asserting for the presence of AVX, we should have had an early exit from 'X86SelectSIToFP' if the target was not AVX. This patch fixes the issue replacing the invalid assertion with an early exit. Thanks to Dimitry Andric for reporting this problem and for providing a small reproducible testcase. Added test pr23273.ll. llvm-svn: 235295	2015-04-20 11:56:59 +00:00
Karthik Bhat	76aa662cf0	[NFC] Refactor identification of reductions as common utility function. This patch refactors reduction identification code out of LoopVectorizer and exposes them as common utilities. No functional change. Review: http://reviews.llvm.org/D9046 llvm-svn: 235284	2015-04-20 04:38:33 +00:00
Hal Finkel	1e5733bbed	[InlineAsm] Remove EarlyClobber on registers that are also inputs When an inline asm call has an output register marked as early-clobber, but that same register is also an input operand, what should we do? GCC accepts this, and is documented to accept this for read/write operands saying, "Furthermore, if the earlyclobber operand is also a read/write operand, then that operand is written only after it's used." For write-only operands, the situation seems less clear, but I have at least one existing codebase that assumes this will work, in part because it has syscall macros like this: ({ \ register uint64_t r0 __asm__ ("r0") = (__NR_ ## name); \ register uint64_t r3 __asm__ ("r3") = ((uint64_t) (arg0)); \ register uint64_t r4 __asm__ ("r4") = ((uint64_t) (arg1)); \ register uint64_t r5 __asm__ ("r5") = ((uint64_t) (arg2)); \ __asm__ __volatile__ \ ("sc" \ : "=&r"(r0),"=&r"(r3),"=&r"(r4),"=&r"(r5) \ : "0"(r0), "1"(r3), "2"(r4), "3"(r5) \ : "r6","r7","r8","r9","r10","r11","r12","cr0","memory"); \ r3; \ }) Furthermore, with register aliases and subregister relationships that only the backend knows about, rejecting this in the frontend seems like a difficult proposition (if we wanted to do so). However, keeping the early-clobber flag on the INLINEASM MI does not work for us, because it will cause the register's live interval to end to soon (so it will not appear defined to be used as an input). Fortunately, fixing this does not seem hard: When forming the INLINEASM MI, check to see if any of the early-clobber outputs are also inputs, and if so, remove the early-clobber flag. llvm-svn: 235283	2015-04-20 00:01:30 +00:00
Simon Pilgrim	749953eebb	[X86][SSE] Fix for getScalarValueForVectorElement to detect scalar sources requiring truncation. The fix ensures that scalar sources inserted into a vector are the correct bit size. Integer scalar sources from BUILD_VECTOR and SCALAR_TO_VECTOR nodes may require truncation that this function doesn't currently support. llvm-svn: 235281	2015-04-19 22:16:49 +00:00
Eric Christopher	d2e3ddad14	Remove CFIFuncName from TargetOptions as it is currently unused. llvm-svn: 235268	2015-04-19 03:21:04 +00:00
Eric Christopher	78804ab2df	Remove the CFIEnforcing flag from TargetOptions as it is unused. llvm-svn: 235267	2015-04-19 03:20:59 +00:00
Craig Topper	43d413b698	Remove unnecessary include and probably a layering violation. llvm-svn: 235262	2015-04-19 00:57:33 +00:00
Ahmed Bougacha	05b72c1fd8	[MemCpyOpt] Don't force i64 when promoting memset/memcpy sizes. Harden r235258 to support any integer bitwidth. The quick glance at the reference made me think only i32 and i64 were valid types, but they're not special, so any overload is legal. Thanks to David Majnemer for noticing! llvm-svn: 235261	2015-04-18 23:06:04 +00:00
Ahmed Bougacha	7216ccc3f3	[MemCpyOpt] Promote both memset/memcpy sizes if differently typed. Followup to r235232, which caused PR23278. We can't assume the memset and memcpy sizes have the same type, as nothing in the language reference prevents that. Instead, zext both to i64 if they disagree. While there, robustify tests by using i8 %c rather than i8 0 for the memset character. llvm-svn: 235258	2015-04-18 17:57:41 +00:00
Benjamin Kramer	2a7404a907	[InstCombine] Create zero constants on demand. No functional change intended. llvm-svn: 235257	2015-04-18 16:52:08 +00:00
David Majnemer	45951a6626	[InstCombine] (mul nsw 1, INT_MIN) != (shl nsw 1, 31) Multiplying INT_MIN by 1 doesn't trigger nsw. However, shifting 1 into the sign bit does trigger nsw. llvm-svn: 235250	2015-04-18 04:41:30 +00:00
Ahmed Bougacha	279e3ee954	[GlobalMerge] Look at uses to create smaller global sets. Instead of merging everything together, look at the users of GlobalVariables, and try to group them by function, to create sets of globals used "together". Using that information, a less-aggressive alternative is to keep merging everything together except globals that are only ever used alone, that is, those for which it's clearly non-profitable to merge with others. In my testing, grouping by Function is too aggressive, but grouping by BasicBlock is too conservative. Anything in-between isn't trivially available, so stick with Function grouping for now. cl::opts are added for testing; both enabled by default. A few of the testcases aren't testing the merging proper, but just various edge cases when merging does occur. Update them to use the previous grouping behavior. Also, one of the tests is unrelated to GlobalMerge; change it accordingly. While there, switch to r234666' flags rather than the brutal -O3. Differential Revision: http://reviews.llvm.org/D8070 llvm-svn: 235249	2015-04-18 01:21:58 +00:00
Duncan P. N. Exon Smith	7c60f20e49	DebugInfo: Delete DIDescriptor (but not its subclasses) Delete `DIDescriptor` and update the remaining users. I'll follow-up by deleting subclasses in manageable groups (top-down). llvm-svn: 235248	2015-04-18 00:35:36 +00:00
Ahmed Bougacha	e14a4d487e	[AArch64] Don't force MVT::Untyped when selecting LD1LANEpost. The result is either an Untyped reg sequence, on ldN with N > 1, or just the type of the input vector, on ld1. Don't force Untyped. Instead, just use the type of the reg sequence. This mirrors the behavior of createTuple, which feeds the LD1*_POST. The narrow code path wasn't actually covered by tests, because V64 insert_vector_elt are widened to V128 before the LD1LANEpost combine has the chance to run, usually. The only case where it does run on V64 vectors is if the vector ops legalizer ran. So, tickle the code with a ctpop. Fixes PR23265. llvm-svn: 235243	2015-04-17 23:43:33 +00:00
Andrew Kaylor	761fb44efe	Fix build wanrings and line endings llvm-svn: 235241	2015-04-17 23:20:24 +00:00
Duncan P. N. Exon Smith	ed557b55ee	DebugInfo: Remove DIDescriptor from the DebugInfo API Stop using `DIDescriptor` and its subclasses in the `DebugInfoFinder` API, as well as the rest of the API hanging around in `DebugInfo.h`. llvm-svn: 235240	2015-04-17 23:20:10 +00:00
Andrew Kaylor	ea8df61d4d	[WinEH] Fixes for a few cppeh failures. Differential Review: http://reviews.llvm.org/D9065 llvm-svn: 235239	2015-04-17 23:05:43 +00:00
Adam Nemet	8dcb3b6a59	[LoopAccesses] Improve debug output llvm-svn: 235238	2015-04-17 22:43:10 +00:00
Zachary Turner	4b08354b0e	[PDB] Support executables and source/line info. Previously DebugInfoPDB could only load data for a PDB given a path to the PDB. It could not open an EXE and find the matching PDB and verify it matched, etc. This patch adds support for that so that we can simply load debug information for a PDB directly. Additionally, this patch extends DebugInfoPDB to support getting source and line information for symbols. llvm-svn: 235237	2015-04-17 22:40:36 +00:00
David Blaikie	d0a2482870	[opaque pointer type] Access the pointee of the result type from the GEP rather than pulling it out of the pointer result type The implementation of this GEP::getResultElementType will be refactored to either rely on a member variable, or recompute the value from the indicies (any preferences?). llvm-svn: 235236	2015-04-17 22:32:20 +00:00
David Blaikie	cc2cd581cf	[opaque pointer type] Query the GEP for its source element type directly rather than finding it through the pointer type of the first operand in the Verifier llvm-svn: 235235	2015-04-17 22:32:17 +00:00
David Blaikie	d33bad3e87	[opaque pointer type] Use the parsed explicit pointee type when error-checking geps during LL parsing llvm-svn: 235233	2015-04-17 22:32:13 +00:00
Ahmed Bougacha	83f78a459a	[MemCpyOpt] Optimize double-storing by memset+memcpy. A common idiom in some code is to do the following: memset(dst, 0, dst_size); memcpy(dst, src, src_size); Some of the memset is redundant; instead, we can do: memcpy(dst, src, src_size); memset(dst + src_size, 0, dst_size <= src_size ? 0 : dst_size - src_size); Original patch by: Joel Jones Differential Revision: http://reviews.llvm.org/D498 llvm-svn: 235232	2015-04-17 22:20:57 +00:00
Duncan P. N. Exon Smith	364a3005f2	AsmPrinter: Create a unified .debug_loc stream This commit removes `DebugLocList` and replaces it with `DebugLocStream`. - `DebugLocEntry` no longer contains its byte/comment streams. - The `DebugLocEntry` list for a variable/inlined-at pair is allocated on the stack, and released right after `DebugLocEntry::finalize()` (possible because of the refactoring in r231023). Now, only one list is in memory at a time now. - There's a single unified stream for the `.debug_loc` section that persists, stored in the new `DebugLocStream` data structure. The last point is important: this collapses the nested `SmallVector<>`s from `DebugLocList` into unified streams. We previously had something like the following: vec<tuple<Label, CU, vec<tuple<BeginSym, EndSym, vec<Value>, vec<char>, vec<string>>>>> A `SmallVector` can avoid allocations, but is statically fairly large for a vector: three pointers plus the size of the small storage, which is the number of elements in small mode times the element size). Nesting these is expensive, since an inner vector's size contributes to the element size of an outer one. (Nesting any vector is expensive...) In the old data structure, the outer vector's element size was 632B, excluding allocation costs for when the middle and inner vectors exceeded their small sizes. 312B of this was for the "three" pointers in the vector-tree beneath it. If you assume 1M functions with an average of 10 variable/inlined-at pairs each (in an LTO scenario), that's almost 6GB (besides inner allocations), with almost 3GB for the "three" pointers. This came up in a heap profile a little while ago of a `clang -flto -g` bootstrap, with `DwarfDebug::collectVariableInfo()` using something like 10-15% of the total memory. With this commit, we have: tuple<vec<tuple<Label, CU, Offset>>, vec<tuple<BeginSym, EndSym, Offset, Offset>>, vec<char>, vec<string>> The offsets are used to create `ArrayRef` slices of adjacent `SmallVector`s. This reduces the number of vectors to four (unrelated to the number of variable/inlined-at pairs), and caps the number of allocations at the same number. Besides saving memory and limiting allocations, this is NFC. I don't know my way around this code very well yet, but I wonder if we could go further: why stream to a side-table, instead of directly to the output stream? llvm-svn: 235229	2015-04-17 21:34:47 +00:00
Rafael Espindola	35d6189f0f	Compute A-B when A or B is weak. Similar to r235222, but for the weak symbol case. In an "ideal" assembler/object format an expression would always refer to the final value and A-B would only be computed from a section in the same comdat as A and B with A and B strong. Unfortunately that is not the case with debug info on ELF, so we need an heuristic. Since we need an heuristic, we may as well use the same one as gas: * call weak_sym : produces a relocation, even if in the same section. * A - weak_sym and weak_sym -A: don't produce a relocation if we can compute it. This fixes pr23272 and changes the fix of pr22815 to match what gas does. llvm-svn: 235227	2015-04-17 21:15:17 +00:00
Duncan P. N. Exon Smith	237662429d	Remove dead code, NFC llvm-svn: 235225	2015-04-17 21:06:49 +00:00
Ahmed Bougacha	2448ef5f33	[AArch64] Avoid vector->load dependency cycles when creating LD1post. They would break the SelectionDAG. Note that the opposite load->vector dependency is already obvious in: (LD1post vec, ..) llvm-svn: 235224	2015-04-17 21:02:30 +00:00
David Majnemer	dcd89368cb	[WinEH] Reusing HandlerType entries leads to small CatchHigh values CatchHigh may be smaller than TryHigh if we reuse an outlined catch handler for two different invokes with different EH states. We have no evidence which shows that CatchHigh must be greater than TryHigh or TryLow. We can revisit this if we turn out to be wrong. llvm-svn: 235223	2015-04-17 20:12:09 +00:00
Rafael Espindola	db8a58688d	Compute A-B if both A and B are in the same comdat section. Part of pr23272. A small annoyance with the assembly syntax we implement is that given an expression there is no way to know if what is desired is the value of that expression for the symbols in this file or for the final values of those symbols in a link. The first case is useful for use in sections that get discarded or ignored if the section they are describing is discarded. For axample, consider A-B where A and B are in the same comdat section. We can compute the value of the difference in the section that is present in the current .o and if that section survives to the final DSO the value will still will be correct. But the section is in a comdat. Another section from another object file might be used istead. We know that that section will define A and B, but we have no idea what the value of A-B might be. In practice we have to assume that the intention is to compute the value in the current section since otherwise the is no way to create something like the debug aranges section. llvm-svn: 235222	2015-04-17 20:05:17 +00:00
David Blaikie	b7a0298731	[opaque pointer types] Use the pointee type loaded from bitcode when constructing a LoadInst Now (with a few carefully placed suppressions relating to general type serialization, etc) we can round trip a simple load through bitcode and textual IR without calling getElementType on a PointerType. llvm-svn: 235221	2015-04-17 19:56:21 +00:00
Pirama Arumuga Nainar	50604a69e9	Fix build errors introduced by r235215 Summary: - Handle TypePromoteFloat in switch statements - Move an expression into an assert to avoid unused variable in non-assert builds. Reviewers: srhines, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9086 llvm-svn: 235220	2015-04-17 19:51:44 +00:00
Pirama Arumuga Nainar	db7c07e2bf	Add support to promote f16 to f32 Summary: This patch adds legalization support to operate on FP16 as a load/store type and do operations on it as floats. Tests for ARM are added to test/CodeGen/ARM/fp16-promote.ll Reviewers: srhines, t.p.northover Differential Revision: http://reviews.llvm.org/D8755 llvm-svn: 235215	2015-04-17 18:36:25 +00:00
Vasileios Kalintiris	816ea84e7a	[mips][FastISel] Implement FastMaterializeAlloca in Mips fast-isel. Summary: Implement the method FastMaterializeAlloca in Mips fast-isel Based on a patch by Reed Kotler. Test Plan: Passes test-suite at O0/O2 for mips32 r1/r2 fastalloca.ll Reviewers: dsanders, rkotler Subscribers: rfuhler, llvm-commits Differential Revision: http://reviews.llvm.org/D6742 llvm-svn: 235213	2015-04-17 17:29:58 +00:00
David Majnemer	2be05eef31	[WinEH] Allow CatchHigh to be equal to TryHigh Catch blocks which are empty may be in the same state as their try blocks. It is not meaningful to give the catch block its own state number in this case because it can't do anything exceptional. llvm-svn: 235212	2015-04-17 17:20:30 +00:00
Manman Ren	ce0a066524	[LTO API] add lto_codegen_set_should_internalize. When debugging LTO issues with ld64, we use -save-temps to save the merged optimized bitcode file, then invoke ld64 again on the single bitcode file. The saved bitcode file is already internalized, so we can call lto_codegen_set_should_internalize and skip running internalization again. rdar://20227235 llvm-svn: 235211	2015-04-17 17:10:09 +00:00
Sanjay Patel	2161c49a4e	[X86, AVX] add an exedepfix entry for vmovq == vmovlps == vmovlpd This is the AVX extension of r235014: http://llvm.org/viewvc/llvm-project?view=revision&revision=235014 Review: http://reviews.llvm.org/D8691 llvm-svn: 235210	2015-04-17 17:02:37 +00:00
Duncan P. N. Exon Smith	c0f7dd72b7	AsmPrinter: Store MDExpression directly instead of MDNode, NFC Clean up `DebugLocEntry::Value::Expression`'s type while I'm messing around in here anyway. llvm-svn: 235203	2015-04-17 16:36:10 +00:00
Duncan P. N. Exon Smith	546c8be967	AsmPrinter: Stop storing MDLocalVariable in DebugLocEntry Stop storing the `MDLocalVariable` in the `DebugLocEntry::Value`s. We generate the list of `DebugLocEntry`s separately for each variable/inlined-at pair, so the variable never actually changes here. This is effectively NFC (aside from saving some memory and CPU time). llvm-svn: 235202	2015-04-17 16:33:37 +00:00
Duncan P. N. Exon Smith	fba25d6e9b	AsmPrinter: Calculate type upfront for location lists, NFC We can calculate the variable type up front before calling `DebugLocEntry::finalize()`. In fact, since we only care about the type if it's an `MDBasicType`, don't even bother resolving it using the type identifier map. llvm-svn: 235201	2015-04-17 16:28:58 +00:00
David Blaikie	561a157233	[opaque pointer type] Serialize the type of an llvm::Function as a function type rather than a function pointer type llvm-svn: 235200	2015-04-17 16:28:26 +00:00
Kit Barton	f4669f5905	Add support for v1i128 type. The v1i128 type is needed for the quadword add/substract instructions introduced in POWER8. Futhermore, the PowerPC ABI specifies that parameters of type v1i128 are to be passed in a single vector register, while parameters of type i128 are passed in pairs of GPRs. Thus, it is necessary to be able to differentiate between v1i128 and i128 in LLVM. http://reviews.llvm.org/D8564 llvm-svn: 235198	2015-04-17 16:11:05 +00:00
Kit Barton	7291802533	Add the i128 builtin type to LLVM. The i128 type is needed as a builtin type in order to support the v1i128 vector type. The PowerPC ABI requires that the i128 and v1i128 types are handled differently when passed as parameters to functions (i128 is passed in pairs of GPRs, v1i128 is passed in a single vector register). http://reviews.llvm.org/D8564 llvm-svn: 235196	2015-04-17 15:32:15 +00:00
Vasileios Kalintiris	a4035e6284	[mips][FastISel] Implement shift ops for Mips fast-isel. Summary: Add shift operators implementation to fast-isel for Mips. These are shift ops for non legal forms, i.e. i8 and i16. Based on a patch by Reed Kotler. Test Plan: Reviewers: dsanders Subscribers: echristo, rfuhler, llvm-commits Differential Revision: http://reviews.llvm.org/D6726 llvm-svn: 235194	2015-04-17 14:29:21 +00:00
James Molloy	a4ff7b2713	Fix TRUNCATE splitting helper logic. This is a followon to r233681 - I'd misunderstood the semantics of FTRUNC, and had confused it with (FP_ROUND ..., 0). Thanks for Ahmed Bougacha for his post-commit review! llvm-svn: 235191	2015-04-17 13:51:40 +00:00
Rafael Espindola	7f4e07befc	Move AliasedSymbol to MachObjectWriter. It was only used by MachO. Part of pr19627. llvm-svn: 235185	2015-04-17 12:28:43 +00:00
Yaron Keren	97de57343a	Revert r235177 as the Handle is used to fail GetExitCodeProcess on purpose. Avoid double closing of the handle by testing GetLastErr for ERROR_INVALID_HANDLE and not calling CloseHandle(PI.ProcessHandle) then. llvm-svn: 235184	2015-04-17 12:11:15 +00:00
Vasileios Kalintiris	bb60cfb5c4	[mips] Teach the delay slot filler to remove needless KILL instructions. Summary: Previously, the presence of KILL instructions would block valid candidates from filling a specific delay slot. With the elimination of the KILL instructions, in the appropriate range, we are able to fill more slots and keep the information from future def/use analysis consistent. Reviewers: dsanders Reviewed By: dsanders Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D7724 llvm-svn: 235183	2015-04-17 12:01:02 +00:00

1 2 3 4 5 ...

79047 Commits