llvm-project

Commit Graph

Author	SHA1	Message	Date
Jingyue Wu	3286ec1484	[NVPTX] run SeparateConstOffsetFromGEP before SLSR Summary: We pick this order because SeparateConstOffsetFromGEP may create more opportunities for SLSR. Test Plan: reassociate-geps-and-slsr.ll no performance regression on internal benchmarks Reviewers: meheff Subscribers: llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D9230 llvm-svn: 235632	2015-04-23 20:00:04 +00:00
Tom Stellard	d1f0f0268c	R600/SI: Add assembler support for all CI and VI VOP1 instructions llvm-svn: 235629	2015-04-23 19:33:54 +00:00
Tom Stellard	4b3e755480	R600/SI: v_mov_fed_b32 does not exist on VI llvm-svn: 235628	2015-04-23 19:33:52 +00:00
Tom Stellard	21cce29041	R600/SI: Use a better error message for unsupported instructions in the assembler llvm-svn: 235627	2015-04-23 19:33:51 +00:00
Tom Stellard	7130ef49cb	R600/SI: Improve AsmParser support for forced e64 encoding We can now force e64 encoding even when the operands would be legal for e32 encoding. llvm-svn: 235626	2015-04-23 19:33:48 +00:00
Andrew Kaylor	5f715522f1	[WinEH] Handle stubs for outlined functions that have only unreached terminators. llvm-svn: 235618	2015-04-23 18:37:39 +00:00
Reid Kleckner	909ea7e6b8	Revert "[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works" We still have some "uses remain after removal" issues in -O0 builds. This reverts commit r235557. llvm-svn: 235617	2015-04-23 18:34:01 +00:00
Hal Finkel	7c5cb066d0	[PowerPC] Enable printing instructions using aliases TableGen had been nicely generating code to print a number of instructions using shorter aliases (and PowerPC has plenty of short mnemonics), but we were not calling it. For some of the aliases we support in the parser, TableGen can't infer the "inverse" alias relationship, so there is still more to do. Thus, after some hours of updating test cases... llvm-svn: 235616	2015-04-23 18:30:38 +00:00
Zachary Turner	6489d7b949	Move DIContext.h to common DebugInfo location. This will enable us to create a PDBContext so as to expose some amount of debug info functionality through a common interace. Differential Revision: http://reviews.llvm.org/D9205 Reviewed by: Alexey Samsonov llvm-svn: 235612	2015-04-23 17:37:47 +00:00
Philip Reames	5461d45abf	Move Value.isDereferenceablePointer to ValueTracking [NFC] Move isDereferenceablePointer function to Analysis. This function recursively tracks dereferencability over a chain of values like other functions in ValueTracking. This refactoring is motivated by further changes to support dereferenceable_or_null attribute (http://reviews.llvm.org/D8650). isDereferenceablePointer will be extended to perform context-sensitive analysis and IR is not a good place to have such functionality. Patch by: Artur Pilipenko <apilipenko@azulsystems.com> Differential Revision: reviews.llvm.org/D9075 llvm-svn: 235611	2015-04-23 17:36:48 +00:00
Pirama Arumuga Nainar	745615ca00	[AArch64] Add nvcast patterns for v4f16 and v8f16 Summary: Constant stores of f16 vectors can create NvCast nodes from various operand types to v4f16 or v8f16 depending on patterns in the stored constants. This patch adds nvcast rules with v4f16 and v8f16 values. AArchISelLowering::LowerBUILD_VECTOR has the details on which constant patterns generate the nvcast nodes. Reviewers: jmolloy, srhines, ab Subscribers: rengolin, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D9201 llvm-svn: 235610	2015-04-23 17:32:25 +00:00
Pirama Arumuga Nainar	b18815354d	[AArch64] Handle vec4, vec8, vec16 *itofp for half Summary: Set operation action for SINT_TO_FP and UINT_TO_FP nodes with v4i32, v8i8, v8i16 inputs to allow promotion of v4f16 results. Add tests for sitofp and uitofp for vec4, vec8, vec16, and i8, i16, i32, and i64 vectors. Only missing tests are for v16i8 and v16i16 as the shift operations are too complicated to write a proper check sequence. The conversions from v4i64 to v4f16 do not depend on this patch - v4i64 is split and the conversion gets handled while lowering v2i64. I am adding a test here for completeness. Reviewers: aemerson, rengolin, ab, jmolloy, srhines Subscribers: rengolin, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D9166 llvm-svn: 235609	2015-04-23 17:16:27 +00:00
Hans Wennborg	0867b151c9	Re-commit r235560: Switch lowering: extract jump tables and bit tests before building binary tree (PR22262) Third time's the charm. The previous commit was reverted as a reverse for-loop in SelectionDAGBuilder::lowerWorkItem did 'I--' on an iterator at the beginning of a vector, causing asserts when using debugging iterators. This commit fixes that. llvm-svn: 235608	2015-04-23 16:45:24 +00:00
Krzysztof Parzyszek	876a19d855	[Hexagon] Shrink-wrap stack frame (Hexagon-specific) llvm-svn: 235603	2015-04-23 16:05:39 +00:00
Toma Tabacu	7fc89d2141	[mips] [IAS] Move NOP emission after pseudo-instruction expansion. NFC. As suggested in the review for http://reviews.llvm.org/D8537. llvm-svn: 235601	2015-04-23 14:48:38 +00:00
Aaron Ballman	0be238cebd	Revert r235560; this commit was causing several failed assertions in Debug builds using MSVC's STL. The iterator is being used outside of its valid range. llvm-svn: 235597	2015-04-23 13:41:59 +00:00
Filipe Cabecinhas	6621cb7478	Be more strict about the operand for the array type in BitcodeReader Summary: Bug found with AFL fuzz. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9016 llvm-svn: 235596	2015-04-23 13:38:21 +00:00
Filipe Cabecinhas	ee48feadfd	Verify sizes when trying to read a BitcodeAbbrevOp Summary: Make sure the abbrev operands are valid and that we can read/skip them afterwards. Bug found with AFL fuzz. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9030 llvm-svn: 235595	2015-04-23 13:25:35 +00:00
Simon Pilgrim	86b034bae9	[DAGCombiner] Remove extra bitcasts surrounding vector shuffles Patch to remove extra bitcasts from shuffles, this is often a legacy of XformToShuffleWithZero being used to combine bitmaskings (of float vectors bitcast to integer vectors) into shuffles: bitcast(shuffle(bitcast(s0),bitcast(s1))) -> shuffle(s0,s1) Differential Revision: http://reviews.llvm.org/D9097 llvm-svn: 235578	2015-04-23 08:43:13 +00:00
Karthik Bhat	24e6cc2de4	Move common loop utility function isInductionPHI into LoopUtils.cpp This patch refactors the definition of common utility function "isInductionPHI" to LoopUtils.cpp. This fixes compilation error when configured with -DBUILD_SHARED_LIBS=ON llvm-svn: 235577	2015-04-23 08:29:20 +00:00
Karthik Bhat	8210fdf26e	Add support to interchange loops with reductions. This patch enables interchanging of tightly nested loops with reductions. Differential Revision: http://reviews.llvm.org/D8314 llvm-svn: 235571	2015-04-23 04:51:44 +00:00
Andrew Kaylor	43e1d76278	[WinEH] Don't skip landing pads that end with an unreachable instruction. llvm-svn: 235563	2015-04-23 00:20:44 +00:00
Hans Wennborg	15823d49b6	Switch lowering: extract jump tables and bit tests before building binary tree (PR22262) This is a re-commit of r235101, which also fixes the problems with the previous patch: - Switches with only a default case and non-fallthrough were handled incorrectly - The previous patch tickled a bug in PowerPC Early-Return Creation which is fixed here. > This is a major rewrite of the SelectionDAG switch lowering. The previous code > would lower switches as a binary tre, discovering clusters of cases > suitable for lowering by jump tables or bit tests as it went along. To increase > the likelihood of finding jump tables, the binary tree pivot was selected to > maximize case density on both sides of the pivot. > > By not selecting the pivot in the middle, the binary trees would not always > be balanced, leading to performance problems in the generated code. > > This patch rewrites the lowering to search for clusters of cases > suitable for jump tables or bit tests first, and then builds the binary > tree around those clusters. This way, the binary tree will always be balanced. > > This has the added benefit of decoupling the different aspects of the lowering: > tree building and jump table or bit tests finding are now easier to tweak > separately. > > For example, this will enable us to balance the tree based on profile info > in the future. > > The algorithm for finding jump tables is quadratic, whereas the previous algorithm > was O(n log n) for common cases, and quadratic only in the worst-case. This > doesn't seem to be major problem in practice, e.g. compiling a file consisting > of a 10k-case switch was only 30% slower, and such large switches should be rare > in practice. Compiling e.g. gcc.c showed no compile-time difference. If this > does turn out to be a problem, we could limit the search space of the algorithm. > > This commit also disables all optimizations during switch lowering in -O0. > > Differential Revision: http://reviews.llvm.org/D8649 llvm-svn: 235560	2015-04-22 23:14:56 +00:00
David Majnemer	7d0e99c601	[InstCombine] Use a more targeted fix instead of r235544 Only clear out the NSW/NUW flags if we are optimizing 'add'/'sub' while taking advantage that the sign bit is not set. We do this optimization to further shrink the mask but shrinking the mask isn't NSW/NUW preserving in this case. llvm-svn: 235558	2015-04-22 22:42:05 +00:00
Reid Kleckner	64a2a6a473	[SEH] Remove the old __C_specific_handler code now that WinEHPrepare works This removes the -sehprepare flag and makes __C_specific_handler functions always to use WinEHPrepare. This was tested by building all of chromium_builder_tests and running a few tests that use SEH, but if something breaks, we can revert this. llvm-svn: 235557	2015-04-22 22:13:09 +00:00
Lang Hames	34cfa49b66	[RuntimeDyld][COFF] Add external symbol resolution support to RuntimeDyldCOFF. Patch by Andy Ayers. Thanks Andy! llvm-svn: 235554	2015-04-22 21:38:37 +00:00
Krzysztof Parzyszek	952d951418	[Hexagon] Some cleanup of instruction selection code llvm-svn: 235552	2015-04-22 21:17:00 +00:00
Reid Kleckner	fd7df284b8	[WinEH] Demote values and phis live across exception handlers up front In particular, this handles SSA values that are live out of a handler. The existing code only handles values that are live in to a handler. It also handles phi nodes in the block where normal control should resume after the end of a catch handler. When EH return points have phi nodes, we need to split the return edge. It is impossible for phi elimination to emit copies in the previous block if that block gets outlined. The indirectbr that we leave in the function is only notional, and is eliminated from the MachineFunction CFG early on. Reviewers: majnemer, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D9158 llvm-svn: 235545	2015-04-22 21:05:21 +00:00
David Majnemer	fe58d13a17	[InstCombine] Clear out nsw/nuw if we modify computation in the chain An nsw/nuw operation relies on the values feeding into it to not overflow if 'poison' is not to be produced. This means that optimizations which make modifications to the bottom of a chain (like SimplifyDemandedBits) must strip out nsw/nuw if they cannot ensure that they will be preserved. This fixes PR23309. llvm-svn: 235544	2015-04-22 20:59:28 +00:00
Krzysztof Parzyszek	cd97c985c7	[Hexagon] Use A2_tfrsi for constant pool and jump table addresses llvm-svn: 235535	2015-04-22 18:25:53 +00:00
David Blaikie	d2db881e85	Revert "[opaque pointer type] Avoid using PointerType::getElementType for a few cases of CallInst" This reverts commit r235458. It looks like this might be breaking something LTO-ish. Looking into it & will recommit with a fix/test case/etc once I've got more to go on. llvm-svn: 235533	2015-04-22 18:16:49 +00:00
Pete Cooper	037b700b7f	[AArch64] Use MachineRegisterInfo instead of LiveIntervals to calculate liveness. NFC. The CondOpt pass currently uses LiveIntervals to set the dead flag on a def. This patch uses MachineRegisterInfo::use_empty instead as that is equivalent to the def being dead. This removes an instance of LiveIntervals in the pass manager pipeline and saves 3.8% of compile time on llc conpiled for AArch64. Reviewed by Chad Rosier and Zhaoshi. llvm-svn: 235532	2015-04-22 18:05:13 +00:00
Sanjay Patel	c96ee08016	don't repeat function names in comments; NFC llvm-svn: 235531	2015-04-22 18:04:46 +00:00
Krzysztof Parzyszek	05902163b6	[Hexagon] Consider constant-extended offsets to be valid llvm-svn: 235529	2015-04-22 17:51:26 +00:00
Luqman Aden	c76f470c2d	Test commit: fix typo in comment. llvm-svn: 235526	2015-04-22 17:42:37 +00:00
Krzysztof Parzyszek	9ee04e401a	Fix Windows build break: use LLVM_FUNCTION_NAME instead of __func__. llvm-svn: 235525	2015-04-22 17:19:44 +00:00
Matt Arsenault	deaef8e24b	R600: Fix always inline pass breaking noinline functions No test since calls are not actually supported yet. llvm-svn: 235524	2015-04-22 17:10:44 +00:00
Krzysztof Parzyszek	4fa2a9f7fd	[Hexagon] Overhaul of stack object allocation - Use static allocation for aligned stack objects. - Simplify dynamic stack object allocation. - Simplify elimination of frame-indices. llvm-svn: 235521	2015-04-22 16:43:53 +00:00
David Blaikie	e169e8206b	[opaque pointer type] Use pointee type retrieved from asm, rather than accessing it via the pointer type llvm-svn: 235520	2015-04-22 16:37:35 +00:00
Sanjay Patel	cab567873f	[x86] Add store-folded memop patterns for vcvtps2ph Differential Revision: http://reviews.llvm.org/D7296 llvm-svn: 235517	2015-04-22 16:11:19 +00:00
Krzysztof Parzyszek	6bbcb31fda	[Hexagon] Treat CFI as solo instructions llvm-svn: 235516	2015-04-22 15:47:35 +00:00
Krzysztof Parzyszek	badf3a6356	[Hexagon] Implement HexagonInstPrinter::printRegName llvm-svn: 235514	2015-04-22 15:38:17 +00:00
Brendon Cahoon	f9751ad1b0	Fix a type mismatch assert in SCEV division An assert was triggered when attempting to create a new SCEV with operands of different types in the visitAddRecExpr. In this test case, the operand types of the numerator and denominator are different. The SCEV division code should generate a conservative answer when this happens. Differential Revision: http://reviews.llvm.org/D9021 llvm-svn: 235511	2015-04-22 15:06:40 +00:00
Andrea Di Biagio	6cd2f42fac	[X86][AVX] Fix failure due to a missing ISel pattern to select VBROADCAST nodes (PR23259). This fixes a regression introduced at revision 218263. On AVX, if we optimize for size, a splat build_vector of a load is lowered into a VBROADCAST node. This is done even if the value type of the splat build_vector node is v2i64. Since AVX doesn't support v2f64/v2i64 broadcasts, revision 218263 added two extra tablegen patterns to allow selecting a VMOVDDUPrm from an X86VBroadcast where the scalar element comes from a loadi64/loadf64. However, revision 218263 forgot to add an extra fallback pattern for the case where we have a X86VBroadcast of a loadi64 with multiple uses. This patch adds the missing tablegen pattern in X86InstrSSE.td. This patch also adds an extra test to 'splat-for-size.ll' to verify that ISel doesn't crash with a 'fatal error in the backend' due to a missing AVX pattern to select v2i64 X86ISD::BROADCAST nodes. llvm-svn: 235509	2015-04-22 14:53:39 +00:00
Olivier Sallenave	c587bee405	Fixed logic to enable complex FMA formation. llvm-svn: 235508	2015-04-22 14:07:26 +00:00
Zoran Jovanovic	b59a541926	[mips][microMIPSr6] Implement mips32 to microMIPSr6 mapping support Differential Revision: http://reviews.llvm.org/D8661 llvm-svn: 235505	2015-04-22 13:27:34 +00:00
Hal Finkel	0d49cf2645	[DAGCombine] Disable select(c, load,load) for indexed loads This turned up after r235333, but was a pre-existing bug. The optimization which transforms select(c, load, load) into a load of a select of the addresses does not handle indexed loads (pre/post inc/dec). However, it did not check for them either, leading to a crash if it tried to transform one of them. llvm-svn: 235497	2015-04-22 11:32:25 +00:00
Vasileios Kalintiris	e7508c9fc7	Revert "[mips][FastISel] Implement shift ops for Mips fast-isel." This reverts commit r235194. It was causing a failure in FastISel buildbots due to sign-extension issues. llvm-svn: 235495	2015-04-22 10:08:46 +00:00
James Molloy	cd2334e86e	[AArch64] Disable complex GEP optimization by default. Enough concerns were raised that this optimization is pessimising some code patterns. The obvious fix, to add a Reassociate run afterwards, causes even more pessimisation in some cases due to fewer complex addressing modes being matched. As there isn't a trivial fix for this, backing this out by default until someone gets a chance to fix the addressing mode matcher. llvm-svn: 235491	2015-04-22 09:11:38 +00:00
Filipe Cabecinhas	ea79c5b4f7	Have more strict type checks when creating BinOp nodes in BitcodeReader Summary: Bug found with AFL. Reviewers: rafael, bkramer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9015 llvm-svn: 235489	2015-04-22 09:06:21 +00:00

1 2 3 4 5 ...

78969 Commits