llvm-project

Commit Graph

Author	SHA1	Message	Date
Kostya Serebryany	7ec0c56e07	[libFuzzer] get rid of UserSuppliedFuzzer; NFC llvm-svn: 260798	2016-02-13 03:25:16 +00:00
Kostya Serebryany	a399221c32	[libFuzzer] simplify the code around Random. NFC llvm-svn: 260797	2016-02-13 03:00:53 +00:00
Kostya Serebryany	ecab57b3ce	[libFuzzer] remove UserSuppliedFuzzer from the interface (it was a bad idea). llvm-svn: 260796	2016-02-13 02:39:30 +00:00
Kostya Serebryany	22cc5e2375	[libFuzzer] provide a plain C interface for custom mutators (experimental) llvm-svn: 260794	2016-02-13 02:29:38 +00:00
Tom Stellard	4409051d00	AMDGPU/SI: Add llvm.amdgcn.mov.dpp intrinsic This intrinsic will be used to expose dpp functionality to higher-level languages. It will map to the dpp version of v_mov_b32. llvm-svn: 260792	2016-02-13 02:09:49 +00:00
Keno Fischer	7c7c3e3591	[Cloning] Clone every Function's Debug Info Summary: Export the CloneDebugInfoMetadata utility, which clones all debug info associated with a function into the first module. Also use this function in CloneModule on each function we clone (the CloneFunction entrypoint already does this). Without this, cloning a module will lead to DI quality regressions, especially since r252219 reversed the Function <-> DISubprogram edge (before we could get lucky and have this edge preserved if the DISubprogram itself was, e.g. due to location metadata). This was verified to fix missing debug information in julia and a unittest to verify the new behavior is included. Patch by Yichao Yu! Thanks! Reviewers: loladiro, pcc Differential Revision: http://reviews.llvm.org/D17165 llvm-svn: 260791	2016-02-13 02:04:29 +00:00
Matt Arsenault	5e845e54f5	Add AMDGPU related triple vendors/OSes As support expands to more runtimes, we'll need to distinguish between more than just HSA and unknown. This also lets us stop using unknown everywhere. llvm-svn: 260790	2016-02-13 01:56:21 +00:00
Matt Arsenault	d2759212b8	AMDGPU: Cleanup includes and random macros llvm-svn: 260784	2016-02-13 01:24:08 +00:00
Matt Arsenault	ce56a0ef54	AMDGPU: Add intrinsics for sin/cos These provide direct access to the hardware instruction without the unit version required like llvm.sin/llvm.cos lowering requires. llvm-svn: 260782	2016-02-13 01:19:56 +00:00
Matt Arsenault	79963e80b8	AMDGPU: Rename intrinsic to better match instruction name Also fixes missing f32 test. llvm-svn: 260780	2016-02-13 01:03:00 +00:00
Tom Stellard	607051640c	AMDGPU/SI: Add instruction defs for VOP1 DPP instructions Reviewers: nhaustov, cfang, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17159 llvm-svn: 260774	2016-02-13 00:51:31 +00:00
Matt Arsenault	16f48d7c55	AMDGPU: Fix broken condition causing warning llvm-svn: 260773	2016-02-13 00:36:10 +00:00
Pirama Arumuga Nainar	7476bc89e9	Don't combine fp_round (fp_round x) if f80 to f16 is generated Summary: This patch skips DAG combine of fp_round (fp_round x) if it results in an fp_round from f80 to f16. fp_round from f80 to f16 always generates an expensive (and as yet, unimplemented) libcall to __truncxfhf2. This prevents selection of native f16 conversion instructions from f32 or f64. Moreover, the first (value-preserving) fp_round from f80 to either f32 or f64 may become a NOP in platforms like x86. Reviewers: ab Subscribers: srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D17221 llvm-svn: 260769	2016-02-13 00:08:05 +00:00
Alexey Samsonov	7217d27ee6	Fix Windows buildbot breakage. llvm-svn: 260766	2016-02-12 23:51:06 +00:00
Tom Stellard	bc4497b13c	AMDGPU/SI: Detect uniform branches and emit s_cbranch instructions Reviewers: arsenm Subscribers: mareko, MatzeB, qcolombet, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16603 llvm-svn: 260765	2016-02-12 23:45:29 +00:00
Yunzhong Gao	0de36ec169	Disable the vzeroupper insertion pass on PS4. Differential Revision: http://reviews.llvm.org/D16837 llvm-svn: 260764	2016-02-12 23:37:57 +00:00
Derek Schuff	51699a83cd	[WebAssembly] Report more meaningful error messages for some unsupported ops. Computed gotos and RETURNADDR may never be supported; we can do FRAMEADDR in the future. llvm-svn: 260759	2016-02-12 22:56:03 +00:00
Krzysztof Parzyszek	7793ddb043	[Hexagon] Optimize stack slot spills Replace spills to memory with spills to registers, if possible. This applies mostly to predicate registers (both scalar and vector), since they are very limited in number. A spill of a predicate register may happen even if there is a general-purpose register available. In cases like this the stack spill/reload may be eliminated completely. This optimization will consider all stack objects, regardless of where they came from and try to match the live range of the stack slot with a dead range of a register from an appropriate register class. llvm-svn: 260758	2016-02-12 22:53:35 +00:00
Krzysztof Parzyszek	abb17e5f41	[Hexagon] Mark HVX registers as volatile llvm-svn: 260753	2016-02-12 22:26:44 +00:00
Derek Schuff	3114fc14e0	[WebAssembly] Update test expectations after r260737 llvm-svn: 260750	2016-02-12 22:05:08 +00:00
Krzysztof Parzyszek	79a886be06	[Hexagon] Recognize more cases in copyPhysReg and stack slot load/store llvm-svn: 260748	2016-02-12 21:56:41 +00:00
Reid Kleckner	876330d53a	[codeview] Describe local variables in registers llvm-svn: 260746	2016-02-12 21:48:30 +00:00
Rong Xu	bb49490de1	[PGO] Add another interface for annotateValueSite Add another interface to function annotateValueSite() which directly uses the VauleData array. Differential Revision: http://reviews.llvm.org/D17108 llvm-svn: 260741	2016-02-12 21:36:17 +00:00
Dan Gohman	a6771b37f8	[WebAssembly] Fix byval for empty types. llvm-svn: 260740	2016-02-12 21:30:18 +00:00
Chad Rosier	026f15e687	[AArch64] Enable post-RA MI scheduler for Kryo. This should have landed in r260686. llvm-svn: 260739	2016-02-12 21:27:33 +00:00
Dan Gohman	a187ab2aeb	[WebAssembly] Fix insertion of a BLOCK in a loop header that also ends a BLOCK. llvm-svn: 260737	2016-02-12 21:19:25 +00:00
Andrew Kaylor	d1188ddd33	[WinEH] Prevent EH state numbering from skipping nested cleanup pads that never return Differential Revision: http://reviews.llvm.org/D17208 llvm-svn: 260733	2016-02-12 21:10:16 +00:00
Chad Rosier	81362a8599	[LIR] Allow merging of memsets in negatively strided loops. Last part of PR25166. llvm-svn: 260732	2016-02-12 21:03:23 +00:00
Justin Lebar	6086c6a387	Fix typo in comment. llvm-svn: 260731	2016-02-12 21:01:37 +00:00
Justin Lebar	db63949e8d	[SimplifyCFG] Don't fold conditional branches that contain calls to convergent functions. Summary: Performing this optimization duplicates the call to the convergent function and adds new control-flow dependencies, which is a no-no. Reviewers: jingyue Subscribers: broune, hfinkel, tra, resistor, joker.eph, arsenm, llvm-commits, mzolotukhin Differential Revision: http://reviews.llvm.org/D17128 llvm-svn: 260730	2016-02-12 21:01:36 +00:00
Justin Lebar	df04d2a1f1	[LoopRotate] Don't perform loop rotation if the loop header calls a convergent function. Summary: Calls to convergent functions can be duplicated, but only if the duplicates are not control-flow dependent on any additional values. Loop rotation doesn't meet the bar. Reviewers: jingyue Subscribers: mzolotukhin, llvm-commits, arsenm, joker.eph, resistor, tra, hfinkel, broune Differential Revision: http://reviews.llvm.org/D17127 llvm-svn: 260729	2016-02-12 21:01:33 +00:00
Justin Lebar	144c5a6c15	Add convergent property to CodeMetrics. Summary: No functional changes. Reviewers: jingyue, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17126 llvm-svn: 260728	2016-02-12 21:01:31 +00:00
Krzysztof Parzyszek	feb65a3f8b	[Hexagon] Recognize more instructions in isLoadFromStackSlot/isStoreToStackSlot llvm-svn: 260725	2016-02-12 20:54:15 +00:00
Quentin Colombet	232f447782	Get rid of some GLOBAL_ISEL ifdefs that should be harmless for code size. More to come, but those were easy. llvm-svn: 260723	2016-02-12 20:41:24 +00:00
David Majnemer	01674939b2	Remove unused variable llvm-svn: 260722	2016-02-12 20:33:51 +00:00
Philip Reames	96fccc2d09	[GVN] Common code for local and non-local load availability [NFCI] The attached patch removes all of the block local code for performing X-load forwarding by reusing the code used in the non-local case. The motivation here is to remove duplication and in the process increase our test coverage of some fairly tricky code. I have some upcoming changes I'll be proposing in this area and wanted to have the code cleaned up a bit first. Note: The review for this mostly happened in email which didn't make it to phabricator on the 258882 commit thread. Differential Revision: http://reviews.llvm.org/D16608 llvm-svn: 260711	2016-02-12 19:24:57 +00:00
Chad Rosier	4acff96646	[LIR] Partially revert r252926(NFC), which introduced a very subtle change. In short, before r252926 we were comparing an unsigned (StoreSize) against an a APInt (Stride), which is fine and well. After we were zero extending the Stride and then converting to an unsigned, which is not the same thing. Obviously, Stides can also be negative. This commit just restores the original behavior. AFAICT, it's not possible to write a test case to expose the issue because the code already has checks to make sure the StoreSize can't overflow an unsigned (which prevents the Stride from overflowing an unsigned as well). llvm-svn: 260706	2016-02-12 19:05:27 +00:00
Philip Reames	2b9100dfbd	[LVI] Exploit nsw/nuw when computing constant ranges As the title says. Modelled after similar code in SCEV. This is useful when analysing induction variables in loops which have been canonicalized by other passes. I wrote the tests as non-loops specifically to avoid the generality introduced in http://reviews.llvm.org/D17174. While that can handle many induction variables without needing to exploit nsw, there's no reason not to use it if we've already proven it. Differential Revision: http://reviews.llvm.org/D17177 llvm-svn: 260705	2016-02-12 19:05:16 +00:00
Mehdi Amini	40b369cf5a	GlobalISel is always built since r260566, reflect it in LLVMBuild.txt Other component could not depends on an optional library in llvm-config From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260701	2016-02-12 18:43:14 +00:00
Krzysztof Parzyszek	fd02aad8fd	[Hexagon] Add utility functions to detect sign- and zero-extending loads llvm-svn: 260698	2016-02-12 18:37:23 +00:00
Krzysztof Parzyszek	996ad1fa00	[Hexagon] Replace expansion of spill pseudo-instructions in frame lowering Rewrite the code to handle all pseudo-instructions in a single pass. This temporarily reverts spill slot optimization that used general- purpose registers to hold values of spilled predicate registers. llvm-svn: 260696	2016-02-12 18:19:53 +00:00
David Majnemer	0f0abc7bc2	[InstCombine] Don't aggressively replace xor with icmp For some cases, InstCombine replaces the sequence of xor/sub instruction followed by cmp instruction into a single cmp instruction. However, this replacement may result suboptimal result especially when the xor/sub has more than one use, as discussed in bug 26465 (https://llvm.org/bugs/show_bug.cgi?id=26465). This patch make the replacement happen only when xor/sub has only one use. Differential Revision: http://reviews.llvm.org/D16915 Patch by Taewook Oh! llvm-svn: 260695	2016-02-12 18:12:38 +00:00
Tom Stellard	46937ca4e7	[AMDGPU] Assembler: Swap operands of flat_store instructions to match AMD assembler Historically, AMD internal sp3 assembler has flat_store* addr, data format. To match existing code and to enable reuse, change LLVM definitions to match. Also update MC and CodeGen tests. Differential Revision: http://reviews.llvm.org/D16927 Patch by: Nikolay Haustov llvm-svn: 260694	2016-02-12 17:57:54 +00:00
Changpeng Fang	e07f1aa8fa	AMDGPU/SI: Annotate Loops with Constant Condition in SIAnnotateControlFlow pass. Summary: It is possible that the loop condition can be a boolean constant (infinite loop, for example). So we sould handle constant condition in annotating a loop. This patch adds this functionality to support annotating constant condition. Reviewers: tstellarAMD, arsenm Subscribers: llvm-commits, arsenm Differential Revision: http://reviews.llvm.org/D15093 llvm-svn: 260692	2016-02-12 17:11:04 +00:00
Krzysztof Parzyszek	7ce3dbcb57	[Hexagon] Remove HexagonExpandPredSpillCode pass This code is dead. The expansion is now done in HexagonFrameLowering. llvm-svn: 260691	2016-02-12 17:09:58 +00:00
Krzysztof Parzyszek	7d5b4db7f9	[Hexagon] Eliminate pseudo instructions for circ/brev loads and stores We can generate the actual instructions from the intrinsics without the need for pseudo-instructions. Also, since the intrinsics have a side- effect in a form of a store, attempt to optimize away loads from the store location. llvm-svn: 260690	2016-02-12 17:01:51 +00:00
Geoff Berry	c25d3bd238	[AArch64] Reduce number of callee-save save/restores. Summary: Before this change, callee-save registers would be rounded up to even pairs of GPRs and FPRs. This change eliminates these extra padding load/stores, though it does keep the stack allocation the same size unless both the GPR and FPR sets have an odd size, in which case one full pair stack slot (16 bytes) is saved. This optimization cannot currently be done for MachO targets since they rely on a fast-path .debug_frame equivalent that can only encode callee-save registers as pairs. Reviewers: t.p.northover, rengolin, mcrosier, jmolloy Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D17000 llvm-svn: 260689	2016-02-12 16:31:41 +00:00
Krzysztof Parzyszek	bdb04d9032	[Hexagon] Handle out-of-range offsets in eliminateFrameIndex Create a virtual register that will hold the actual address and use it with the offset of 0 in the place of the original FI. llvm-svn: 260688	2016-02-12 16:27:23 +00:00
Chad Rosier	cd2be7f084	[AArch64] Add support for Qualcomm Kryo CPU. Machine model description by Dave Estes <cestes@codeaurora.org>. llvm-svn: 260686	2016-02-12 15:51:51 +00:00
Rafael Espindola	cbc31d699b	Delete the deprecated LLVMLinkModules. llvm-svn: 260683	2016-02-12 15:28:45 +00:00
Jun Bum Lim	397eb7b0b3	[AArch64] Merge two adjacent str WZR into str XZR Summary: This change merges adjacent 32 bit zero stores into a 64 bit zero store. e.g., str wzr, [x0] str wzr, [x0, #4] becomes str xzr, [x0] Therefore, four adjacent 32 bit zero stores will be a single stp. e.g., str wzr, [x0] str wzr, [x0, #4] str wzr, [x0, #8] str wzr, [x0, #12] becomes stp xzr, xzr, [x0] Reviewers: mcrosier, jmolloy, gberry, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16933 llvm-svn: 260682	2016-02-12 15:25:39 +00:00
Krzysztof Parzyszek	e59964377c	[Hexagon] Specify vector alignment in DataLayout string The DataLayout can calculate alignment of vectors based on the alignment of the element type and the number of elements. In fact, it is the product of these two values. The problem is that for vectors of N x i1, this will return the alignment of N bytes, since the alignment of i1 is 8 bits. The vector types of vNi1 should be aligned to N bits instead. Provide explicit alignment for HVX vectors to avoid such complications. llvm-svn: 260678	2016-02-12 14:47:38 +00:00
Benjamin Kramer	ac5e36f52e	Fix uninitialized memory read. Found by msan. llvm-svn: 260676	2016-02-12 12:37:21 +00:00
Chandler Carruth	3937bc70f9	[attrs] Simplify the convergent removal to directly use the pre-built node set rather than walking the SCC directly. This directly exposes the functions and has already had null entries filtered out. We also don't need need to handle optnone as it has already been handled in the caller -- we never try to remove convergent when there are optnone functions in the SCC. With this change, the code for removing convergent should work with the new pass manager and a different SCC analysis. llvm-svn: 260668	2016-02-12 09:47:49 +00:00
Chandler Carruth	057df3d423	[attrs] Consolidate the test for a non-SCC, non-convergent function call with the test for a non-convergent intrinsic call. While it is possible to use the call records to search for function calls, we're going to do an instruction scan anyways to find the intrinsics, we can handle both cases while scanning instructions. This will also make the logic more amenable to the new pass manager which doesn't use the same call graph structure. My next patch will remove use of CallGraphNode entirely and allow this code to work with both the old and new pass manager. Fortunately, it should also get strictly simpler without changing functionality. llvm-svn: 260666	2016-02-12 09:23:53 +00:00
Matt Arsenault	296b849163	AMDGPU: Set flat_scratch from flat_scratch_init reg This was hardcoded to the static private size, but this would be missing the offset and additional size for someday when we have dynamic sizing. Also stops always initializing flat_scratch even when unused. In the future we should stop emitting this unless flat instructions are used to access private memory. For example this will initialize it almost always on VI because flat is used for global access. llvm-svn: 260658	2016-02-12 06:31:30 +00:00
Mehdi Amini	f71d653879	C API: Remove LLVMGetDataLayout that was deprecated in 3.7 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260657	2016-02-12 06:22:00 +00:00
Chandler Carruth	bbbbec0b54	[attrs] Run clang-format over a newly added routine in function-attrs before I update it to be friendly with the new pass manager. llvm-svn: 260653	2016-02-12 03:07:50 +00:00
Matt Arsenault	24ee0785dd	AMDGPU: Set element_size in private resource descriptor Introduce a subtarget feature for this, and leave the default with the current behavior which assumes up to 16-byte loads/stores can be used. The field also seems to have the ability to be set to 2 bytes, but I'm not sure what that would be used for. llvm-svn: 260651	2016-02-12 02:40:47 +00:00
Kostya Serebryany	9d14e4bb15	[libFuzzer] make -runs=N flag also affect the simple runner (will execute every input N times) llvm-svn: 260649	2016-02-12 02:32:03 +00:00
Matt Arsenault	16f7bcb661	AMDGPU: Fix mishandling alignment when scalarizing vector loads/stores I don't think this was causing any real problems, so I'm not sure how to test for this. llvm-svn: 260646	2016-02-12 02:22:21 +00:00
Matt Arsenault	55d49cfe2d	AMDGPU: Initialize SILowerControlFlow llvm-svn: 260645	2016-02-12 02:16:10 +00:00
Matt Arsenault	806dd0a532	AMDGPU: Remove trailing whitespace llvm-svn: 260644	2016-02-12 02:16:07 +00:00
Evgeniy Stepanov	ba6ca87ffb	[msan] Put msan constructor in a comdat. MSan adds a constructor to each translation unit that calls __msan_init, and does nothing else. The idea is to run __msan_init before any instrumented code. This results in multiple constructors and multiple .init_array entries in the final binary, one per translation unit. This is absolutely unnecessary; one would be enough. This change moves the constructors to a comdat group in order to drop the extra ones. llvm-svn: 260632	2016-02-12 00:37:52 +00:00
Philip Reames	854a84c0b0	[LVI] Improve select handling to use condition This patches teaches LVI to recognize clamp idioms (e.g. select(a > 5, a, 5) will always produce something greater than 5. The tests end up being somewhat simplistic because trying to exercise the case I actually care about (a loop with a range check on a clamped secondary induction variable) ends up tripping across a couple of other imprecisions in the analysis. Ah, the joys of LVI... Differential Revision: http://reviews.llvm.org/D16827 llvm-svn: 260627	2016-02-12 00:09:18 +00:00
Tim Northover	94bdbd09d7	ARMv7k: use Cortex-A7 by default even for tvOS Also actually test the default CPU from those triples. llvm-svn: 260621	2016-02-11 23:49:08 +00:00
Matthew Simpson	a4e43c5b51	[SLP] Add debug output for extract cost (NFC) llvm-svn: 260614	2016-02-11 23:06:40 +00:00
Quentin Colombet	490cfbe2a2	Re-apply r238452, the bug was in clang and was fixed in r260567. Original commit message: [InstCombine] Fold IntToPtr and PtrToInt into preceding loads. Currently we only fold a BitCast into a Load when the BitCast is its only user. Do the same for any no-op cast. Patch by Philip Pfaffe! Differential Revision: http://reviews.llvm.org/D9152 llvm-svn: 260612	2016-02-11 22:30:41 +00:00
Mike Aizatsky	fcb06b4aa5	[libfuzzer] Removing coverage-related flags from asan options. Summary: Reasons to remove are twofold: - we don't really need coverage=1 for libfuzzer operation - makes controlling coverage for fuzzer processes non-trivial. Differential Revision: http://reviews.llvm.org/D17168 llvm-svn: 260611	2016-02-11 22:20:34 +00:00
Sanjay Patel	ac42fecf74	[x86] simplify getZeroVector() ; NFCI Let DAG.getConstant() handle the splatting; there's no need to repeat that logic here. See also: http://reviews.llvm.org/rL258833 http://reviews.llvm.org/rL260582 llvm-svn: 260609	2016-02-11 22:17:04 +00:00
Mehdi Amini	9c1c3ac627	Revert "Refactor the PassManagerBuilder: extract a "addFunctionSimplificationPasses()"" This reverts commit r260603. I didn't intend to push it :( From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260607	2016-02-11 22:09:11 +00:00
Mehdi Amini	c5bf5ecc1b	Revert "Define the ThinLTO Pipeline" This reverts commit r260604. I didn't intend to push this now. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260606	2016-02-11 22:09:07 +00:00
Mehdi Amini	1462b76cdc	Revert "Add a new insert_as() method to DenseMap and use it for ConstantUniqueMap" This reverts commit r260458. It was backported on an internal branch and broke stage2 build. Since this can lead to weird random crash I'm reverting upstream as well while investigating. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260605	2016-02-11 22:00:36 +00:00
Mehdi Amini	484470d605	Define the ThinLTO Pipeline Summary: On the contrary to Full LTO, ThinLTO can afford to shift compile time from the frontend to the linker: both phases are parallel. This pipeline is based on the proposal in D13443 for full LTO. We ] didn't move forward on this proposal because the link was far too long after that. This patch refactor the "function simplification" passes that are part of the inliner loop in a helper function (this part is NFC and can be commited separately to simplify the diff). The ThinLTO pipeline integrates in the regular O2/O3 flow: - The compile phase perform the inliner with a somehow lighter function simplification. (TODO: tune the inliner thresholds here) This is intendend to simplify the IR and get rid of obvious things like linkonce_odr that will be inlined. - The link phase will run the pipeline from the start, extended with some specific passes that leverage the augmented knowledge we have during LTO. Especially after the inliner is done, a sequence of globalDCE/globalOpt is performed, followed by another run of the "function simplification" passes. The measurements on the public test suite as well as on our internal suite show an overall net improvement. The binary size for the clang executable is reduced by 5%. We're still tuning it with the bringup of ThinLTO but this should provide a good starting point. Reviewers: tejohnson Subscribers: joker.eph, llvm-commits, dexonsmith Differential Revision: http://reviews.llvm.org/D17115 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260604	2016-02-11 22:00:31 +00:00
Mehdi Amini	f9a3718c5a	Refactor the PassManagerBuilder: extract a "addFunctionSimplificationPasses()" It is intended to contains the passes run over a function after the inliner is done with a function and before it moves to its callers. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 260603	2016-02-11 22:00:25 +00:00
Quentin Colombet	ccd7725808	[IRTranslator] Use a single virtual register to represent any Value. PR26161. llvm-svn: 260602	2016-02-11 21:48:32 +00:00
Quentin Colombet	1cb8fac171	[AArch64] Implements the lowering of formal arguments for GlobalISel. This is just a trivial implementation: - Support only arguments passed in registers. - Support only "plain" arguments, i.e., no sext/zext attribute. At this point, it is possible to play with the IRTranslator on AArch64: llc -mtriple arm64-<vendor>-<os> -print-machineinstrs <input.ll> -o - -global-isel For now, we only support the translation of program with adds and returns. Follow-up patches are on their way to add a test case (the MIRParser is not ready as it is). llvm-svn: 260600	2016-02-11 21:45:08 +00:00
Tom Stellard	1397d49ef5	AMDGPU/SI: Make sure MIMG descriptors and samplers stay in SGPRs Summary: It's possible to have resource descriptors and samplers stored in VGPRs, either by a VMEM instruction or in the case of samplers, floating-point calculations. When this happens, we need to use v_readfirstlane to copy these values back to sgprs. Reviewers: mareko, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17102 llvm-svn: 260599	2016-02-11 21:45:07 +00:00
Amaury Sechet	2f43208c9a	Add support for phi nodes in the LLVM C API test Summary: This required to add binding to Instruction::removeFromParent so that instruction can be forward declared and then moved at the right place. Reviewers: bogner, chandlerc, echristo, dblaikie, joker.eph, Wallbraker Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17057 llvm-svn: 260597	2016-02-11 21:37:54 +00:00
Quentin Colombet	8fd6718700	[Target] Add a helper function to check if an opcode is invalid after isel. llvm-svn: 260590	2016-02-11 21:16:56 +00:00
Tom Stellard	fa8f204c5b	AMDGPU/SI: When splitting SMRD instructions, add its users to VALU worklist Summary: When we split SMRD instructions into two MUBUFs we were adding the users of the newly created MUBUFs to the VALU worklist. However, the only users these instructions had was the REG_SEQUENCE that was inserted by splitSMRD when the original SMRD instruction was split. We need to make sure to add the users of the original SMRD to the VALU worklist before it is split. I have a test case, but it requires one other bug fix, so it will be added in a later commt. Reviewers: mareko, arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17101 llvm-svn: 260588	2016-02-11 21:14:34 +00:00
Pete Cooper	5562c333b8	Set load alignment on aggregate loads. When optimizing a extractvalue(load), we generate a load from the aggregate type. This load didn't have alignment set and so would get the alignment of the type. This breaks when the type is packed and so the alignment should be lower. For example, loading { int, int } would give us alignment of 4, but the original load from this type may have an alignment of 1 if packed. Reviewed by David Majnemer Differential revision: http://reviews.llvm.org/D17158 llvm-svn: 260587	2016-02-11 21:10:40 +00:00
Matthias Braun	c67f5a6ab1	Revert "LiveIntervalAnalysis: Support moving of subregister defs in handleMove" This is broke a bot: http://lab.llvm.org:8011/builders/clang-cmake-aarch64-quick/builds/4703/steps/test-suite/logs/test.log Reverting while I investigate. This reverts commit r260565. llvm-svn: 260586	2016-02-11 21:07:44 +00:00
Derek Schuff	3f0632958b	[WebAssembly] Reformat WebAssemblyFrameLowering and WebAssemblyISelLowering Reviewers: sunfish, jfb Subscribers: jfb, dschuff Differential Revision: http://reviews.llvm.org/D17156 llvm-svn: 260585	2016-02-11 20:57:09 +00:00
Sanjay Patel	e5df1dfb14	[SelectionDAG] change getConstant() to use the input SDLoc when building splat vectors The code change is simple enough: instead of attaching an anonymous SDLoc to splatted vector constants, use the scalar constant's existing SDLoc since that is what is passed into getConstant() as a param. But this changes instruction scheduling, so I'll explain why that happens. The motivation for this patch starts near: http://reviews.llvm.org/rL258833 ...x86's getZeroVector() could be similarly cleaned up and I thought it would be 'NFC'. But when I made that change locally, several x86 codegen tests wiggled. It turns out that the lack of SDLoc consistency in getConstant() changes the way ScheduleDAGRRList behaves. This is because the SDLoc contains 'IROrder' and some DAG scheduler algorithms use IROrder for tie-breaking. Differential Revision: http://reviews.llvm.org/D16972 llvm-svn: 260582	2016-02-11 20:21:24 +00:00
Quentin Colombet	fd9d0a07d8	[GlobalISel] Add the necessary plumbing to lower formal arguments. llvm-svn: 260579	2016-02-11 19:59:41 +00:00
Peter Collingbourne	7c384ccea2	DwarfDebug: emit type units immediately. Rather than storing type units in a vector and emitting them at the end of code generation, emit them immediately and destroy them, reclaiming the memory we were using for their DIEs. In one benchmark carried out against Chromium's 50 largest (by bitcode file size) translation units, total peak memory consumption with type units decreased by median 17%, or by 7% when compared against disabling type units. Tested using check-{llvm,clang}, the GDB 7.5 test suite (with '-fdebug-types-section') and by eyeballing llvm-dwarfdump output on those Chromium translation units with split DWARF both disabled and enabled, and verifying that the only changes were to addresses and abbreviation ordering. Differential Revision: http://reviews.llvm.org/D17118 llvm-svn: 260578	2016-02-11 19:57:46 +00:00
Rafael Espindola	cf98162574	Use copy initialization. We can do it since getMemBuffer returns a unique_ptr. llvm-svn: 260576	2016-02-11 19:54:18 +00:00
Quentin Colombet	5cf7b415cc	[AArch64] Trivial implementation of lower return for the IRTranslator. llvm-svn: 260574	2016-02-11 19:45:27 +00:00
Kevin B. Smith	6a83350bee	[X86] New pass to change byte and word instructions to zero-extending versions. Differential Revision: http://reviews.llvm.org/D17032 llvm-svn: 260572	2016-02-11 19:43:04 +00:00
Reid Kleckner	829365aeef	[codeview] Fix bug around multi-level wrapper inlining If there were wrapper functions with no instructions of their own in the inlining tree, we would fail to emit InlineSite records for them. llvm-svn: 260571	2016-02-11 19:41:47 +00:00
Quentin Colombet	d96f49543d	[AArch64] Plug the beginning of the GlobalISel pipeline. llvm-svn: 260569	2016-02-11 19:35:06 +00:00
Quentin Colombet	2e00253750	Play nice with Visual Studio and attributes llvm-svn: 260568	2016-02-11 19:33:21 +00:00
Quentin Colombet	bde158cbc7	[CMake] Produce an empty library for GlobalISel when not building it. The rational for this change is that LLVMBuild cannot express conditional dependencies. Therefore, when we start optionally using GlobalISel library for say AArch64, without that change, all the tools that use the AArch64 library would need to explicitly link with GlobalISel when we ask for it. This does not scale. Instead, we will set the dependencies between the target and GlobalISel and if we did not ask to build GlobalISel, the library will just be empty. Thanks to Chris Bieneman and Mehdi Animi for the idea. llvm-svn: 260566	2016-02-11 19:18:27 +00:00
Matthias Braun	33c641bddf	LiveIntervalAnalysis: Support moving of subregister defs in handleMove If two definitions write to independent subregisters then they can be put in any order. LiveIntervalAnalysis::handleMove() did not support this previously because it looks like moving a definition of a vreg past another one. This is a modified version of a patch proposed (two years ago) by Vincent Lejeune! This version does not touch the read-undef flags and is extended for the case of moving a subregister def behind all uses - this can happen for subregister defs that are completely unused. Differential Revision: http://reviews.llvm.org/D9067 llvm-svn: 260565	2016-02-11 19:03:53 +00:00
Quentin Colombet	74d7d2f00b	[GlobalISel] Teach the IRTranslator how to lower returns. llvm-svn: 260562	2016-02-11 18:53:28 +00:00
Tom Stellard	e993451f5c	[AMDGPU] Fix for "v_div_scale_f64 reg, vcc, ..." parsing Summary: Added support for "VOP3Only" attribute in VOP3bInst encoding. Set VOP3Only=1 for V_DIV_SCALE_F64/32 insns. Added support for multi-dest instructions in AMDGPUAs::cvt*(). Added lit test for "V_DIV_SCALE_F64\|F32 vreg,vcc\|sreg,vreg,vreg,vreg". Reviewers: tstellarAMD, arsenm Subscribers: arsenm, SamWot, nhaustov, vpykhtin Differential Revision: http://reviews.llvm.org/D16995 Patch By: Artem Tamazov llvm-svn: 260560	2016-02-11 18:25:26 +00:00
Quentin Colombet	9855111b77	[GlobalISel] Add a type to MachineInstr. We actually need that information only for generic instructions, therefore it would be nice not to have to pay the extra memory consumption for all instructions. Especially because a typed non-generic instruction does not make sense. The question is then, is it possible to have that information in a union or something? My initial thought was that we could have a derived class GenericMachineInstr with additional information, but in practice it makes little to no sense since generic MachineInstrs are likely turned into non-generic ones by just switching the opcode. In other words, we don't want to go through the process of creating a new, non-generic MachineInstr, object each time we do this switch. The memory benefit probably is not worth the extra compile time. Another option would be to keep the type of the MachineInstr in a side table. This would induce an extra indirection though. Anyway, I will file a PR to discuss about it and remember we need to come back to it at some point. llvm-svn: 260558	2016-02-11 18:22:37 +00:00
Artem Belevich	a8455f2e2b	[NVPTX] emit .file directives for files referenced by subprograms. .. so .loc directives referring to those files work correctly. Differential Revision: http://reviews.llvm.org/D17086 llvm-svn: 260557	2016-02-11 18:21:47 +00:00
Quentin Colombet	37a09a8428	[GlobalISel] Add a hook in TargetConfigPass to run GlobalISel. llvm-svn: 260553	2016-02-11 17:57:22 +00:00

1 2 3 4 5 ...

87228 Commits