llvm-project

Commit Graph

Author	SHA1	Message	Date
Stanislav Mekhanoshin	478b81982f	[AMDGPU] Unroll more to eliminate phis and conditions Increase threshold to unroll a loop which contains an "if" statement whose condition defined by a PHI belonging to the loop. This may help to eliminate if region and potentially even PHI itself, saving on both divergence and registers used for the PHI. Add a small bonus for each of such "if" statements. Differential Revision: https://reviews.llvm.org/D31693 llvm-svn: 299779	2017-04-07 16:26:28 +00:00
Rafael Espindola	e4c8b9b78c	Simplify this further. Thanks to Rui for noticing it. llvm-svn: 299777	2017-04-07 16:10:46 +00:00
Dehao Chen	58fa724494	Use PMADDWD to expand reduction in a loop Summary: PMADDWD can help improve 8/16 bit integer mutliply-add operation performance for cases like: for (int i = 0; i < count; i++) a += x[i] * y[i]; Reviewers: wmi, davidxl, hfinkel, RKSimon, zvi, mkuper Reviewed By: mkuper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31679 llvm-svn: 299776	2017-04-07 15:41:52 +00:00
Reid Kleckner	d3c87b5332	[lit] Try using process pools by default again Both pickling errors encountered on clang bots and Darwin compiler-rt should now be fixed. This has no impact on testing time on Linux, and on Windows goes from 88s to 63s for 'check'. The tests pass on Mac, but I haven't compared execution time. llvm-svn: 299775	2017-04-07 15:28:32 +00:00
Saleem Abdulrasool	b51bcaf2f0	Sema: prevent __declspec(naked) use on x64 MSDN (https://msdn.microsoft.com/en-us/library/h5w10wxs.aspx) indicates that `__declspec(naked)` is only permitted on x86 and ARM targets. Testing with cl does confirm this behaviour. Provide a warning for use of `__declspec(naked)` on x64. llvm-svn: 299774	2017-04-07 15:13:47 +00:00
Simon Atanasyan	60d0e982f0	[ELF] Do not pass GOT section as an argument to handleARMTlsRelocation and handleMipsTlsRelocation functions. NFC Both functions always use the same GOT sections In<ELFT>::Got and In<ELFT>::MipsGot respectively, so we do not need to pass them as an argument. llvm-svn: 299773	2017-04-07 15:05:44 +00:00
Igor Breger	2953788c36	[GlobalISel] implement narrowing for G_CONSTANT. Summary: [GlobalISel] implement narrowing for G_CONSTANT. Reviewers: bogner, zvi, t.p.northover Reviewed By: t.p.northover Subscribers: llvm-commits, dberris, rovka, kristof.beyls Differential Revision: https://reviews.llvm.org/D31744 llvm-svn: 299772	2017-04-07 14:41:59 +00:00
Gor Nishanov	138ad6c9c0	[coroutines] Insert spills of PHI instructions correctly Summary: Fix a bug where we were inserting a spill in between the PHIs in the beginning of the block. Consider this fragment: ``` begin: %phi1 = phi i32 [ 0, %entry ], [ 2, %alt ] %phi2 = phi i32 [ 1, %entry ], [ 3, %alt ] %sp1 = call i8 @llvm.coro.suspend(token none, i1 false) switch i8 %sp1, label %suspend [i8 0, label %resume i8 1, label %cleanup] resume: call i32 @print(i32 %phi1) ``` Unless we are spilling the argument or result of the invoke, we were always inserting the spill immediately following the instruction. The fix adds a check that if the spilled instruction is a PHI Node, select an appropriate insert point with `getFirstInsertionPt()` that skips all the PHI Nodes and EH pads. Reviewers: majnemer, rnk Reviewed By: rnk Subscribers: qcolombet, EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D31799 llvm-svn: 299771	2017-04-07 14:16:49 +00:00
Matthew Simpson	11fe2e9f2b	Reapply r298620: [LV] Vectorize GEPs This patch reapplies r298620. The original patch was reverted because of two issues. First, the patch exposed a bug in InstCombine that caused the Chromium builds to fail (PR32414). This issue was fixed in r299017. Second, the patch introduced a bug in the vectorizer's scalars analysis that caused test suite builds to fail on SystemZ. The scalars analysis was too aggressive and marked a memory instruction scalar, even though it was going to be vectorized. This issue has been fixed in the current patch and several new test cases for the scalars analysis have been added. llvm-svn: 299770	2017-04-07 14:15:34 +00:00
Simon Dardis	9f6a5cd91d	[mips] Remove usage of debug only variable (NFC) Fix the lld-x86_64-darwin13 buildbot by removing the declaration of a debug only variable and instead moving the value into the debug statement. llvm-svn: 299769	2017-04-07 13:49:12 +00:00
Petar Jovanovic	bc54eb89ad	[mips][msa] Fix generation of bm(n)zi and bins[lr]i instructions We have two cases here, the first one being the following instruction selection from the builtin function: bm(n)zi builtin -> vselect node -> bins[lr]i machine instruction In case of bm(n)zi having an immediate which has either its high or low bits set, a bins[lr] instruction can be selected through the selectVSplatMask[LR] function. The function counts the number of bits set, and that value is being passed to the bins[lr]i instruction as its immediate, which in turn copies immediate modulo the size of the element in bits plus 1 as per specs, where we get the off-by-one-error. The other case is: bins[lr]i -> vselect node -> bsel.v In this case, a bsel.v instruction gets selected with a mask having one bit less set than required. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D30579 llvm-svn: 299768	2017-04-07 13:31:36 +00:00
Dmitry Preobrazhensky	e5147247b8	[AMDGPU][MC] Fix for Bug 28211 + LIT tests - corrected DS_GWS_* opcodes (see VI_Shader_Programming#16.pdf for detailed description) - address operand is not used - several opcodes have data operand - all opcodes have offset modifier - DS_AND_SRC2_B32: corrected typo in mnemo - DS_WRAP_RTN_F32 replaced with DS_WRAP_RTN_B32 - added CI/VI opcodes: - DS_CONDXCHG32_RTN_B64 - DS_GWS_SEMA_RELEASE_ALL - added VI opcodes: - DS_CONSUME - DS_APPEND - DS_ORDERED_COUNT Differential Revision: https://reviews.llvm.org/D31707 llvm-svn: 299767	2017-04-07 13:07:13 +00:00
Simon Dardis	6470ff0b24	[SelectionDAG] Enable target specific vector scalarization of calls and returns By target hookifying getRegisterType, getNumRegisters, getVectorBreakdown, backends can request that LLVM to scalarize vector types for calls and returns. The MIPS vector ABI requires that vector arguments and returns are passed in integer registers. With SelectionDAG's new hooks, the MIPS backend can now handle LLVM-IR with vector types in calls and returns. E.g. 'call @foo(<4 x i32> %4)'. Previously these cases would be scalarized for the MIPS O32/N32/N64 ABI for calls and returns if vector types were not legal. If vector types were legal, a single 128bit vector argument would be assigned to a single 32 bit / 64 bit integer register. By teaching the MIPS backend to inspect the original types, it can now implement the MIPS vector ABI which requires a particular method of scalarizing vectors. Previously, the MIPS backend relied on clang to scalarize types such as "call @foo(<4 x float> %a) into "call @foo(i32 inreg %1, i32 inreg %2, i32 inreg %3, i32 inreg %4)". This patch enables the MIPS backend to take either form for vector types. Reviewers: zoran.jovanovic, jaydeep, vkalintiris, slthakur Differential Revision: https://reviews.llvm.org/D27845 llvm-svn: 299766	2017-04-07 13:03:52 +00:00
Philip Pfaffe	78265cd237	Fix missing .git/indexloadPolly in ensure-correct-tile-sizes testcase llvm-svn: 299765	2017-04-07 12:55:26 +00:00
Haojian Wu	0bc5e02799	Fix compiler warnings: "ISO c99 requires rest arguments to be used" on the test file. llvm-svn: 299764	2017-04-07 12:37:32 +00:00
Jonas Paulsson	cad72efee6	[SystemZ] Check for presence of vector support in SystemZISelLowering A test case was found with llvm-stress that caused DAGCombiner to crash when compiling for an older subtarget without vector support. SystemZTargetLowering::combineTruncateExtract() should do nothing for older subtargets. This check was placed in canTreatAsByteVector(), which also helps in a few other places. Review: Ulrich Weigand llvm-svn: 299763	2017-04-07 12:35:11 +00:00
Jonas Paulsson	16100c637e	[SystemZ] Remove confusing comment in combineEXTRACT_VECTOR_ELT() It isn't just one-element vectors that can appear here. llvm-svn: 299762	2017-04-07 12:11:41 +00:00
Diana Picus	fed80723c0	[ARM] GlobalISel: Test hard float properly It turns out -float-abi=hard doesn't set the hard float calling convention for libcalls. We need to use a hard float triple instead (e.g. gnueabihf). llvm-svn: 299761	2017-04-07 12:04:24 +00:00
NAKAMURA Takumi	a77d32b7af	CloneDetection.h: Fix warnings. [-Wdocumentation] llvm-svn: 299760	2017-04-07 11:06:31 +00:00
Laszlo Nagy	6d9a7e8aaf	[scan-build-py] merge runner module to analyzer Differential Revision: https://reviews.llvm.org/D31237 llvm-svn: 299759	2017-04-07 11:04:49 +00:00
Krasimir Georgiev	50117372db	[clangd] Extract FsPath from file:// uri Patch contributed by stanionascu! rfc8089#appendix-E.2 specifies that paths can begin with a drive letter e.g. as file:///c:/. In this case just consuming front file:// is not enough and the 3rd slash must be consumed to produce a valid path on windows. The patch introduce a generic way of converting an uri to a filesystem path and back. Differential Revision: https://reviews.llvm.org/D31401 llvm-svn: 299758	2017-04-07 11:03:26 +00:00
Sam Kolton	6e79529db4	[AMDGPU] Move SiShrinkInstruction and SDWAPeephole to SSAOptimization passes Summary: Difference beetween PreRegAlloc() and MachineSSAOptimization() are that the former is run despite of -O0 optimization level. In my undestanding SiShrinkInstructions and SDWAPeephole shouldn't run when optimizations are disabled. With this change order of passes will not change. Reviewers: arsenm, vpykhtin, rampitec Subscribers: qcolombet, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31705 llvm-svn: 299757	2017-04-07 10:53:12 +00:00
Diana Picus	3c608448e1	[ARM] GlobalISel: Support frem for 64-bit values Legalize to a libcall. llvm-svn: 299756	2017-04-07 10:50:02 +00:00
James Henderson	9d9a663731	[ELF] Recommit r299635 to pad x86 executable sections with 0xcc This follows r299748 which fixed a latent bug the original commit exposed. llvm-svn: 299755	2017-04-07 10:36:42 +00:00
Sanne Wouda	db1bdf472a	Skip Unicode character expansion in assembly files Summary: When using the C preprocessor with assembly files, either with a capital `S` file extension, or with `-xassembler-with-cpp`, the Unicode escape sequence `\u` is ignored. The `\u` pattern can be used for expanding a macro argument that starts with `u`. Author: Salman Arif <salman.arif@arm.com> Reviewers: rengolin, olista01 Reviewed By: olista01 Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D31765 llvm-svn: 299754	2017-04-07 10:13:00 +00:00
Diana Picus	a5bab61a8d	[ARM] GlobalISel: Support frem for 32-bit values Legalize to a libcall. On this occasion, also start allowing soft float subtargets. For the moment G_FREM is the only legal floating point operation for them. llvm-svn: 299753	2017-04-07 09:41:39 +00:00
Alexander Kornienko	434b333afd	[clang-tidy] A couple of minor fixes in modernize-use-using tests llvm-svn: 299752	2017-04-07 09:41:27 +00:00
Peter Smith	99a992343c	[ELF] Fix ARM TLS global dynamic TlsOffsetRel for non-preemtible symbols When the target of the TlsOffsetRel is non-preemptible we can write the offset directly into the GOT without needing a dynamic relocation. This is optional for dynamically linked executables but is required for static linking. This change adds the relocation to the GOT entry and a test case for non-0 offsets so that if we miss out the offset the test won't spuriously pass by virtue of the default value being 0. Differential Revision: https://reviews.llvm.org/D31749 llvm-svn: 299751	2017-04-07 09:37:30 +00:00
Peter Smith	a630d0f01b	[ELF] Split handleNoRelaxTlsRelocation into ARM and Mips specific impls The handleNoRelaxTlsRelocation handled both ARM and Mips as at a high-level the actions of what to do when encountering a local dynamic or global dynamic TLS relocation are the same. However due to Mips using a custom GOT the differences of the implementation are enough that the function became difficult to understand. This change replaces handleNotRelaxTlsRelocation into handleARMTlsRelocation() and handleMipsTlsRelocation() so that the ARM and Mips specific code is isolated. Differential Revision: https://reviews.llvm.org/D31748 llvm-svn: 299750	2017-04-07 09:09:48 +00:00
James Henderson	db664b6c86	Fix signed/unsigned comparison warning in a unit test. This was appearing on lld-x86_64-darwin13. llvm-svn: 299749	2017-04-07 08:48:17 +00:00
James Henderson	8abda20a9f	[Core] Fix parallel_for for Linux r299635 exposed a latent bug in the Linux implementation of parallel_for, which resulted in it calling the function outside of the range requested, resulting later in a segmentation fault. This change fixes this issue and adds a unit test. llvm-svn: 299748	2017-04-07 08:11:28 +00:00
Craig Topper	33e0dbcc58	[InstCombine] Handle more commuted cases of ((A & B) \| ~A) -> (~A \| B) llvm-svn: 299747	2017-04-07 07:32:00 +00:00
Craig Topper	ccf85f24c8	[InstCombine] Add additional tests with varied commuting to show missing combines. NFC llvm-svn: 299746	2017-04-07 07:31:55 +00:00
Craig Topper	60dd9cd8e4	[InstSimplify] Use Instruction::BinaryOps instead of unsigned for a few function operands to remove some casts. NFC llvm-svn: 299745	2017-04-07 05:57:51 +00:00
Akira Hatanaka	6055ae88e9	Revert "Allow a standard library to implement conditional noexcept for optional and unique_ptr hash functions." This reverts commit r299734. llvm-svn: 299744	2017-04-07 05:55:28 +00:00
Shoaib Meenai	17af695f2b	[libc++] Drop support for CRTs older than VS 2015 LLVM dropped support for Visual Studio versions older than 2015 quite some time ago, so I consider it safe to drop libc++'s support for older CRTs. The CRT in Visual Studio 2015 provides a lot of previously missing functions, so targeting it requires less special casing. Differential Revision: https://reviews.llvm.org/D31798 llvm-svn: 299743	2017-04-07 02:20:52 +00:00
Rafael Espindola	dc8eb810db	Move a cast out of a function. NFC. The argument was always casted, so cast it in the caller. llvm-svn: 299742	2017-04-07 01:40:21 +00:00
Daniel Berlin	d952ceae2f	AliasAnalysis: Be less conservative about volatile than atomic. Summary: getModRefInfo is meant to answer the question "what impact does this instruction have on a given memory location" (not even another instruction). Long debate on this on IRC comes to the conclusion the answer should be "nothing special". That is, a noalias volatile store does not affect a memory location just by being volatile. Note: DSE and GVN and memdep currently believe this, because memdep just goes behind AA's back after it says "modref" right now. see line 635 of memdep. Prior to this patch we would get modref there, then check aliasing, and if it said noalias, we would continue. getModRefInfo already has this same AA check, it just wasn't being used because volatile was lumped in with ordering. (I am separately testing whether this code in memdep is now dead except for the invariant load case) Reviewers: jyknight, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31726 llvm-svn: 299741	2017-04-07 01:28:36 +00:00
Rafael Espindola	7f91d44fba	Remove unnecessary cast. llvm-svn: 299740	2017-04-07 01:25:01 +00:00
Reid Kleckner	78495ea7c0	Add missing import llvm-svn: 299739	2017-04-07 01:24:48 +00:00
Reid Kleckner	f6e857c402	[lit] Fix Darwin pickling errors with process pools For a function to be pickle-able, it has to be in the top-level of a real Python module. So, I made one for this code snippet. llvm-svn: 299738	2017-04-07 01:23:15 +00:00
Craig Topper	72a622cac7	[InstCombine] Add more commuted patterns to support folding ((~A & B) \| A) -> (A \| B). llvm-svn: 299737	2017-04-07 00:29:47 +00:00
Derek Schuff	9bb494caf4	[WebAssembly] Fix -Wcovered-switch-default warning llvm-svn: 299736	2017-04-06 23:52:01 +00:00
Billy Robert O'Neal III	8ac1fa16b2	Add noexcept(false) to more strongly indicate that not being noexcept is important for hash tests. llvm-svn: 299735	2017-04-06 23:50:33 +00:00
Billy Robert O'Neal III	eaeeaaf375	Allow a standard library to implement conditional noexcept for optional and unique_ptr hash functions. These tests were unconditionally asserting that optional and unique_ptr declare throwing hashes, but MSVC++ implements conditional noexcept forwarding that of the underlying hash function. As a result we were failing these tests but there's nothing forbidding strengthening noexcept in that way. Changed the ASSERT_NOT_NOEXCEPT asserts to use types which themselves have non-noexcept hash functions. llvm-svn: 299734	2017-04-06 23:50:21 +00:00
Zachary Turner	10169b6d0d	Allow specification of what kinds of class members to dump. Previously when dumping class definitions, there were only two modes - on or off. But it's useful to sometimes get a little more fine-grained. For example, you might only want to see the record layout (for example to look for extraneous padding). This patch adds a third mode, layout mode, which does exactly that. Only this-relative data members are displayed in this mode. Differential Revision: https://reviews.llvm.org/D31794 llvm-svn: 299733	2017-04-06 23:43:39 +00:00
Zachary Turner	63230a4e71	[llvm-pdbdump] Allow pretty to only dump specific types of types. Previously we just had the -types option, which would dump all classes, typedefs, and enums. But this produces a lot of output if you only want to view classes, for example. This patch breaks this down into 3 additional options, -classes, -enums, and -typedefs, and keeps the -types option around which implies all 3 more specific options. Differential Revision: https://reviews.llvm.org/D31791 llvm-svn: 299732	2017-04-06 23:43:12 +00:00
Adrian Prantl	5dcbbbc1c6	Add an end-to-end testcase for address sanitizer. llvm-svn: 299731	2017-04-06 23:36:44 +00:00
Adrian Prantl	37b72b428a	Add a test for debug info with the safestack sanitizer enabled. llvm-svn: 299730	2017-04-06 23:30:51 +00:00
Eugene Zemtsov	5753710248	Add more tests for ExtractContextAndIdentifier llvm-svn: 299729	2017-04-06 23:12:43 +00:00

1 2 3 4 5 ...

259151 Commits All Branches Search

259151 Commits

All Branches