llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	747bf8afa8	AMDGPU: Re-use TM.getNullPointerValue llvm-svn: 297662	2017-03-13 20:18:14 +00:00
Rafael Espindola	d31f04b319	Bring back r297624. The issues was just a missing REQUIRES in the test. llvm-svn: 297661	2017-03-13 20:00:25 +00:00
Matt Arsenault	971c85ebb4	AMDGPU: Treat 0 as private null pointer in addrspacecast lowering llvm-svn: 297658	2017-03-13 19:47:31 +00:00
Rafael Espindola	3978b877d7	Revert "Fix crash when multiple raw_fd_ostreams to stdout are created." This reverts commit r297624. It was failing on the bots. llvm-svn: 297657	2017-03-13 19:38:32 +00:00
Jessica Paquette	c984e21394	[Outliner] Add tail call support This commit adds tail call support to the MachineOutliner pass. This allows the outliner to insert jumps rather than calls in areas where tail calling is possible. Outlined tail calls include the return or terminator of the basic block being outlined from. Tail call support allows the outliner to take returns and terminators into consideration while finding candidates to outline. It also allows the outliner to save more instructions. For example, in the X86-64 outliner, a tail called outlined function saves one instruction since no return has to be inserted. llvm-svn: 297653	2017-03-13 18:39:33 +00:00
Craig Topper	616641632e	[X86] Lower AVX2 gather intrinsics similar to AVX-512. Apply the same input source optimizations to break execution dependencies. For AVX-512 we force the input to zero if the input is undef or the mask is all ones to break an execution dependency. This patch brings the same behavior to AVX2. llvm-svn: 297652	2017-03-13 18:34:46 +00:00
Craig Topper	eb7ea28bdd	[AVX-512] If gather mask is all ones, force the input to a zero vector. We were already forcing undef inputs to become a zero vector, this now catches an all ones mask too. Ideally we'd use undef and let execution dep fix handle picking the best register/clearance for the undef, but I don't think it can handle the early clobber today. llvm-svn: 297651	2017-03-13 18:17:46 +00:00
Matt Arsenault	d81f557fe2	AMDGPU: Fold icmp/fcmp into icmp intrinsic The typical use is a library vote function which compares to 0. Fold the user condition into the intrinsic. llvm-svn: 297650	2017-03-13 18:14:02 +00:00
Jonas Devlieghere	5eb9c81d82	[Linker] Provide callback for internalization Differential Revision: https://reviews.llvm.org/D30738 llvm-svn: 297649	2017-03-13 18:08:11 +00:00
Adrian Prantl	140a8569ce	API gardening: Rename FindAllocaDbgValue to findDbgValue (NFC) and use have it use SmallVectorImpl. There is nothing specific about allocas in this function. llvm-svn: 297643	2017-03-13 17:20:47 +00:00
Zachary Turner	407dec59a4	[llvm-pdbdump] Add support for dumping symbols from Yaml -> PDB. Previously we could round-trip type records from PDB -> Yaml -> PDB, but for symbols we could only go from PDB -> Yaml. This completes the round-tripping for symbols as well. llvm-svn: 297625	2017-03-13 14:57:45 +00:00
Rafael Espindola	82d55239ea	Fix crash when multiple raw_fd_ostreams to stdout are created. If raw_fd_ostream is constructed with the path of "-", it claims ownership of the stdout file descriptor. This means that it closes stdout when it is destroyed. If there are multiple users of raw_fd_ostream wrapped around stdout, then a crash can occur because of operations on a closed stream. An example of this would be running something like "clang -S -o - -MD -MF - test.cpp". Alternatively, using outs() (which creates a local version of raw_fd_stream to stdout) anywhere combined with such a stream usage would cause the crash. The fix duplicates the stdout file descriptor when used within raw_fd_ostream, so that only that particular descriptor is closed when the stream is destroyed. Patch by James Henderson! llvm-svn: 297624	2017-03-13 14:45:06 +00:00
Diana Picus	94db2e288b	[ARM] GlobalISel: Support SP in regbankselect We used to hit an unreachable in getRegBankFromRegClass when dealing with the stack pointer. This commit adds support for the GPRsp reg class. llvm-svn: 297621	2017-03-13 14:28:34 +00:00
Aaron Ballman	345012dfa0	Reverting r297617 because it broke some bots: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/49970 llvm-svn: 297618	2017-03-13 12:24:51 +00:00
Aaron Ballman	f5cba91591	Add support for getting file system permissions and implement sys::fs::permissions to set them. Patch by James Henderson. llvm-svn: 297617	2017-03-13 12:17:14 +00:00
Balaram Makam	cacc08bb46	[AArch64] Map Sched Read/Write resources for Falkor. llvm-svn: 297611	2017-03-13 10:42:17 +00:00
Gil Rapaport	00cb43908c	[LV] Set memcheck metadata also for VF==1 This commit is a follow-up on r297580. It fixes the FIXME added temporarily by that commit to keep the removal of Unroller's specialized version of scalarizeInstruction() an NFC. See https://reviews.llvm.org/D30715 for details. llvm-svn: 297610	2017-03-13 10:23:46 +00:00
Sjoerd Meijer	aea3a990a2	ARMDisassembler: loop over ARM decode tables Loop over the ARM decode tables; this is a clean-up to reduce some code duplication. Differential Revision: https://reviews.llvm.org/D30814 llvm-svn: 297608	2017-03-13 09:41:10 +00:00
Craig Topper	48ba1e2d66	[AVX-512] Add VEX_WIG to VEX vcvtsd2ss/vcvtss2sd intrinsic instructions so they can be correctly matched by EVEX2VEX table generation. llvm-svn: 297601	2017-03-13 05:14:47 +00:00
Craig Topper	08b413acf2	[AVX-512] Use sse_loadf32/f64 for vcvtss2sd and vcvtsd2ss intrinsic patterns. llvm-svn: 297600	2017-03-13 05:14:44 +00:00
Craig Topper	5a63ca2ad2	[AVX-512] Use sse_load_f64/f32 in VCVTSS2SI/VCVTSD2SI patterns. llvm-svn: 297599	2017-03-13 03:59:06 +00:00
Craig Topper	111b2d6997	[X86] Remove unused SDTypeProfile. NFC llvm-svn: 297594	2017-03-12 23:05:03 +00:00
Craig Topper	2b92542908	[X86] Lower SSE/AVX cmpps/pd intrinsics directly to X86ISD::CMPP SDNodes. This allows us to remove a duplicate set of patterns. llvm-svn: 297593	2017-03-12 23:05:00 +00:00
Craig Topper	7d56c8315b	[AVX-512] Fix the valid immediates for the scatter/gather prefetch intrinsics. The immediate should be 1 or 2, not 0 or 1. This was found while adding bounds checking to clang. In fact the existing clang builtin test failed if we ran it all the way to assembly. llvm-svn: 297591	2017-03-12 22:29:12 +00:00
Sanjay Patel	f06b963a2b	[x86] don't blindly transform SETB into SBB I noticed unnecessary 'sbb' instructions in D30472 and while looking at 'ptest' codegen recently. This happens because we were transforming any 'setb' - even when we only wanted a single-bit result. This patch moves those transforms under visitAdd/visitSub, so we we're only creating sbb/adc when it is a win. I don't know why we need a SETCC_CARRY node type, but I'm not proposing to change that existing behavior in this patch. Also, I'm skeptical that sbb/adc are a win for all micro-arches, so I added comments to the test files where this transform still fires. The test changes here are all cases where we no longer produce sbb/adc. Avoiding partial register stalls (generating an xor to clear a register) is not handled in some cases, but that's a separate issue. Differential Revision: https://reviews.llvm.org/D30611 llvm-svn: 297586	2017-03-12 18:28:48 +00:00
Anna Thomas	a10e3e4c34	[LVI] Add Datalayout to the class LazyValueInfo since all its Impls require it. NFC llvm-svn: 297583	2017-03-12 14:06:41 +00:00
Azharuddin Mohammed	473b75c3d5	Remove CRC32 instructions from AArch64InstrInfo::hasShiftedReg Summary: A53 scheduler causes an assertion failure on all CRC instructions: include/llvm/CodeGen/MachineInstr.h:280: const llvm::MachineOperand &llvm::MachineInstr::getOperand(unsigned int) const: Assertion `i < getNumOperands() && "getOperand() out of range!"' failed. The case statements corresponding to CRC instructions are incorrect and should be removed. Also adding a testcase while on this. Reviewers: t.p.northover, javed.absar, apazos, rengolin Reviewed By: rengolin Subscribers: evandro, aemerson, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D30274 llvm-svn: 297582	2017-03-12 14:02:32 +00:00
Gil Rapaport	a1e5a37d3f	[LV] A unified scalarizeInstruction() for Vectorizer and Unroller; NFC Unroller's specialized scalarizeInstruction() is mostly duplicating Vectorizer's variant. OTOH Vectorizer's scalarizeInstruction() already supports the special case of VF==1 except for avoiding mask-bit extraction in that case. This patch removes Unroller's specialized version in favor of a unified method. The only functional difference between the two variants seems to be setting memcheck metadata for loads and stores only in Vectorizer's variant, which is a bug in Unroller. To keep this patch an NFC the unified method doesn't set memcheck metadata for VF==1. Differential Revision: https://reviews.llvm.org/D30715 llvm-svn: 297580	2017-03-12 12:31:38 +00:00
Ayal Zaks	09cf3121d8	Test commit. llvm-svn: 297579	2017-03-12 09:48:06 +00:00
Daniel Berlin	64e689938d	Split NewGVN class into a legacy pass and an impl, instead of a merged class. llvm-svn: 297576	2017-03-12 04:46:45 +00:00
Craig Topper	58647b16e5	[AVX-512] Fix a bad use of a high GR8 register after copying from a mask register during fast isel. This ends up extracting from bits 15:8 instead of the lower bits of the mask. I'm pretty sure there are more problems lurking here. But I think this fixes PR32241. I've added the test case from that bug and added asserts that will fail if we ever try to copy between high registers and mask registers again. llvm-svn: 297574	2017-03-12 03:37:37 +00:00
Craig Topper	6ab5edfa73	[AVX-512] Remove unused field in X86VectorVTInfo tablegen class. llvm-svn: 297572	2017-03-12 03:37:32 +00:00
Simon Pilgrim	18debfa5b4	[X86][SSE] Improve extraction of elements from v16i8 (pre-SSE41) Without SSE41 (pextrb) we currently extract byte elements from a vector by spilling to stack and reloading the byte. This patch is an initial attempt at using MOVD/PEXTRW to extract the relevant DWORD/WORD from the vector and then shift+truncate to collect the correct byte. Extraction of multiple bytes this way would result in code bloat, but as explained in the patch we could probably afford to be more aggressive with the supported extractions before again falling back on spilling - possibly through counting the number of extracts and which DWORD/WORD they originate? Differential Revision: https://reviews.llvm.org/D29841 llvm-svn: 297568	2017-03-11 20:42:31 +00:00
Simon Pilgrim	9ff5732c92	Remove unnecessary whitespace. llvm-svn: 297567	2017-03-11 20:23:59 +00:00
Craig Topper	02b463270c	[X86] Remove unnecessary commented out code. NFC llvm-svn: 297563	2017-03-11 18:25:56 +00:00
Simon Pilgrim	fa97699d09	Fix -Wsentinel warning llvm-svn: 297560	2017-03-11 12:56:02 +00:00
Amaury Sechet	d1ec5d54cf	Use setBits in SelectionDAG Summary: As per title. Reviewers: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30836 llvm-svn: 297559	2017-03-11 11:24:03 +00:00
Matt Arsenault	dd905b0e9b	AMDGPU: Remove packf16 intrinsic llvm-svn: 297557	2017-03-11 05:51:16 +00:00
Matt Arsenault	3cb9ff8863	AMDGPU: Keep track of modifiers when converting v_mac to v_mad Since v_max_f32_e64/v_max_f16_e64 can be folded if the target instruction supports the clamp bit, we also need to maintain modifiers when converting v_mac to v_mad. This fixes a rendering issue with Dirt Rally because a v_mac instruction with the clamp bit set was converted to a v_mad but that bit was lost during the conversion. Fixes: e184e01dd79 ("AMDGPU: Fold FP clamp as modifier bit") Patch by Samuel Pitoiset <samuel.pitoiset@gmail.com> llvm-svn: 297556	2017-03-11 05:40:40 +00:00
Kostya Serebryany	d481e1c361	[libFuzzer] add more iterations to LLVMFuzzer-Memcmp64BytesTest llvm-svn: 297554	2017-03-11 05:14:49 +00:00
Kostya Serebryany	5dfa9642a8	[libFuzzer] reduce the number of vector resizes during merge (https://github.com/google/oss-fuzz/issues/445 ) llvm-svn: 297551	2017-03-11 02:50:47 +00:00
Kostya Serebryany	81d1744519	[libFuzzer] print how much memory is consumed by the outer merge process (https://github.com/google/oss-fuzz/issues/445 ) llvm-svn: 297546	2017-03-11 02:26:20 +00:00
Kostya Serebryany	b6b2f18ea8	[libFuzzer] add test/LargeTest.cpp, mostly for manual experiments with large number of edges, not yet suitable for unit testing llvm-svn: 297544	2017-03-11 01:54:06 +00:00
Kostya Serebryany	7939ee4d51	[libFuzzer] remove fuzzer-jobs.test which is flaky and not very useful llvm-svn: 297543	2017-03-11 01:48:54 +00:00
Daniel Berlin	14f5c47c1d	Remove opt-bisect support for "cases" in favor of debug counters Summary: Ths "cases" support was not quite finished, is unused, and is really just debug counters. (well, almost, debug counters are slightly more powerful, in that they can skip things at the start, too). Note, opt-bisect itself could also be implemented as a wrapper around debug counters, but not sure it's worth it ATM. I'll shove it on a todo list if we think it is. Reviewers: MatzeB, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30856 llvm-svn: 297542	2017-03-11 01:41:03 +00:00
Sanjoy Das	3f1e8e0102	Use a WeakVH for UnknownInstructions in AliasSetTracker Summary: This change solves the same problem as D30726, except that this only throws out the bathwater. AST was not correctly tracking and deleting UnknownInstructions via handles. The existing code only tracks "pointers" in its `ASTCallbackVH`, so an UnknownInstruction (that isn't also def'ing a pointer used by another memory instruction) never gets a `ASTCallbackVH`. There are two other ways to solve this problem: - Use the `PointerRec` scheme for both known and unknown instructions. - Use a `CallbackVH` that erases the offending Instruction from the UnknownInstruction list. Both of the above changes seemed to be significantly (and unnecessarily IMO) more complex than this. Reviewers: chandlerc, dberlin, hfinkel, reames Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D30849 llvm-svn: 297539	2017-03-11 01:15:48 +00:00
Daniel Berlin	cd07a0f685	VNCoercion: Make the function signatures all consistent llvm-svn: 297537	2017-03-11 00:51:01 +00:00
Stanislav Mekhanoshin	79da2a7698	[AMDGPU] Remove getBidirectionalReasonRank This method inverts the Reason field of a scheduling candidate. It does right comparison between RegCritical and RegExcess, but everything else is broken. In fact it can prefer less strong reason such as Weak over RegCritical because Weak > -RegCritical. The CandReason enum is properly sorted, so just remove artificial ranking. Differential Revision: https://reviews.llvm.org/D30557 llvm-svn: 297536	2017-03-11 00:29:27 +00:00
Quentin Colombet	ee8a4f51c4	[IRTranslator] Simplify error handling for translating constants. NFC. We don't need to check whether the fallback path is enabled to return false. Just do that all the time on error cases, the caller knows (or at least should know!) how to handle the failing case. llvm-svn: 297535	2017-03-11 00:28:33 +00:00
Stanislav Mekhanoshin	b546174b0e	Fix subreg value numbers in handleMoveUp The problem can occur in presence of subregs. If we are swapping two instructions defining different subregs of the same register we will get a new liveout from a block. We need to preserve value number for block's liveout for successor block's livein to match. Differential Revision: https://reviews.llvm.org/D30558 llvm-svn: 297534	2017-03-11 00:14:52 +00:00

1 2 3 4 5 ...

100587 Commits