llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	a45da385f8	[X86][AVX] Peek through bitcasts to find the source of broadcasts AVX1 can only broadcast vectors as floats/doubles, so for 256-bit vectors we insert bitcasts if we are shuffling v8i32/v4i64 types. Unfortunately the presence of these bitcasts prevents the current broadcast lowering code from peeking through cases where we have concatenated / extracted vectors to create the 256-bit vectors. This patch allows us to peek through bitcasts as long as the number of elements doesn't change (i.e. element bitwidth is the same) so the broadcast index is not affected. Note this bitcast peek is different from the stage later on which doesn't care about the type and is just trying to find a load node. Differential Revision: http://reviews.llvm.org/D21660 llvm-svn: 273848	2016-06-27 07:44:32 +00:00
Rafael Espindola	1ac1fa818e	Mips: Fix access to private functions. llvm-svn: 273843	2016-06-27 03:19:40 +00:00
Rafael Espindola	e715172791	Use isPositionIndependent. NFC. llvm-svn: 273829	2016-06-26 22:32:53 +00:00
Rafael Espindola	405e25a970	Use isPositionIndependent predicate. NFC. llvm-svn: 273827	2016-06-26 22:24:01 +00:00
Rafael Espindola	ae0d866f56	Refactor a duplicated predicate. NFC. llvm-svn: 273826	2016-06-26 22:13:55 +00:00
Craig Topper	8f577fd5b5	[X86] Rewrite lowerVectorShuffleWithPSHUFB to not require a ZeroableMask to be created. We can do everything with the starting mask and zeroable bit vector. This removes the last usage of isSingleInputShuffleMask. NFC llvm-svn: 273804	2016-06-26 05:10:56 +00:00
Craig Topper	8bba749a48	[X86] Replace calls to isSingleInputShuffleMask with just checking if V2 is UNDEF. Canonicalization and creation of shuffle vector ensures this is equivalent. llvm-svn: 273803	2016-06-26 05:10:53 +00:00
Craig Topper	9a2e979b3d	[X86] Convert ==/!= comparisons with -1 for checking undef in shuffle lowering to comparisons of <0 or >=0. While there do the same for other kinds of index checks that can just check for greater than 0. No functional change intended. llvm-svn: 273788	2016-06-25 19:05:29 +00:00
Craig Topper	53a39d1a63	[X86] Pull similar bitcasts on different paths to earlier shared point. NFC llvm-svn: 273787	2016-06-25 19:05:23 +00:00
Jan Vesely	3bc1af2be4	AMDGPU/R600: Fix GlobalValue regressions. Don't cast GV expression to MCSymbolRefExpr. r272705 changed GV to binary expressions by including offset even if the offset it 0 (we haven't hit this sooner since tested workloads don't include static offsets) We don't really care about the type of expression, so set it directly. Fixes: r272705 Consider section relative relocations. Since all const as data is in one boffer section relative is equivalent to abs32. Fixes: r273166 Differential Revision: http://reviews.llvm.org/D21633 llvm-svn: 273785	2016-06-25 18:24:16 +00:00
Konstantin Zhuravlyov	f2f3d14774	[AMDGPU] Emit debugger prologue and emit the rest of the debugger fields in the kernel code header Debugger prologue is emitted if -mattr=+amdgpu-debugger-emit-prologue. Debugger prologue writes work group IDs and work item IDs to scratch memory at fixed location in the following format: - offset 0: work group ID x - offset 4: work group ID y - offset 8: work group ID z - offset 16: work item ID x - offset 20: work item ID y - offset 24: work item ID z Set - amd_kernel_code_t::debug_wavefront_private_segment_offset_sgpr to scratch wave offset reg - amd_kernel_code_t::debug_private_segment_buffer_sgpr to scratch rsrc reg - amd_kernel_code_t::is_debug_supported to true if all debugger features are enabled Differential Revision: http://reviews.llvm.org/D20335 llvm-svn: 273769	2016-06-25 03:11:28 +00:00
Tom Stellard	b164a9843b	AMDGPU/SI: Make sure not to fold offsets into local address space globals Summary: Offset folding only works if you are emitting relocations, and we don't emit relocations for local address space globals. Reviewers: arsenm, nhaustov Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D21647 llvm-svn: 273765	2016-06-25 01:59:16 +00:00
Matthias Braun	1e374a7aa6	AMDGPU: Define a schedule class for COPY. COPY was lacking a scheduling class, define it to avoid regressions in the upcoming change to the bidirectional MachineScheduler. Approved by tstellar on IRC. Differential Revision: http://reviews.llvm.org/D21540 llvm-svn: 273751	2016-06-24 23:52:11 +00:00
Rafael Espindola	a686f12705	Simplify. NFC. Also delete out of date comment. This code was always returning .data since r253436. llvm-svn: 273739	2016-06-24 22:19:54 +00:00
Krzysztof Parzyszek	709a626015	[Hexagon] Simplify (+fix) instruction selection for indexed loads/stores llvm-svn: 273733	2016-06-24 21:27:17 +00:00
Rafael Espindola	a895a0cd01	Add support for musl-libc on ARM Linux. Patch by Lei Zhang! llvm-svn: 273726	2016-06-24 21:14:33 +00:00
Ahmed Bougacha	241e74cbc2	[ARM] Remove dead SDNodes. NFC. The opcodes are used, but only by DAG->DAG. llvm-svn: 273717	2016-06-24 20:38:00 +00:00
Ahmed Bougacha	0851ecd1b0	[X86] Remove dead ISD opcodes. NFC. llvm-svn: 273716	2016-06-24 20:37:55 +00:00
Evandro Menezes	3830479f41	[AArch64] Adjust the model for the vector by element FP multiplies on Exynos M1. (NFC) llvm-svn: 273708	2016-06-24 18:58:54 +00:00
Rafael Espindola	f092cc8a14	Use existing predicate. NFC. This doesn't handle ELF, but neither did the previous code. llvm-svn: 273677	2016-06-24 13:28:26 +00:00
Rafael Espindola	01cdf31cab	Merge two identical if branches. NFC. llvm-svn: 273674	2016-06-24 13:08:06 +00:00
Rafael Espindola	41d308689c	Merge two identical if branches. NFC. llvm-svn: 273673	2016-06-24 13:05:20 +00:00
Rafael Espindola	ce37f03273	clang-format a region. NFC. llvm-svn: 273672	2016-06-24 12:58:25 +00:00
Matt Arsenault	86de486d31	AMDGPU: Add stub custom CodeGenPrepare pass This will do various things including ones CodeGenPrepare does, but with knowledge of uniform values. llvm-svn: 273657	2016-06-24 07:07:55 +00:00
Matt Arsenault	c581611e11	AMDGPU: Remove disable-irstructurizer subtarget feature The only real reason to use it is for testing, so replace it with a command line option instead of a potentially function dependent feature. llvm-svn: 273653	2016-06-24 06:30:22 +00:00
Matt Arsenault	43e92fe306	AMDGPU: Cleanup subtarget handling. Split AMDGPUSubtarget into amdgcn/r600 specific subclasses. This removes most of the static_casting of the basic codegen classes everywhere, and tries to restrict the features visible on the wrong target. llvm-svn: 273652	2016-06-24 06:30:11 +00:00
David Majnemer	d770877328	Switch more loops to be range-based This makes the code a little more concise, no functional change is intended. llvm-svn: 273644	2016-06-24 04:05:21 +00:00
Craig Topper	024402dcdf	[X86] Combine two nearby calls to isSingleInputShuffleVector. NFC llvm-svn: 273643	2016-06-24 03:06:11 +00:00
Ahmed Bougacha	f0b46ee0aa	[ARM] Use aapcs_vfp for ___truncdfhf2 on v7k. r215348 overrode the f16 libcalls to be soft-float, but v7k uses the default (hard-float) calling convention. llvm-svn: 273631	2016-06-24 00:08:01 +00:00
Evandro Menezes	62c70101c3	[AArch64] Model the cost of vector by element FP multiplies on Exynos M1. (NFC) llvm-svn: 273630	2016-06-23 23:43:23 +00:00
Tom Stellard	14416ae6cd	Support/ELF: Add R_AMDGPU_GOTPCREL relocation Summary: We will start generating this in a future patch. Reviewers: arsenm, kzhuravl, rafael, ruiu, tony-tye Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D21482 llvm-svn: 273628	2016-06-23 23:11:29 +00:00
Kyle Butt	991df7889b	Codegen: [X86] preservere memory refs for folded umul_lohi Memory references were not being propagated for this folded load. This prevented optimizations like LICM from hoisting the load. Added test to verify that this allows LICM to proceed. llvm-svn: 273617	2016-06-23 21:40:35 +00:00
Rafael Espindola	2d3cce71ee	Uses shouldAssumeDSOLocal. With that SystemZ knows to avoid a GOT for PIE. llvm-svn: 273614	2016-06-23 21:18:59 +00:00
Rafael Espindola	65787a9e01	Refactor to use shouldAssumeDSOLocal. NFC. llvm-svn: 273612	2016-06-23 20:50:42 +00:00
Matt Arsenault	8d4b0eddd6	AMDGPU: Add option to disable spilling SGPRs to VGPRs. This can help debug spilling problems. llvm-svn: 273605	2016-06-23 20:00:34 +00:00
Rafael Espindola	53fd425e06	Refactor duplicated code. NFC. llvm-svn: 273595	2016-06-23 18:43:06 +00:00
Michael Kuperstein	0194d30e09	[X86] Extract HiPE prologue constants into metadata X86FrameLowering::adjustForHiPEPrologue() contains a hard-coded offset into an Erlang Runtime System-internal data structure (the PCB). As the layout of this data structure is prone to change, this poses problems for maintaining compatibility. To address this problem, the compiler can produce this information as module-level named metadata. For example (where P_NSP_LIMIT is the offending offset): !hipe.literals = !{ !2, !3, !4 } !2 = !{ !"P_NSP_LIMIT", i32 152 } !3 = !{ !"X86_LEAF_WORDS", i32 24 } !4 = !{ !"AMD64_LEAF_WORDS", i32 24 } Patch by Magnus Lang Differential Revision: http://reviews.llvm.org/D20363 llvm-svn: 273593	2016-06-23 18:17:25 +00:00
Reid Kleckner	8f4bd1fdf2	Fix the wasm build by including EndianStream.h llvm-svn: 273591	2016-06-23 18:12:31 +00:00
Nirav Dave	bfdb483755	Preserve DebugInfo when replacing values in DAGCombiner Recommiting after correcting over-eager Debug Value transfer fixing PR28270. [DAG] Previously debug values would transfer debuginfo for the selected start node for a replacement which allows for debug to be dropped. Push debug value transfer to occur with node/value replacement in SelectionDAG, remove now extraneous transfers of debug values. This refixes PR9817 which was being incompletely checked in the testsuite. Reviewers: jyknight Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D21037 llvm-svn: 273585	2016-06-23 17:52:57 +00:00
Pablo Barrio	7a64346533	[ARM] Lower (select_cc k k (select_cc ~k ~k x)) into (SSAT l_k x) Summary: SSAT saturates an integer, making sure that its value lies within an interval [-k, k]. Since the constant is given to SSAT as the number of bytes set to one, k + 1 must be a power of 2, otherwise the optimization is not possible. Also, the select_cc must use < and > respectively so that they define an interval. Reviewers: mcrosier, jmolloy, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D21372 llvm-svn: 273581	2016-06-23 16:53:49 +00:00
Hans Wennborg	a8b7d4f73f	Revert r273567 "[SystemZ] Let z13 also support FeatureMiscellaneousExtensions." It broke test/CodeGen/SystemZ/vec-extract-02.ll llvm-svn: 273575	2016-06-23 16:13:26 +00:00
Jonas Paulsson	b1a2b5a708	[SystemZ] Let z13 also support FeatureMiscellaneousExtensions. This processor feature had been left out by mistake from the z13 ProcessorModel. Reviewed by Ulrich Weigand. llvm-svn: 273567	2016-06-23 15:12:06 +00:00
Valery Pykhtin	a852d695b8	[AMDGPU] Enable absolute expression initializer for amd_kernel_code_t fields. Differential Revision: http://reviews.llvm.org/D21380 llvm-svn: 273561	2016-06-23 14:13:06 +00:00
Daniel Sanders	de393329b9	[mips] Don't derive the default ABI from the CPU in the backend. Summary: The backend has no reason to behave like a driver and should generally do as it's told (and error out if it can't) instead of trying to figure out what the API user meant. The default ABI is still derived from the arch component as a concession to backwards compatibility. API-users that previously passed an explicit CPU and a triple that was inconsistent with the CPU (e.g. mips-linux-gnu and mips64r2) may get a different ABI to what they got before. However, it's expected that there are no such users on the basis that CodeGen has been asserting that the triple is consistent with the selected ABI for several releases. API-users that were consistent or passed '' or 'generic' as the CPU will see no difference. Reviewers: sdardis, rafael Subscribers: rafael, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D21466 llvm-svn: 273557	2016-06-23 12:42:53 +00:00
Diana Picus	eb3dd14b95	[ARM] Use member initializers in ARMSubtarget. NFCI Move most of the initializations in ARMSubtarget::initializeEnvironment to member initializers. Change suggested by Matthias Braun (see http://reviews.llvm.org/D21432). llvm-svn: 273556	2016-06-23 12:04:33 +00:00
Daniel Sanders	8e17bea7d5	[mips][ias] Integers are not registers. Summary: When parseAnyRegister() encounters a symbol alias, it parses integers and adds a corresponding expression to the operand list. This is clearly wrong since the only operands that parseAnyRegister() should be accepting are registers. It's not clear why this code was added and there are no test cases that cover it. I think it might be leftover from when searchSymbolAlias() was more widely used. Reviewers: sdardis Subscribers: dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D21377 llvm-svn: 273555	2016-06-23 10:54:09 +00:00
Diana Picus	e440f99913	[AMDGPU] Remove exit-on-error in test (PR27761) The exit-on-error flag was necessary in order to avoid an assertion when handling DYNAMIC_STACKALLOC nodes in SelectionDAGLegalize. We can avoid the assertion by creating some dummy nodes. This enables us to remove the exit-on-error flag on the first 2 run lines (SI), but on the third run line (R600) we would run into another assertion when trying to reserve indirect registers. This patch also replaces that assertion with an early exit from the function. Fixes PR27761. Differential Revision: http://reviews.llvm.org/D20852 llvm-svn: 273550	2016-06-23 09:19:16 +00:00
Simon Dardis	724e530296	[mips] Fix dext/dins definitions dext and dins, along with their 'm' and 'u' variants are defined in mips64r2, not mips64. Reviewers: dsanders, vkalintiris Differential Review: http://reviews.llvm.org/D21608 llvm-svn: 273549	2016-06-23 09:06:20 +00:00
Diana Picus	c5baa43f53	[ARM] Do not test for CPUs, use SubtargetFeatures (Part 1). NFCI This is a cleanup commit similar to r271555, but for ARM. The end goal is to get rid of the isSwift / isCortexXY / isWhatever methods. Since the ARM backend seems to have quite a lot of calls to these methods, I intend to submit 5-6 subtarget features at a time, instead of one big lump. Differential Revision: http://reviews.llvm.org/D21432 llvm-svn: 273544	2016-06-23 07:47:35 +00:00
Craig Topper	597aa42fec	[AVX512] Remove masked unpack intrinsics and autoupgrade to vectorshuffle and selects. llvm-svn: 273543	2016-06-23 07:37:33 +00:00

1 2 3 4 5 ...

38071 Commits