llvm-project

History

Scott Douglass 953f908173 [ARM] Modify codegen for memcpy intrinsic to prefer LDM/STM. We were previously codegen'ing memcpy as regular load/store operations and hoping that the register allocator would allocate registers in ascending order so that we could apply an LDM/STM combine after register allocation. According to the commit that first introduced this code (r37179), we planned to teach the register allocator to allocate the registers in ascending order. This never got implemented, and up to now we've been stuck with very poor codegen. A much simpler approach for achieving better codegen is to create MEMCPY pseudo instructions, attach scratch virtual registers to them and then, post register allocation, expand the MEMCPYs into LDM/STM pairs using the scratch registers. The register allocator will have picked arbitrary registers which we sort when expanding the MEMCPY. This approach also avoids the need to repeatedly calculate offsets which ultimately ought to be eliminated pre-RA in order to decrease register pressure. Fixes PR9199 and PR23768. [This is based on Peter Collingbourne's r238473 which was reverted.] Differential Revision: http://reviews.llvm.org/D13239 Change-Id: I727543c2e94136e0f80b8e22d5642d7b9ee5b458 Author: Peter Collingbourne <peter@pcc.me.uk> llvm-svn: 249322		2015-10-05 14:49:54 +00:00
..
AArch64	[AArch64] Remove an unnecessary run line and other cleanup. NFC.	2015-09-30 21:10:02 +00:00
AMDGPU	AMDGPU/SI: Remove assert from AMDGPUOpenCLImageTypeLowering pass	2015-10-01 21:16:05 +00:00
ARM	[ARM] Modify codegen for memcpy intrinsic to prefer LDM/STM.	2015-10-05 14:49:54 +00:00
BPF	[bpf] rename triple names bpf_be -> bpfeb	2015-06-05 16:11:14 +00:00
CPP	Fix CPP Backend for GEP API changes for opaque pointer types	2015-09-08 18:42:29 +00:00
Generic	Make the default triple optional by allowing an empty string	2015-09-16 05:34:32 +00:00
Hexagon	Update edge weights properly when merging blocks in if-conversion.	2015-09-18 20:22:41 +00:00
Inputs	DI: Require subprogram definitions to be distinct	2015-08-28 20:26:49 +00:00
MIR	Fix PR 24724 - The implicit register verifier shouldn't assume certain operand	2015-09-10 14:04:34 +00:00
MSP430	…
Mips	[mips][microMIPS] Fix an issue with selecting sqrt instruction in LLVM backend	2015-10-02 13:06:02 +00:00
NVPTX	[NVPTX] Let NVPTX backend detect integer min and max patterns.	2015-08-26 23:22:02 +00:00
PowerPC	[PowerPC] Disable shrink wrapping	2015-09-30 17:29:03 +00:00
SPARC	[SPARC] Switch to the Machine Scheduler.	2015-09-10 21:49:06 +00:00
SystemZ	[SystemZ] Add some generic (floating point support) load instructions.	2015-10-01 18:12:28 +00:00
Thumb	[ARM] Modify codegen for memcpy intrinsic to prefer LDM/STM.	2015-10-05 14:49:54 +00:00
Thumb2	[ARM][NEON] Use address space in vld([1234]\|[234]lane) and vst([1234]\|[234]lane) instructions	2015-09-30 10:56:37 +00:00
WebAssembly	[WebAssembly] Implement the remaining conversion operations.	2015-10-03 02:10:28 +00:00
WinEH	[WinEH] Teach AsmPrinter about funclets	2015-09-29 20:12:33 +00:00
X86	[X86][SSE4A] Added shuffle decode tests for 'special case' SSE4A EXTRQI/INSERTQI ops.	2015-10-04 10:12:53 +00:00
XCore	[opaque pointer type] Add textual IR support for explicit type parameter for global aliases	2015-09-11 03:22:04 +00:00