llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	08a6857c82	[AVX512] Fix copy/paste mistake a I made in a comment. llvm-svn: 270331	2016-05-21 22:50:04 +00:00
Michael Zuckerman	a63a129749	[Clang][AVX512][intrinsics] Fix rcp and sqrt intrinsics. Differential Revision: http://reviews.llvm.org/D20438 llvm-svn: 270322	2016-05-21 14:44:18 +00:00
Michael Zuckerman	11b55b29d1	[Clang][AVX512][intrinsics] Fix vscalef intrinsics. Differential Revision: http://reviews.llvm.org/D20324 llvm-svn: 270321	2016-05-21 11:09:53 +00:00
Craig Topper	02626c076b	[AVX512] Add patterns for VEXTRACT v16i16->v8i16 and v32i8->v16i8. Disable AVX2 versions of vector extract when AVX512VL is enabled. llvm-svn: 270318	2016-05-21 07:08:56 +00:00
Craig Topper	22ae353207	[AVX512] Disable AVX2 VPERMD, VPERMQ, VPERMPS, and VPERMPD patterns when AVX512VL is enabled. Also add shuffle comment printing for AVX512VL VPERMPD/VPERMQ to keep some tests that now use these instructions instead of the AVX2 ones. llvm-svn: 270317	2016-05-21 06:07:18 +00:00
Craig Topper	6be70deda3	[AVX512] Disable AVX/AVX2 VBROADCASTSS/VBROADCASTSD patterns when AVX512VL is enabled. llvm-svn: 270316	2016-05-21 05:47:25 +00:00
Matt Arsenault	7f9eabd2c2	AMDGPU: Define priorities for register classes Allocating larger register classes first should give better allocation results (and more importantly for myself, make the lit tests more stable with respect to scheduler changes). Patch by Matthias Braun llvm-svn: 270312	2016-05-21 03:55:07 +00:00
Craig Topper	97565ded80	[AVX512] Disable AVX/AVX2 patterns for VPSADBW and VPMULUDQ when the AVX512VL/AVX512BWI equivalents are available. llvm-svn: 270311	2016-05-21 03:52:32 +00:00
Craig Topper	b395105584	[X86] Convert some SSE2/AVX2 intrinsics to ISD opcodes during lowering instead of pattern matching the intrinsics. This unifies handling with AVX512 and allows these intrinsics to select EVEX encoded instructions to increase available registers. llvm-svn: 270310	2016-05-21 03:52:28 +00:00
Matt Arsenault	71e6676169	AMDGPU: Cleanup lowering actions These are kind of a mess and hard to follow, particularly for loads and stores. Fix various redundant, unnecessary and dead settings. llvm-svn: 270307	2016-05-21 02:27:49 +00:00
Matt Arsenault	81a709503d	AMDGPU: Fix high bits after division optimization This is essentially doing a 24-bit signed division with FP. We need to truncate to the N bit result. llvm-svn: 270305	2016-05-21 01:53:33 +00:00
Dylan McKay	52ed0aa203	[AVR] Add AVRMCAsmInfo llvm-svn: 270302	2016-05-21 01:06:37 +00:00
Matt Arsenault	b6e1cc2a92	AMDGPU: Fix verifier error when spilling SGPRs The current SGPR spilling test does not stress this because it is using s_buffer_load instructions to increase SGPR pressure and spill, but their output operands have the same SReg_32_XM0 constraint. This fixes an error when the SReg_32 output from most instructions is spilled. llvm-svn: 270301	2016-05-21 00:53:42 +00:00
Matt Arsenault	8f5e008534	AMDGPU: Fix relationship between SReg_32 and SReg_32_XM0 llvm-svn: 270300	2016-05-21 00:53:28 +00:00
Dylan McKay	28ae31731e	[AVR] Fix header files in MCTargetDesc Everything now compiles successfully, but there are still undefined references. llvm-svn: 270298	2016-05-21 00:35:07 +00:00
Matt Arsenault	4945905f5f	AMDGPU: Handle cbranch vccz/vccnz llvm-svn: 270297	2016-05-21 00:29:40 +00:00
Matt Arsenault	72fcd5f597	AMDGPU: Implement ReverseBranchCondition llvm-svn: 270296	2016-05-21 00:29:34 +00:00
Matt Arsenault	6d09380532	AMDGPU: Implement AnalyzeBranch Original patch by Tom Stellard llvm-svn: 270295	2016-05-21 00:29:27 +00:00
Dan Gohman	b7c2400fa7	[WebAssembly] Optimize away return instructions using fallthroughs. This saves a small amount of code size, and is a first small step toward passing values on the stack across block boundaries. Differential Review: http://reviews.llvm.org/D20450 llvm-svn: 270294	2016-05-21 00:21:56 +00:00
Dylan McKay	be8e2e0fa8	[AVR] Fix signuature of AVRTargetMachine constructor llvm-svn: 270292	2016-05-20 23:39:04 +00:00
Justin Bogner	dc8af06b27	SDAG: Implement Select instead of SelectImpl in PPCDAGToDAGISel - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 270283	2016-05-20 21:43:23 +00:00
Jacques Pienaar	5ffdef55f0	[lanai] Change reloc to use PIC_ by default and cleanup. * Change reloc to PIC_; * Cleanup (clang-format & modify test); llvm-svn: 270282	2016-05-20 21:41:53 +00:00
David Majnemer	498f2fd11b	Address post-review for r270246 This gets rid of some unnecessary SmallStrings in X86TargetMachine::getSubtargetImpl. No functionality change is intended. llvm-svn: 270270	2016-05-20 20:41:24 +00:00
Jun Bum Lim	b21d4e17a2	[AArch64] Disable narrow load merge by default Summary: As this optimization converts two loads into one load with two shift instructions, it could potentially hurt performance if a loop is arithmetic operation intensive. Reviewers: t.p.northover, mcrosier, jmolloy Subscribers: evandro, jmolloy, aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D20172 llvm-svn: 270251	2016-05-20 18:45:49 +00:00
David Majnemer	ca29023b02	[X86] Reduce memory allocations in X86TargetMachine::getSubtargetImpl We performed a number of memory allocations each time getTTI was called, remove them by using SmallString. No functionality change intended. llvm-svn: 270246	2016-05-20 18:16:06 +00:00
Sanjay Patel	5496a23316	fix comments; NFC llvm-svn: 270237	2016-05-20 17:07:19 +00:00
Sanjay Patel	1dc57cb944	use range-loops; NFCI llvm-svn: 270236	2016-05-20 17:00:10 +00:00
Sanjay Patel	8bc63b2f47	fix documentation comments; NFC llvm-svn: 270234	2016-05-20 16:46:01 +00:00
Simon Pilgrim	55ef3da27b	[X86][AVX] Generalized matching for target shuffle combines This patch is a first step towards a more extendible method of matching combined target shuffle masks. Initially this just pulls out the existing basic mask matches and adds support for some 256/512 bit equivalents. Future patterns will require a number of features to be added but I wanted to keep this patch simple. I hope we can avoid duplication between shuffle lowering and combining and share more complex pattern match functions in future commits. Differential Revision: http://reviews.llvm.org/D19198 llvm-svn: 270230	2016-05-20 16:19:30 +00:00
Rafael Espindola	c7e9813228	Refactor X86 symbol access classification. This refactors the logic in X86 to avoid code duplication. It also splits it in two steps: it first decides if a symbol is local to the DSO and then uses that information to decide how to access it. The first part is implemented by shouldAssumeDSOLocal. It is not in any way specific to X86. In a followup patch I intend to move it to somewhere common and reused it in other backends. llvm-svn: 270209	2016-05-20 12:20:10 +00:00
Rafael Espindola	8571aa3d5d	Simplify handling of hidden stubs on PowerPC. We now handle them just like non hidden ones. This was already the case on x86 (r207518) and arm (r207517). llvm-svn: 270205	2016-05-20 12:00:52 +00:00
NAKAMURA Takumi	0e57b13743	SparcISelLowering.cpp: Add missing StringSwitch.h llvm-svn: 270200	2016-05-20 10:53:56 +00:00
Chris Dewhurst	ad74117af4	[Sparc] Implement getRegisterByName. Allows Sparc registers to be specifically referred to in inline assembly. llvm-svn: 270198	2016-05-20 10:21:01 +00:00
Chris Dewhurst	0dfa6bc004	[Sparc] Enable more inline assembly constraints. Note: This is specifically to allow GCC's test pr44707 to pass. Trivial change, not put for differential revision. Test included. llvm-svn: 270192	2016-05-20 09:03:01 +00:00
Craig Topper	b182715a52	[X86] Fix another AVX pattern to only be disable if VLX and BWI are supported. llvm-svn: 270182	2016-05-20 05:10:27 +00:00
Jacques Pienaar	813e83734d	[lanai] Use Optional<Reloc> in LanaiTargetMachine. Follow r269988 and use Optional<Reloc>. llvm-svn: 270176	2016-05-20 03:21:37 +00:00
Craig Topper	0a7a8dee2b	[X86] Fix some AVX patterns to only be disabled if VLX and BWI are supported. Without this we get isel failures on the avx-intrinsics-x86.ll test in AVX512VL. llvm-svn: 270174	2016-05-20 02:00:08 +00:00
Dylan McKay	7ec6f56040	Add AVRTargetStreamers Reviewed by Matt Arsenault in http://reviews.llvm.org/D16311 llvm-svn: 270171	2016-05-20 01:17:38 +00:00
Rafael Espindola	ab03eb007c	Record a TargetMachine instead of a Reloc::Model. Addresses r270095's code review. llvm-svn: 270147	2016-05-19 22:07:57 +00:00
Matt Arsenault	4e3d383c46	AMDGPU: Remove pointless conversions llvm-svn: 270139	2016-05-19 21:09:58 +00:00
Dan Gohman	847afa2231	[WebAssembly] Simplify code that never has to handle physical registers. NFC. llvm-svn: 270137	2016-05-19 21:07:20 +00:00
David Blaikie	bc744272f8	Fix -Wunused-variable in non-Asserts build llvm-svn: 270118	2016-05-19 20:44:22 +00:00
David Blaikie	f869d3190c	Simplify conditional unreachable into an assertion llvm-svn: 270111	2016-05-19 20:28:40 +00:00
Hans Wennborg	172eee9cfc	X86: Don't reset the stack after calls that don't return (PR27117) Since the calls don't return, the instruction afterwards will never run, and is just taking up unnecessary space in the binary. Differential Revision: http://reviews.llvm.org/D20406 llvm-svn: 270109	2016-05-19 20:15:33 +00:00
Rafael Espindola	46107b9e62	Remember the relocation model. NFC. This avoids passing a TargetMachine in a few places. llvm-svn: 270095	2016-05-19 18:49:29 +00:00
Rafael Espindola	cb2d266360	Style fixes. NFC. llvm-svn: 270093	2016-05-19 18:34:20 +00:00
Zhan Jun Liau	e327fa12a1	[SystemZ] Test commit - remove idea from README Remove a comment about not supporting LRVH/STRVH from the README LRVH/STRVH are being generated as of r269688 llvm-svn: 270092	2016-05-19 18:30:17 +00:00
Matt Arsenault	4318ea354a	AMDGPU: Also look for s_cbranch_vccz llvm-svn: 270091	2016-05-19 18:20:25 +00:00
Ron Lieberman	562e19eecb	Fix a covnersion from string to bool issue used in an assert Problem Was exposed by -Wstring-conversion llvm-svn: 270087	2016-05-19 18:05:56 +00:00
Chad Rosier	02f25a9565	[AArch64 ] Generate a BFXIL from 'or (and X, Mask0Imm),(and Y, Mask1Imm)'. Mask0Imm and ~Mask1Imm must be equivalent and one of the MaskImms is a shifted mask (e.g., 0x000ffff0). Both 'and's must have a single use. This changes code like: and w8, w0, #0xffff000f and w9, w1, #0x0000fff0 orr w0, w9, w8 into lsr w8, w1, #4 bfi w0, w8, #4, #12 llvm-svn: 270063	2016-05-19 14:19:47 +00:00

1 2 3 4 5 ...

37575 Commits