llvm-project

Commit Graph

Author	SHA1	Message	Date
Daniel Sanders	a73d8fe2ad	[mips] Distinguish 'R', 'ZC', and 'm' inline assembly memory constraint. Summary: Previous behaviour of 'R' and 'm' has been preserved for now. They will be improved in subsequent commits. The offset permitted by ZC varies according to the subtarget since it is intended to match the restrictions of the pref, ll, and sc instructions. The restrictions on these instructions are: * For microMIPS: 12-bit signed offset. * For Mips32r6/Mips64r6: 9-bit signed offset. * Otherwise: 16-bit signed offset. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8414 llvm-svn: 233063	2015-03-24 11:26:34 +00:00
Michael Kuperstein	774b441b5e	Use std::bitset for SubtargetFeatures Previously, subtarget features were a bitfield with the underlying type being uint64_t. Since several targets (X86 and ARM, in particular) have hit or were very close to hitting this bound, switching the features to use a bitset. No functional change. The first time this was committed (r229831), it caused several buildbot failures. At least some of the ARM ones were due to gcc/binutils issues, and should now be fixed. Differential Revision: http://reviews.llvm.org/D8542 llvm-svn: 233055	2015-03-24 09:17:25 +00:00
Ahmed Bougacha	d1655cb1c0	[AArch64, ARM] Enable GlobalMerge with -O3 rather than -O1. The pass used to be enabled by default with CodeGenOpt::Less (-O1). This is too aggressive, considering the pass indiscriminately merges all globals together. Currently, performance doesn't always improve, and, on code that uses few globals (e.g., the odd file- or function- static), more often than not is degraded by the optimization. Lengthy discussion can be found on llvmdev (AArch64-focused; ARM has similar problems): http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-February/082800.html Also, it makes tooling and debuggers less useful when dealing with globals and data sections. GlobalMerge needs to better identify those cases that benefit, and this will be done separately. In the meantime, move the pass to run with -O3 rather than -O1, on both ARM and AArch64. llvm-svn: 233024	2015-03-23 21:17:36 +00:00
David Blaikie	4eaa79c8d9	Refactor: Simplify boolean expressions in R600 target Simplify boolean expressions with `true` and `false` using `clang-tidy` Patch by Richard Thomson. Differential Revision: http://reviews.llvm.org/D8520 llvm-svn: 233020	2015-03-23 20:56:44 +00:00
David Blaikie	50e4f9e4c8	Refactor: Simplify boolean expressions in x86 target Simplify boolean expressions with `true` and `false` with `clang-tidy` Patch by Richard Thomson. Differential Revision: http://reviews.llvm.org/D8519 llvm-svn: 233002	2015-03-23 19:42:36 +00:00
Benjamin Kramer	799003bf8c	Re-sort includes with sort-includes.py and insert raw_ostream.h where it's used. llvm-svn: 232998	2015-03-23 19:32:43 +00:00
Matt Arsenault	88a13c6c8d	R600/SI: Merge tables for commuting Don't use a separate table for compares anymore, and use the same VOP2_REV class. llvm-svn: 232992	2015-03-23 18:45:41 +00:00
Matt Arsenault	0943b0e30f	R600/SI: Only use one range of isCommutable for compares Also don't count the class instructions as isCompare anymore. llvm-svn: 232991	2015-03-23 18:45:38 +00:00
Matt Arsenault	448dac05cd	R600/SI: Remove redundant unsetting of hasSideEffects These are already set in the base class for the instruction. llvm-svn: 232990	2015-03-23 18:45:36 +00:00
Matt Arsenault	42f39e1a3f	R600/SI: Move hasSideEffects setting into VOPCX classes llvm-svn: 232989	2015-03-23 18:45:35 +00:00
Matt Arsenault	f5b2cd891a	R600/SI: Allow commuting compares This enables very common cases to switch to the smaller encoding. All of the standard LLVM canonicalizations of comparisons are the opposite of what we want. Compares with constants are moved to the RHS, but the first operand can be an inline immediate, literal constant, or SGPR using the 32-bit VOPC encoding. There are additional bad canonicalizations that should also be fixed, such as canonicalizing ge x, k to gt x, (k + 1) if this makes k no longer an inline immediate value. llvm-svn: 232988	2015-03-23 18:45:30 +00:00
Matt Arsenault	05b617fed5	R600/SI: Use right class for cmpsx f64 instructions Use VOPCX_F64 to not need the let Defs = [EXEC] llvm-svn: 232987	2015-03-23 18:45:23 +00:00
Matt Arsenault	a2dd76f41c	R600/SI: Remove cond operand to VOPCX classes It isn't used, and these will probably never be directly selected. llvm-svn: 232986	2015-03-23 18:45:20 +00:00
Benjamin Kramer	16132e6faa	Purge unused includes throughout libSupport. NFC. llvm-svn: 232976	2015-03-23 18:07:13 +00:00
Chad Rosier	affe181b39	[AArch64] Enable rematerialization of float 0 values. Patch by Geoff Berry<gberry@codeaurora.org>. llvm-svn: 232967	2015-03-23 17:19:34 +00:00
Bradley Smith	ae0ad9c95d	Revert "[ARM] Add more pattern matching for f16 <-> f64 conversions" This change is incorrect since it converts double rounding into single rounding, which can produce different results. Instead this optimization will be done by modifying Clang's codegen to not produce double rounding in the first place. This reverts commit r232954. llvm-svn: 232962	2015-03-23 16:52:52 +00:00
Eli Bendersky	3e84019a39	Simplify boolean expressions with true and false using clang-tidy Patch by Richard (legalize@xmission.com) Differential Revision: http://reviews.llvm.org/D8521 llvm-svn: 232961	2015-03-23 16:26:23 +00:00
James Molloy	fa041153e5	[ARM] Remove target-specific ITOFP/FPTOI nodes Anton tried this 5 years ago but it was reverted due to extra VMOVs being emitted. This can be easily fixed with a liberal application of patterns - matching loads/stores and extractelts. llvm-svn: 232958	2015-03-23 16:15:16 +00:00
Tom Stellard	f0a575f6be	R600/SI: Fix crash in SIInstrInfo::areLoadsFromSameBasePtr() This function assumed that SMRD instructions always have immediate offsets, which is not always the case. llvm-svn: 232957	2015-03-23 16:06:01 +00:00
Colin LeMahieu	473e34782d	[Hexagon] Simplify boolean expression Patch by Richard http://reviews.llvm.org/D8523 llvm-svn: 232955	2015-03-23 16:01:03 +00:00
Bradley Smith	bc0f0d8c49	[ARM] Add more pattern matching for f16 <-> f64 conversions Specifically when the conversion is done in two steps, f16 -> f32 -> f64. For example: %1 = tail call float @llvm.convert.from.fp16.f32(i16 %0) %conv = fpext float %1 to double to: vcvtb.f64.f16 llvm-svn: 232954	2015-03-23 15:59:54 +00:00
Benjamin Kramer	51f6096cf8	Move private classes into anonymous namespaces NFC. llvm-svn: 232944	2015-03-23 12:30:58 +00:00
Petar Jovanovic	5b4362276b	Fix sign extension for MIPS64 in makeLibCall function Fixing sign extension in makeLibCall for MIPS64. In MIPS64 architecture all 32 bit arguments (int, unsigned int, float 32 (soft float)) must be sign extended. This fixes test "MultiSource/Applications/oggenc/". Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D7791 llvm-svn: 232943	2015-03-23 12:28:13 +00:00
Daniel Sanders	f731eee322	[aarch64] Distinguish the 'Q' and 'm' inline assembly memory constraints. Summary: But still handle them the same way since I don't know how they differ on this target. Clang also has code for 'Ump', 'Utf', 'Usa', and 'Ush' but calls llvm_unreachable() on this code path so they are not converted to a constraint id at the moment. No functional change intended. Reviewers: t.p.northover Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D8177 llvm-svn: 232941	2015-03-23 11:33:15 +00:00
David Majnemer	abd9f5bfb6	Silence a GCC warning llvm-svn: 232923	2015-03-22 21:27:10 +00:00
Simon Pilgrim	3f229eaf3f	Fixed MSVC compile warning issue introduced in r232837 - was reporting 'warning C4715: 'getType32' : not all control paths return a value' llvm-svn: 232913	2015-03-22 13:38:36 +00:00
Benjamin Kramer	7857d723f1	[SimplifyLibCalls] Turn memchr(const, C, const) into a bitfield check. strchr("123!", C) != nullptr is a common pattern to check if C is one of 1, 2, 3 or !. If the largest element of the string is smaller than the target's register size we can easily create a bitfield and just do a simple test for set membership. int foo(char C) { return strchr("123!", C) != nullptr; } now becomes cmpl $64, %edi ## range check sbbb %al, %al movabsq $0xE000200000001, %rcx btq %rdi, %rcx ## bit test sbbb %cl, %cl andb %al, %cl ## and the two conditions andb $1, %cl movzbl %cl, %eax ## returning an int ret (imho the backend should expand this into a series of branches, but that's a different story) The code is currently limited to bit fields that fit in a register, so usually 64 or 32 bits. Sadly, this misses anything using alpha chars or {}. This could be fixed by just emitting a i128 bit field, but that can generate really ugly code so we have to find a better way. To some degree this is also recreating switch lowering logic, but we can't simply emit a switch instruction and thus change the CFG within instcombine. llvm-svn: 232902	2015-03-21 21:09:33 +00:00
Eric Christopher	4d0f35a901	Remove the target independent TargetMachine::getSubtarget and TargetMachine::getSubtargetImpl routines. This keeps the target independent code free of bare subtarget calls while the remainder of the backends are migrated, or not if they don't wish to support per-function subtargets as would be needed for function multiversioning or LTO of disparate cpu subarchitecture types, e.g. clang -msse4.2 -c foo.c -emit-llvm -o foo.bc clang -c bar.c -emit-llvm -o bar.bc llvm-link foo.bc bar.bc -o baz.bc llc baz.bc and get appropriate code for what the command lines requested. llvm-svn: 232885	2015-03-21 04:22:23 +00:00
Eric Christopher	faad620569	Remove the bare getSubtargetImpl call from the AArch64 port. As part of this add a test that shows we can generate code for functions that specifically enable a subtarget feature. llvm-svn: 232884	2015-03-21 04:04:50 +00:00
Eric Christopher	83eb13c967	Remove the bare getSubtargetImpl call from the PPC port. As part of this add a test that shows we can generate code with for functions that differ by subtarget feature. llvm-svn: 232882	2015-03-21 03:36:02 +00:00
Eric Christopher	8024d030fb	Grab a subtarget off of an AMDGPUTargetMachine rather than a bare target machine in preparation for the TargetMachine bare getSubtarget/getSubtargetImpl calls going away. llvm-svn: 232880	2015-03-21 03:17:25 +00:00
Eric Christopher	c5a85af3b2	Cache the Function dependent subtarget on the MachineFunction. As preparation for removing the getSubtargetImpl() call from TargetMachine go ahead and flip the switch on caching the function dependent subtarget and remove the bare getSubtargetImpl call from the X86 port. As part of this add a few tests that show we can generate code and assemble on X86 based on features/cpu on the Function. llvm-svn: 232879	2015-03-21 03:13:10 +00:00
Eric Christopher	cba722f8c1	Grab the cached subtarget off of the MachineFunction. llvm-svn: 232878	2015-03-21 03:13:07 +00:00
Eric Christopher	948bdf996b	Grab a subtarget off of a MipsTargetMachine rather than a bare target machine in preparation for the TargetMachine bare getSubtarget/getSubtargetImpl calls going away. llvm-svn: 232877	2015-03-21 03:13:05 +00:00
Eric Christopher	5c3dffc459	Simplify the query for a subtarget in the NVPTX pass manager. llvm-svn: 232876	2015-03-21 03:13:03 +00:00
Eric Christopher	cd53d6eda7	Change getISAEncoding to use the target triple to determine thumb-ness similar to the rest of the Module level asm printing infrastructure as debug info finalization happens after the function may be missing. llvm-svn: 232875	2015-03-21 03:13:01 +00:00
Eric Christopher	23a7d1e6f4	Make the Hexagon ISelDAGToDAG pass set the subtarget dynamically on each runOnMachineFunction invocation. llvm-svn: 232874	2015-03-21 03:12:59 +00:00
Ahmed Bougacha	e6bb09ac3f	[AArch64] Prefer UZP for concat_vector of illegal truncs. Follow-up to r232459: prefer a UZP shuffle to the intermediate truncs. llvm-svn: 232871	2015-03-21 01:08:39 +00:00
Sanjay Patel	c88f724fed	[X86] Prefer blendps over insertps codegen for one special case With this patch, for this one exact case, we'll generate: blendps %xmm0, %xmm1, $1 instead of: insertps %xmm0, %xmm1, $0 If there's a memory operand available for load folding and we're optimizing for size, we'll still generate the insertps. The detailed performance data motivation for this may be found in D7866; in summary, blendps has 2-3x throughput vs. insertps on widely used chips. Differential Revision: http://reviews.llvm.org/D8332 llvm-svn: 232850	2015-03-20 21:19:52 +00:00
Benjamin Kramer	063667cea2	X86: Make helper functions static. NFC. llvm-svn: 232848	2015-03-20 21:07:30 +00:00
Rafael Espindola	36a15cb975	Don't declare all text sections at the start of the .s The code this patch removes was there to make sure the text sections went before the dwarf sections. That is necessary because MachO uses offsets relative to the start of the file, so adding a section can change relaxations. The dwarf sections were being printed at the start just to produce symbols pointing at the start of those sections. The underlying issue was fixed in r231898. The dwarf sections are now printed when they are about to be used, which is after we printed the text sections. To make sure we don't regress, the patch makes the MachO streamer assert if CodeGen puts anything unexpected after the DWARF sections. llvm-svn: 232842	2015-03-20 20:00:01 +00:00
Rafael Espindola	bdfbde56e0	Reorganize the x86 ELF relocation selection logic. The main differences are: * Split in 32 and 64 bit functions. * First switch on the Modifier so that we have only one non fully covered switch. * Map the fixup kind first to a x86_64 (or i386) specific enum, to make it easy to handle cases like X86::reloc_riprel_4byte_movq_load. * Switch on IsPCRel last, which reduces code duplication. Fixes pr22308. llvm-svn: 232837	2015-03-20 19:48:54 +00:00
John Brawn	1f26a47630	[ARM] Fix handling of thumb1 out-of-range frame offsets LocalStackSlotPass assumes that isFrameOffsetLegal doesn't change its answer when the base register changes. Unfortunately this isn't true in thumb1, where SP-based loads allow a larger offset than non-SP-based loads, and this causes the base register reuse code to generate instructions that are unencodable, causing an assertion failure. Solve this by adding a BaseReg parameter to isFrameOffsetLegal, which ARMBaseRegisterInfo can then make use of to give the correct answer. Differential Revision: http://reviews.llvm.org/D8419 llvm-svn: 232825	2015-03-20 17:20:07 +00:00
Simon Pilgrim	180cad2e57	Stripped trailing whitespace. NFC. llvm-svn: 232822	2015-03-20 16:08:17 +00:00
Tom Stellard	3b0dab9f3f	R600/SI: Refactor VOP2 instruction defs llvm-svn: 232817	2015-03-20 15:14:23 +00:00
Tom Stellard	23c2c3d0f4	R600/SI: Refactor VOP1 instruction defs llvm-svn: 232816	2015-03-20 15:14:21 +00:00
Rafael Espindola	8c8d15879f	Reduce indentation after return. NFC. llvm-svn: 232814	2015-03-20 14:33:25 +00:00
Rafael Espindola	2d74274017	Use early returns. NFC. llvm-svn: 232813	2015-03-20 14:23:46 +00:00
Rafael Espindola	9e77cba164	Fold a llvm_unreachable into an assert. NFC. llvm-svn: 232811	2015-03-20 13:50:15 +00:00
Rafael Espindola	5f5e24bb92	clang-format a function. NFC. llvm-svn: 232810	2015-03-20 13:47:40 +00:00

1 2 3 4 5 ...

32479 Commits