llvm-project

Commit Graph

Author	SHA1	Message	Date
Matthias Braun	27b6692fe2	AArch64Subtarget: Use default member initializers llvm-svn: 271057	2016-05-27 22:14:09 +00:00
Benjamin Kramer	3e9a5d3468	Apply clang-tidy's misc-static-assert where it makes sense. Also fold conditions into assert(0) where it makes sense. No functional change intended. llvm-svn: 270982	2016-05-27 11:36:04 +00:00
Chad Rosier	14aa2ad1f4	[AArch64] Generate rev16/rev32 from bswap + srl when upper bits are known zero. Canonicalize (srl (bswap i32 x), 16) to (rotr (bswap i32 x), 16), if the high 16-bits of x are zero. Similarly, canonicalize (srl (bswap i64 x), 32) to (rotr (bswap i64 x), 32), if the high 32-bits of x are zero. test_rev_w_srl16: test_rev_w_srl16: and w8, w0, #0xffff and w8, w0, #0xffff rev w8, w8 ---> rev16 w0, w8 lsr w0, w8, #16 test_rev_x_srl32: test_rev_x_srl32: rev x8, x8 ---> rev32 x0, x8 lsr x0, x8, #32 llvm-svn: 270896	2016-05-26 19:41:33 +00:00
Chad Rosier	816a67da49	[AArch64] Generate a BFI/BFXIL from 'or (and X, MaskImm), OrImm'. If and only if the value being inserted sets only known zero bits. This combine transforms things like and w8, w0, #0xfffffff0 movz w9, #5 orr w0, w8, w9 into movz w8, #5 bfxil w0, w8, #0, #4 The combine is tuned to make sure we always reduce the number of instructions. We avoid churning code for what is expected to be performance neutral changes (e.g., converted AND+OR to OR+BFI). Differential Revision: http://reviews.llvm.org/D20387 llvm-svn: 270846	2016-05-26 13:27:56 +00:00
Rafael Espindola	a224de06bc	Use shouldAssumeDSOLocal on AArch64. This reduces code duplication and now AArch64 also handles PIE. llvm-svn: 270844	2016-05-26 12:42:55 +00:00
Rafael Espindola	6b93bf5783	Don't repeat name in comment and git-clang-format. llvm-svn: 270785	2016-05-25 22:44:06 +00:00
Rafael Espindola	6b4baa5f58	Sort includes. llvm-svn: 270769	2016-05-25 21:37:29 +00:00
Jun Bum Lim	b21d4e17a2	[AArch64] Disable narrow load merge by default Summary: As this optimization converts two loads into one load with two shift instructions, it could potentially hurt performance if a loop is arithmetic operation intensive. Reviewers: t.p.northover, mcrosier, jmolloy Subscribers: evandro, jmolloy, aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D20172 llvm-svn: 270251	2016-05-20 18:45:49 +00:00
Chad Rosier	02f25a9565	[AArch64 ] Generate a BFXIL from 'or (and X, Mask0Imm),(and Y, Mask1Imm)'. Mask0Imm and ~Mask1Imm must be equivalent and one of the MaskImms is a shifted mask (e.g., 0x000ffff0). Both 'and's must have a single use. This changes code like: and w8, w0, #0xffff000f and w9, w1, #0x0000fff0 orr w0, w9, w8 into lsr w8, w1, #4 bfi w0, w8, #4, #12 llvm-svn: 270063	2016-05-19 14:19:47 +00:00
Chad Rosier	e006202a4d	[AArch64] Push comment into function. NFC. llvm-svn: 270003	2016-05-18 23:51:17 +00:00
Rafael Espindola	8c34dd8257	Delete Reloc::Default. Having an enum member named Default is quite confusing: Is it distinct from the others? This patch removes that member and instead uses Optional<Reloc> in places where we have a user input that still hasn't been maped to the default value, which is now clear has no be one of the remaining 3 options. llvm-svn: 269988	2016-05-18 22:04:49 +00:00
Chad Rosier	91294c5bdc	[AArch64] Minor refactoring. NFC. llvm-svn: 269963	2016-05-18 17:43:11 +00:00
Rafael Espindola	38af4d6347	Trivial cleanups. This just clang formats and cleans comments in an area I am about to post a patch for review. llvm-svn: 269946	2016-05-18 16:00:24 +00:00
Geoff Berry	74cb718ea9	[AArch64] Fix bug in large stack spill slot handling (PR27717) Summary: Fix bug in MachO path where a frame index offset would not be reserved for handling large frames when an extra non-used callee-save register was saved. In the case where the extra register is reserved or not a GPR (e.g. %FP in the MachO case), this would lead to the register scavenger later failing when called from PrologEpilogInserter. Reviewers: t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D20185 llvm-svn: 269697	2016-05-16 20:52:28 +00:00
Chad Rosier	c73d559df4	Use proper capitalization and punctuation per coding standards. NFC. llvm-svn: 269652	2016-05-16 12:55:01 +00:00
Benjamin Kramer	a65b610bd2	Move helper classes into anonymous namespaces. NFC. llvm-svn: 269591	2016-05-15 15:18:11 +00:00
Chad Rosier	7e8dd51d50	[AArch64] Update local variable names to conform to coding standard. NFC. llvm-svn: 269573	2016-05-14 18:56:28 +00:00
Chad Rosier	08d9908ea9	[AArch64] Simplify logic to reduce vertical space. NFC. llvm-svn: 269512	2016-05-13 22:53:13 +00:00
Paul Osmialowski	4f5b3be7f1	add support for -print-imm-hex for AArch64 Most immediates are printed in Aarch64InstPrinter using 'formatImm' macro, but not all of them. Implementation contains following rules: - floating point immediates are always printed as decimal - signed integer immediates are printed depends on flag settings (for negative values 'formatImm' macro prints the value as i.e -0x01 which may be convenient when imm is an address or offset) - logical immediates are always printed as hex - the 64-bit immediate for advSIMD, encoded in "a🅱️c:d:e:f:g:h" is always printed as hex - the 64-bit immedaite in exception generation instructions like: brk, dcps1, dcps2, dcps3, hlt, hvc, smc, svc is always printed as hex - the rest of immediates is printed depends on availability of -print-imm-hex Signed-off-by: Maciej Gabka <maciej.gabka@arm.com> Signed-off-by: Paul Osmialowski <pawel.osmialowski@arm.com> Differential Revision: http://reviews.llvm.org/D16929 llvm-svn: 269446	2016-05-13 18:00:09 +00:00
Justin Bogner	283e3bd793	SDAG: Implement Select instead of SelectImpl in AArch64DAGToDAGISel This one has a lot of code churn, but it's all mechanical and straightforward. - Where we were returning a node before, call ReplaceNode instead. - Where we would return null to fall back to another selector, rename the method to try* and return a bool for success. - Where we were calling SelectNodeTo, just return afterwards. Part of llvm.org/pr26808. llvm-svn: 269379	2016-05-12 23:10:30 +00:00
Justin Bogner	3525da7466	SDAG: Clean up dangling nodes in AArch64ISelDAGToDAG::SelectImpl When we convert to the void Select interface, leaving unreferenced nodes around won't be allowed anymore. Part of llvm.org/pr26808. llvm-svn: 269345	2016-05-12 20:54:27 +00:00
Chad Rosier	179480a4ea	[AArch64] Give function a more appropriate name. llvm-svn: 269335	2016-05-12 19:51:58 +00:00
Chad Rosier	042ac2c17f	[AArch64] Minor refactoring to simplify future patch. NFC. llvm-svn: 269329	2016-05-12 19:38:18 +00:00
Chad Rosier	39481ace40	[AArch64] Remove command-line option use for testing. The EXTR combine has been in tree for over 2 years without complain, so go ahead and remove the option. llvm-svn: 269292	2016-05-12 13:27:24 +00:00
Chad Rosier	9926a5e31d	[AArch64] Add support for unscaled narrow stores in getUsefulBitsForUse. llvm-svn: 269263	2016-05-12 01:42:01 +00:00
Chad Rosier	fe7bba4ee4	[AArch64] Remove floating-point narrow stores from getUsefulBitsForUse. While not impossible, it's unlikely we'd be performing bitwise operations on FP values. llvm-svn: 269260	2016-05-12 01:04:15 +00:00
Chad Rosier	23a1a9a66d	[AArch64] Improve getUsefulBitsForUse for narrow stores. For narrow stores (e.g., strb, srth) we know the upper bits of the register are unused/not useful. In some cases we can use this information to eliminate unnecessary instructions. For example, without this patch we generate (from the 2nd test case): ldr w8, [x0] and w8, w8, #0xfff0 bfxil w8, w2, #16, #4 strh w8, [x1] and after the patch the 'and' is removed: ldr w8, [x0] bfxil w8, w2, #16, #4 strh w8, [x1] ret During the lowering of the bitfield insert instruction the 'and' is eliminated because we know the upper 16-bits that are masked off are unused and the lower 4-bits that are masked off are overwritten by the insert itself. Therefore, the 'and' is unnecessary. Differential Revision: http://reviews.llvm.org/D20175 llvm-svn: 269226	2016-05-11 20:19:54 +00:00
Weiming Zhao	095c271131	[AArch64] Fix DAG selection for cmps for fp16 type Summary: When emitting comparison for fp16, in addition to promote the LHS and RHS to fp32, we need to change the VT as well. Reviewers: t.p.northover Subscribers: t.p.northover, aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D19922 llvm-svn: 269151	2016-05-11 01:26:32 +00:00
Tim Northover	9508a70adc	AArch64: allow vN to represent 64-bit registers in inline asm. Unlike xN/wN, the size of vN is genuinely ambiguous in the assembly, so we should try to infer what was intended from the type. But only down to 64-bits (vN can never represent sN, hN or bN). llvm-svn: 269132	2016-05-10 22:26:45 +00:00
Jonas Paulsson	8e5b0c65cc	[foldMemoryOperand()] Pass LiveIntervals to enable liveness check. SystemZ (and probably other targets as well) can fold a memory operand by changing the opcode into a new instruction that as a side-effect also clobbers the CC-reg. In order to do this, liveness of that reg must first be checked. When LIS is passed, getRegUnit() can be called on it and the right LiveRange is computed on demand. Reviewed by Matthias Braun. http://reviews.llvm.org/D19861 llvm-svn: 269026	2016-05-10 08:09:37 +00:00
Matthias Braun	31d19d43c7	CodeGen: Move TargetPassConfig from Passes.h to an own header; NFC Many files include Passes.h but only a fraction needs to know about the TargetPassConfig class. Move it into an own header. Also rename Passes.cpp to TargetPassConfig.cpp while we are at it. llvm-svn: 269011	2016-05-10 03:21:59 +00:00
Silviu Baranga	f60be28ed8	[AArch64] Implement lowering of the X constraint on AArch64 Summary: This implements the lowering of the X constraint on AArch64. The default behaviour of the X constraint lowering is to restrict it to "f". This is a problem because the "f" constraint is not implemented on AArch64 and would be too restrictive anyway. Therefore, the AArch64 hook will lower this to "w" (if the operand is a floating point or vector) or "r" otherwise. The implementation is similar with the one added for ARM (r267411). This is the AArch64 side of the fix for http://llvm.org/PR26493 Reviewers: rengolin Subscribers: aemerson, rengolin, llvm-commits, t.p.northover Differential Revision: http://reviews.llvm.org/D19967 llvm-svn: 268907	2016-05-09 11:10:44 +00:00
Geoff Berry	a5335647d5	[AArch64] Combine callee-save and local stack SP adjustment instructions. Summary: If a function needs to allocate both callee-save stack memory and local stack memory, we currently decrement/increment the SP in two steps: first for the callee-save area, and then for the local stack area. This changes the code to allocate them both at once at the very beginning/end of the function. This has two benefits: 1) there is one fewer sub/add micro-op in the prologue/epilogue 2) the stack adjustment instructions act as a scheduling barrier, so moving them to the very beginning/end of the function increases post-RA scheduler's ability to move instructions (that only depend on argument registers) before any of the callee-save stores This change can cause an increase in instructions if the original local stack SP decrement could be folded into the first store to the stack. This occurs when the first local stack store is to stack offset 0. In this case we are trading off one more sub instruction for one fewer sub micro-op (along with benefits (2) and (3) above). Reviewers: t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18619 llvm-svn: 268746	2016-05-06 16:34:59 +00:00
Jun Bum Lim	33be4997ed	[AArch64] Decouple zero store promotion from narrow ld merge. NFC. Summary: This change refactors to decouple the zero store promotion from the narrow ld merge and add a flag (enable-narrow-ld-merge=true) to control the narrow ld merge optimization. Reviewers: jmolloy, t.p.northover, mcrosier Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D19885 llvm-svn: 268744	2016-05-06 15:08:57 +00:00
Justin Bogner	b012699741	SDAG: Rename Select->SelectImpl and repurpose Select as returning void This is a step towards removing the rampant undefined behaviour in SelectionDAG, which is a part of llvm.org/PR26808. We rename SelectionDAGISel::Select to SelectImpl and update targets to match, and then change Select to return void and consolidate the sketchy behaviour we're trying to get away from there. Next, we'll update backends to implement `void Select(...)` instead of SelectImpl and eventually drop the base Select implementation. llvm-svn: 268693	2016-05-05 23:19:08 +00:00
Chad Rosier	777dc513a0	[AArch64] Remove unused MBP headers/dependency. NFC. llvm-svn: 268682	2016-05-05 20:58:38 +00:00
Evandro Menezes	d23324aab1	[AArch64] Add cheap as move instructions for Exynos M1 llvm-svn: 268549	2016-05-04 20:47:25 +00:00
Evandro Menezes	bcb95cd0ed	[AArch64] Use the reciprocal estimation machinery This patch adds support for estimating the square root, its reciprocal and division or reciprocal using the combiner generic reciprocal machinery. llvm-svn: 268539	2016-05-04 20:18:27 +00:00
Matthias Braun	e25bbd0bb8	AArch64/optimizeCondBranch: Remove earlier kill flag when forming TBZ This fixes -verify-machineinstrs complaints when compiling test-suite/SingleSource/Benchmarks/Shootout-C++/wordfreq.cpp llvm-svn: 268360	2016-05-03 04:54:16 +00:00
Matthias Braun	d1aabb2813	livePhysRegs: Pass MBB by reference in addLive{Ins\|Outs}(); NFC The block must no be nullptr for the addLiveIns()/addLiveOuts() function. llvm-svn: 268340	2016-05-03 00:24:32 +00:00
Matthias Braun	24f26e6d91	LivePhysRegs: Automatically determine presence of pristine regs. Remove the AddPristinesAndCSRs parameters from addLiveIns()/addLiveOuts(). We need to respect pristine registers after prologue epilogue insertion, Seeing that we got this wrong in at least two commits already, we should rather pay the small price to query MachineFrameInfo for it. There are three cases that did not set AddPristineAndCSRs to true even after register allocation: - ExecutionDepsFix: live-out registers are used as a hint that the register is used soon. This is not true for pristine registers so use the new addLiveOutsNoPristines() to maintain this behaviour. - SystemZShortenInst: Not setting AddPristineAndCSRs to true looks like a bug, should do the right thing automatically now. - StackMapLivenessAnalysis: Not adding pristine registers looks like a bug to me. Added a FIXME comment but maintain the current behaviour as a change may need to get coordinated with GC runtimes. llvm-svn: 268336	2016-05-03 00:08:46 +00:00
Chad Rosier	9d1a556125	Cleanup comments. NFC. llvm-svn: 268236	2016-05-02 14:56:21 +00:00
Chad Rosier	7b6001ee0f	Cleanup comments. NFC. llvm-svn: 268235	2016-05-02 14:50:30 +00:00
Craig Topper	33772c5375	[CodeGen] Default CTTZ_ZERO_UNDEF/CTLZ_ZERO_UNDEF to Expand in TargetLoweringBase. This is what the majority of the targets want and removes a bunch of code. Set it to Legal explicitly in the few cases where that's the desired behavior. llvm-svn: 267853	2016-04-28 03:34:31 +00:00
Craig Topper	3b4842b56f	[AArch64] Expand CTTZ for all vector types. llvm-svn: 267837	2016-04-28 01:58:21 +00:00
Ahmed Bougacha	5a3bf6a4a9	[AArch64] Set AddPristinesAndCSRs to expandCMP_SWAP LivePhysRegs. We run after PEI. Found via inspection; no obvious testcase. Follow-up to r266339. llvm-svn: 267780	2016-04-27 20:33:05 +00:00
Ahmed Bougacha	9e71425f54	[AArch64] Set correct successors in CMPXCHG pseudo expansion. transferSuccessors() would LoadCmpBB a successor of DoneBB, whereas it should be a successor of the original MBB. Follow-up to r266339. Unfortunately, it's tricky to catch this in the verifier. llvm-svn: 267779	2016-04-27 20:33:02 +00:00
Gerolf Hoflehner	50426191d7	[DAGCombiner] Follow coding convention for function name (NFC) llvm-svn: 267745	2016-04-27 17:27:16 +00:00
Matthew Simpson	47bd3994b7	Add parentheses to silence buildbot warning llvm-svn: 267734	2016-04-27 16:25:04 +00:00
Matthew Simpson	e5dfb08fcb	[TTI] Add hook for vector extract with extension This change adds a new hook for estimating the cost of vector extracts followed by zero- and sign-extensions. The motivating example for this change is the SMOV and UMOV instructions on AArch64. These instructions move data from vector to general purpose registers while performing the corresponding extension (sign-extend for SMOV and zero-extend for UMOV) at the same time. For these operations, TargetTransformInfo can assume the extensions are free and only report the cost of the vector extract. The SLP vectorizer has been updated to make use of the new hook. Differential Revision: http://reviews.llvm.org/D18523 llvm-svn: 267725	2016-04-27 15:20:21 +00:00

1 2 3 4 5 ...

1612 Commits