llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	e4d7ac4cb1	[InstCombine] Add test cases showing failures to handle commuted patterns after tricking the operand complexity sorting. llvm-svn: 301296	2017-04-25 06:22:17 +00:00
Craig Topper	c4b48a32f0	[InstCombine] Use commutable matchers to reduce some code. NFC llvm-svn: 301294	2017-04-25 06:02:11 +00:00
Gil Rapaport	860f0a2bad	[LV] Remove redundant basic block split This patch is part of D28975's breakdown. Genreating the control-flow to guard predicated instructions modified to only use SplitBlockAndInsertIfThen() for producing the if-then construct. Differential Revision: https://reviews.llvm.org/D32224 llvm-svn: 301293	2017-04-25 05:57:22 +00:00
Serge Guelton	376508ad8d	Update doc of the variadic version of getOrInsertFunction It no longer needs a null terminator. llvm-svn: 301292	2017-04-25 05:45:37 +00:00
Xinliang David Li	f12a0faf88	[CodeExtractor]: Fixup use refs of the old phi. Differential Revision: http://reviews.llvm.org/D32468 llvm-svn: 301291	2017-04-25 04:51:19 +00:00
Akira Hatanaka	490397fc08	[ObjCARC] Do not sink an objc_retain past a clang.arc.use. We need to do this to prevent a miscompile which sinks an objc_retain past an objc_release that releases the object objc_retain retains. This happens because the top-down and bottom-up traversals each determines the insert point for retain or release individually without knowing where the other instruction is moved. For example, when the following IR is fed to the ARC optimizer, the top-down traversal decides to insert objc_retain right before objc_release and the bottom-up traversal decides to insert objc_release right after clang.arc.use. (IR before ARC optimizer) %11 = call i8* @objc_retain(i8* %10) call void (...) @clang.arc.use(%0* %5) call void @llvm.dbg.value(...) call void @objc_release(i8* %6) This reverses the order of objc_release and objc_retain, which causes the object to be destructed prematurely. (IR after ARC optimizer) call void (...) @clang.arc.use(%0* %5) call void @objc_release(i8* %6) call void @llvm.dbg.value(...) %11 = call i8* @objc_retain(i8* %10) rdar://problem/30530580 llvm-svn: 301289	2017-04-25 04:06:35 +00:00
Davide Italiano	5b65f12bfa	[SimplifyLibCalls] Remove a cl::opt that's been `true` for a long time. llvm-svn: 301288	2017-04-25 03:48:47 +00:00
Sanjoy Das	bbebcb6c4d	Teach SCEV normalization to de/normalize non-affine add recs Summary: Before this change, SCEV Normalization would incorrectly normalize non-affine add recurrences. To work around this there was (still is) a check in place to make sure we only tried to normalize affine add recurrences. We recently found a bug in aforementioned check to bail out of normalizing non-affine add recurrences. However, instead of fixing the bailout, I have decided to teach SCEV normalization to work correctly with non-affine add recurrences, making the bailout unnecessary (I'll remove it in a subsequent change). I've also added some unit tests (which would have failed before this change). Reviewers: atrick, sunfish, efriedma Reviewed By: atrick Subscribers: mcrosier, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D32104 llvm-svn: 301281	2017-04-25 00:09:19 +00:00
Matt Arsenault	6d7f01e3d8	InferAddressSpaces: Use reference arguments instead of pointers llvm-svn: 301276	2017-04-24 23:42:41 +00:00
Eugene Zelenko	1df42fac54	[Object] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 301275	2017-04-24 23:21:38 +00:00
Matt Arsenault	e8d0539f20	InferAddressSpaces: Remove redundant assert This is just asserting all the operations are handled in the switch, which the unreachable already handles. llvm-svn: 301270	2017-04-24 23:02:57 +00:00
Sanjay Patel	6b01b4f5a6	[ARM, x86] add more vector tests for bool math; NFC I'm proposing a fold for increment-of-sexted-bool in: https://reviews.llvm.org/D31944 ...so we need to know what happens in more cases like these. llvm-svn: 301269	2017-04-24 22:42:34 +00:00
Reid Kleckner	df7263567a	[git-llvm] Remove CR from middle of svn propget output llvm-svn: 301268	2017-04-24 22:26:46 +00:00
Reid Kleckner	63b26f0eea	Make getSlotAttributes return an AttributeSet instead of a wrapper list Remove the temporary, poorly named getSlotSet method which did the same thing. Also remove getSlotNode, which is a hold-over from when we were dealing with AttributeSetNode* instead of AttributeSet. llvm-svn: 301267	2017-04-24 22:25:02 +00:00
Reid Kleckner	4534097b0b	[git-llvm] Make `push` work on CRLF files with svn:eol-style=native Summary: `git apply` on Windows doesn't work for files that SVN checks out as CRLF. There is no way to force SVN to check everything out with Unix line endings on Windows. Files with svn:eol-style=native will always come out with CRLF, breaking `git apply`, which wants Unix line endings. My workaround is to list all files with this property set in the change, and run `dos2unix` on them. SVN doesn't commit a massive line ending change because the svn:eol-style property indicates that these are text files. Tested on r301245. Reviewers: zturner, jlebar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32452 llvm-svn: 301262	2017-04-24 22:09:08 +00:00
Sanjay Patel	35c362ebbb	[InstSimplify] use ConstantRange to simplify more and-of-icmps We can simplify (and (icmp X, C1), (icmp X, C2)) to one of the icmps in many cases. I had to check some of these with Alive to prove to myself it's right, but everything seems to check out. Eg, the code in instcombine was completely ignoring predicates with mismatched signedness. Handling or-of-icmps would be a follow-up step. Differential Revision: https://reviews.llvm.org/D32143 llvm-svn: 301260	2017-04-24 21:52:39 +00:00
Simon Pilgrim	93da6660a2	[DAGCombiner] Use APInt::intersects to avoid tmp variable. NFCI. llvm-svn: 301258	2017-04-24 21:43:21 +00:00
Matt Arsenault	e22184940b	AMDGPU: Slightly simplify prolog reserved register handling Rely on MachineRegisterInfo's knowledge of used physical registers. Move flat_scratch initialization earlier, so the uses are visible when making these decisions. This will make it easier to add another reserved register at the end for the stack pointer rather than handling another special case. llvm-svn: 301254	2017-04-24 21:08:32 +00:00
Galina Kistanova	5fda6a90e0	Cosmetic change. llvm-svn: 301253	2017-04-24 21:06:29 +00:00
Saleem Abdulrasool	53972d60cb	ProfileData: clean up some stale declarations (NFC) These were removed in SVN r300381. Remove the declarations. llvm-svn: 301252	2017-04-24 21:05:05 +00:00
Galina Kistanova	c7524f05b2	Small addition on how to add a builder. llvm-svn: 301248	2017-04-24 20:48:40 +00:00
Artem Tamazov	d6656b945e	[AMDGPU][mc][tests][NFC] Bulk ISA tests: update for Gfx7/Gfx8, add for Gfx9. llvm-svn: 301247	2017-04-24 20:42:27 +00:00
Reid Kleckner	b4a2d18777	[Bitcode] Refactor attribute group writing to avoid getSlotAttributes Summary: That API creates a temporary AttributeList to carry an index and a single AttributeSet. We need to carry the index in addition to the set, because that is how attribute groups are currently encoded. NFC Reviewers: pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32262 llvm-svn: 301245	2017-04-24 20:38:30 +00:00
Teresa Johnson	b2c390e9f5	Update profile during memory instrinsic optimization Summary: Ensure that the new merge BB (which contains the rest of the original BB after the mem op being optimized) gets a profile frequency, in case there are additional mem ops later in the BB. Otherwise they get skipped as the merge BB looks cold. Reviewers: davidxl, xur Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32447 llvm-svn: 301244	2017-04-24 20:30:42 +00:00
Matt Arsenault	4474652c95	Revert "StructurizeCFG: Directly invert cmp instructions" This reverts commit r300732. This breaks a few tests. I think the problem is related to adding more uses of the condition that don't yet exist at this point. llvm-svn: 301242	2017-04-24 20:25:01 +00:00
Davide Italiano	ca81fbcadb	[LoopUnroll] Remove spurious newline. Eli pointed out in the review, but I didn't squash the two commits correctly. Pointy-hat to me. llvm-svn: 301241	2017-04-24 20:17:38 +00:00
Frederich Munch	fd96d5e1c9	Revert "Refactor DynamicLibrary so searching for a symbol will have a defined order" The i686-mingw32-RA-on-linux bot is still having errors. This reverts commit r301236. llvm-svn: 301240	2017-04-24 20:16:01 +00:00
Davide Italiano	0f62eea7ff	[LoopUnroll] Don't try to unroll non canonical loops. The current Loop Unroll implementation works with loops having a single latch that contains a conditional branch to a block outside the loop (the other successor is, by defition of latch, the header). If this precondition doesn't hold, avoid unrolling the loop as the code is not ready to handle such circumstances. Differential Revision: https://reviews.llvm.org/D32261 llvm-svn: 301239	2017-04-24 20:14:11 +00:00
Sanjoy Das	206f65c049	[LIR] Obey non-integral pointer semantics Summary: See http://llvm.org/docs/LangRef.html#non-integral-pointer-type Reviewers: haicheng Reviewed By: haicheng Subscribers: mcrosier, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D32196 llvm-svn: 301238	2017-04-24 20:12:10 +00:00
Saleem Abdulrasool	d056cb4b74	Avoid unnecessary copies in some for loops Use constant references rather than `const auto` which will cause the copy constructor. These particular cases cause issues for the swift compiler. llvm-svn: 301237	2017-04-24 20:01:03 +00:00
Frederich Munch	70c377a362	Refactor DynamicLibrary so searching for a symbol will have a defined order and libraries are properly unloaded when llvm_shutdown is called. Summary: This was mostly affecting usage of the JIT, where storing the library handles in a set made iteration unordered/undefined. This lead to disagreement between the JIT and native code as to what the address and implementation of particularly on Windows with stdlib functions: JIT: putenv_s("TEST", "VALUE") // called msvcrt.dll, putenv_s JIT: getenv("TEST") -> "VALUE" // called msvcrt.dll, getenv Native: getenv("TEST") -> NULL // called ucrt.dll, getenv Also fixed is the issue of DynamicLibrary::getPermanentLibrary(0,0) on Windows not giving priority to the process' symbols as it did on Unix. Reviewers: chapuni, v.g.vassilev, lhames Reviewed By: lhames Subscribers: danalbert, srhines, mgorny, vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D30107 llvm-svn: 301236	2017-04-24 19:55:16 +00:00
Krzysztof Parzyszek	c8e8e2a046	Move value type list from TargetRegisterClass to TargetRegisterInfo Differential Revision: https://reviews.llvm.org/D31937 llvm-svn: 301234	2017-04-24 19:51:12 +00:00
Krzysztof Parzyszek	98ab4c64c4	Revert r301231: Accidentally committed stale files I forgot to commit local changes before commit. llvm-svn: 301232	2017-04-24 19:48:51 +00:00
Krzysztof Parzyszek	c0197066d7	Move value type list from TargetRegisterClass to TargetRegisterInfo Differential Revision: https://reviews.llvm.org/D31937 llvm-svn: 301231	2017-04-24 19:43:45 +00:00
Matt Arsenault	0774ea267a	AMDGPU: Select scratch mubuf offsets when pointer is a constant In call sequence setups, there may not be a frame index base and the pointer is a constant offset from the frame pointer / scratch wave offset register. llvm-svn: 301230	2017-04-24 19:40:59 +00:00
Matt Arsenault	df6539f44b	AMDGPU: Set StackGrowsUp in MCAsmInfo Not sure what this does though. llvm-svn: 301229	2017-04-24 19:40:51 +00:00
Stanislav Mekhanoshin	bd5394be3d	[AMDGPU] Merge M0 initializations Merges equivalent initializations of M0 and hoists them into a common dominator block. Technically the same code can be used with any register, physical or virtual. Differential Revision: https://reviews.llvm.org/D32279 llvm-svn: 301228	2017-04-24 19:37:54 +00:00
Piotr Padlewski	610c966a4e	Handle invariant.group.barrier in BasicAA Summary: llvm.invariant.group.barrier returns pointer that mustalias pointer it takes. It can't be marked with `returned` attribute, because it would be remove easily. The other reason is that only Alias Analysis can know about this, because if any other pass would know it, then the result would be replaced with it's argument, which would be invalid. We can think about returned pointer as something that mustalias, but it doesn't have to be bitwise the same as the argument. Reviewers: dberlin, chandlerc, hfinkel, sanjoy Subscribers: reames, nlewycky, rsmith, anna, amharc Differential Revision: https://reviews.llvm.org/D31585 llvm-svn: 301227	2017-04-24 19:37:17 +00:00
Evgeniy Stepanov	9e536081fe	[asan] Let the frontend disable gc-sections optimization for asan globals. Also extend -asan-globals-live-support flag to all binary formats. llvm-svn: 301226	2017-04-24 19:34:13 +00:00
Mandeep Singh Grang	799a2edb3d	[SimplifyCFG] Fix for non-determinism in codegen Summary: This patch fixes issues in codegen uncovered due to https://reviews.llvm.org/D26718 Reviewers: majnemer, chenli, davide Reviewed By: davide Subscribers: davide, arsenm, llvm-commits Differential Revision: https://reviews.llvm.org/D26726 llvm-svn: 301222	2017-04-24 19:20:45 +00:00
Krzysztof Parzyszek	44e25f37ae	Move size and alignment information of regclass to TargetRegisterInfo 1. RegisterClass::getSize() is split into two functions: - TargetRegisterInfo::getRegSizeInBits(const TargetRegisterClass &RC) const; - TargetRegisterInfo::getSpillSize(const TargetRegisterClass &RC) const; 2. RegisterClass::getAlignment() is replaced by: - TargetRegisterInfo::getSpillAlignment(const TargetRegisterClass &RC) const; This will allow making those values depend on subtarget features in the future. Differential Revision: https://reviews.llvm.org/D31783 llvm-svn: 301221	2017-04-24 18:55:33 +00:00
Dimitry Andric	49e033f41d	Don't test setting sticky bits on files for modern BSDs Summary: In rL297945, jhenderson added methods for setting permissions to sys::fs, but some of the unittests that attempt to set sticky bits (01000) on files fail on modern BSDs, such as FreeBSD, NetBSD and OpenBSD. This is because those systems do not allow regular users to set sticky bits on files, only on directories. Fix it by disabling these particular tests on modern BSDs. Reviewers: emaste, brad, jhenderson Reviewed By: jhenderson Subscribers: joerg, krytarowski, llvm-commits Differential Revision: https://reviews.llvm.org/D32120 llvm-svn: 301220	2017-04-24 18:54:48 +00:00
Adrian Prantl	083e6a5b5c	Don't emit CFI instructions at the end of a function When functions are terminated by unreachable instructions, the last instruction might trigger a CFI instruction to be generated. However, emitting it would be be illegal since the function (and thus the FDE the CFI is in) has already ended with the previous instruction. Darwin's dwarfdump --verify --eh-frame complains about this and the specification supports this. Relevant bits from the DWARF 5 standard (6.4 Call Frame Information): "[The] address_range [field in an FDE]: The number of bytes of program instructions described by this entry." "Row creation instructions: [...] The new location value is always greater than the current one." The first quotation implies that a CFI cannot describe a target address outside of the enclosing FDE's range. rdar://problem/26244988 Differential Revision: https://reviews.llvm.org/D32246 llvm-svn: 301219	2017-04-24 18:45:59 +00:00
George Karpenkov	0d447d514a	Updates documentation for a syntax sugar libfuzzer flag, as implemented in https://reviews.llvm.org/D32193 llvm-svn: 301217	2017-04-24 18:39:52 +00:00
Yaxun Liu	fd23a0c095	CodeGen: Add a hook for getFenceOperandTy Currently the operand type for ATOMIC_FENCE assumes value type of a pointer in address space 0. This is fine for most targets. However for amdgcn target, the size of pointer in address space 0 depends on triple environment. For amdgiz environment, it is 64 bit but for other environment it is 32 bit. On the other hand, amdgcn target expects 32 bit fence operands independent of the target triple environment. Therefore a hook is need in target lowering for getting the fence operand type. This patch has no effect on targets other than amdgcn. Differential Revision: https://reviews.llvm.org/D32186 llvm-svn: 301215	2017-04-24 18:26:27 +00:00
Evgeniy Stepanov	58ccc0949a	Revert "Compute safety information in a much finer granularity." Use-after-free in llvm::isGuaranteedToExecute. llvm-svn: 301214	2017-04-24 18:25:07 +00:00
Sanjay Patel	0889225f51	[InstSimplify] move (A & ~B) \| (A ^ B) -> (A ^ B) from InstCombine This is a straight cut and paste, but there's a bigger problem: if this fold exists for simplifyOr, there should be a DeMorganized version for simplifyAnd. But more than that, we have a patchwork of ad hoc logic optimizations in InstCombine. There should be some structure to ensure that we're not missing sibling folds across and/or/xor. llvm-svn: 301213	2017-04-24 18:24:36 +00:00
Matthias Braun	f9796b76e9	X86RegisterInfo: eliminateFrameIndex: Avoid code duplication; NFC Re-Commit of r300922 and r300923 with less aggressive assert (see discussion at the end of https://reviews.llvm.org/D32205) X86RegisterInfo::eliminateFrameIndex() and X86FrameLowering::getFrameIndexReference() both had logic to compute the base register. This consolidates the code. Also use MachineInstr::isReturn instead of manually enumerating tail call instructions (return instructions were not included in the previous list because they never reference frame indexes). Differential Revision: https://reviews.llvm.org/D32206 llvm-svn: 301211	2017-04-24 18:15:00 +00:00
Adrian Prantl	f2c7997013	Use DW_OP_stack_value when reconstructing variable values with arithmetic. When the location description of a source variable involves arithmetic on the value itself, it needs to be marked with DW_OP_stack_value since it is not describing the variable's location, but rather its value. This is a follow-up to r297971 and fixes the source testcase quoted in the comment in debuginfo-dce.ll. rdar://problem/30725338 This reapplies r301093 without modifications. llvm-svn: 301210	2017-04-24 18:11:42 +00:00
Adrian Prantl	283833d022	Add a testcase for DIExpression(DW_OP_stack_value) and relax the assertion that prohibited its emission. This fixes the assertion failure uncovered by r301093. llvm-svn: 301209	2017-04-24 18:11:38 +00:00
Matt Arsenault	1c0ae3972f	AMDGPU: Add StackPtr and FramePtr registers to MFI These will be necessary for setting up call sequences. llvm-svn: 301208	2017-04-24 18:05:16 +00:00
Matt Arsenault	3e02538a02	AMDGPU: Move trap lowering to DAG Fixes traps in any block besides the entry block, and fixes depending on a live-in physical register by using a virtual register copy. Also happens to stop emitting a nop in the case debug trap is not supported. llvm-svn: 301206	2017-04-24 17:49:13 +00:00
Davide Italiano	ebd77645cc	[DomPrinter] Add a way to programmatically dump a dot representation. Differential Revision: https://reviews.llvm.org/D32145 llvm-svn: 301205	2017-04-24 17:48:44 +00:00
Zachary Turner	da949c1804	[llvm-pdbdump] Merge functionality of graphical and text dumpers. The real difference between these two was that a) The "graphical" dumper could recurse, while the text one could not. b) The "text" dumper could display nested types and functions, while the graphical one could not. Merge these two so that there is only one dumper that can recurse arbitrarily deep and optionally display nested types or not. llvm-svn: 301204	2017-04-24 17:47:52 +00:00
Zachary Turner	1690164cac	[llvm-pdbdump] Re-write the record layout code to be more resilient. This reworks the way virtual bases are handled, and also the way padding is detected across multiple levels of aggregates, producing a much more accurate result. llvm-svn: 301203	2017-04-24 17:47:24 +00:00
Craig Topper	1dec281104	[APInt] Simplify the zext and sext methods This replaces a hand written copy loop with a call to memcpy for both zext and sext. For sext, it replaces multiple if/else blocks propagating sign information forward. Now we just do a copy, a sign extension on the last copied word, a memset, and clearUnusedBits. Differential Revision: https://reviews.llvm.org/D32417 llvm-svn: 301201	2017-04-24 17:37:10 +00:00
George Karpenkov	0ab4f06bf1	Testing commit credentials llvm-svn: 301200	2017-04-24 17:28:32 +00:00
Matt Arsenault	02907f3039	InstCombine: Fix assert when reassociating fsub with undef There is logic to track the expected number of instructions produced. It thought in this case an instruction would be necessary to negate the result, but here it folded into a ConstantExpr fneg when the non-undef value operand was cancelled out by the second fsub. I'm not sure why we don't fold constant FP ops with undef currently, but I think that would also avoid this problem. llvm-svn: 301199	2017-04-24 17:24:37 +00:00
Craig Topper	8b37326ae2	[APInt] Add ashrInPlace method and rewrite ashr to make a copy and then call ashrInPlace. This patch adds an in place version of ashr to match lshr and shl which were recently added. I've tried to make this similar to the lshr code with additions to handle the sign extension. I've also tried to do this with less if checks than the current ashr code by sign extending the original result to a word boundary before doing any of the shifting. This removes a lot of the complexity of determining where to fill in sign bits after the shifting. Differential Revision: https://reviews.llvm.org/D32415 llvm-svn: 301198	2017-04-24 17:18:47 +00:00
Nicolai Haehnle	5dea645138	AMDGPU: Move v_readlane lane select from VGPR to SGPR Summary: Fix a compiler bug when the lane select happens to end up in a VGPR. Clarify the semantic of the corresponding intrinsic to be that of the corresponding GLSL: the lane select must be uniform across a wave front, otherwise results are undefined. Reviewers: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D32343 llvm-svn: 301197	2017-04-24 17:17:36 +00:00
Xin Tong	a266923d57	Compute safety information in a much finer granularity. Summary: Instead of keeping a variable indicating whether there are early exits in the loop. We keep all the early exits. This improves LICM's ability to move instructions out of the loop based on is-guaranteed-to-execute. I am going to update compilation time as well soon. Reviewers: hfinkel, sanjoy, efriedma, mkuper Reviewed By: hfinkel Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D32433 llvm-svn: 301196	2017-04-24 17:12:22 +00:00
Nicolai Haehnle	9c66185315	InstCombine/AMDGPU: Fix constant folding of llvm.amdgcn.{icmp,fcmp} Summary: The return value of these intrinsics should always have 0 bits for inactive threads. This means that when all arguments are constant and the comparison evaluates to true, the intrinsic should return the current exec mask. Fixes some GL_ARB_shader_ballot tests. Reviewers: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D32344 llvm-svn: 301195	2017-04-24 17:08:43 +00:00
Igor Breger	87aafa073f	[GlobalISel][X86] Lower FormalArgument/Ret using G_MERGE_VALUES/G_UNMERGE_VALUES. Summary: [GlobalISel][X86] Lower FormalArgument/Ret using G_MERGE_VALUES/G_UNMERGE_VALUES. Reviewers: zvi, t.p.northover, guyblank Reviewed By: t.p.northover Subscribers: dberris, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D32288 llvm-svn: 301194	2017-04-24 17:05:52 +00:00
Simon Pilgrim	f60f57e6e8	[DAGCombiner] Updated bswap byte offset variable names to be more descriptive. NFC As discussed on D32039, use MaskByteOffset to describe the variable and also pull out repeated getOpcode() calls. llvm-svn: 301193	2017-04-24 17:05:14 +00:00
Craig Topper	c6b05684c6	[APInt] Fix repeated word in comments. NFC llvm-svn: 301192	2017-04-24 17:00:22 +00:00
Nicolai Haehnle	ef449787d8	AMDGPU: Fix crash when scheduling non-memory SMRD instructions Summary: Fixes piglit spec/arb_shader_clock/execution/* Reviewers: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D32345 llvm-svn: 301191	2017-04-24 16:53:52 +00:00
Nirav Dave	c799f3a809	[SDAG] Teach Chain Analysis about BaseIndexOffset addressing. While we use BaseIndexOffset in FindBetterNeighborChains to appropriately realize they're almost the same address and should be improved concurrently we do not use it in isAlias using the non-index understanding FindBaseOffset instead. Adding a BaseIndexOffset check in isAlias like should allow indexed stores to be merged. FindBaseOffset to be excised in subsequent patch. Reviewers: jyknight, aditya_nandakumar, bogner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31987 llvm-svn: 301187	2017-04-24 15:37:20 +00:00
Simon Pilgrim	9111cd950d	[X86][AVX] Add scheduling latency/throughput tests for missing AVX1 instructions Had to split btver2/znver1 checks as only btver2 suppresses zeroupper llvm-svn: 301181	2017-04-24 14:26:30 +00:00
Jonas Paulsson	1e8648577c	[SystemZ] Update kill-flag in splitMove(). EarlierMI needs to clear the kill flag on the first operand in case of a store. Review: Ulrich Weigand llvm-svn: 301177	2017-04-24 12:40:28 +00:00
Renato Golin	54c736f833	[DWARF] Move test to x86 directory llvm-svn: 301176	2017-04-24 12:37:11 +00:00
Philip Pfaffe	f1200648bd	[RegionInfo] Fix dangling references created by moving RegionInfo objects Summary: Region objects capture the address of the creating RegionInfo instance. Because the RegionInfo class is movable, moving a RegionInfo object creates dangling references. This patch fixes these references by walking the Regions post-move, and updating references to the new parent. Reviewers: Meinersbur, grosser Reviewed By: Meinersbur, grosser Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31719 llvm-svn: 301175	2017-04-24 11:54:37 +00:00
Ismail Donmez	6dda31729c	Add SUSE vendor Summary: SUSE's ARM triples end with -gnueabi even though they are hard-float. This requires special handling of SUSE ARM triples. Hence we need a way to differentiate the SUSE as vendor. This CL adds that. Reviewers: chandlerc, compnerd, echristo, rengolin Reviewed By: rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D32426 llvm-svn: 301174	2017-04-24 11:18:29 +00:00
Nitesh Jain	0032fae179	[LLVM][MIPS] Fix different definition of off_t in LLDB and LLVM. Reviewers: beanz Subscribers: jaydeep, bhushan, lldb-commits, slthakur, llvm-commits, krytarowski, emaste Differential Revision: https://reviews.llvm.org/D32125 llvm-svn: 301171	2017-04-24 10:36:46 +00:00
George Rimar	ca53211beb	[DWARF] - Take relocations in account when extracting ranges from .debug_ranges I found this when investigated "Bug 32319 - .gdb_index is broken/incomplete" for LLD. When we have object file with .debug_ranges section it may be filled with zeroes. Relocations are exist in file to relocate this zeroes into real values later, but until that a pair of zeroes is treated as terminator. And DWARF parser thinks there is no ranges at all when I am trying to collect address ranges for building .gdb_index. Solution implemented in this patch is to take relocations in account when parsing ranges. Differential revision: https://reviews.llvm.org/D32228 llvm-svn: 301170	2017-04-24 10:19:45 +00:00
Diana Picus	f53865daa4	[ARM] GlobalISel: Legalize s8 and s16 G_(S\|U)DIV We have to widen the operands to 32 bits and then we can either use hardware division if it is available or lower to a libcall otherwise. At the moment it is not enough to set the Legalizer action to WidenScalar, since for libcalls it won't know what to do (it won't be able to find what size to widen to, because it will find Libcall and not Legal for 32 bits). To hack around this limitation, we request Custom lowering, and as part of that we widen first and then we run another legalizeInstrStep on the widened DIV. llvm-svn: 301166	2017-04-24 09:12:19 +00:00
Sjoerd Meijer	e5b8557d5b	[Arch64AsmParser] better diagnostic for isb Instruction isb takes as an operand either 'sy' or an immediate value. This improves the diagnostic when the string is not 'sy' and adds a test case for this which was missing. This also adds tests to check invalid inputs for dsb and dmb. Differential Revision: https://reviews.llvm.org/D32227 llvm-svn: 301165	2017-04-24 08:22:20 +00:00
Diana Picus	b70e88bdec	[ARM] GlobalISel: Support G_(S\|U)DIV for s32 Add support for both targets with hardware division and without. For hardware division we have to add support throughout the pipeline (legalizer, reg bank select, instruction select). For targets without hardware division, we only need to mark it as a libcall. llvm-svn: 301164	2017-04-24 08:20:05 +00:00
Diana Picus	e97822e1b7	[GlobalISel] Legalize G_(S\|U)DIV libcalls Treat them the same as the other binary operations that we have so far, but on integers rather than floating point types. Extract the common code into a helper. This will be used in the ARM backend. llvm-svn: 301163	2017-04-24 07:22:31 +00:00
Diana Picus	95a8aa93e2	[ARM] GlobalISel: Select G_CONSTANT with CImm operands When selecting a G_CONSTANT to a MOVi, we need the value to be an Imm operand. We used to just leave the G_CONSTANT operand unchanged, which works in some cases (such as the GEP offsets that we create when referring to stack slots). However, in many other places the G_CONSTANTs are created with CImm operands. This patch makes sure to handle those as well, and to error out gracefully if in the end we don't end up with an Imm operand. Thanks to Oliver Stannard for reporting this issue. llvm-svn: 301162	2017-04-24 06:30:56 +00:00
Dean Michael Berris	01b880a954	[XRay][tools] Fixup for pedantic and permissive errors/warnings Remove extraneous semicolons and fully qualify the Trace type. Follow-up to D29320. llvm-svn: 301161	2017-04-24 06:15:53 +00:00
Dean Michael Berris	ca780b5a27	[XRay] A tool for Comparing xray function call graphs Summary: This is a tool for comparing the function graphs produced by the llvm-xray graph too. It takes the form of a new subcommand of the llvm-xray tool 'graph-diff'. This initial version of the patch is very rough, but it is close to feature complete. Depends on D29363 Reviewers: dblaikie, dberris Reviewed By: dberris Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D29320 llvm-svn: 301160	2017-04-24 05:54:33 +00:00
Craig Topper	fc03d2d21f	[APInt] Make behavior of ashr by BitWidth consistent between single and multi word. Previously single word would always return 0 regardless of the original sign. Multi word would return all 0s or all 1s based on the original sign. Now single word takes into account the sign as well. llvm-svn: 301159	2017-04-24 05:38:26 +00:00
Frederich Munch	b8c236a6e4	Revert "Refactor DynamicLibrary so searching for a symbol will have a defined order.” The changes are causing the i686-mingw32 build to fail. This reverts commit r301153, and the changes for a separate warning on i686-mingw32 in r301155 and r301156. llvm-svn: 301157	2017-04-24 03:33:30 +00:00
Frederich Munch	799259f320	Fix warning converting from boolean to pointer introduced in r301153. This reverts commit r301155, which was incorrect. llvm-svn: 301156	2017-04-24 03:12:16 +00:00
Frederich Munch	c152a96350	Fix warning converting from void* to boolean introduced in r301153. llvm-svn: 301155	2017-04-24 02:51:40 +00:00
Sanjoy Das	0cdcdf018e	Revert "[SCEV] Enable SCEV verification by default in EXPENSIVE_CHECKS builds" This reverts commit r301150. It breaks CodeGen/Hexagon/hwloop-wrap2.ll, reverting while I investigate. llvm-svn: 301154	2017-04-24 02:35:19 +00:00
Frederich Munch	9f40457d61	Refactor DynamicLibrary so searching for a symbol will have a defined order and libraries are properly unloaded when llvm_shutdown is called. Summary: This was mostly affecting usage of the JIT, where storing the library handles in a set made iteration unordered/undefined. This lead to disagreement between the JIT and native code as to what the address and implementation of particularly on Windows with stdlib functions: JIT: putenv_s("TEST", "VALUE") // called msvcrt.dll, putenv_s JIT: getenv("TEST") -> "VALUE" // called msvcrt.dll, getenv Native: getenv("TEST") -> NULL // called ucrt.dll, getenv Also fixed is the issue of DynamicLibrary::getPermanentLibrary(0,0) on Windows not giving priority to the process' symbols as it did on Unix. Reviewers: chapuni, v.g.vassilev, lhames Reviewed By: lhames Subscribers: danalbert, srhines, mgorny, vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D30107 llvm-svn: 301153	2017-04-24 02:30:12 +00:00
Lang Hames	fe3c21c879	[Orc] Fix a warning by removing an unused lambda capture. llvm-svn: 301152	2017-04-24 01:21:23 +00:00
Sanjoy Das	25972aa82e	Fix unused variables / fields warnings in release builds llvm-svn: 301151	2017-04-24 00:46:40 +00:00
Sanjoy Das	8919303b0a	[SCEV] Enable SCEV verification by default in EXPENSIVE_CHECKS builds llvm-svn: 301150	2017-04-24 00:41:58 +00:00
Sanjoy Das	bdbc4938f9	[SCEV] Fix exponential time complexity by caching llvm-svn: 301149	2017-04-24 00:09:46 +00:00
Xinliang David Li	db8d09b6c2	[PartialInine]: add triaging options There are more bugs (runtime failures) triggered when partial inlining is turned on. Add options to help triaging problems. llvm-svn: 301148	2017-04-23 23:39:04 +00:00
Lang Hames	70eccdc727	[Orc] Use recursive mutexes for Error serialization. Errors can be nested, so we need recursive locking for serialization / deserialization. llvm-svn: 301147	2017-04-23 23:36:13 +00:00
Sanjoy Das	148e49f3c8	[SCEV] Move towards a verifier without false positives This change reboots SCEV's current (off by default) verification logic to avoid false failures. Instead of stringifying trip counts, it maps old and new trip counts to the same ScalarEvolution "universe" and asks ScalarEvolution to compute the difference between them. If the difference comes out to be a non-zero constant, then (barring some corner cases) we know we messed up. I've not yet enabled this by default since it hits an exponential time issue in SCEV, but once I fix that, I'll flip it on by default in EXPENSIVE_CHECKS builds. llvm-svn: 301146	2017-04-23 23:04:45 +00:00
Simon Pilgrim	12df01c3c7	[X86][AVX] Add scheduling latency/throughput tests for some AVX1 instructions More instructions will be added in future commits llvm-svn: 301145	2017-04-23 22:08:17 +00:00
Sanjay Patel	e0c26e0640	[InstCombine] add/move folds for [not]-xor We handled all of the commuted variants for plain xor already, although they were scattered around and sometimes folded less efficiently using distributive laws. We had no folds for not-xor. Handling all of these patterns consistently is part of trying to reinstate: https://reviews.llvm.org/rL300977 llvm-svn: 301144	2017-04-23 22:00:02 +00:00
Xinliang David Li	15744ad87b	[PartialInlining] Add optimization remark support Differential Revision: http://reviews.llvm.org/D32387 llvm-svn: 301143	2017-04-23 21:40:58 +00:00
Simon Pilgrim	06d6263309	[X86][SSE] Add scheduler class support for SSE42 (PCMPGT) instructions llvm-svn: 301142	2017-04-23 21:23:27 +00:00
Simon Pilgrim	7d71ed503d	[X86][SSE] Add scheduling latency/throughput tests for (most) SSE42 instructions llvm-svn: 301141	2017-04-23 21:00:25 +00:00
Sanjay Patel	afa371fd1d	[InstCombine] add tests for not-xor and remove redundant tests; NFC llvm-svn: 301140	2017-04-23 20:59:00 +00:00
Xin Tong	f98602a1ab	[JumpThread] We want to fold (not thread) when all predecessor go to single BB's successor. Summary: In case all predecessor go to a single successor of current BB. We want to fold (not thread). I failed to update the phi nodes properly in the last patch https://reviews.llvm.org/rL300657. Phi nodes values are per predecessor in LLVM. Reviewers: sanjoy Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32400 llvm-svn: 301139	2017-04-23 20:56:29 +00:00
Simon Pilgrim	19a173ac23	[X86][SSE] Add scheduling latency/throughput tests for (most) SSE41 instructions llvm-svn: 301137	2017-04-23 20:05:21 +00:00
Simon Pilgrim	57fea6879b	[X86][SSE] Add missing scheduling latency/throughput test for PINSRW llvm-svn: 301136	2017-04-23 19:56:49 +00:00
Xin Tong	b7b081262a	Correct grammar. NFC llvm-svn: 301135	2017-04-23 17:36:25 +00:00
Craig Topper	4e590e59a0	[APInt] Make clearUnusedBits branch free. This makes the WordBits calculation calculate a value between 1 and 64 for the number of bits in the last word. Previously if the BitWidth was a multiple of 64 bits the WordBits value was 0 and we had to bail out early to avoid an undefined shift. Now with a value of 64 we no longer have an undefined shift issue. This shows a 15-16k reduction in the size of the opt binary on my local x86-64 build. llvm-svn: 301134	2017-04-23 17:16:26 +00:00
Craig Topper	652ca99622	[APInt] In sext single word case, use SignExtend64 and let the APInt constructor mask off any excess bits. The current code is trying to be clever with shifts to avoid needing to clear unused bits. But it looks like the compiler is unable to optimize out the unused bit handling in the APInt constructor. Given this its better to just use SignExtend64 and have more readable code. llvm-svn: 301133	2017-04-23 17:16:24 +00:00
Sanjay Patel	42a84ac710	[InstCombine] add tests for or-to-xor; NFC llvm-svn: 301131	2017-04-23 16:37:36 +00:00
Sanjay Patel	d13b0bfdac	[InstCombine] add pattern matches for commuted variants of xor-to-xor There's probably some better way to write this that eliminates the code duplication without hurting readability, but at least this eliminates the logic holes and is hopefully slightly more efficient than creating new instructions. llvm-svn: 301129	2017-04-23 16:03:00 +00:00
Sanjay Patel	9081808521	[InstCombine] add tests for xor-to-xor; NFC Besides missing 2 commuted patterns, the way we handle these folds is inefficient. llvm-svn: 301128	2017-04-23 14:51:03 +00:00
Simon Pilgrim	c781c0f630	[X86][SSE] Add scheduling latency/throughput tests for SSSE3 instructions llvm-svn: 301127	2017-04-23 14:01:55 +00:00
Simon Pilgrim	e8f8422fe5	[X86][SSE] Add scheduling latency/throughput tests for SSE3 instructions llvm-svn: 301126	2017-04-23 13:59:29 +00:00
Sanjay Patel	794c34dc35	[InstCombine] add tests for add-to-xor commuted variants; NFC 1 out of the 4 tests commuted the operands, so there's an asymmetry somewhere under this in how we handle these transforms. llvm-svn: 301125	2017-04-23 13:37:05 +00:00
Renato Golin	4abfb3d741	Revert "[APInt] Fix a few places that use APInt::getRawData to operate within the normal API." This reverts commit r301105, 4, 3 and 1, as a follow up of the previous revert, which broke even more bots. For reference: Revert "[APInt] Use operator<<= where possible. NFC" Revert "[APInt] Use operator<<= instead of shl where possible. NFC" Revert "[APInt] Use ashInPlace where possible." PR32754. llvm-svn: 301111	2017-04-23 12:15:30 +00:00
Renato Golin	cc4a9120f6	Revert "[APInt] Add ashrInPlace method and implement ashr using it. Also fix a bug in the shift by BitWidth handling." This reverts commit r301094, as it broke all ARM self-hosting bots. PR32754. llvm-svn: 301110	2017-04-23 12:02:07 +00:00
Ayman Musa	137c44fe64	[X86][MPX] Add load & store instructions of bnd values to getLoadStoreRegOpcode function. This is needed for a follow up patch that generates the memory folding tables. Differential Revision: https://reviews.llvm.org/D32232 llvm-svn: 301109	2017-04-23 08:28:42 +00:00
Ayman Musa	544988f34d	[X86] Convert test checks to generated checks of update_llc_test_checks.py. NFC llvm-svn: 301107	2017-04-23 07:41:40 +00:00
Artyom Skrobov	53cf1897cc	[ARM] ScheduleDAGRRList::DelayForLiveRegsBottomUp must consider OptionalDefs Summary: D30400 has enabled tADC and tSBC instructions to be unglued, thereby allowing CPSR to remain live between Thumb1 scheduling units. Most Thumb1 instructions have an OptionalDef for CPSR; but the scheduler ignored the OptionalDefs, and could unwittingly insert a flag-setting instruction in between an ADDS and the corresponding ADC. Reviewers: javed.absar, atrick, MatzeB, t.p.northover, jmolloy, rengolin Reviewed By: javed.absar Subscribers: rogfer01, efriedma, aemerson, rengolin, llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D31081 llvm-svn: 301106	2017-04-23 06:58:08 +00:00
Craig Topper	474e5de72d	[APInt] Fix a few places that use APInt::getRawData to operate within the normal API. getRawData exposes the internal type of the APInt class directly to its users. Ideally we wouldn't expose such an implementation detail. This patch fixes a few of the easy cases by using truncate, extract, or a rotate. llvm-svn: 301105	2017-04-23 06:41:11 +00:00
Craig Topper	cdd5ae6676	[APInt] Use operator<<= where possible. NFC llvm-svn: 301104	2017-04-23 05:43:02 +00:00
Craig Topper	5f68af0806	[APInt] Use operator<<= instead of shl where possible. NFC llvm-svn: 301103	2017-04-23 05:18:31 +00:00
Davide Italiano	5da7090256	[ThinLTO/Summary] Rename anonymous globals as last action ... ... in the per-TU -O0 pipeline. The problem is that there could be passes registered using `addExtensionsToPM()` introducing unnamed globals. Asan is an example, but there may be others. Building cppcheck with `-flto=thin` and `-fsanitize=address` triggers an assertion while we're reading bitcode (in lib/LTO), as the BitcodeReader assumes there are no unnamed globals (because the namer has run). Unfortunately I wasn't able to find an easy way to test this. I added a comment in the hope nobody moves this again. llvm-svn: 301102	2017-04-23 04:49:34 +00:00
Craig Topper	ae9672c96d	[APInt] Use ashInPlace where possible. llvm-svn: 301101	2017-04-23 03:45:59 +00:00
Adrian Prantl	4677205010	Revert "Use DW_OP_stack_value when reconstructing variable values with arithmetic." This reverts commit r301093 while investigating stage2 bot breakage. llvm-svn: 301099	2017-04-23 00:44:40 +00:00
Jonathan Roelofs	1233fe5ac3	Fix testcase: s/CHECKNEXT/CHECK-NEXT/ llvm-svn: 301098	2017-04-22 23:43:44 +00:00
Sanjay Patel	ceff20fe50	[InstCombine] clean up tests and regenerate checks; NFC llvm-svn: 301097	2017-04-22 23:36:47 +00:00
Craig Topper	26af2a993a	[APInt] Add ashrInPlace method and implement ashr using it. Also fix a bug in the shift by BitWidth handling. For single word, shift by BitWidth was always returning 0, but for multiword it was based on original sign. Now single word matches multi word. llvm-svn: 301094	2017-04-22 22:00:03 +00:00
Adrian Prantl	a2d25ac14a	Use DW_OP_stack_value when reconstructing variable values with arithmetic. When the location description of a source variable involves arithmetic on the value itself, it needs to be marked with DW_OP_stack_value since it is not describing the variable's location, but rather its value. This is a follow-up to r297971 and fixes the source testcase quoted in the comment in debuginfo-dce.ll. rdar://problem/30725338 llvm-svn: 301093	2017-04-22 20:54:06 +00:00
Simon Pilgrim	f27a714a9e	[X86] Regenerate TLS tests Use the correct check prefix for X86/X32/X64 target types. llvm-svn: 301092	2017-04-22 20:13:58 +00:00
Craig Topper	3a29e3b8e7	[APInt] Remove unnecessary min with BitWidth from countTrailingOnesSlowCase. The unused upper bits are guaranteed to be 0 so we don't need to worry about accidentally counting them. llvm-svn: 301091	2017-04-22 19:59:11 +00:00
Xinliang David Li	016a82ba51	[PartialInlining] Using existing hasAddressTaken interface to legality check/NFC llvm-svn: 301090	2017-04-22 19:24:19 +00:00
Sanjay Patel	3b863f8a1e	[InstCombine] use 'match' to reduce code; NFCI The later uses of dyn_castNotVal in this block are either incomplete (doesn't handle vector constants) or overstepping (shouldn't handle constants at all), but this first use is just unnecessary. 'I' is obviously not a constant, and it can't be a not-of-a-not because that would already be instsimplified. llvm-svn: 301088	2017-04-22 18:05:35 +00:00
Kamil Rytarowski	fc32c3a2c5	Update documentation for the NetBSD target LLVM is known to work on NetBSD x86 32-bit and 64-bit. llvm-svn: 301081	2017-04-22 16:11:23 +00:00
Daniel Sanders	658541fe69	[globalisel][tablegen] Add support for RegisterOperand. Summary: It functions just like RegisterClass except that the class is obtained from a field. Depends on D31761. Reviewers: ab, qcolombet, t.p.northover, rovka, kristof.beyls, aditya_nandakumar Reviewed By: ab Subscribers: dberris, llvm-commits, igorb Differential Revision: https://reviews.llvm.org/D32229 llvm-svn: 301080	2017-04-22 15:53:21 +00:00
Daniel Sanders	2deea1878e	[globalisel][tablegen] Revise API for ComplexPattern operands to improve flexibility. Summary: Some targets need to be able to do more complex rendering than just adding an operand or two to an instruction. For example, it may need to insert an instruction to extract a subreg first, or it may need to perform an operation on the operand. In SelectionDAG, targets would create SDNode's to achieve the desired effect during the complex pattern predicate. This worked because SelectionDAG had a form of garbage collection that would take care of SDNode's that were created but not used due to a later predicate rejecting a match. This doesn't translate well to GlobalISel and the churn was wasteful. The API changes in this patch enable GlobalISel to accomplish the same thing without the waste. The API is now: InstructionSelector::OptionalComplexRendererFn selectArithImmed(MachineOperand &Root) const; where Root is the root of the match. The return value can be omitted to indicate that the predicate failed to match, or a function with the signature ComplexRendererFn can be returned. For example: return OptionalComplexRendererFn( [=](MachineInstrBuilder &MIB) { MIB.addImm(Immed).addImm(ShVal); }); adds two immediate operands to the rendered instruction. Immed and ShVal are captured from the predicate function. As an added bonus, this also reduces the amount of information we need to provide to GIComplexOperandMatcher. Depends on D31418 Reviewers: aditya_nandakumar, t.p.northover, qcolombet, rovka, ab, javed.absar Reviewed By: ab Subscribers: dberris, kristof.beyls, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D31761 llvm-svn: 301079	2017-04-22 15:11:04 +00:00
Daniel Sanders	3016d3c6c9	[globalisel][tablegen] Fix PR32733 by checking which instruction operands belong to. canMutate() was returning true when the operands were all in the same order as the matched instruction. However, it wasn't checking the operands were actually on that instruction. This worked when we could only match a single instruction but the addition of nested instruction matching led to cases where the operands could be split across multiple instructions. canMutate() now returns false if operands belong to instructions other than the root of the match. llvm-svn: 301077	2017-04-22 14:31:28 +00:00
David Blaikie	5477b97d45	Fix test to handle .rel and .rela sections (& to actually specify the target architecture as X86) llvm-svn: 301073	2017-04-22 08:17:39 +00:00
David Blaikie	85366acf15	Avoid using relocations for ref_addr in .dwo files In dwo files the fixed offset can be used - if the dwos are linked into a dwp, the dwo consumer must use the dwp tables to find out where the original range of the debug_info was and resolve the "section relative" value relative to that original range - effectively avoiding/reimplementing the relocation handling. llvm-svn: 301072	2017-04-22 07:53:44 +00:00
David Blaikie	6cce69020c	Fix test from polluting the source tree (though this seems like a "does this not crash" test - which isn't very good. Should be fixed) llvm-svn: 301071	2017-04-22 07:53:40 +00:00
Artur Pilipenko	0632bdc648	Fix for PR32740 - Invalid floating type, unreachable between r300969 and r301029 The bug was introduced by r301018 "[InstCombine] fadd double (sitofp x), y check that the promotion is valid". The patch didn't expect that fadd can be on vectors not necessarily scalars. Add vector support along with the test. llvm-svn: 301070	2017-04-22 07:24:52 +00:00
Craig Topper	5e113742e7	[APInt] Add WORD_MAX constant and use it instead of UINT64_MAX. NFC llvm-svn: 301069	2017-04-22 06:31:36 +00:00
David Blaikie	c0bb21f38e	Remove the unnecessary virtual dtor from the DIEUnit hierarchy (in favor of protected dtor in the base, final derived classes with public non-virtual dtors) These objects are never polymorphically owned/destroyed, so the virtual dtor was unnecessary. llvm-svn: 301068	2017-04-22 02:18:00 +00:00
Matt Arsenault	01d17e7c5f	LowerSwitch: Fix producing invalid IR on unreachable code If a switch was in an unreachable block that branched to a block with a phi, it would leave phis with missing predecessors. llvm-svn: 301064	2017-04-21 23:54:12 +00:00
David Blaikie	96b1ed50e8	Move Split DWARF handling to an MC option/command line argument rather than using metadata Since Split DWARF needs to name the actual .dwo file that is generated, it can't be known at the time the llvm::Module is produced as it may be merged with other Modules before the object is generated and that object may be generated with any name. By passing the Split DWARF file name when LLVM is producing object code the .dwo file name in the object file can match correctly. The support for Split DWARF for implicit modules remains the same - using metadata to store the dwo name and dwo id so that potentially multiple skeleton CUs referring to different dwo files can be generated from one llvm::Module. llvm-svn: 301062	2017-04-21 23:35:26 +00:00
Kuba Mracek	5b4293c7d9	Fixup for r301054: Use an explicit constructor. llvm-svn: 301061	2017-04-21 23:28:01 +00:00
Easwaran Raman	e1bd7cceca	Remove a repeated comment line. NFC. llvm-svn: 301059	2017-04-21 23:12:16 +00:00
Kuba Mracek	a04026232e	Fixup for r301054: Only use __attribute__((no_sanitize("memory"))) when it's available. llvm-svn: 301058	2017-04-21 22:58:55 +00:00
Matthias Braun	d78597ec08	AArch64FrameLowering: Check if the ExtraCSSpill register is actually unused The code assumed that when saving an additional CSR register (ExtraCSSpill==true) we would have a free register throughout the function. This was not true if this CSR register is also used to pass values as in the swiftself case. rdar://31451816 llvm-svn: 301057	2017-04-21 22:42:08 +00:00
Kuba Mracek	71c4043ae9	[libFuzzer] Always build libFuzzer There are two reasons why users might want to build libfuzzer: - To fuzz LLVM itself - To get the libFuzzer.a archive file, so that they can attach it to their code This change always builds libfuzzer, and supports the second use case if the specified flag is set. The point of this patch is to have something that can potentially be shipped with the compiler, and this also ensures that the version of libFuzzer is correct to use with that compiler. Patch by George Karpenkov. Differential Revision: https://reviews.llvm.org/D32096 llvm-svn: 301054	2017-04-21 22:38:24 +00:00
Craig Topper	feaa5514db	[APSInt] Use APInt::compare and APInt::compareSigned to implement APSInt::compareValue APInt just got compare methods that return -1, 0, or 1 instead of just having ult/slt and eq. This patch uses these methods to implement APSInt::compareValues so that we don't have to call do an equal comparison and then possibly a second less than comparison. Differential Revision: https://reviews.llvm.org/D32381 llvm-svn: 301053	2017-04-21 22:32:27 +00:00
Craig Topper	19ce7adc7f	[APSInt] Make use of APInt's recently acquired in place lshr and shl capabilities in APSInt's >>= and <<= operators. APInt hasn't acquired an in place ashr yet, but hopefully soon. llvm-svn: 301052	2017-04-21 22:30:06 +00:00
Adrian Prantl	ff384546f5	Add test coverage for mem2reg dbg.declare lowering. llvm-svn: 301050	2017-04-21 22:13:55 +00:00
Eugene Zelenko	9f5094df36	[Object] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 301049	2017-04-21 22:03:05 +00:00
Hans Wennborg	9b9a5358dd	Re-commit r301040 "X86: Don't emit zero-byte functions on Windows" In addition to the original commit, tighten the condition for when to pad empty functions to COFF Windows. This avoids running into problems when targeting e.g. Win32 AMDGPU, which caused test failures when this was committed initially. llvm-svn: 301047	2017-04-21 21:48:41 +00:00
Frederich Munch	5b0887025b	[Test commit] Remove extra newline. llvm-svn: 301046	2017-04-21 21:39:50 +00:00
Matt Arsenault	c07bda7b87	InferAddressSpaces: Infer for just GEPs Fixes leaving intermediate flat addressing computations where a GEP instruction's source is a constant expression. Still leaves behind a trivial addrspacecast + gep pair that instcombine is able to handle, which ideally could be folded here directly. llvm-svn: 301044	2017-04-21 21:35:04 +00:00
Xinliang David Li	0e9f6df169	[PartialInliner] Partial inliner needs to check use kind before transformation Differential Revision: https://reviews.llvm.org/D32373 llvm-svn: 301042	2017-04-21 21:20:56 +00:00
Hans Wennborg	04593000d8	Revert r301040 "X86: Don't emit zero-byte functions on Windows" This broke almost all bots. Reverting while fixing. llvm-svn: 301041	2017-04-21 21:10:37 +00:00
Hans Wennborg	cb3e810714	X86: Don't emit zero-byte functions on Windows Empty functions can lead to duplicate entries in the Guard CF Function Table of a binary due to multiple functions sharing the same RVA, causing the kernel to refuse to load that binary. We had a terrific bug due to this in Chromium. It turns out we were already doing this for Mach-O in certain situations. This patch expands the code for that in AsmPrinter::EmitFunctionBody() and renames TargetInstrInfo::getNoopForMachoTarget() to simply getNoop() since it seems it was used for not just Mach-O anyway. Differential Revision: https://reviews.llvm.org/D32330 llvm-svn: 301040	2017-04-21 20:58:12 +00:00
Zachary Turner	0fc009b008	Add a dependency from llvm/test to llvm-cvtres. llvm-svn: 301038	2017-04-21 20:45:11 +00:00
Tim Northover	1efaa3a88f	AArch64: add test for "fence singlethread" Forgot a git add yesterday. llvm-svn: 301037	2017-04-21 20:36:08 +00:00
Tim Northover	e31cf3f824	ARM: make sure we use all entries in a vector before forming a vpaddl. Otherwise there's some mismatch, and we'll either form an illegal type or an illegal node. Thanks to Eli Friedman for pointing out the problem with my original solution. llvm-svn: 301036	2017-04-21 20:35:52 +00:00
Sanjay Patel	8ce1d4cbe1	[InstCombine] revert r300977 and r301021 This can cause an inf-loop. Investigating... llvm-svn: 301035	2017-04-21 20:29:17 +00:00
Zachary Turner	f9161bd1d5	Fixed a type conversion error in BitVector. llvm-svn: 301033	2017-04-21 20:18:43 +00:00
Zachary Turner	dbd1c5cda3	[BitVector] Make BitVector store an ArrayRef. This makes certain operations on the underlying storage easier since we have access to ArrayRef methods such as drop_front, drop_back, slice, range-based for loops, etc. Differential Revision: https://reviews.llvm.org/D32367 llvm-svn: 301031	2017-04-21 20:12:08 +00:00
Adrian Prantl	1a18f1ad10	typo llvm-svn: 301030	2017-04-21 20:06:41 +00:00
Konstantin Zhuravlyov	f628406bbd	AMDGPU/GFX9: Enable FastFMAF32 Differential Revision: https://reviews.llvm.org/D32363 llvm-svn: 301029	2017-04-21 19:57:53 +00:00
Konstantin Zhuravlyov	3d1cc88c68	AMDGPU: Temporarily disable packed inlinable literals (v2f16, v2i16) Differential Revision: https://reviews.llvm.org/D32361 llvm-svn: 301028	2017-04-21 19:45:22 +00:00
Konstantin Zhuravlyov	88938d4e67	AMDGPU: Fix S_PACK_HH_B32_B16 - We really ought to zero out lower 16 bits Differential Revision: https://reviews.llvm.org/D32356 llvm-svn: 301026	2017-04-21 19:35:05 +00:00
Yaxun Liu	15a96b1dc8	[AMDGPU] Handle SI_MASKED_UNREACHABLE in instruction emitter SI_MASKED_UNREACHABLE does not have machine instruction encoding. It needs special handling in AMDGPUAsmPrinter::EmitInstruction like some other pseudo instructions. This patch fixes compilation failure of RadeonRays. Differential Revision: https://reviews.llvm.org/D32364 llvm-svn: 301025	2017-04-21 19:32:02 +00:00
Matthias Braun	1a9062408f	Revert "X86RegisterInfo: eliminateFrameIndex: Avoid code duplication; NFC" It seems we have on situation in a sanitizer enable bootstrap build where the return instruction has a frame index operand that does not point to a fixed object and fails the assert added here. This reverts commit r300923. This reverts commit r300922. llvm-svn: 301024	2017-04-21 19:26:45 +00:00
Konstantin Zhuravlyov	c4b18e7099	AMDGPU: Do not lower fast unsafe div for safe, f32, with fp32 denormals Differential Revision: https://reviews.llvm.org/D32085 llvm-svn: 301023	2017-04-21 19:25:33 +00:00
Sanjay Patel	0f001a4701	[InstCombine] use isSubsetOf() for efficiency C \| ~D == -1 ~(C \| ~D) == 0 ~C & D == 0 D & ~C == 0 D.isSubsetOf(C) llvm-svn: 301021	2017-04-21 19:16:52 +00:00
Akira Hatanaka	22e839f4b2	[AArch64] Improve code generation for logical instructions taking immediate operands. This commit adds an AArch64 dag-combine that optimizes code generation for logical instructions taking immediate operands. The optimization uses demanded bits to change a logical instruction's immediate operand so that the immediate can be folded into the immediate field of the instruction. This recommits r300932 and r300930, which was causing dag-combine to loop forever. The problem was that optimizeLogicalImm was returning true even when there was no change to the immediate node (which happened when the immediate was all zeros or ones), which caused dag-combine to push and pop the same node to the work list over and over again without making any progress. This commit fixes the bug by returning false early in optimizeLogicalImm if the immediate is all zeros or ones. Also, it changes the code to compare the immediate with 0 or Mask rather than calling countPopulation. rdar://problem/18231627 Differential Revision: https://reviews.llvm.org/D5591 llvm-svn: 301019	2017-04-21 18:53:12 +00:00
Artur Pilipenko	134d94f9a3	[InstCombine] fadd double (sitofp x), y check that the promotion is valid Doing these transformations check that the result of integer addition is representable in the FP type. (fadd double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (fadd double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This is a fix for https://bugs.llvm.org//show_bug.cgi?id=27036 Reviewed By: andrew.w.kaylor, scanon, spatel Differential Revision: https://reviews.llvm.org/D31182 llvm-svn: 301018	2017-04-21 18:45:25 +00:00
Kuba Mracek	c3ecc4b314	Fixup for r301007: Restrict the -D hack to Darwin. llvm-svn: 301017	2017-04-21 18:19:56 +00:00
Zachary Turner	492674ec2a	[BitVector] Add find_last() and find_last_unset(). Differential Revision: https://reviews.llvm.org/D32302 llvm-svn: 301014	2017-04-21 18:07:46 +00:00
Kuba Mracek	81acbf3daa	Revert r301010: Bot failures on Windows, NetBSD and even some old Darwin. llvm-svn: 301012	2017-04-21 18:02:22 +00:00
Kuba Mracek	a0ab8c2e40	[libFuzzer] Always build libFuzzer There are two reasons why users might want to build libfuzzer: - To fuzz LLVM itself - To get the libFuzzer.a archive file, so that they can attach it to their code This change always builds libfuzzer, and supports the second use case if the specified flag is set. The point of this patch is to have something that can potentially be shipped with the compiler, and this also ensures that the version of libFuzzer is correct to use with that compiler. Patch by George Karpenkov. Differential Revision: https://reviews.llvm.org/D32096 llvm-svn: 301010	2017-04-21 17:47:44 +00:00
Kuba Mracek	309182a7d3	[libFuzzer] Changing thread_local to __thread in libFuzzer Old Apple compilers do not support thread_local keyword. This patch adds -Dthread_local=__thread when the compiler doesn't support thread_local. Differential Revision: https://reviews.llvm.org/D32312 llvm-svn: 301007	2017-04-21 17:39:50 +00:00
Zachary Turner	ae68a2e82a	Add llvm-cvtres to LLVMBuild.txt It wasn't getting picked up as an implicit project, so it wasn't being built. llvm-svn: 301006	2017-04-21 17:37:31 +00:00
Joel Jones	a7c4a52188	[AArch64] Refactor instruction selection lowering for addresses. NFCI Factor out the common code used for generating addresses into common templated functions that call overloaded versions of a new function, getTargetNode. Tested with make check-llvm with targets AArch64. Differential Revision: https://reviews.llvm.org/D32169 llvm-svn: 301005	2017-04-21 17:31:03 +00:00
Zachary Turner	087edfa2e8	Add empty shell of llvm-cvtres. This marks the beginning of an effort to port remaining MSVC toolchain miscellaneous utilities to all platforms. Currently clang-cl shells out to certain additional tools such as the IDL compiler, resource compiler, and a few other tools, but as these tools are Windows-only it limits the ability of clang to target Windows on other platforms. having a full suite of these tools directly in LLVM should eliminate this constraint. The current implementation provides no actual functionality, it is just an empty skeleton executable for the purposes of making incremental changes. Differential Revision: https://reviews.llvm.org/D32095 Patch by Eric Beckmann (ecbeckmann@google.com) llvm-svn: 301004	2017-04-21 17:30:29 +00:00
Tim Northover	1061ccca8c	ARM: don't try to create an i8 -> i32 vpaddl. DAG combine was mistakenly assuming that the step-up it was looking at was always a doubling, but it can sometimes be a larger extension in which case we'd crash. llvm-svn: 301002	2017-04-21 17:21:59 +00:00
Kuba Mracek	9eb170fede	[libFuzzer] Check for target(popcnt) capability before usage Older compilers (e.g. LLVM 3.4) do not support the attribute target("popcnt"). In order to support those, this diff check the attribute support using the preprocessor. Patch by George Karpenkov. Differential Revision: https://reviews.llvm.org/D32311 llvm-svn: 300999	2017-04-21 16:57:37 +00:00
Craig Topper	72f31a8381	[ValueTracking] Use APInt::setAllBits and APInt::intersects to simplify some code. NFC llvm-svn: 300997	2017-04-21 16:43:32 +00:00
Craig Topper	1dc8fc8bfa	[APInt] Add compare/compareSigned methods that return -1, 0, 1. Reimplement slt/ult and friends using them Currently sle and ule have to call slt/ult and eq to get the proper answer. This results in extra code for both calls and additional scans of multiword APInts. This patch replaces slt/ult with a compareSigned/compare that can return -1, 0, or 1 so we can cover all the comparison functions with a single call. While I was there I removed the activeBits calls and other checks at the start of the slow part of ult. Both of the activeBits calls potentially scan through each of the APInts separately. I can't imagine that's any better than just scanning them in parallel and doing the compares. Now we just share the code with tcCompare. These changes seem to be good for about a 7-8k reduction on the size of the opt binary on my local x86-64 build. Differential Revision: https://reviews.llvm.org/D32339 llvm-svn: 300995	2017-04-21 16:13:15 +00:00
Juergen Ributzka	a66f42caa9	Remove empty and unused header file. llvm-svn: 300994	2017-04-21 16:05:01 +00:00
Daniel Sanders	e7b0d66080	[globalisel][tablegen] Import SelectionDAG's rule predicates and support the equivalent in GIRule. Summary: The SelectionDAG importer now imports rules with Predicate's attached via Requires, PredicateControl, etc. These predicates are implemented as bitset's to allow multiple predicates to be tested together. However, unlike the MC layer subtarget features, each target only pays for it's own predicates (e.g. AArch64 doesn't have 192 feature bits just because X86 needs a lot). Both AArch64 and X86 derive at least one predicate from the MachineFunction or Function so they must re-initialize AvailableFeatures before each function. They also declare locals in <Target>InstructionSelector so that computeAvailableFeatures() can use the code from SelectionDAG without modification. Reviewers: rovka, qcolombet, aditya_nandakumar, t.p.northover, ab Reviewed By: rovka Subscribers: aemerson, rengolin, dberris, kristof.beyls, llvm-commits, igorb Differential Revision: https://reviews.llvm.org/D31418 llvm-svn: 300993	2017-04-21 15:59:56 +00:00
Craig Topper	7af078847c	[SimplifyCFG] Fix the determination of PostBB in conditional store merging to handle the targets on the second branch being commuted Currently we choose PostBB as the single successor of QFB, but its possible that QTB's single successor is QFB which would make QFB the correct choice. Differential Revision: https://reviews.llvm.org/D32323 llvm-svn: 300992	2017-04-21 15:53:42 +00:00
Wei Mi	337d4d95c2	[ConstHoisting] Add BFI in constanthoisting pass and select the best insertion places based on it. Existing constant hoisting pass will merge a group of contants in a small range and hoist the const materialization code to the common dominator of their uses. However, if the uses are all in cold pathes, existing implementation may hoist the materialization code from cold pathes to a hot place. This may hurt performance. The patch introduces BFI to the pass and selects the best insertion places based on it. The change is controlled by an option consthoist-with-block-frequency which is off by default for now. Differential Revision: https://reviews.llvm.org/D28962 llvm-svn: 300989	2017-04-21 15:50:16 +00:00
Chad Rosier	428556c536	[AArch64][Falkor] Refine modeling of store-release exclusive instructions. llvm-svn: 300987	2017-04-21 14:58:32 +00:00
Joel Jones	97aaa23aec	[Mips] Document Mips Backend Relocation Principles This revision documents the combination of C++ and table-gen code that handles relocations and addresses. Thanks for Simon Dardis for the careful reviews. Differential Revision: https://reviews.llvm.org/D31628 llvm-svn: 300986	2017-04-21 14:49:27 +00:00
Chad Rosier	d631b9e500	[AArch64][Falkor] Refine resource needs of STRQ with register offset. llvm-svn: 300984	2017-04-21 14:33:13 +00:00
Matthew Simpson	e2037d24f9	[LV] Model if-converted phi node costs Phi nodes in non-header blocks are converted to select instructions after if-conversion. This patch updates the cost model to account for the selects. Differential Revision: https://reviews.llvm.org/D31906 llvm-svn: 300980	2017-04-21 14:14:54 +00:00
Daniel Sanders	419efdd55b	Revert r300964 + r300970 - [globalisel][tablegen] Import SelectionDAG's rule predicates and support the equivalent in GIRule. It's causing llvm-clang-x86_64-expensive-checks-win to fail to compile and I haven't worked out why. Reverting to make it green while I figure it out. llvm-svn: 300978	2017-04-21 14:09:20 +00:00
Sanjay Patel	347b54b093	[InstCombine] prefer xor with -1 because 'not' is easier to understand (PR32706) This matches the demanded bits behavior in the DAG and should fix: https://bugs.llvm.org/show_bug.cgi?id=32706 Differential Revision: https://reviews.llvm.org/D32255 llvm-svn: 300977	2017-04-21 14:03:54 +00:00
Chad Rosier	537defeeb5	[AArch64][Falkor] Refine loads/stores that require an extra LD pipe. llvm-svn: 300976	2017-04-21 13:55:41 +00:00
Chad Rosier	bbcc828833	[AArch64][Falkor] Fix number of microops for WriteSTIdx missed in r300892. llvm-svn: 300975	2017-04-21 13:37:01 +00:00
Chad Rosier	4f2e9e237f	[AArch64] Fix a few missed pre/post-inc in Falkor. llvm-svn: 300974	2017-04-21 13:36:57 +00:00
Diana Picus	64a33431eb	[ARM] GlobalISel: Add support for G_TRUNC Select them as copies. We only select if both the source and the destination are on the same register bank, so this shouldn't cause any trouble. llvm-svn: 300971	2017-04-21 13:16:50 +00:00
Daniel Sanders	4df04ec227	[globalisel][tablegen] Try again to fix builds on old MSVC's after r300964 This should fix llvm-clang-x86_64-expensive-checks-win I reproduced the error using the following code: namespace llvm { // Moving this out of the llvm namespace fixes the error. template<unsigned NumBits> class PredicateBitsetImpl {}; } namespace { const unsigned MAX_SUBTARGET_PREDICATES = 11; // This works on Clang but is broken on MSVC // using PredicateBitset = PredicateBitsetImpl<MAX_SUBTARGET_PREDICATES>; // Some versions emit a syntax error here ("error C2061: syntax error: identifier // 'PredicateBitsetImpl'") but others accept it and only emit the C3646 below. // // This works on Clang and MSVC using PredicateBitset = llvm::PredicateBitsetImpl<MAX_SUBTARGET_PREDICATES>; class Foo { private: PredicateBitset A; // error C3646: 'A': unknown override specifier }; } llvm-svn: 300970	2017-04-21 12:51:43 +00:00
Daniel Sanders	ed30f5b274	Revert: r300966 - [globalisel][tablegen] Attempt to fix builds on old MSVC's after r300964 It didn't fix the builder. llvm-svn: 300968	2017-04-21 12:08:25 +00:00
Diana Picus	f941ec0ecc	[ARM] GlobalISel: Make struct arguments fail elegantly The condition in isSupportedType didn't handle struct/array arguments properly. Fix the check and add a test to make sure we use the fallback path in this kind of situation. The test deals with some common cases where the call lowering should error out. There are still some issues here that need to be addressed (tail calls come to mind), but they can be addressed in other patches. llvm-svn: 300967	2017-04-21 11:53:01 +00:00
Daniel Sanders	0cd1b539bc	[globalisel][tablegen] Attempt to fix builds on old MSVC's after r300964 This should fix llvm-clang-x86_64-expensive-checks-win llvm-svn: 300966	2017-04-21 11:29:29 +00:00
Daniel Sanders	279d03527e	[globalisel][tablegen] Import SelectionDAG's rule predicates and support the equivalent in GIRule. Summary: The SelectionDAG importer now imports rules with Predicate's attached via Requires, PredicateControl, etc. These predicates are implemented as bitset's to allow multiple predicates to be tested together. However, unlike the MC layer subtarget features, each target only pays for it's own predicates (e.g. AArch64 doesn't have 192 feature bits just because X86 needs a lot). Both AArch64 and X86 derive at least one predicate from the MachineFunction or Function so they must re-initialize AvailableFeatures before each function. They also declare locals in <Target>InstructionSelector so that computeAvailableFeatures() can use the code from SelectionDAG without modification. Reviewers: rovka, qcolombet, aditya_nandakumar, t.p.northover, ab Reviewed By: rovka Subscribers: aemerson, rengolin, dberris, kristof.beyls, llvm-commits, igorb Differential Revision: https://reviews.llvm.org/D31418 llvm-svn: 300964	2017-04-21 10:27:20 +00:00
Clement Courbet	41b4333066	typo llvm-svn: 300963	2017-04-21 09:21:05 +00:00
Clement Courbet	00b51bf123	add skylake llvm-svn: 300962	2017-04-21 09:21:01 +00:00
Clement Courbet	2430e25ba3	add 32 bit tests llvm-svn: 300961	2017-04-21 09:20:58 +00:00
Clement Courbet	d5f6182bec	use repmovsb when optimizing forminsize llvm-svn: 300960	2017-04-21 09:20:55 +00:00
Clement Courbet	203fc17797	Rename FastString flag. llvm-svn: 300959	2017-04-21 09:20:50 +00:00
Clement Courbet	a7c233fbe0	add more tests llvm-svn: 300958	2017-04-21 09:20:44 +00:00
Clement Courbet	1ce3b82dea	X86 memcpy: use REPMOVSB instead of REPMOVS{Q,D,W} for inline copies when the subtarget has fast strings. This has two advantages: - Speed is improved. For example, on Haswell thoughput improvements increase linearly with size from 256 to 512 bytes, after which they plateau: (e.g. 1% for 260 bytes, 25% for 400 bytes, 40% for 508 bytes). - Code is much smaller (no need to handle boundaries). llvm-svn: 300957	2017-04-21 09:20:39 +00:00
George Rimar	f8a9642526	[DWARF] - Refactoring: localize handling of relocations in a single place. This is splitted from D32228, currently DWARF parsers code has few places that applied relocations values manually. These places has similar duplicated code. Patch introduces separate method that can be used to obtain relocated value. That helps to reduce code and simplifies things. Differential revision: https://reviews.llvm.org/D32284 llvm-svn: 300956	2017-04-21 09:12:18 +00:00
Clement Courbet	8177fee513	Delete dead code llvm-svn: 300952	2017-04-21 07:40:59 +00:00
Artyom Skrobov	8d9643009f	[Thumb1] The recently added tADCS and tSBCS pseudo-instructions were missing `Uses = [CPSR]` Summary: Thanks to Oliver Stannard for helping catch this. Reviewers: olista01, efriedma Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D31815 llvm-svn: 300951	2017-04-21 07:35:21 +00:00
Serguei Katkov	c9a752c5b7	[AsmWriter] Eliminate warning. NFC This patch eliminates the following warning lib/IR/AsmWriter.cpp:1128:57: warning: suggest parentheses around '&&' within '\|\|' [-Wparentheses] (StrVal[1] >= '0' && StrVal[1] <= '9')) && Reviewers: timshen, rnk, davide Reviewed By: davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D32337 llvm-svn: 300950	2017-04-21 06:14:38 +00:00
George Burgess IV	56169ed753	[MSSA] Clean up the updater a bit. NFC - Mark an internal function static - Remove the llvm namespace (just holding on to the `using namespace llvm;` Works on My Machine(TM)) llvm-svn: 300947	2017-04-21 04:54:52 +00:00
Davide Italiano	fa15de34b7	[PartialInliner] Fix crash when inlining functions with unreachable blocks. CodeExtractor looks up the dominator node corresponding to return blocks when splitting them. If one of these blocks is unreachable, there's no node in the Dom and CodeExtractor crashes because it doesn't check for domtree node validity. In theory, we could add just a check for skipping null DTNodes in `splitReturnBlock` but the fix I propose here is slightly different. To the best of my knowledge, unreachable blocks are irrelevant for the algorithm, therefore we can just skip them when building the candidate set in the constructor. Differential Revision: https://reviews.llvm.org/D32335 llvm-svn: 300946	2017-04-21 04:25:00 +00:00
Serguei Katkov	5b817d02bf	[BPI] Add multiplication by scalar operators to BranchProbability This patch just adds two operators to BranchProbability class: (BP * scalar) and (BP *= scalar). Reviewers: junbuml, chandlerc, sanjoy, vsk Reviewed By: chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32334 llvm-svn: 300945	2017-04-21 03:14:30 +00:00
Serguei Katkov	3a46eb4442	[AsmWriter/APFloat] FP constant printing: Avoid usage of locale dependent snprinf This should fix the bug https://bugs.llvm.org/show_bug.cgi?id=12906 To print the FP constant AsmWriter does the following: 1) convert FP value to String (actually using snprintf function which is locale dependent). 2) Convert String back to FP Value 3) Compare original and got FP values. If they are not equal just dump as hex. The problem happens on the 2nd step when APFloat does not expect group delimiter or fraction delimiter other than period symbol and so on, which can be produced on the first step if LLVM library is used in an environment with corresponding locale set. To fix this issue the locale independent APFloat:toString function is used. However it prints FP values slightly differently than snprintf does. Specifically it suppress trailing zeros in significant, use capital E and so on. It results in 117 test failures during make check. To avoid this I've also updated APFloat.toString a bit to pass make check at least. Reviewers: sberg, bogner, majnemer, sanjoy, timshen, rnk Reviewed By: timshen, rnk Subscribers: rnk, llvm-commits Differential Revision: https://reviews.llvm.org/D32276 llvm-svn: 300943	2017-04-21 02:52:17 +00:00
Akira Hatanaka	78ccba6a20	Revert r300932 and r300930. It seems that r300930 was creating an infinite loop in dag-combine when compling the following file: MultiSource/Benchmarks/MiBench/consumer-typeset/z21.c llvm-svn: 300940	2017-04-21 01:31:50 +00:00
Akira Hatanaka	e52caddae8	[AArch64] Use suffix ULL to shift a 64-bit value. llvm-svn: 300932	2017-04-21 00:35:27 +00:00
Davide Italiano	059574c537	[CodeExtractor] Remove an unneeded level of indirection. NFCI. llvm-svn: 300931	2017-04-21 00:21:09 +00:00
Akira Hatanaka	19077aaee0	[AArch64] Improve code generation for logical instructions taking immediate operands. This commit adds an AArch64 dag-combine that optimizes code generation for logical instructions taking immediate operands. The optimization uses demanded bits to change a logical instruction's immediate operand so that the immediate can be folded into the immediate field of the instruction. This recommits r300913, which broke bots because I didn't fix a call to ShrinkDemandedConstant in SIISelLowering.cpp after changing the APIs of TargetLoweringOpt and TargetLowering. rdar://problem/18231627 Differential Revision: https://reviews.llvm.org/D5591 llvm-svn: 300930	2017-04-21 00:05:16 +00:00
Eli Friedman	d0e6ae5678	Revert r300746 (SCEV analysis for or instructions). There have been multiple reports of this causing problems: a compile-time explosion on the LLVM testsuite, and a stack overflow for an opencl kernel. llvm-svn: 300928	2017-04-20 23:59:05 +00:00
Craig Topper	358cd9ae3a	[InstCombine] Remove the zextOrTrunc from ShrinkDemandedConstant. The demanded mask and the constant should always be the same width for all callers today. Also stop copying the demanded mask as its passed in. We should avoid allocating memory unless we are going to do something. The final AND to create the new constant will take care of it. llvm-svn: 300927	2017-04-20 23:58:27 +00:00
Matthias Braun	9610a26251	X86RegisterInfo: eliminateFrameIndex: Avoid code duplication; NFC X86RegisterInfo::eliminateFrameIndex() and X86FrameLowering::getFrameIndexReference() both had logic to compute the base register. This consolidates the code. Also use MachineInstr::isReturn instead of manually enumerating tail call instructions (return instructions were not included in the previous list because they never reference frame indexes). Differential Revision: https://reviews.llvm.org/D32206 llvm-svn: 300923	2017-04-20 23:34:50 +00:00
Matthias Braun	63e3e8ce72	X86RegisterInfo: eliminateFrameIndex: Force SP for AfterFPPop; NFC AfterFPPop is used for tailcall/tailjump instructions. We shouldn't ever have frame-pointer/base-pointer relative addressing for those. After all the frame/base pointer should already be restored to their previous values at the return. Make this fact explicit in preparation for an upcoming refactoring. Differential Revision: https://reviews.llvm.org/D32205 llvm-svn: 300922	2017-04-20 23:34:46 +00:00
Sanjoy Das	3e748e9220	Fix typo in comment llvm-svn: 300918	2017-04-20 23:07:00 +00:00
Akira Hatanaka	7b06cebe73	Revert "[AArch64] Improve code generation for logical instructions taking" This reverts r300913. This broke bots. llvm-svn: 300916	2017-04-20 23:03:30 +00:00
Craig Topper	1cb0e5afb0	[Simplify] Add testcase to show that merging conditional stores for triangles is sensitive to the order of the branch targets on the conditional branches. NFC llvm-svn: 300915	2017-04-20 22:57:36 +00:00
Akira Hatanaka	e327f09832	[AArch64] Improve code generation for logical instructions taking immediate operands. This commit adds an AArch64 dag-combine that optimizes code generation for logical instructions taking immediate operands. The optimization uses demanded bits to change a logical instruction's immediate operand so that the immediate can be folded into the immediate field of the instruction. rdar://problem/18231627 Differential Revision: https://reviews.llvm.org/D5591 llvm-svn: 300913	2017-04-20 22:47:56 +00:00
Sanjay Patel	cc663b82fa	[InstCombine] function names start with lower-case letter; NFC Forgot to make this fix with the signature change in r300911. llvm-svn: 300912	2017-04-20 22:37:01 +00:00
Sanjay Patel	c9485ca895	[InstCombine] allow shl+shr demanded bits folds with splat constants llvm-svn: 300911	2017-04-20 22:33:54 +00:00
Sanjay Patel	be2dcaf45a	[InstCombine] add tests for shl+shr demanded bits splat vector folds; NFC llvm-svn: 300907	2017-04-20 22:18:47 +00:00
Tim Northover	100b7f6eae	AArch64: lower "fence singlethread" to a pure compiler barrier. Single-threaded fences aren't required to provide any synchronization with other processing elements so there's no need for a DMB. They should still be a barrier for compiler optimizations though. llvm-svn: 300905	2017-04-20 21:57:45 +00:00
Tim Northover	46e58354da	ARM: lower "fence singlethread" to a pure compiler barrier. Single-threaded fences aren't required to provide any synchronization with other processing elements so there's no need for a DMB. They should still be a barrier for compiler optimizations though. llvm-svn: 300904	2017-04-20 21:56:52 +00:00
Xinliang David Li	99e3ca1526	Use basicblock split block utility function Instead of calling BasicBlock::SplitBasicBlock directly in CodeExtractor. Differential Revision: https://reviews.llvm.org/D32308 llvm-svn: 300899	2017-04-20 21:40:22 +00:00
Sanjay Patel	3e1ae72fcf	[InstCombine] allow shl demanded bits folds with splat constants More fixes are needed to enable the helper SimplifyShrShlDemandedBits(). llvm-svn: 300898	2017-04-20 21:33:02 +00:00
Craig Topper	ff23889609	[InstCombine] Use APInt::intersects and APInt::isSubsetOf to improve a few more places in SimplifyDemandedBits. llvm-svn: 300896	2017-04-20 21:24:37 +00:00
Chad Rosier	4279c58ec4	[AArch64] Whitespace/ordering fixes for Falkor machine description. NFC. llvm-svn: 300893	2017-04-20 21:11:17 +00:00
Chad Rosier	a56bdbe62d	[AArch64] Refine Falkor machine description for pre/post-inc and stores. llvm-svn: 300892	2017-04-20 21:11:09 +00:00
Sanjay Patel	fb5b3e773a	[InstCombine] allow ashr/lshr demanded bits folds with splat constants llvm-svn: 300888	2017-04-20 20:59:02 +00:00
Craig Topper	17f37ba3b9	[InstCombine] Use APInt::isSubsetOf to simplify some code in SimplifyDemandedBits. NFC This allows us to use less temporary APInt for And and Invert operations. llvm-svn: 300885	2017-04-20 20:47:35 +00:00
Sanjay Patel	7e77bed813	[InstCombine] add tests for demanded bits ashr/lshr splat constants; NFC llvm-svn: 300884	2017-04-20 20:44:54 +00:00
Adrian Prantl	ada104888e	Don't emit locations that need a DW_OP_stack_value in DWARF 2 & 3. https://bugs.llvm.org/show_bug.cgi?id=32382 llvm-svn: 300883	2017-04-20 20:42:33 +00:00
Benjamin Kramer	9ed1371adb	[Support] Make asan poisoning for recyclers more aggressive by also poisoning the 'next' pointer. llvm-svn: 300882	2017-04-20 20:28:18 +00:00
Benjamin Kramer	49f1c3297b	Remove stray ^S. NFC. llvm-svn: 300880	2017-04-20 20:03:36 +00:00
Paul Robinson	e5dcdb8558	[DWARF] Fix a couple of typos llvm-svn: 300879	2017-04-20 20:03:03 +00:00
Tim Northover	8b1240b0f0	ARM: handle post-indexed NEON ops where the offset isn't the access width. Before, we assumed that any ConstantInt offset was precisely the access width, so we could use the "[rN]!" form. ISelLowering only ever created that kind, but further simplification during combining could lead to unexpected constants and incorrect codegen. Should fix PR32658. llvm-svn: 300878	2017-04-20 19:54:02 +00:00
Adrian McCarthy	175d70ee5c	VarStreamArrayIterator needed non-const operator* overload. Without this change, the operator-> provided by iterator_facade lost type qualifiers. Differential Revision: https://reviews.llvm.org/D32235 llvm-svn: 300877	2017-04-20 19:34:06 +00:00
Craig Topper	0ec3f2f39a	[InstCombine] Remove redundant code from SimplifyDemandedBits handling for Or. The code above it is equivalent if you work through the bitwise math. llvm-svn: 300876	2017-04-20 19:31:22 +00:00
Paul Robinson	70b34533c2	[DWARF] Versioning for DWARF constants; verify FORMs Associate the version-when-defined with definitions of standard DWARF constants. Identify the "vendor" for DWARF extensions. Use this information to verify FORMs in .debug_abbrev are defined as of the DWARF version specified in the associated unit. Removed two tests that had specified DWARF v1 (which essentially does not exist). Differential Revision: http://reviews.llvm.org/D30785 llvm-svn: 300875	2017-04-20 19:16:51 +00:00
Benjamin Kramer	4f1bed1bcf	[go bindings] Rmove duplicated conversion function definitions after r300843. llvm-svn: 300872	2017-04-20 19:06:11 +00:00
Chad Rosier	9f25dd56a8	[AArch64] Improve scheduling of logical operations on Falkor. llvm-svn: 300871	2017-04-20 18:50:21 +00:00
Weiming Zhao	962c5a3aec	[Thumb-1] Fix corner cases for compressed jump tables Summary: When synthesized TBB/TBH is expanded, we need to avoid the case of: BaseReg is redefined after the load of branching target. E.g.: %R2 = tLEApcrelJT <jt#1> %R1 = tLDRr %R1, %R2 ==> %R2 = tLEApcrelJT <jt#1> %R2 = tLDRspi %SP, 12 %R2 = tLDRspi %SP, 12 tBR_JTr %R1 tTBB_JT %R2, %R1 ` Reviewers: jmolloy Reviewed By: jmolloy Subscribers: llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D32250 llvm-svn: 300870	2017-04-20 18:37:14 +00:00
Davide Italiano	b965121ba8	[CodeExtractor] Remove a bunch of unneeded constructors. Differential Revision: https://reviews.llvm.org/D32305 llvm-svn: 300869	2017-04-20 18:33:40 +00:00
Benjamin Kramer	997fd5eeb4	[Recycler] Add asan/msan annotations. This enables use after free and uninit memory checking for memory returned by a recycler. SelectionDAG currently relies on the opcode of a free'd node being ISD::DELETED_NODE, so poke a hole in the asan poison for SDNode opcodes. This means that we won't find some issues, but only in SDag. llvm-svn: 300868	2017-04-20 18:29:37 +00:00
Benjamin Kramer	58dadd59d9	Fix use-after-frees on memory allocated in a Recycler. This will become asan errors once the patch lands that poisons the memory after free. The x86 change is a hack, but I don't see how to solve this properly at the moment. llvm-svn: 300867	2017-04-20 18:29:14 +00:00
Artyom Skrobov	938fc1341d	Fixing outdated comment [NFC] Since r32105 back in 2006, RegisterPass doesn't support passes without a default constructor. llvm-svn: 300866	2017-04-20 18:20:02 +00:00
Andrew Kaylor	73b4a9a4a4	Fix formatting of constrained FP intrinsic documentation llvm-svn: 300865	2017-04-20 18:18:36 +00:00
Yaxun Liu	5d977f8ed4	CodeGen: Let frame index value type match alloca addr space Recently alloca address space has been added to data layout. Due to this change, pointer returned by alloca may have different size as pointer in address space 0. However, currently the value type of frame index is assumed to be of the same size as pointer in address space 0. This patch fixes that. Most targets assume alloca returning pointer in address space 0, which is the default alloca address space. Therefore it is NFC for them. AMDGCN target with amdgiz environment requires this change since it assumes alloca returning pointer to addr space 5 and its size is 32, which is different from the size of pointer in addr space 0 which is 64. Differential Revision: https://reviews.llvm.org/D32021 llvm-svn: 300864	2017-04-20 18:15:34 +00:00
Reid Kleckner	62731e1c89	Remove duplicate AttributeList::removeAttributes implementation Have the AttributeList overload delegate to the AttrBuilder one. Simplify the AttrBuilder overload by avoiding getSlotAttributes, which creates temporary AttributeLists. Simplify `AttrBuilder::removeAttributes(AttributeList, unsigned)` by using getAttributes instead of manually iterating over slots. Extracted from https://reviews.llvm.org/D32262 NFC llvm-svn: 300863	2017-04-20 18:08:36 +00:00
Sanjay Patel	13985cd111	[DAGCombiner] use more local variables in isAlias(); NFCI llvm-svn: 300860	2017-04-20 18:02:27 +00:00
Sam Clegg	90d99413ac	[WebAssembly] Add known failures for wasm object file backend Subscribers: jfb, dschuff Differential Revision: https://reviews.llvm.org/D32300 llvm-svn: 300859	2017-04-20 17:18:15 +00:00
Zachary Turner	2a593bc508	Resubmit "[BitVector] Add operator<<= and operator>>=." This was failing due to the use of assigning a Mask to an unsigned, rather than to a BitWord. But most systems do not have sizeof(unsigned) == sizeof(unsigned long), so the mask was getting truncated. llvm-svn: 300857	2017-04-20 16:56:54 +00:00
Craig Topper	bcfd2d1789	[APInt] Rename getSignBit to getSignMask getSignBit is a static function that creates an APInt with only the sign bit set. getSignMask seems like a better name to convey its functionality. In fact several places use it and then store in an APInt named SignMask. Differential Revision: https://reviews.llvm.org/D32108 llvm-svn: 300856	2017-04-20 16:56:25 +00:00
Amara Emerson	20f2b01222	[SVE] Fix mismatched sign comparison warning in unit test from r300842. llvm-svn: 300855	2017-04-20 16:54:49 +00:00
Sanjay Patel	2d0e88fb9b	[DAGCombiner] fix variable names in isAlias(); NFCI We started with zero-based params and switched to one-based locals... Also, variables start with a capital and functions do not. llvm-svn: 300854	2017-04-20 16:36:37 +00:00
Zachary Turner	b3dac3816f	Revert "[BitVector] Add operator<<= and operator>>=." This is causing test failures on Linux / BSD systems. Reverting while I investigate. llvm-svn: 300852	2017-04-20 16:35:22 +00:00
Craig Topper	a8129a1122	[APInt] Add isSubsetOf method that can check if one APInt is a subset of another without creating temporary APInts This question comes up in many places in SimplifyDemandedBits. This makes it easy to ask without allocating additional temporary APInts. The BitVector class provides a similar functionality through its (IMHO badly named) test(const BitVector&) method. Though its output polarity is reversed. I've provided one example use case in this patch. I plan to do more as a follow up. Differential Revision: https://reviews.llvm.org/D32258 llvm-svn: 300851	2017-04-20 16:17:13 +00:00
Sanjay Patel	b7701bc9af	[DAGCombiner] give names to repeated calcs in isAlias(); NFCI llvm-svn: 300850	2017-04-20 16:15:08 +00:00
Craig Topper	83dc1c60aa	In SimplifyDemandedUseBits, use computeKnownBits directly to handle Constants Currently we don't explicitly process ConstantDataSequential, ConstantAggregateZero, or ConstantVector, or Undef before applying the Depth limit. Instead they occur after the depth check in the non-instruction path. For the constant types that we do handle, the code is replicated from computeKnownBits. This patch fixes the missing constant handling and the reduces the amount of code by just using computeKnownBits directly for any type of Constant. Differential Revision: https://reviews.llvm.org/D32123 llvm-svn: 300849	2017-04-20 16:14:58 +00:00
Zachary Turner	7500b0ece8	[BitVector] Add operator<<= and operator>>=. Differential Revision: https://reviews.llvm.org/D32244 llvm-svn: 300848	2017-04-20 15:57:58 +00:00
Daniel Sanders	5377fb3419	[globalisel] Enable tracing the legalizer with --debug-only=legalize-mir Reviewers: t.p.northover, ab, qcolombet, aditya_nandakumar, rovka, kristof.beyls Reviewed By: kristof.beyls Subscribers: dberris, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D31750 llvm-svn: 300847	2017-04-20 15:46:12 +00:00
Amaury Sechet	a52b03d2ea	Introduce LLVMDIBuilderRef Summary: This patch adds a definition of `LLVMDIBuilderRef` that represents an `llvm::DIBuilder`. Authored by Harlan Haskins Reviewers: deadalnix, aprantl, probinson, dblaikie, echristo, whitequark Reviewed By: deadalnix, whitequark Subscribers: CodaFi, loladiro Differential Revision: https://reviews.llvm.org/D32122 llvm-svn: 300843	2017-04-20 14:22:47 +00:00
Amara Emerson	23e79ec2b3	[MVT][SVE] Scalable vector MVTs (3/3) Adds MVT::ElementCount to represent the length of a vector which may be scalable, then adds helper functions that work with it. Patch by Graham Hunter. Differential Revision: https://reviews.llvm.org/D32019 llvm-svn: 300842	2017-04-20 13:54:09 +00:00
Amara Emerson	bfbdebd00e	[MVT][SVE] Scalable vector MVTs (2/3) Adds scalable vector machine value types, and updates the switch statements required for tablegen. Patch by Graham Hunter. Differential Revision: https://reviews.llvm.org/D32018 llvm-svn: 300840	2017-04-20 13:36:58 +00:00
Petar Jovanovic	2b6fe3ffa6	[mips][msa] Mask vectors holding shift amounts Masked vectors which hold shift amounts when creating the following nodes: ISD::SHL, ISD::SRL or ISD::SRA. Instructions that use said nodes, which have had their arguments altered are sll, srl, sra, bneg, bclr and bset. For said instructions, the shift amount or the bit position that is specified in the corresponding vector elements will be interpreted as the shift amount/bit position modulo the size of the element in bits. The problem lies in compiling with -O2 enabled, where the instructions for formats .w and .d are not generated, but are instead optimized away. In this case, having shift amounts that are either negative or greater than the element bit size results in generation of incorrect results when constant folding. We remedy this by masking the operands for the nodes mentioned above before actually creating them, so that the final result is correct before placed into the constant pool. Patch by Stefan Maksimovic. Differential Revision: https://reviews.llvm.org/D31331 llvm-svn: 300839	2017-04-20 13:26:46 +00:00
Amara Emerson	5054782052	[MVT][SVE] Scalable vector MVTs (1/3) This patch adds a few helper functions to obtain new vector value types based on existing ones without needing to care about whether they are scalable or not. I've confined their use to a few common locations right now, and targets that don't have scalable vectors should never need to care about these. Patch by Graham Hunter. Differential Revision: https://reviews.llvm.org/D32017 llvm-svn: 300838	2017-04-20 13:08:17 +00:00
John Brawn	66719f63d0	[ARM] Fix handling of mapping symbols when changing sections ChangeSection incorrectly registers LastEMSInfo as belonging to the previous section, not the current section. This happens to work when changing sections using .section, as the previous section is set to the current section before the call to ChangeSection, but not when using .popsection. Differential Revision: https://reviews.llvm.org/D32225 llvm-svn: 300831	2017-04-20 10:18:13 +00:00
John Brawn	5ca5daa6b9	[AArch64] Fix handling of zero immediate in fmov instructions Currently fmov #0 with a vector destination is handle incorrectly and results in fmov #-1.9375 being emitted but should instead give an error. This is due to the way we cope with fmov #0 with a scalar destination being an alias of fmov zr, so fix this by actually doing it through an alias. Differential Revision: https://reviews.llvm.org/D31949 llvm-svn: 300830	2017-04-20 10:13:54 +00:00
John Brawn	dcf037a6f0	[AArch64] Fix handling of integer fp immediates When an integer is used as an fp immediate we're failing to check the return value of getFP64Imm, so invalid values are silently permitted. Fix this by merging together the integer and real handling. llvm-svn: 300828	2017-04-20 10:10:10 +00:00
Diana Picus	7c6dee9f16	[ARM] Rename HW div feature to HW div Thumb. NFCI. The hardware div feature refers only to Thumb, but because of its name it is tempting to use it to check for hardware division in general, which may cause problems in ARM mode. See https://reviews.llvm.org/D32005. This patch adds "Thumb" to its name, to make its scope clear. One notable place where I haven't made the change is in the feature flag (used with -mattr), which is still hwdiv. Changing it would also require changes in a lot of tests, including clang tests, and it doesn't seem like it's worth the effort. Differential Revision: https://reviews.llvm.org/D32160 llvm-svn: 300827	2017-04-20 09:38:25 +00:00
Craig Topper	6a63842708	[APInt] In slt/sgt(uint64_t), only call getMinSignedBits if the APInt is not a single word. llvm-svn: 300824	2017-04-20 06:04:03 +00:00
Craig Topper	fded30584e	[APInt] Call the slow case counting methods directly in isMask/isShiftedMask. We already handled the single word case. NFC llvm-svn: 300823	2017-04-20 06:04:01 +00:00
Craig Topper	9ce5ef9475	[SelectionDAG] Fix another place that was passing a large value to APInt::lshrInPlace. llvm-svn: 300821	2017-04-20 04:55:01 +00:00
Craig Topper	d3884b8402	[SelectionDAG] Use getActiveBits() and countTrailingZeros() to avoid creating temporary APInts with lshr and trunc. NFCI llvm-svn: 300819	2017-04-20 04:23:43 +00:00
Craig Topper	4db0c69373	Recommit "[APInt] Add back the asserts that check that the APInt shift methods aren't called with values larger than BitWidth." This includes a fix to clamp a right shift of larger than BitWidth in DAG combining. llvm-svn: 300816	2017-04-20 03:49:18 +00:00
Craig Topper	6fd0a5c99d	Revert r300811 "[APInt] Add back the asserts that check that the APInt shift methods aren't called with values larger than BitWidth." This is failing a self host debug build. llvm-svn: 300813	2017-04-20 02:46:21 +00:00
Craig Topper	baa392e4e0	[APInt] Implement APInt::intersects without creating a temporary APInt in the multiword case Summary: This is a simple question we should be able to answer without creating a temporary to hold the AND result. We can also get an early out as soon as we find a word that intersects. Reviewers: RKSimon, hans, spatel, davide Reviewed By: hans, davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D32253 llvm-svn: 300812	2017-04-20 02:11:27 +00:00
Craig Topper	e49252cea1	[APInt] Add back the asserts that check that the APInt shift methods aren't called with values larger than BitWidth. The underlying tcShiftRight/tcShiftLeft functions support the larger bit widths but the APInt interface shouldn't rely on that. llvm-svn: 300811	2017-04-20 02:03:09 +00:00
Serge Pavlov	802aa667d5	Do not run frame verification if target does not use frame instructions llvm-svn: 300807	2017-04-20 01:34:04 +00:00
Ahmed Bougacha	db2c16aebb	Revert "[libFuzzer] XFAIL fuzzer-oom.test on Darwin." This reverts commit r300127. r300759 implemented StopTheWorld for Darwin, so the test passes again. llvm-svn: 300801	2017-04-20 00:16:13 +00:00
Kostya Serebryany	f60f61d0b3	[libFuzzer] extend help for -minimize_crash to cover ASAN_OPTIONS=dedup_token_length=3 llvm-svn: 300800	2017-04-19 23:58:05 +00:00
Craig Topper	b3624e4f45	[APInt] Implement operator==(uint64_t) similar to ugt/ult(uint64_t) to remove one of the out of line EqualsSlowCase methods. llvm-svn: 300799	2017-04-19 23:57:51 +00:00
Craig Topper	0e7f6fb6b4	[APInt] Don't call getActiveBits() in ult/ugt(uint64_t) if its a single word. The compiled code already needs to check single/multi word for the countLeadingZeros call inside of getActiveBits, but it isn't able to optimize out the leadingZeros call in the single word case that can't produce a value larger than 64. This shrank the opt binary by about 5-6k on my local x86-64 build. llvm-svn: 300798	2017-04-19 23:55:48 +00:00
Sanjoy Das	25e71d87b5	Statepoint Docs: fix incorrect uses of it's llvm-svn: 300797	2017-04-19 23:55:03 +00:00
Craig Topper	9700a60a61	[APInt] Use ugt(uint64_t) for the compare in getLimitedValue(uint64_t) since the code is identical to it. NFC llvm-svn: 300796	2017-04-19 23:52:59 +00:00
Reid Kleckner	f1de9e83c2	[DAE] Simplify attribute list creation, NFC Removes a use of getSlotAttributes, which I intend to change. llvm-svn: 300795	2017-04-19 23:45:45 +00:00

... 4 5 6 7 8 ...

148261 Commits