llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	ac234b604d	AMDGPU: Rename enums to be consistent with HSA code object terminology llvm-svn: 254330	2015-11-30 21:15:57 +00:00
Matt Arsenault	0e3d38937e	AMDGPU: Remove SIPrepareScratchRegs It does not work because of emergency stack slots. This pass was supposed to eliminate dummy registers for the spill instructions, but the register scavenger can introduce more during PrologEpilogInserter, so some would end up left behind if they were needed. The potential for spilling the scratch resource descriptor and offset register makes doing something like this overly complicated. Reserve registers to use for the resource descriptor and use them directly in eliminateFrameIndex. Also removes creating another scratch resource descriptor when directly selecting scratch MUBUF instructions. The choice of which registers are reserved is temporary. For now it attempts to pick the next available registers after the user and system SGPRs. llvm-svn: 254329	2015-11-30 21:15:53 +00:00
Matt Arsenault	ff6da2fe89	AMDGPU: Use assert zext for workgroup sizes llvm-svn: 254328	2015-11-30 21:15:45 +00:00
Quentin Colombet	cdad10f333	[ARM] For old thumb ISA like v4t, we cannot use PC directly in pop. Fix the epilogue emission to account for that. llvm-svn: 254325	2015-11-30 20:37:58 +00:00
Davide Italiano	1aeed6a955	[SimplifyLibCalls] Transform log(exp2(y)) to y*log(2) under fast-math. llvm-svn: 254317	2015-11-30 19:36:35 +00:00
David Majnemer	bf4119faf6	[X86] Add RIP to GR64_TCW64 The MachineVerifier wants to check that the register operands of an instruction belong to the instruction's register class. RIP-relative control flow instructions violated this by referencing RIP. While this was fixed for SysV, it was never fixed for Win64. llvm-svn: 254315	2015-11-30 19:04:19 +00:00
Kit Barton	f4ce2f3a9e	Enable shrink wrapping for PPC64 Re-enable shrink wrapping for PPC64 Little Endian. One minor modification to PPCFrameLowering::findScratchRegister was necessary to handle fall-thru blocks (blocks with no terminator) correctly. Tested with all LLVM test, clang tests, and the self-hosting build, with no problems found. PHabricator: http://reviews.llvm.org/D14778 llvm-svn: 254314	2015-11-30 18:59:41 +00:00
Rafael Espindola	c98b20b0d6	Fix another llvm.ctors merging bug. We were not looking past casts to see if an element should be included or not. llvm-svn: 254313	2015-11-30 18:54:24 +00:00
Dan Gohman	96029f7880	[WebAssembly] Fix a few minor compiler warnings. NFC. llvm-svn: 254311	2015-11-30 18:42:08 +00:00
Sanjay Patel	239be1fb0d	fix formatting; NFC llvm-svn: 254310	2015-11-30 17:52:02 +00:00
Colin LeMahieu	e6241798c9	[Hexagon] NFC Reordering headers. llvm-svn: 254307	2015-11-30 17:32:34 +00:00
Matt Arsenault	ea03cf2fa1	AMDGPU: Don't reserve SCRATCH_PTR input register This hasn't been doing anything since using relocations was added. llvm-svn: 254304	2015-11-30 15:46:47 +00:00
Aaron Ballman	33c95f08b0	Silencing a 32-bit to 64-bit implicit conversion warning; NFC. llvm-svn: 254302	2015-11-30 14:52:33 +00:00
Hrvoje Varga	c03957f049	[mips][microMIPS] Implement LBUX, LHX, LWX, MAQ_S[A].W.PHL, MAQ_S[A].W.PHR, MFHI, MFLO, MTHI and MTLO instructions Differential Revision: http://reviews.llvm.org/D14436 llvm-svn: 254297	2015-11-30 12:58:39 +00:00
Zoran Jovanovic	a887b36167	[mips][microMIPS] Fix issue with offset operand of BALC and BC instructions Value of offset operand for microMIPS BALC and BC instructions is currently shifted 2 bits, but it should be 1 bit. Differential Revision: http://reviews.llvm.org/D14770 llvm-svn: 254296	2015-11-30 12:56:18 +00:00
Zlatko Buljan	56f3b0e410	[mips][microMIPS] Implement PRECR.QB.PH, PRECR_SRA[_R].PH.W, PRECRQ.PH.W, PRECRQ.QB.PH, PRECRQU_S.QB.PH and PRECRQ_RS.PH.W instructions Differential Revision: http://reviews.llvm.org/D14605 llvm-svn: 254291	2015-11-30 08:37:38 +00:00
Craig Topper	27e2912fa8	Revert r254279 "[X86] Use ArrayRef. NFC". It seems to have upset an MSVC build bot. llvm-svn: 254280	2015-11-30 02:28:19 +00:00
Craig Topper	b84f39865f	[X86] Use ArrayRef. NFC llvm-svn: 254279	2015-11-30 02:08:05 +00:00
Craig Topper	aad5f11e5f	[AVX512] The vpermi2 instructions require an integer vector for the index vector. This is reflected correctly in the intrinsics, but was not refelected in the isel patterns. For the floating point types, this requires adding a bitcast to the index vector when its passed through to the output. llvm-svn: 254277	2015-11-30 00:13:24 +00:00
Sanjoy Das	9b0015f77d	[SCEV] Use lambda instead of std::bind; NFC The lambda is more readable. llvm-svn: 254276	2015-11-29 23:40:57 +00:00
Sanjoy Das	3b827c7028	[SCEV] Use range version of all_of; NFC llvm-svn: 254275	2015-11-29 23:40:53 +00:00
Craig Topper	fbde7aa13a	[X86] Remove duplicate entries from intrinsics tables and add asserts to verify there are no others. llvm-svn: 254274	2015-11-29 23:18:32 +00:00
Dan Gohman	9551a44d0c	[WebAssembly] Delete an obsolete TODO comment. llvm-svn: 254272	2015-11-29 23:09:41 +00:00
Dan Gohman	174b2d83ee	[WebAssembly] Set several MCInstrDesc flags. llvm-svn: 254271	2015-11-29 22:59:19 +00:00
Craig Topper	ecae476e4c	[X86] int_x86_avx2_permps and X86ISD::VPERMV should take an integer vector for its shuffle indices. llvm-svn: 254269	2015-11-29 22:53:22 +00:00
Dan Gohman	5237b3991d	[WebAssembly] Delete unused functions. NFC. llvm-svn: 254268	2015-11-29 22:48:57 +00:00
Dan Gohman	7a6b9825ce	[WebAssembly] Minor clang-format and selected clang-tidy cleanups. NFC. llvm-svn: 254267	2015-11-29 22:32:02 +00:00
Sanjay Patel	b67076c0f8	fix typos in comments; NFC llvm-svn: 254266	2015-11-29 22:09:34 +00:00
Davide Italiano	0b14f29285	[SimplifyLibCalls] Don't crash if the function doesn't have a name. llvm-svn: 254265	2015-11-29 21:58:56 +00:00
Davide Italiano	e2db58cfb8	[SimplifyLibCalls] Cross out implemented transformations. llvm-svn: 254264	2015-11-29 21:00:43 +00:00
Davide Italiano	b8b7133c94	[SimplifyLibCalls] Tranform log(pow(x, y)) -> ylog(x). This one is enabled only under -ffast-math. There are cases where the difference between the value computed and the correct value is huge even for ffast-math, e.g. as Steven pointed out: x = -1, y = -4 log(pow(-1), 4) = 0 4log(-1) = NaN I checked what GCC does and apparently they do the same optimization (which result in the dramatic difference). Future work might try to make this (slightly) less worse. Differential Revision: http://reviews.llvm.org/D14400 llvm-svn: 254263	2015-11-29 20:58:04 +00:00
Diego Novillo	7ff0a174d1	SamplePGO - Do not use std::to_string in diagnostics. This fixes buildbots in systems that std::to_string is not present. It also tidies the output of the diagnostic to render doubles a bit better (thanks Ben Kramer for help with string streams and format). llvm-svn: 254261	2015-11-29 18:23:26 +00:00
Craig Topper	6066164454	Use a lambda instead of std::bind and std::mem_fn I introduced in r254242. NFC llvm-svn: 254260	2015-11-29 18:05:22 +00:00
Simon Pilgrim	88aa627c0b	[X86][SSE] Added support for lowering to ADDSUBPS/ADDSUBPD with commuted inputs We could already recognise shuffle(FSUB, FADD) -> ADDSUB, this allow us to recognise shuffle(FADD, FSUB) -> ADDSUB by commuting the shuffle mask prior to matching. llvm-svn: 254259	2015-11-29 16:41:04 +00:00
Rafael Espindola	eb5e0a77b4	Simplify. NFC. llvm-svn: 254254	2015-11-29 14:33:06 +00:00
Igor Breger	e293e83f5d	AVX512:Implemented encoding for the vmovq.s instruction. Differential Revision: http://reviews.llvm.org/D14810 llvm-svn: 254248	2015-11-29 07:41:26 +00:00
Craig Topper	d896b03e4c	Remove an intermediate lambda. NFC llvm-svn: 254246	2015-11-29 05:38:08 +00:00
Craig Topper	b4b66d06df	Remove unnecessary intermediate lambda. NFC llvm-svn: 254243	2015-11-29 04:37:14 +00:00
Craig Topper	d0573179dc	[SelectionDAG] Use std::any_of instead of a manually coded loop. NFC llvm-svn: 254242	2015-11-29 04:37:11 +00:00
Rafael Espindola	c945c8d22e	Correctly handle llvm.global_ctors merging. We were not handling the case where an entry must be dropped and the destination module has no llvm.global_ctors. llvm-svn: 254241	2015-11-29 03:29:42 +00:00
Rafael Espindola	9f30fac4d8	Fix a crash when writing merged bitcode. Playing with mutateType in here was making getValueType and getType incompatible. llvm-svn: 254240	2015-11-29 03:21:30 +00:00
Davide Italiano	da3beebad1	[SimplifyLibCalls] Use any_of(). Suggested by David Blaikie! llvm-svn: 254239	2015-11-28 22:27:48 +00:00
Benjamin Kramer	89766e5b1d	[SimplifyLibCalls] Fix inverted condition that lead to an uninitialized memory read below. Found by msan! llvm-svn: 254238	2015-11-28 21:43:12 +00:00
Xinliang David Li	b75544a6df	[PGO] Move value profile format related structures and APIs to common file This is the last step to enable profile runtime to share the same value prof data format and reader/writer code with llvm host tools. The VP related data structures are moved to a section in InstrProfData.inc enabled with macro INSTR_PROF_VALUE_PROF_DATA, and common API implementations are enabled with INSTR_PROF_COMMON_API_IMPL. There should be no functional change. llvm-svn: 254235	2015-11-28 19:07:09 +00:00
Renato Golin	5dbc8a5283	Revert "[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM." This reverts commit r254201 and r254202, as it broke test-suite, self-hosting and sanitizer tests on ARM buildbots. llvm-svn: 254234	2015-11-28 17:23:46 +00:00
Jonas Paulsson	f12b925bb1	[Stack realignment] Handling of aligned allocas. This patch implements dynamic realignment of stack objects for targets with a non-realigned stack pointer. Behaviour in FunctionLoweringInfo is changed so that for a target that has StackRealignable set to false, over-aligned static allocas are considered to be variable-sized objects and are handled with DYNAMIC_STACKALLOC nodes. It would be good to group aligned allocas into a single big alloca as an optimization, but this is yet todo. SystemZ benefits from this, due to its stack frame layout. New tests SystemZ/alloca-03.ll for aligned allocas, and SystemZ/alloca-04.ll for "no-realign-stack" attribute on functions. Review and help from Ulrich Weigand and Hal Finkel. llvm-svn: 254227	2015-11-28 11:02:32 +00:00
Craig Topper	e471cf32a0	Use range-based for loops. NFC llvm-svn: 254222	2015-11-28 08:23:04 +00:00
Xinliang David Li	017fffbd2a	[PGO] Add return code for vp rt record init routine to indicate error condition llvm-svn: 254220	2015-11-28 05:47:34 +00:00
Xinliang David Li	4cccee52ce	[PGO] Allow value profile writer interface to allocated target buffer Raw profile writer needs to write all data of one kind in one continuous block, so the buffer needs to be pre-allocated and passed to the writer method in pieces for function profile data. The change adds the support for raw value data writing. llvm-svn: 254219	2015-11-28 05:37:01 +00:00
Xinliang David Li	be969c2942	Function name cleanup (NFC) llvm-svn: 254218	2015-11-28 05:06:00 +00:00
Xinliang David Li	8e32f4d5b6	[PGO] Extract VP data integrity check code into a helper function (NFC) llvm-svn: 254217	2015-11-28 04:56:07 +00:00
Diego Novillo	84f06cc835	SamplePGO - Add initial support for inliner annotations. This adds two thresholds to the sample profiler to affect inlining decisions: the concept of global hotness and coldness. Functions that have accumulated more than a certain fraction of samples at runtime, are annotated with the InlineHint attribute. Conversely, functions that accumulate less than a certain fraction of samples, are annotated with the Cold attribute. This is very similar to the hints emitted by Clang when using instrumentation profiles. Notice that this is a very blunt instrument. A function may have globally collected a significant fraction of samples, but that does not necessarily mean that every callsite for that function is hot. Ideally, we would annotate each callsite with the samples collected at that callsite. This way, the inliner can incorporate all these weights into its cost model. Once the inliner offers this functionality, we can change the hints emitted here to a more precise per-callsite annotation. For now, this is providing some measure of speedups with our internal benchmarks. I've observed speedups of up to 23% (though the geo mean is about 3%). I expect these numbers to improve as the inliner gets better annotations. llvm-svn: 254212	2015-11-27 23:14:51 +00:00
Diego Novillo	b579240875	SamplePGO - Fix default threshold for hot callsites. Based on testing of internal benchmarks, I'm lowering this threshold to a value of 0.1%. This means that SamplePGO will respect 99.9% of the original inline decisions when following a profile. The performance difference is noticeable in some tests. With the previous threshold, the speedups over baseline -O2 was about 0.63%. With the new default, the speedups are around 3% on average. The point of this threshold is not to do more aggressive inlining. When an inlined callsite crosses this threshold, SamplePGO will redo the inline decision so that it can better apply the input profile. By respecting most original inline decisions, we can apply more of the input profile because the shape of the code follows the profile more closely. In the next series, I'll be looking at adding some inline hints for the cold callsites and for toplevel functions that are hot/cold as well. llvm-svn: 254211	2015-11-27 23:14:49 +00:00
Rafael Espindola	19b52383c5	Simplify the linking of recursive data. Now the ValueMapper has two callbacks. The first one maps the declaration. The ValueMapper records the mapping and then materializes the body/initializer. llvm-svn: 254209	2015-11-27 20:28:19 +00:00
Artyom Skrobov	f01a59f9fb	Follow-up fix for r254201 llvm-svn: 254202	2015-11-27 16:20:34 +00:00
Artyom Skrobov	b955b90509	[ARM] Generate ABI_optimization_goals build attribute, as described in the ARM ARM. Summary: Since this build attribute corresponds to a whole module, and different functions in a module may differ in the optimizations enabled for them, this attribute is emitted after all functions, and only in the case that the optimization goals for all functions match. Reviewers: logan, hans Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14934 llvm-svn: 254201	2015-11-27 15:30:51 +00:00
Oliver Stannard	b25914e03f	[AArch64] Add ARMv8.2-A FP16 scalar instructions ARMv8.2-A adds 16-bit floating point versions of all existing VFP floating-point instructions. This is an optional extension, so all of these instructions require the FeatureFullFP16 subtarget feature. Most of these instructions are the same as the 32- and 64-bit versions, but with the type field (bits 23-22) set to 0b11. Previously the top bit of the size field was always 0, so the instruction classes only provided a 1-bit size field, which I have widened to 2 bits. Differential Revision: http://reviews.llvm.org/D15014 llvm-svn: 254198	2015-11-27 13:04:48 +00:00
Adhemerval Zanella	d93c0c4dc4	[sanitizer] [dfsan] Unify aarch64 mapping This patch changes the DFSan instrumentation for aarch64 to instead of using fixes application mask defined by SANITIZER_AARCH64_VMA to read the application shadow mask value from compiler-rt. The value is initialized based on runtime VAM detection. Along with this patch a compiler-rt one will also be added to export the shadow mask variable. llvm-svn: 254196	2015-11-27 12:42:39 +00:00
Davide Italiano	ac0953a2e6	[SimplifyLibCalls] Use range-based loop. NFC. llvm-svn: 254193	2015-11-27 08:05:40 +00:00
Craig Topper	e38c57a4b8	[X86] Pair a NoVLX with HasAVX512 to match the others and remove a unique predicate check in the isel tables. NFC llvm-svn: 254191	2015-11-27 05:44:02 +00:00
Peter Collingbourne	8359a6a83e	MC: Simplify handling of temporary symbols in COFF writer. The COFF object writer was previously adding unnecessary symbols to its temporary data structures and cleaning them up later. This made the code harder to understand and caused a bug (aliases classed as temporary symbols would cause an assertion failure). A much simpler way of handling such symbols is to ask the layout for their section-relative position when needed. Tested with a bootstrap on Windows and by building Chrome. Differential Revision: http://reviews.llvm.org/D14975 llvm-svn: 254183	2015-11-26 23:29:27 +00:00
Charlie Turner	54336a5a4e	[LoopVectorize] Use MapVector rather than DenseMap for MinBWs. The order in which instructions are truncated in truncateToMinimalBitwidths effects code generation. Switch to a map with a determinisic order, since the iteration order over a DenseMap is not defined. This code is not hot, so the difference in container performance isn't interesting. Many thanks to David Blaikie for making me aware of MapVector! Fixes PR25490. Differential Revision: http://reviews.llvm.org/D14981 llvm-svn: 254179	2015-11-26 20:39:51 +00:00
Craig Topper	a47576f297	[X86] Now that X86VPermt2 is used in all the avx512_perm_t_sizes just hardcode it into the patterns instead of passing as an argument. NFC llvm-svn: 254177	2015-11-26 20:21:29 +00:00
Craig Topper	05858f52fe	[X86] Merge X86VPermt2Fp and X86VPermt2Int back together by weakening them just enough. The SDTCisSameSizeAs introduced in r254138 helps here. llvm-svn: 254176	2015-11-26 20:02:01 +00:00
Craig Topper	0009656335	[X86] Split ISD node for Vfpclass and Vfpclasss so that we can write strong type constraints for each that don't cause ambiguous isel. llvm-svn: 254172	2015-11-26 19:41:34 +00:00
Rafael Espindola	8934577171	Disallow aliases to available_externally. They are as much trouble as aliases to declarations. They are requiring the code generator to define a symbol with the same value as another symbol, but the second symbol is undefined. If representing this is important for some optimization, we could add support for available_externally aliases. They would be required to point to a declaration (or available_externally definition). llvm-svn: 254170	2015-11-26 19:22:59 +00:00
Craig Topper	ff2f14731a	[X86] Revert part of r254167 to recover bots. llvm-svn: 254169	2015-11-26 19:13:05 +00:00
Krzysztof Parzyszek	08ff8883fd	[Hexagon] Lowering of V60/HVX vector types llvm-svn: 254168	2015-11-26 18:38:27 +00:00
Craig Topper	9d1deb4b72	[X86] Strengthen more type constraints to reduce isel table size. llvm-svn: 254167	2015-11-26 18:31:19 +00:00
Krzysztof Parzyszek	4eb6d4d1f2	[Hexagon] Hexagon V60 HVX intrinsic defintions Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 254165	2015-11-26 16:54:33 +00:00
Daniel Sanders	daa4b6fbd9	[mips][ias] Range check uimm5 operands and fix several bugs this revealed. Summary: The bugs were: * append, prepend, and balign were not tested * balign takes a uimm2 not a uimm5. * drotr32 was correctly implemented with a uimm5 but the tests expected '52' to be valid. * li/la were implemented with a uimm5 instead of simm32. simm32 isn't completely correct either but I'll fix that when I get to simm32. A notable omission are some of the shift instructions. Several of these have been implemented using a single uimm6 instruction (rather than two uimm5 instructions and a CodeGen-only uimm6 pseudo). These will be updated in the uimm6 patch. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14712 llvm-svn: 254164	2015-11-26 16:35:41 +00:00
Oliver Stannard	64c167db7a	[AArch64] Add ARMv8.2-A new AT instruction variants ARMv8.2-A adds new variants of the "at" (address translate) system instruction, which take the PSTATE.PAN bit (added in ARMv8.1-A). These are a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15018 llvm-svn: 254159	2015-11-26 15:34:44 +00:00
Martell Malone	d12292480a	ARM: address WOA unsigned division overflow crash Building on r253865 the crash is not limited to signed overflows. Disable custom handling of unsigned 32-bit and 64-bit integer divide. Add test cases for both 32-bit and 64-bit unsigned integer overflow. llvm-svn: 254158	2015-11-26 15:34:03 +00:00
Oliver Stannard	911ea20f07	[AArch64] Add ARMv8.2-A UAO PSTATE bit ARMv8.2-A adds a new PSTATE bit, PSTATE.UAO, which allows the LDTR/STTR instructions to behave the same as LDR/STR with respect to execute-only pages at higher privilege levels. New variants of the MSR/MRS instructions are added to allow reading and writing this bit. It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15020 llvm-svn: 254157	2015-11-26 15:32:30 +00:00
Oliver Stannard	1a81cc9f43	[AArch64] Add ARMv8.2-A persistent memory instruction ARMv8.2-A adds the "dc cvap" instruction, which is a system instruction that cleans caches to the point of persistence (for systems that have persistent memory). It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15016 llvm-svn: 254156	2015-11-26 15:28:47 +00:00
Oliver Stannard	48b43741d0	[AArch64] Add ARMv8.2-A ID_A64MMFR2_EL1 register ARMv8.2-A adds a new ID register, ID_A64MMFR2_EL1, which behaves in the same way as ID_A64MMFR0_EL1 and ID_A64MMFR1_EL1. It is a required part of ARMv8.2-A, so no additional subtarget features are required. Differential Revision: http://reviews.llvm.org/D15017 llvm-svn: 254155	2015-11-26 15:26:10 +00:00
Oliver Stannard	7cc0c4e675	[AArch64] Add subtarget features for ARMv8.2-A This adds subtarget features for ARMv8.2-A, which builds on (and requires the features from) ARMv8.1-A. Most assembler-visible features of ARMv8.2-A are system instructions, and are all required parts of the architecture, so just depend on the HasV8_2aOps subtarget feature. There is also one large, optional feature, which adds 16-bit floating point versions of all existing floating-point instructions (VFP and SIMD), this is represented by the FeatureFullFP16 subtarget feature. Differential Revision: http://reviews.llvm.org/D15013 llvm-svn: 254154	2015-11-26 15:23:32 +00:00
Benjamin Kramer	fb419e71f4	[SimplifyLibCalls] Don't depend on a called function having a name, it might be an indirect call. Fixes the crasher in PR25651 and related crashers using the same pattern. llvm-svn: 254145	2015-11-26 09:51:17 +00:00
Craig Topper	a3ac738725	[X86] Strengthen more type constraints to reduce isel table size. llvm-svn: 254142	2015-11-26 07:58:20 +00:00
Vyacheslav Klochkov	ed865dfcc5	X86-FMA3: Improved/enabled the memory folding optimization for scalar loads generated for _mm_losd_s{s,d}() intrinsics and used in scalar FMAs generated for FMA intrinsics _mm_f{madd,msub,nmadd,nmsub}_s{s,d}(). Reviewer: David Kreitzer Differential Revision: http://reviews.llvm.org/D14762 llvm-svn: 254140	2015-11-26 07:45:30 +00:00
Craig Topper	4c175cdc8e	[X86] Strengthen the type constraints on X86psadbw and X86dbpsadbw to reduce some of the type checks in the isel matching tables. llvm-svn: 254139	2015-11-26 07:02:21 +00:00
Krzysztof Parzyszek	195dc8d0db	[Hexagon] HVX vector register classes and more isel patterns llvm-svn: 254132	2015-11-26 04:33:11 +00:00
Tom Stellard	48f29f21ee	AMDGPU: Add llvm.amdgcn.dispatch.ptr intrinsic Summary: This returns a pointer to the dispatch packet, which can be used to load information about the kernel dispach. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14898 llvm-svn: 254116	2015-11-26 00:43:29 +00:00
Xinliang David Li	ed966771da	[PGO] Implement ValueProfiling Closure interfaces for runtime value profile data This is one of the many steps to commonize value profiling support between profile runtime and compiler/llvm tools. After this change, profiler runtime now can share the same C APIs to do VP serialization/deseriazation with LLVM host tools (and produces value data in identical format between indexed and raw profile). It is not yet enabled in profiler runtime yet. Also added a unit test case to test runtime profile data serialization/deserialization interfaces implemented using common closure code. llvm-svn: 254110	2015-11-25 23:31:18 +00:00
Evgeniy Stepanov	9842d61ca4	[safestack] Fix alignment of dynamic allocas. Fixes PR25588. llvm-svn: 254109	2015-11-25 22:52:30 +00:00
Dan Gohman	a774d719a0	[WebAssembly] Fix inline asm support for i64 operands. llvm-svn: 254106	2015-11-25 22:28:50 +00:00
Dan Gohman	d9b4218831	[WebAssembly] Fold setne and seteq comparisons into selects. llvm-svn: 254104	2015-11-25 22:13:48 +00:00
Kostya Serebryany	2d0ef14f5d	[libFuzzer] add a flag -exact_artifact_path llvm-svn: 254100	2015-11-25 21:40:46 +00:00
Krzysztof Parzyszek	70a134d29f	[Hexagon] Treat transfers of FP immediates are pseudo instructions This is a temporary fix to address ICE on 2005-10-21-longlonggtu.ll. The proper fix will be to use A2_tfrsi, but it will need more work to teach all users of A2_tfrsi to also expect a floating-point operand. llvm-svn: 254099	2015-11-25 21:40:03 +00:00
Dan Gohman	5941bde03c	[WebAssembly] Add some comments. NFC. llvm-svn: 254096	2015-11-25 21:32:06 +00:00
Marek Olsak	7ed6b2f414	AMDGPU/SI: select S_ABS_I32 when possible (v2) v2: added more tests, moved the SALU->VALU conversion to a separate function It looks like it's not possible to get subregisters in the S_ABS lowering code, and I don't feel like guessing without testing what the correct code would look like. llvm-svn: 254095	2015-11-25 21:22:45 +00:00
Dan Gohman	80e34e0a18	[WebAssembly] Fix WebAssembly register numbering for registers added late. If virtual registers are created late, mappings to WebAssembly registers need to be added explicitly. This patch adds a function to do so and teaches WebAssemblyPeephole to use it. This fixes an out-of-bounds access on the WARegs vector. llvm-svn: 254094	2015-11-25 21:13:02 +00:00
Davide Italiano	dd04fee8a6	[SCCP] More informative message if we don't know how to handle a terminator. llvm-svn: 254093	2015-11-25 21:03:36 +00:00
Matt Arsenault	49affb8462	AMDGPU: Check feature attributes in SIMachineFunctionInfo llvm-svn: 254091	2015-11-25 20:55:12 +00:00
Krzysztof Parzyszek	207c13f254	Add hexagonv55 and hexagonv60 as recognized CPUs, make v60 the default llvm-svn: 254089	2015-11-25 20:30:59 +00:00
Matt Arsenault	61001bbc03	AMDGPU: Make v2i64/v2f64 legal types. They can be loaded and stored, so count them as legal. This is mostly to fix a number of common cases for load/store merging. llvm-svn: 254086	2015-11-25 19:58:34 +00:00
Artyom Skrobov	314ee04268	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC) Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085	2015-11-25 19:41:11 +00:00
Dan Gohman	fb3e0594e4	[WebAssembly] Use a physical register to describe ARGUMENT liveness. Instead of trying to move ARGUMENT instructions back up to the top after they've been scheduled or sunk down, use a fake physical register to create a liveness constraint that prevents ARGUMENT instructions from moving down in the first place. This is still not entirely ideal, however it is more robust than letting them move and moving them back. llvm-svn: 254084	2015-11-25 19:36:19 +00:00
Xinliang David Li	e809231b22	[PGO] Regroup functions in better order (NFC) llvm-svn: 254080	2015-11-25 19:13:00 +00:00
Dan Gohman	9c54d3b4c6	[WebAssembly] Clean up several FIXME comments. llvm-svn: 254079	2015-11-25 18:13:18 +00:00
Dan Gohman	81719f8555	[WebAssembly] Support for register stackifying with load and store instructions. llvm-svn: 254076	2015-11-25 16:55:01 +00:00
Dan Gohman	2c8fe6a428	[WebAssembly] Codegen support for ISD::ExternalSymbol llvm-svn: 254075	2015-11-25 16:44:29 +00:00
Dan Gohman	fd4a88c376	[WebAssembly] Add 'final' to some classes. NFC. llvm-svn: 254073	2015-11-25 16:29:24 +00:00
Dan Gohman	04c0401f28	[WebAssembly] Whitespace consistency. NFC. llvm-svn: 254071	2015-11-25 16:26:14 +00:00
Sanjay Patel	25150784ae	fix typo; NFC llvm-svn: 254069	2015-11-25 15:33:36 +00:00
Hal Finkel	005f840959	[PowerPC] Don't generate mfocrf on the e500mc The e500mc does not actually support the mfocrf instruction; update the processor definitions to reflect that fact. Patch by Tom Rix (with some test-case cleanup by me). llvm-svn: 254064	2015-11-25 10:14:31 +00:00
Eric Christopher	4675c439aa	Fix some places where we were assuming that memory type had been legalized to a simple type when lowering a truncating store of a vector type. In this case for an EVT we'll return Expand as we should in all of the cases anyhow. The testcase triggered at the one in VectorLegalizer::LegalizeOp, inspection found the rest. llvm-svn: 254061	2015-11-25 09:11:53 +00:00
Elena Demikhovsky	f07df9fcac	AVX-512: Fixed a bug in VPERMT2* intrinsic. It was wrong order of operands (from intrinsic to DAG node). I added more strict type specification for instruction selection. Differential Revision: http://reviews.llvm.org/D14942 llvm-svn: 254059	2015-11-25 08:17:56 +00:00
Xinliang David Li	f47cf5505f	[PGO] Convert InstrProfRecord based serialization methods to use common C methods 1. Convert serialization methods using InstrProfRecord as source into C (impl) interfaces using Closure. 2. Reimplement InstrProfRecord serialization method to use new C interface as dummy wrapper. Now it is ready to implement wrapper for runtime value profile data. (The new code need better source location -- but not changed in this patch to minimize diffs. ) llvm-svn: 254057	2015-11-25 06:23:38 +00:00
Xinliang David Li	ac5b860633	[PGO] convert a subset of C++ interfaces into C (for sharing) (NFC) llvm-svn: 254056	2015-11-25 04:29:24 +00:00
Xinliang David Li	4f18bef998	Move member functions closer to others of the same class (NFC) llvm-svn: 254055	2015-11-25 03:24:37 +00:00
Peter Collingbourne	463ff6d823	AsmParser: Make the code for parsing unnamed aliases more closely resemble that for unnamed globals. This fixes parsing of forward references to unnamed aliases. While here, remove an unnecessary isa check. llvm-svn: 254054	2015-11-25 02:54:07 +00:00
Sanjoy Das	c521c7bea5	[OperandBundles] Extract duplicated code into a helper function, NFC llvm-svn: 254047	2015-11-25 00:42:24 +00:00
Sanjoy Das	7629346193	[InstCombine] Don't drop operand bundles Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14857 llvm-svn: 254046	2015-11-25 00:42:19 +00:00
Xinliang David Li	4945b16708	Fix function naming (NFC) llvm-svn: 254045	2015-11-25 00:08:49 +00:00
Hans Wennborg	e412b71f95	Revert r253528: "[X86] Enable shrink-wrapping by default." This caused PR25607 and also caused Chromium to crash on start-up. (Also had to update test/CodeGen/X86/avx-splat.ll, which was committed after shrink wrapping was enabled.) llvm-svn: 254044	2015-11-25 00:05:13 +00:00
Kaelyn Takata	d0955312d9	Fix an asan error where NumElements > 32 for at least one case in test/CodeGen/X86/avg.ll. llvm-svn: 254043	2015-11-25 00:03:29 +00:00
Rong Xu	25c106b347	[PGO] Revert revision r254021,r254028,r254035 Revert the above revision due to multiple issues. llvm-svn: 254040	2015-11-24 23:49:08 +00:00
Xinliang David Li	28b700373e	[PGO] Add mapper callback to interfaces retrieving value data for site (NFC) This allows cleaner implementation and merging retrieving/mapping in one pass. llvm-svn: 254038	2015-11-24 23:36:52 +00:00
Teresa Johnson	3930361969	[ThinLTO] Add option to limit importing based on instruction count Add a simple initial heuristic to control importing based on the number of instructions recorded in the function's summary. Add option to control the limit, and test using option. llvm-svn: 254036	2015-11-24 22:55:46 +00:00
Diego Novillo	0b6985a3c6	SamplePGO - Add test for hot/cold inlined functions. When the original binary is executed and sampled, the resulting profile contains information on the original inline stack. We currently follow the original inline plan if we notice that the inlined callsite has more than 0 samples to it. A better way is to determine whether the callsite is actually worth inlining. If the callsite accumulates a small fraction of the samples spent in the parent function, then we don't want to bother inlining it (as it means that the callsite is actually cold). This patch introduces a threshold expressed in percentage of samples in relation to the parent function. If the callsite uses less than N% of the total samples used by its parent, the original inline decision is not re-applied. I've set the threshold to the very arbitrary value of 5%. I'm yet to do any actual experiments to see what's a good value. I wanted to separate the basic mechanism from the tuning. llvm-svn: 254034	2015-11-24 22:38:37 +00:00
Rong Xu	4dd22b8d2b	[PGO] Fix build errors in x86_64-darwin Fix buildbot failure for x86_64-darwin due to r254021 llvm-svn: 254028	2015-11-24 21:55:50 +00:00
Rong Xu	1b665ca707	[PGO] MST based PGO instrumentation infrastructure This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 254021	2015-11-24 21:31:25 +00:00
Teresa Johnson	d450da3281	[ThinLTO] Refactor function body scan during importing into helper (NFC) llvm-svn: 254020	2015-11-24 21:15:19 +00:00
Sanjoy Das	990914d64c	[RuntimeDyld] Fix a class of arithmetic errors introduced in r253918 r253918 had refactored expressions like "A - B.Address + C" to "A - B.getAddressWithOffset(C)". This is incorrect, since the latter really computes "A - B.Address - C". None of the tests I can run locally on x86 broke due to this bug, but it is the current suspect for breakage on the AArch64 buildbots. llvm-svn: 254017	2015-11-24 20:37:01 +00:00
Simon Pilgrim	1b4fecb098	[X86][FMA] Optimize FNEG(FMA) Patterns X86 needs to use its own FMA opcodes, preventing the standard FNEG(FMA) pattern table recognition method used by other platforms. This patch adds support for lowering FNEG(FMA(X,Y,Z)) into a single suitably negated FMA instruction. Fix for PR24364 Differential Revision: http://reviews.llvm.org/D14906 llvm-svn: 254016	2015-11-24 20:31:46 +00:00
Matthias Braun	147110da84	LiveVariables should not clobber MachineOperand::IsDead, ::IsKill on reserved physical registers Patch by Nick Johnson <Nicholas.Paul.Johnson@deshawresearch.com> Differential Revision: http://reviews.llvm.org/D14875 llvm-svn: 254012	2015-11-24 20:06:56 +00:00
Teresa Johnson	130de7af7f	[ThinLTO] Enable iterative importing in FunctionImport pass Analyze imported function bodies and add any new external calls to the worklist for importing. Currently no controls on the importing so this will end up importing everything possible in the call tree below the importing module. Basic profitability checks coming next. Update test to check for iteratively inlined functions. llvm-svn: 254011	2015-11-24 19:55:04 +00:00
Cong Hou	db6220f84d	[X86] Fix several issues related to X86's psadbw instruction. This patch fixes the following issues: 1. Fix the return type of X86psadbw: it should not be the same type of inputs. For vNi8 inputs the output should be vMi64, where M = N/8. 2. Fix the return type of int_x86_avx512_psad_bw_512 accordingly. 3. Fix the definiton of PSADBW, VPSADBW, and VPSADBWY accordingly. 4. Adjust the return type when building a DAG node of X86ISD::PSADBW type. 5. Update related tests. Differential revision: http://reviews.llvm.org/D14897 llvm-svn: 254010	2015-11-24 19:51:26 +00:00
Teresa Johnson	b098f0c133	[ThinLTO] Handle previously imported and promoted locals in module linker The new function import pass exposed an issue when we import references to local values on multiple importing passes. They are renamed on each import pass, and we need to ensure that the already promoted and renamed references existing in the dest module are correctly identified and updated so that they aren't spuriously renamed again (due to a perceived conflict with the newly linked reference). llvm-svn: 254009	2015-11-24 19:46:58 +00:00
Weiming Zhao	45d4cb9a14	[Utils] Put includes in correct order. NFC. Summary: Followed the guidelines in: http://llvm.org/docs/CodingStandards.html#include-style However, I noticed that uppercase named headers come before lowercase ones throughout the codebase. So kept them as is. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide, jmolloy, atrick Subscribers: sanjoy Differential Revision: http://reviews.llvm.org/D14939 llvm-svn: 254005	2015-11-24 18:57:06 +00:00
Xinliang David Li	759dc628c0	[PGO] Small interface change to be profile rt ready Convert two C++ static member functions to be C APIs. This is one of the many steps to get ready to share VP writer code with profiler runtime. llvm-svn: 253999	2015-11-24 18:15:46 +00:00
Sanjay Patel	968e91aea0	[InstCombine] fix propagation of fast-math-flags Noticed while working on D4583: http://reviews.llvm.org/D4583 llvm-svn: 253997	2015-11-24 17:51:20 +00:00
Sanjay Patel	739f2ce93a	use convenience function for copying IR flags; NFCI llvm-svn: 253996	2015-11-24 17:16:33 +00:00
Xinliang David Li	1b85d4c961	Minor refactor to make VP writing more efficient llvm-svn: 253994	2015-11-24 17:03:24 +00:00
Krzysztof Parzyszek	b8bb90b744	Add vector types for intrinsics Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 253992	2015-11-24 16:28:14 +00:00
Teresa Johnson	17626654fd	[ThinLTO] Fix FunctionImport alias checking and test Skip imports for weak_any aliases as well. Fix the test to check non-import of weak aliases and functions, and import of normal alias. llvm-svn: 253991	2015-11-24 16:10:43 +00:00
Sanjay Patel	a0d354541d	[x86] remove duplicate movq instruction defs (PR25554) We had duplicated definitions for the same hardware '[v]movq' instructions. For example with SSE: def MOVZQI2PQIrr : RS2I<0x6E, MRMSrcReg, (outs VR128:$dst), (ins GR64:$src), "mov{d\|q}\t{$src, $dst\|$dst, $src}", // X86-64 only [(set VR128:$dst, (v2i64 (X86vzmovl (v2i64 (scalar_to_vector GR64:$src)))))], IIC_SSE_MOVDQ>; def MOV64toPQIrr : RS2I<0x6E, MRMSrcReg, (outs VR128:$dst), (ins GR64:$src), "mov{d\|q}\t{$src, $dst\|$dst, $src}", [(set VR128:$dst, (v2i64 (scalar_to_vector GR64:$src)))], IIC_SSE_MOVDQ>, Sched<[WriteMove]>; As shown in the test case and PR25554: https://llvm.org/bugs/show_bug.cgi?id=25554 This causes us to miss reusing an operand because later passes don't know these 'movq' are the same instruction. This patch deletes one pair of these defs. Sadly, this won't fix the original test case in the bug report. Something else is still broken. Differential Revision: http://reviews.llvm.org/D14941 llvm-svn: 253988	2015-11-24 15:44:35 +00:00
Krzysztof Parzyszek	aa93575b7e	[Hexagon] Add missing include of <cctype> Lack thereof breaks Windows builds due to the use of std::isspace in HexagonInstrInfo.cpp. llvm-svn: 253987	2015-11-24 15:11:13 +00:00
Krzysztof Parzyszek	b9a1c3a32c	[Hexagon] Bring HexagonInstrInfo up to date llvm-svn: 253986	2015-11-24 14:55:26 +00:00
Krzysztof Parzyszek	d4b566d50b	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253978	2015-11-24 13:07:35 +00:00
Matt Arsenault	ff05da806c	AMDGPU: Split LDS vector loads If properly aligned this could allow using ds_read_b64. llvm-svn: 253975	2015-11-24 12:18:54 +00:00
Matt Arsenault	4d801cd357	AMDGPU: Split x8 and x16 vector loads instead of scalarize The one regression in the builtin tests is in the read2 test which now (again) has many extra copies, but this should be solved once the pass is replaced with a DAG combine. llvm-svn: 253974	2015-11-24 12:05:03 +00:00
Ismail Donmez	65487e2d7e	Fix build after r253954 llvm-svn: 253969	2015-11-24 09:48:09 +00:00
Cong Hou	1938f2eb98	Let SelectionDAG start to use probability-based interface to add successors. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes. 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights. 3. Use new interfaces in all other passes. 4. Remove old interfaces. This the second patch above. In this patch SelectionDAG starts to use probability-based interfaces in MBB to add successors but other MC passes are still using weight-based interfaces. Therefore, we need to maintain correct weight list in MBB even when probability-based interfaces are used. This is done by updating weight list in probability-based interfaces by treating the numerator of probabilities as weights. This change affects many test cases that check successor weight values. I will update those test cases once this patch looks good to you. Differential revision: http://reviews.llvm.org/D14361 llvm-svn: 253965	2015-11-24 08:51:23 +00:00
Mehdi Amini	42418aba58	Add a FunctionImporter helper to perform summary-based cross-module function importing Summary: This is a helper to perform cross-module import for ThinLTO. Right now it is importing naively every possible called functions. Reviewers: tejohnson Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14914 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253954	2015-11-24 06:07:49 +00:00
Cong Hou	bed60d35ed	[X86][SSE] Detect AVG pattern during instruction combine for SSE2/AVX2/AVX512BW. This patch detects the AVG pattern in vectorized code, which is simply c = (a + b + 1) / 2, where a, b, and c have the same type which are vectors of either unsigned i8 or unsigned i16. In the IR, i8/i16 will be promoted to i32 before any arithmetic operations. The following IR shows such an example: %1 = zext <N x i8> %a to <N x i32> %2 = zext <N x i8> %b to <N x i32> %3 = add nuw nsw <N x i32> %1, <i32 1 x N> %4 = add nuw nsw <N x i32> %3, %2 %5 = lshr <N x i32> %N, <i32 1 x N> %6 = trunc <N x i32> %5 to <N x i8> and with this patch it will be converted to a X86ISD::AVG instruction. The pattern recognition is done when combining instructions just before type legalization during instruction selection. We do it here because after type legalization, it is much more difficult to do pattern recognition based on many instructions that are doing type conversions. Therefore, for target-specific instructions (like X86ISD::AVG), we need to take care of type legalization by ourselves. However, as X86ISD::AVG behaves similarly to ISD::ADD, I am wondering if there is a way to legalize operands and result types of X86ISD::AVG together with ISD::ADD. It seems that the current design doesn't support this idea. Tests are added for SSE2, AVX2, and AVX512BW and both i8 and i16 types of variant vector sizes. Differential revision: http://reviews.llvm.org/D14761 llvm-svn: 253952	2015-11-24 05:44:19 +00:00
Davide Italiano	c304a0ddc1	[DIE] Make DIE.h NDEBUG conditional-free. Switch dump()/print() method definitions to LLVM_DUMP_METHOD instead. llvm-svn: 253945	2015-11-24 02:21:43 +00:00
Sanjoy Das	5abfbb9246	[RuntimeDyld] Avoid unused-private-field warning; NFC Fixes the no asserts -Werror,-Wunused-private-field build. llvm-svn: 253933	2015-11-23 22:59:36 +00:00
Dan Gohman	192dddc595	[WebAssembly] Don't print the types of memory_size and grow_memory This matches the current spec, for now. llvm-svn: 253931	2015-11-23 22:37:29 +00:00
Xinliang David Li	c667683d2e	[PGO] In llvm-profdata text dump, add comment lines as annotations llvm-svn: 253930	2015-11-23 22:31:22 +00:00
Krzysztof Parzyszek	d5d083ccd4	Revert r253923. Per Eric's request. llvm-svn: 253928	2015-11-23 22:19:57 +00:00
Andy Ayers	9f7501896e	findDeadCallerSavedReg needs to pay attention to calling convention Caller saved regs differ between SysV and Win64. Use the tail call available set to scavenge from. Refactor register info to create new helper to get at tail call GPRs. Added a new test case for windows. Fixed up a number of X64 tests since now RCX is preferred over RDX on SysV. Differential Revision: http://reviews.llvm.org/D14878 llvm-svn: 253927	2015-11-23 22:17:44 +00:00
Dan Gohman	2f16f25391	[WebAssembly] Don't special-case call operand order. With the '=' suffix now indicating which operands are output operands, it's no longer as important to distinguish between a call's inputs and its outputs using operand ordering, so we can go back to printing them in the normal order. llvm-svn: 253925	2015-11-23 22:04:06 +00:00
Krzysztof Parzyszek	f358bfff17	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253923	2015-11-23 22:00:17 +00:00
Dan Gohman	700515fa92	[WebAssembly] Suffix output operands with '='. This distinguishes input operands from output operands. This is something of a syntactic experiment to see whether the mild amount of clutter this adds is outweighed by the extra information it conveys to the reader. llvm-svn: 253922	2015-11-23 21:55:57 +00:00
Sanjoy Das	d5658b0896	[RuntimeDyld] Don't allocate unnecessary stub buffer space Summary: For relocation types that are known to not require stub functions, there is no need to allocate extra space for the stub functions. Reviewers: lhames, reames, maksfb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14676 llvm-svn: 253920	2015-11-23 21:47:51 +00:00
Sanjoy Das	8082592ac9	[RuntimeDyld] Add bounds checking to SectionEntry::advanceStubOffset Summary: Change SectionEntry to keep track of the size of its underlying allocation, and use that to bounds check advanceStubOffset. Reviewers: lhames, andrew.w.kaylor, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14675 llvm-svn: 253919	2015-11-23 21:47:46 +00:00
Sanjoy Das	277776a520	[RuntimeDyld] Add accessors to `SectionEntry`; NFC Summary: Remove naked access to the data members in `SectionEntry` and route accesses through accessor functions. This makes it obvious how the instances of the class are used, and will also facilitate adding bounds checking to `advanceStubOffset` in a later change. Reviewers: lhames, loladiro, andrew.w.kaylor Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14674 llvm-svn: 253918	2015-11-23 21:47:41 +00:00
Dan Gohman	7054ac1b8b	[WebAssembly] Model the return value of store instructions in wasm. llvm-svn: 253916	2015-11-23 21:16:35 +00:00
Chad Rosier	a15b4b6af2	[LIR] Put includes in correct order. NFC. llvm-svn: 253915	2015-11-23 21:09:13 +00:00
Xinliang David Li	6f7c19a494	[PGO] Add --text option for llvm-profdata show\|merge commands The new option is similar to the SampleProfile dump option. - dump raw/indexed format into text profile format - merge the profile and output into text profile format. Note that Value Profiling data text format is not yet designed. That functionality will be added later. Differential Revision: http://reviews.llvm.org/D14894 llvm-svn: 253913	2015-11-23 20:47:38 +00:00
Diego Novillo	243ea6a7d6	SamplePGO - Add coverage tracking for samples. The existing coverage tracker counts the number of records that were used from the input profile. An alternative view of coverage is to check how many available samples were applied. This way, if the profile contains several records with few samples, it doesn't really matter much that they were not applied. The more interesting records to apply are the ones that contribute many samples. llvm-svn: 253912	2015-11-23 20:12:21 +00:00
Andrew Kaylor	0615a0e65d	[WinEH] Fix a case where GVN could incorrectly PRE a load into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253908	2015-11-23 19:51:41 +00:00
Dan Gohman	aa0a4bd05b	[WebAssembly] Don't use set_local instructions explicitly. The current approach to using get_local and set_local is to use them implicitly, as register uses and defs. Introduce new copy instructions which are themselves no-ops except for the get_local and set_local that they imply, so that we use get_local and set_local consistently. llvm-svn: 253905	2015-11-23 19:30:43 +00:00
Teresa Johnson	6b92316811	[ThinLTO] Deduplicate function index loading into shared helper (NFC) Add a shared helper routine to read the function index from a file and create/return the function index object. Use it in llvm-link and llvm-lto. llvm-svn: 253903	2015-11-23 19:19:11 +00:00
Andrew Kaylor	d0430e8580	[WinEH] Fix problem where CodeGenPrepare incorrectly sinks a bitcast into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253902	2015-11-23 19:16:15 +00:00
Dan Gohman	f6857223c9	[WebAssembly] Always print loop end labels WebAssembly is currently using labels to end scopes, so for example a loop scope looks like this: BB0_0: loop BB0_1 ... BB0_1: with BB0_0 being the label of the first block not in the loop. This requires that the label be printed even when it's only reachable via fallthrough. To arrange this, insert a no-op LOOP_END instruction in such cases at the end of the loop. llvm-svn: 253901	2015-11-23 19:12:37 +00:00
Xinliang David Li	c7c1f8581a	[PGO] Introduce alignment macro for instr-prof control data(NFC) llvm-svn: 253893	2015-11-23 18:02:59 +00:00
Dan Gohman	e425c32224	[WebAssembly] Remove incomplete MCCodeEmitter bits. These are parts of a separate patch that I accidentally included in r253878. llvm-svn: 253892	2015-11-23 18:00:04 +00:00
Paul Robinson	af19bc3a9c	Add Windows error code and tidy formatting for system errors. Differential Revision: http://reviews.llvm.org/D14892 llvm-svn: 253888	2015-11-23 17:34:20 +00:00
Dan Gohman	53828fd777	[WebAssembly] Emit .param, .result, and .local through MC. This eliminates one of the main remaining uses of EmitRawText. llvm-svn: 253878	2015-11-23 16:50:18 +00:00
Diego Novillo	1ca881c4bb	SamplePGO - Clear coverage tracking when clearing per-function data. llvm-svn: 253877	2015-11-23 16:30:17 +00:00
Dan Gohman	3280793234	[WebAssembly] Use dominator information to improve BLOCK placement Always starting blocks at the top of their containing loops works, but creates unnecessarily deep nesting because it makes all blocks in a loop overlap. Refine the BLOCK placement algorithm to start blocks at nearest common dominating points instead, which significantly shrinks them and reduces overlapping. llvm-svn: 253876	2015-11-23 16:19:56 +00:00
Daniel Sanders	2b561336d9	[mips] .ent and .end should also set the type and size of the symbol respectively. Reviewers: vkalintiris Subscribers: llvm-commits, seanbruno, emaste, vkalintiris, dsanders Differential Revision: http://reviews.llvm.org/D14221 llvm-svn: 253875	2015-11-23 16:08:03 +00:00
Diego Novillo	39ab68f39b	SamplePGO - Use newly introduced local variable. NFC. llvm-svn: 253868	2015-11-23 15:24:13 +00:00
Krzysztof Parzyszek	29d23f9f4c	[Hexagon] Update instruction formats llvm-svn: 253867	2015-11-23 14:09:26 +00:00
Martell Malone	a6b867eb0d	ARM: address WoA division overflow crash Disable custom handling of signed 32-bit and 64-bit integer divide. Add test cases for both 32-bit and 64-bit integer overflow crashes. llvm-svn: 253865	2015-11-23 13:11:39 +00:00
Craig Topper	2241dfd2dc	[Mips] Remove an unnecessary wrapping of a predicate with std::ptr_fun. NFC llvm-svn: 253855	2015-11-23 07:19:06 +00:00
Davide Italiano	6f93df8105	[Analysis/CallGraph] Switch dump() definitions over to LLVM_DUMP_METHOD. llvm-svn: 253842	2015-11-23 02:58:42 +00:00
Davide Italiano	945d05f6a0	[LoopStrengthReduce] Mark dump() definitions as LLVM_DUMP_METHOD. llvm-svn: 253841	2015-11-23 02:47:30 +00:00
Mehdi Amini	8220e8a830	Add const qualifier for FunctionInfoIndex in ModuleLinker and linkInModule() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253840	2015-11-23 01:59:16 +00:00
Sanjoy Das	0194743fad	[SCEV] Use C++11'isms llvm-svn: 253837	2015-11-22 21:20:13 +00:00
Benjamin Kramer	0969a2a74c	[MDBuilder] Simplify code using initializer lists. NFC. llvm-svn: 253826	2015-11-22 18:03:17 +00:00
Simon Pilgrim	1dfe53e180	Remove duplicate getValueType() calls. NFCI. llvm-svn: 253823	2015-11-22 16:49:38 +00:00
Krzysztof Parzyszek	6753f33388	Avoid dependency between TableGen and CodeGen Duplicate a few common definitions between DFAPacketizer.cpp and DFAPacketizerEmitter.cpp to avoid including files from CodeGen in TableGen. llvm-svn: 253820	2015-11-22 15:20:19 +00:00
Elena Demikhovsky	0fd11526e2	AVX-512: Optimized INSERT_SUBVECTOR for i1 vector types ISERT_SUBVECTOR for i1 vectors may be done with shifts, when we insert into the lower part, or into the upper part, on into all-zero vector. CONCAT_VECTORS uses ISERT_SUBVECTOR. Differential Revision: http://reviews.llvm.org/D14815 llvm-svn: 253819	2015-11-22 13:57:38 +00:00
Xinliang David Li	924e05843d	[PGO] move names of runtime sections definitions to InstrProfData.inc In profile runtime implementation for Darwin, Linux and FreeBSD, the names of sections holding profile control/counter/naming data need to be known by the runtime in order to locate the start/end of the data. Moving the name definitions to the common file to specify the connection. llvm-svn: 253814	2015-11-22 05:42:31 +00:00
Xinliang David Li	c76732396b	[PGO] Define value profiling updater API signature in InstrProfData.inc (NFC) llvm-svn: 253805	2015-11-22 00:22:07 +00:00
Rafael Espindola	d1beb07d39	Have a single way for creating unique value names. We had two code paths. One would create names like "foo.1" and the other names like "foo1". For globals it is important to use "foo.1" to help C++ name demangling. For locals there is no strong reason to go one way or the other so I kept the most common mangling (foo1). llvm-svn: 253804	2015-11-22 00:16:24 +00:00
Sanjay Patel	8066d906f1	fix formatting; NFC llvm-svn: 253802	2015-11-22 00:03:16 +00:00
Sanjoy Das	b37c4c414b	[SCEVExpander] Use C++isms; NFC llvm-svn: 253801	2015-11-21 23:20:10 +00:00
Teresa Johnson	6290dbc0f7	[ThinLTO] Handle bitcode without function summary sections gracefully Summary: Several fixes to the handling of bitcode files without function summary sections so that they are skipped during ThinLTO processing in llvm-lto and the gold plugin when appropriate instead of aborting. 1 Don't assert when trying to add a FunctionInfo that doesn't have a summary attached. 2 Skip FunctionInfo structures that don't have attached function summary sections when trying to create the combined function summary. 3 In both llvm-lto and gold-plugin, check whether a bitcode file has a function summary section before trying to parse the index, and skip the bitcode file if it does not. 4 Fix hasFunctionSummaryInMemBuffer in BitcodeReader, which had a bug where we returned to early while looking for the summary section. Also added llvm-lto and gold-plugin based tests for cases where we don't have function summaries in the bitcode file. I verified that either the first couple fixes described above are enough to avoid the crashes, or fixes 1,3,4. But have combined them all here for added robustness. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14903 llvm-svn: 253796	2015-11-21 21:55:48 +00:00
Krzysztof Parzyszek	b46557292c	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. Reapply the previous patch, this time without circular dependencies. llvm-svn: 253793	2015-11-21 20:00:45 +00:00
Craig Topper	a5ea5289ff	Use modulo operator instead of multiplying result of a divide and subtracting from the original dividend. NFC. llvm-svn: 253792	2015-11-21 17:44:42 +00:00
Krzysztof Parzyszek	4ca21fc1aa	Revert r253790: it breaks all builds for some reason. llvm-svn: 253791	2015-11-21 17:38:33 +00:00
Krzysztof Parzyszek	220a9bc018	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. llvm-svn: 253790	2015-11-21 17:23:52 +00:00
Sanjay Patel	04df583a42	use ternary ops; NFC llvm-svn: 253787	2015-11-21 16:51:19 +00:00
Sanjay Patel	1f3fa2133a	remove unnecessary temp variables; NFC llvm-svn: 253786	2015-11-21 16:37:09 +00:00
Sanjay Patel	5a7bdc9632	fix typo; NFC llvm-svn: 253785	2015-11-21 16:16:29 +00:00
Jonas Paulsson	8f0d2b7f1f	[DAGCombiner] Bugfix for lost chain depenedency. When MergeConsecutiveStores() combines two loads and two stores into wider loads and stores, the chain users of both of the original loads must be transfered to the new load, because it may be that a chain user only depends on one of the loads. New test case: test/CodeGen/SystemZ/dag-combine-01.ll Reviewed by James Y Knight. Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=25310#c6 llvm-svn: 253779	2015-11-21 13:25:07 +00:00
Simon Pilgrim	d5a154424b	[X86][AVX512] Added AVX512 VMOVLHPS/VMOVHLPS shuffle decode comments. llvm-svn: 253777	2015-11-21 13:04:42 +00:00
Simon Pilgrim	96cbce61b2	[X86][SSE] Legal XMM Register Class ordering for SSE1 It turns out we have a number of places that just grab the first type attached to a register class for various reasons. This is fine unless for some reason that type isn't legal on the current target, such as for SSE1 which doesn't support v16i8/v8i16/v4i32/v2i64 - all of which were included before 4f32 in the class. Given that this is such a rare situation I've just re-ordered the types and placed the float types first. Fix for PR16133 Differential Revision: http://reviews.llvm.org/D14787 llvm-svn: 253773	2015-11-21 12:38:34 +00:00
Weiming Zhao	8d5c08f591	[SimplifyLibCalls] Removed some TODOs which are already implemented. NFC. Summary: D14302 implements tan(atan(x)) -> x D14045 implements pow(exp(x), y) -> exp(x*y) Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide Differential Revision: http://reviews.llvm.org/D14882 llvm-svn: 253768	2015-11-21 06:10:20 +00:00
Teresa Johnson	16e2a9eeb6	Move new assert to correct location This assert was meant to execute at the end of parseMetadata, but we return early and never reach the end of the function. Caught by a compile-time warning since the function doesn't return a value from that location. llvm-svn: 253762	2015-11-21 03:51:23 +00:00
Kostya Serebryany	b569368a5a	[libFuzzer] don't crash when reporting a leak in test_single_input mode llvm-svn: 253761	2015-11-21 03:46:43 +00:00
Matthias Braun	5a1857b6eb	ARMLoadStoreOptimizer: Cleanup isMemoryOp(); NFC llvm-svn: 253757	2015-11-21 02:09:49 +00:00
Vinicius Tinti	67cf33d9ab	Test commit llvm-svn: 253737	2015-11-20 23:20:12 +00:00
Rong Xu	a1f61fe841	Add some constantness to GetSuccessorNumber(). llvm-svn: 253733	2015-11-20 23:02:06 +00:00
Eric Christopher	25bf4a8617	Power8 and later support fusing addis/addi and addis/ld instruction pairs that use the same register to execute as a single instruction. No Functional Change Patch by Kyle Butt! llvm-svn: 253724	2015-11-20 22:38:20 +00:00
Owen Anderson	8e85130bb9	Fix another infinite loop in Reassociate caused by Constant::isZero(). Not all zero vectors are ConstantDataVector's. llvm-svn: 253723	2015-11-20 22:34:48 +00:00
Geoff Berry	5256fcada0	[CodeGenPrepare] Create more extloads and fewer ands Summary: Add and instructions immediately after loads that only have their low bits used, assuming that the (and (load x) c) will be matched as a extload and the ands/truncs fed by the extload will be removed by isel. Reviewers: mcrosier, qcolombet, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14584 llvm-svn: 253722	2015-11-20 22:34:39 +00:00
Arnaud A. de Grandmaison	4e89e9f846	[ShrinkWrap] Teach ShrinkWrap to handle targets requiring a register scavenger. The included test only checks for a compiler crash for now. Several people are facing this issue, so we first resolve the crash, and will increase shrinkwrap's coverage later in a follow-up patch. llvm-svn: 253718	2015-11-20 21:54:27 +00:00
Diego Novillo	5fb49e5c5f	SamplePGO - Do not count never-executed inlined functions when computing coverage. If a function was originally inlined but not actually hot at runtime, its samples will not be counted inside the parent function. This throws off the coverage calculation because it expects to find more used records than it should. Fixed by ignoring functions that will not be inlined into the parent. Currently, this is inlined functions with 0 samples. In subsequent patches, I'll change this to mean "cold" functions. llvm-svn: 253716	2015-11-20 21:46:38 +00:00
Jun Bum Lim	80ec0d3f5a	[AArch64]Merge narrow zero stores to a wider store This change merges adjacent zero stores into a wider single store. For example : strh wzr, [x0] strh wzr, [x0, #2] becomes str wzr, [x0] This will fix PR25410. llvm-svn: 253711	2015-11-20 21:14:07 +00:00
Eric Christopher	c180836722	Weak non-function symbols were being accessed directly, which is incorrect, as the chosen representative of the weak symbol may not live with the code in question. Always indirect the access through the TOC instead. Patch by Kyle Butt! llvm-svn: 253708	2015-11-20 20:51:31 +00:00
Krzysztof Parzyszek	6c5ca95814	[Hexagon] Fix the return value from HexagonGenInsert::runOnMachineFunction llvm-svn: 253705	2015-11-20 20:46:23 +00:00
Reid Kleckner	437b1b3ea5	Fix the Windows build, include <tuple> for std::tie llvm-svn: 253698	2015-11-20 19:29:40 +00:00
Tilmann Scheller	925b193eed	Revert "[FunctionAttrs] Remove redundant assignment." This reverts r253661. Turns out that the assignment is not redundant (despite the Clang static analyzer claiming the opposite). The variable is being used by the lambda function AddUsersToWorklistIfCapturing(). llvm-svn: 253696	2015-11-20 19:17:10 +00:00
Nathan Slingerland	a731829788	[llvm-profdata] Add merge() to InstrProfRecord Summary: This change refactors two aspects of InstrProfRecord: 1) Add a merge() method to InstrProfRecord (previously InstrProfWriter combineInstrProfRecords()) in order to better encapsulate this functionality and to make the InstrProfRecord and SampleRecord APIs more consistent. 2) Make InstrProfRecord mergeValueProfData() a private method since it is only ever called internally by merge(). Reviewers: dnovillo, bogner, davidxl Subscribers: silvas, vsk, llvm-commits Differential Revision: http://reviews.llvm.org/D14786 llvm-svn: 253695	2015-11-20 19:12:43 +00:00
Artyom Skrobov	7f0fc9ccb7	Avoid duplicate entry for cortex-a7 in the TargetParser (NFC) Reviewers: t.p.northover, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14757 llvm-svn: 253676	2015-11-20 16:46:14 +00:00
Artyom Skrobov	91f339ab3f	Handle ARMv6-J as an alias, instead of fake architecture Summary: This follows D14577 to treat ARMv6-J as an alias for ARMv6, instead of an architecture in its own right. The functional change is that the default CPU when targeting ARMv6-J changes from arm1136j-s to arm1136jf-s, which is currently used as the default CPU for ARMv6; both are, in fact, ARMv6-J CPUs. The J-bit (Jazelle support) is irrelevant to LLVM, and it doesn't affect code generation, attributes, optimizations, or anything else, apart from selecting the default CPU. Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14755 llvm-svn: 253675	2015-11-20 16:46:09 +00:00
Diego Novillo	df544a098a	SamplePGO - Add line offset and discriminator information to sample reports. While debugging some sampling coverage problems, I found this useful: When applying samples from a profile, it helps to also know what line offset and discriminator the sample belongs to. This makes it easy to correlate against the input profile. llvm-svn: 253670	2015-11-20 15:39:42 +00:00
Teresa Johnson	d4d3dfd8ef	[ThinLTO] Add MODULE_CODE_METADATA_VALUES record Summary: This is split out from the ThinLTO metadata mapping patch http://reviews.llvm.org/D14752. To avoid needing to parse the module level metadata during function importing, a new module-level record is added which holds the number of module-level metadata values. This is required because metadata value ids are assigned implicitly during parsing, and the function-level metadata ids start after the module-level metadata ids. I made a change to this version of the code compared to D14752 in order to add more consistent and thorough assertion checking of the new record value. We now unconditionally use the record value to initialize the MDValueList size, and handle it the same in parseMetadata for all module level metadata cases (lazy loading or not). Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14825 llvm-svn: 253668	2015-11-20 14:51:27 +00:00
Tilmann Scheller	4cd1d51a4d	[Hexagon] Remove redundant assignment. Identified by the Clang static analyzer. llvm-svn: 253664	2015-11-20 13:27:30 +00:00
Daniel Sanders	b700203c8b	Partially revert r253662: some unrelated work was accidentally committed with it. Sorry. llvm-svn: 253663	2015-11-20 13:16:35 +00:00
Daniel Sanders	be9db3c00a	Revert the revert 253497 and 253539 - These commits aren't the cause of the clang-cmake-mips failures. Sorry for the noise. llvm-svn: 253662	2015-11-20 13:13:53 +00:00
Tilmann Scheller	1e929f97f6	[FunctionAttrs] Remove redundant assignment. Identified by the Clang static analyzer. llvm-svn: 253661	2015-11-20 12:51:58 +00:00
Tilmann Scheller	bfd7ce01ea	[Hexagon] Remove redundant local variable. Identified by the Clang static analyzer. llvm-svn: 253660	2015-11-20 12:10:17 +00:00
Owen Anderson	630077ef55	Fix a pair of issues that caused an infinite loop in reassociate. Terrifyingly, one of them is a mishandling of floating point vectors in Constant::isZero(). How exactly this issue survived this long is beyond me. llvm-svn: 253655	2015-11-20 08:16:13 +00:00
Craig Topper	e325e3806f	Use range-based for loops. NFC llvm-svn: 253652	2015-11-20 07:18:48 +00:00
Hrvoje Varga	b65518c15c	[mips][microMIPS] Implement MUL[_S].PH, MULEQ_S.W.PHL, MULEQ_S.W.PHR, MULEU_S.PH.QBL, MULEU_S.PH.QBR, MULQ_RS.PH, MULQ_RS.W, MULQ_S.PH and MULQ_S.W instructions Differential Revision: http://reviews.llvm.org/D14280 llvm-svn: 253651	2015-11-20 07:14:52 +00:00
Dan Gohman	d9625276a7	[WebAssembly] Remove the AsmPrinter code for printing physical registers. WebAssembly does not have physical registers, so even if LLVM uses physical registers like SP, they'll need to be lowered to virtual registers before AsmPrinter time. llvm-svn: 253644	2015-11-20 03:13:31 +00:00
Dan Gohman	dfa81d8e22	[WebAssembly] Add a few open tasks to the target README.txt. llvm-svn: 253643	2015-11-20 03:08:27 +00:00
Dan Gohman	bb7ce8e408	[WebAssembly] Rename SWITCH to TABLESWITCH to match the current wording in the spec. llvm-svn: 253642	2015-11-20 03:02:49 +00:00
Dan Gohman	2dfc3b8be5	[WebAssembly] Remove done items from the README.txt. llvm-svn: 253640	2015-11-20 02:51:38 +00:00
Dan Gohman	7bafa0eaef	[WebAssembly] Add asserts that the expression stack is used in stack order. llvm-svn: 253638	2015-11-20 02:33:24 +00:00
Dan Gohman	b0992dafb3	[WebAssemby] Enforce FIFO ordering for instructions using stackified registers. llvm-svn: 253634	2015-11-20 02:19:12 +00:00
Peter Collingbourne	c85f4ced4d	ScalarEvolution: do not set nuw when creating exprs of form <expr> + <all-ones>. The nuw constraint will not be satisfied unless <expr> == 0. This bug has been around since r102234 (in 2010!), but was uncovered by r251052, which introduced more aggressive optimization of nuw scev expressions. Differential Revision: http://reviews.llvm.org/D14850 llvm-svn: 253627	2015-11-20 01:26:13 +00:00
Eric Christopher	eb027124af	Split the argument unscheduling loop in the WebAssembly register coloring pass. Turn the logic into "look for an insert point and then move things past the insert point". No functional change intended. llvm-svn: 253626	2015-11-20 00:34:54 +00:00
Tobias Edler von Koch	4d45090659	[LTO] Add option to emit assembly from LTOCodeGenerator This adds a new API, LTOCodeGenerator::setFileType, to choose the output file format for LTO CodeGen. A corresponding change to use this new API from llvm-lto and a test case is coming in a separate commit. Differential Revision: http://reviews.llvm.org/D14554 llvm-svn: 253622	2015-11-19 23:59:24 +00:00
Eric Christopher	8c3dbcab1d	Fix a [-Werror,-Wcovered-switch-default] warning by removing the unnecessary default case. llvm-svn: 253621	2015-11-19 23:45:42 +00:00
Reid Kleckner	cc2f6c35a3	[WinEH] Disable most forms of demotion Now that the register allocator knows about the barriers on funclet entry and exit, testing has shown that this is unnecessary. We still demote PHIs on unsplittable blocks due to the differences between the IR CFG and the Machine CFG. llvm-svn: 253619	2015-11-19 23:23:33 +00:00
Dan Gohman	3192ddfeba	[WebAssembly] Implement isCheapToSpeculateCtlz and isCheapToSpeculateCttz. This unbreaks test/CodeGen/WebAssembly/i32.ll and test/CodeGen/WebAssembly/i64.ll after r224899. llvm-svn: 253617	2015-11-19 23:04:59 +00:00
Diego Novillo	379cc5e71b	SamplePGO - Tweak debugging output for function samples. NFC. llvm-svn: 253612	2015-11-19 22:18:30 +00:00
Simon Pilgrim	a9912617c8	[X86][SSE4A] Fix issue with EXTRQI shuffles not starting at the correct start index. Found during stress testing. llvm-svn: 253611	2015-11-19 22:13:56 +00:00
Reid Kleckner	ebee6129cd	Fix UMRs in Mips disassembler on invalid instruction streams The Insn and Size local variables were used without initialization. llvm-svn: 253607	2015-11-19 21:51:55 +00:00
Simon Pilgrim	ae0140d6ec	[X86] Use existing MachineInstrBuilder::addDisp to create offseted pointer. NFC. Minor code duplication tidyup to D13988 llvm-svn: 253606	2015-11-19 21:50:57 +00:00
Davide Italiano	c807f487f7	Follow up to r253591. Turn into an assertion. Reported by: David Blaikie. llvm-svn: 253605	2015-11-19 21:50:08 +00:00
Chad Rosier	1cd3da15e8	[LIR] Update some comments. NFC. llvm-svn: 253603	2015-11-19 21:33:07 +00:00

... 3 4 5 6 7 ...

85130 Commits