llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	9760f03757	AMDGPU/SI: Emit constant arrays in the .hsrodata_readonly_agent section Summary: This is done only when targeting HSA. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D13807 llvm-svn: 254587	2015-12-03 03:34:32 +00:00
Matthias Braun	2fd672a221	Revert "ScheduleDAGInstrs: Rework schedule graph builder." This works mostly fine but breaks some stage 1 builders when compiling compiler-rt on i386. Revert for further investigation as I can't see an obvious cause/fix. This reverts commit r254577. llvm-svn: 254586	2015-12-03 03:01:10 +00:00
Mehdi Amini	311fef6ea5	clang-format FunctionImport after refactoring (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254585	2015-12-03 02:58:14 +00:00
Mehdi Amini	c8c551701e	Refactor FunctionImporter::importFunctions with a helper function to process the Worklist (NFC) This precludes some more functional changes to perform bulk imports. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254583	2015-12-03 02:37:33 +00:00
Mehdi Amini	7471cf81b0	Adapt comment and rename variable in ModuleLinker to describe more accurately the actual use. Thanks Sean Silva for the suggestion. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254582	2015-12-03 02:37:30 +00:00
Mehdi Amini	9abe1089c7	Remove "ExportingModule" from ThinLTO Index (NFC) There is no real reason the index has to have the concept of an exporting Module. We should be able to have one single unique instance of the Index, and it should be read-only after creation for the whole ThinLTO processing. The linker plugin should be able to process multiple modules (in parallel or in sequence) with the same index. The only reason the ExportingModule was present seems to be to implement hasExportedFunctions() that is used by the Module linker to decide what to do with the current Module. For now I replaced it with a query to the map of Modules path to see if this module was declared in the Index and consider that if it is the case then it is probably exporting function. On the long term the Linker interface needs to evolve and this call should not be needed anymore. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254581	2015-12-03 02:37:23 +00:00
Joerg Sonnenberger	48eb197434	Add a TODO item that the nop handling before FP conditional branches is not enough for SPARCv7. llvm-svn: 254580	2015-12-03 02:35:24 +00:00
Matthias Braun	d35fe3d984	ScheduleDAGInstrs: Rework schedule graph builder. The new algorithm remembers the uses encountered while walking backwards until a matching def is found. Contrary to the previous version this: - Works without LiveIntervals being available - Allows to increase the precision to subregisters/lanemasks (not used for now) The changes in the AMDGPU tests are necessary because the R600 scheduler is not stable with respect to the order of nodes in the ready queues. Differential Revision: http://reviews.llvm.org/D9068 llvm-svn: 254577	2015-12-03 02:05:27 +00:00
Matthias Braun	b0083608b4	RegisterPressure: Use range based for, fix else style; NFC llvm-svn: 254575	2015-12-03 01:44:45 +00:00
Justin Bogner	72e81895da	MC: Make sure to clear all of MCMachOStreamer's state The CreatedADWARFSection flag was added in r232842, but isn't cleared properly when resetting the streamer's state. Fix that. llvm-svn: 254571	2015-12-03 00:52:20 +00:00
Derek Schuff	5268aaf7b6	[WebAssembly] Add a test for wasm-store-results pass Differential Revision: http://reviews.llvm.org/D15167 llvm-svn: 254570	2015-12-03 00:50:30 +00:00
Dan Gohman	ac132e9305	[WebAssembly] Assert that byval and nest are not used for return types. llvm-svn: 254567	2015-12-02 23:40:03 +00:00
David Majnemer	6f4583c511	Forgot to add this file with r254562. llvm-svn: 254565	2015-12-02 23:09:05 +00:00
Krzysztof Parzyszek	8d8b229de9	[Hexagon] Improve lowering of instructions to the MC layer - Add extenders when necessary. - Handle some basic relocations. This should fix the failure in tools/clang/test/CodeGenCXX/crash.cpp llvm-svn: 254564	2015-12-02 23:08:29 +00:00
David Majnemer	70497c696a	Move EH-specific helper functions to a more appropriate place No functionality change is intended. llvm-svn: 254562	2015-12-02 23:06:39 +00:00
Alexey Samsonov	44ff204fad	Fixup for r254547: use format_hex() to simplify code. llvm-svn: 254560	2015-12-02 22:59:22 +00:00
Rafael Espindola	4b5ec26373	Switch the linker to having a whitelist of GVs. This replaces DoNotLinkFromSource with ValuesToLink. It also moves the computation of ValuesToLink earlier. It is a bit simpler and an important step in slitting the linker into an ir mover and a linker proper. The test change is because we now avoid creating dead declarations. llvm-svn: 254559	2015-12-02 22:59:04 +00:00
Mike Aizatsky	71552ce64b	Libfuzzer: do not pass null into user function Differential Revision: http://reviews.llvm.org/D15098 llvm-svn: 254558	2015-12-02 22:43:53 +00:00
Reid Kleckner	1f11b4e3a7	Use std::string instead of strdup() and free() in WinCodeViewLineTables llvm-svn: 254557	2015-12-02 22:34:30 +00:00
Rafael Espindola	8c04472edf	Delete what is now duplicated code. Having to import an alias as declaration is not thinlto specific. The test difference are because when we already have a decl and we are not importing it, we just leave the decl alone. llvm-svn: 254556	2015-12-02 22:22:24 +00:00
Cong Hou	1a6b5a9e4f	Fix a typo in LoopVectorize.cpp. NFC. llvm-svn: 254549	2015-12-02 21:33:47 +00:00
Alexey Samsonov	39b7d65d82	[PowerPC] Remove wild call to RegScavenger::initRegState(). This call should in fact be made by RegScavenger::enterBasicBlock() called below. The first call does nothing except for triggering UB, indicated by UBSan (passing nullptr to memset()). llvm-svn: 254548	2015-12-02 21:25:28 +00:00
Alexey Samsonov	bcfabaa05b	[Hexagon] Remove std::hex in favor of format(). std::hex is not used anywhere in LLVM code base except for this place, and it has a known undefined behavior (at least in libstdc++ 4.9.3): https://llvm.org/bugs/show_bug.cgi?id=18156, which fires in UBSan bootstrap of LLVM. llvm-svn: 254547	2015-12-02 21:13:43 +00:00
Rafael Espindola	0a80da0bec	Also copy private linkage globals when needed. This was an omission when handling COFF style comdats with local keys. Should fix the sanitizer-windows bot. llvm-svn: 254543	2015-12-02 20:57:33 +00:00
Rafael Espindola	769efe621a	Don't copy information from aliasee to alias. They are independent. llvm-svn: 254541	2015-12-02 20:03:17 +00:00
Tom Stellard	00f2f91af4	AMDGPU/SI: Correctly emit agent global segment variables when targeting HSA Differential Revision: http://reviews.llvm.org/D14508 llvm-svn: 254540	2015-12-02 19:47:57 +00:00
Krzysztof Parzyszek	de25ecfa62	[Hexagon] Remove TFRI_V4 instruction, use existing A2_tfrsi instead llvm-svn: 254539	2015-12-02 19:44:35 +00:00
Rafael Espindola	f3518c955b	Fix linking when we copy over only a decl. We were failing to copy the fact that the GV is weak and in the case of an alias, producing invalid IR. llvm-svn: 254538	2015-12-02 19:30:52 +00:00
Kyle Butt	cf6a8bfe51	[CodeGen]: Fix bad interaction with AntiDep breaking and inline asm. AggressiveAntiDepBreaker was renaming registers specified by the user for inline assembly. While this will work for compiler-specified registers, it won't work for user-specified registers, and at the time this runs, I don't currently see a way to distinguish them. llvm-svn: 254532	2015-12-02 18:58:51 +00:00
Kyle Butt	015f4fc854	Test Commit: iteratee Remove whitespace from blank lines. NFC llvm-svn: 254531	2015-12-02 18:53:33 +00:00
Fiona Glaser	1075f6323f	Fix accidental off by one change Didn't break any tests, but did unnecessary extra work. llvm-svn: 254529	2015-12-02 18:46:23 +00:00
Tom Stellard	e928533dae	AMDGPU: Fix msan test failure llvm-svn: 254527	2015-12-02 18:35:23 +00:00
Fiona Glaser	e25b06fa23	Scheduler / Regalloc: use unique_ptr[] instead of std::vector vector.resize() is significantly slower than memset in many STLs and the cost of initializing these vectors is significant on targets with many registers. Since we don't need the overhead of a vector, use a simple unique_ptr instead. llvm-svn: 254526	2015-12-02 18:32:59 +00:00
Nathan Slingerland	aa5702d92b	[llvm-profdata] Change instr prof counter overflow to saturate rather than discard Summary: This changes overflow handling during instrumentation profile merge. Rathar than throwing away records that would result in counter overflow, merged counts are instead clamped to the maximum representable value. A warning about counter overflow is still surfaced to the user as before. Reviewers: dnovillo, davidxl, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14893 llvm-svn: 254525	2015-12-02 18:19:24 +00:00
Tim Northover	f520eff782	AArch64: use ldxp/stxp pair to implement 128-bit atomic loads. The ARM ARM is clear that 128-bit loads are only guaranteed to have been atomic if there has been a corresponding successful stxp. It's less clear for AArch32, so I'm leaving that alone for now. llvm-svn: 254524	2015-12-02 18:12:57 +00:00
Dan Gohman	53d1399792	[WebAssembly] Fix comments to say "LIFO" instead of "FIFO" when describing a stack. llvm-svn: 254523	2015-12-02 18:08:49 +00:00
Tom Stellard	e3b5aeaf83	AMDGPU/SI: Don't emit group segment global variables Summary: Only global or readonly segment variables should appear in object files. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15111 llvm-svn: 254519	2015-12-02 17:00:42 +00:00
David Majnemer	942003acc6	Do (A == C1 \|\| A == C2) -> (A & ~(C1 ^ C2)) == C1 rather than (A == C1 \|\| A == C2) -> (A \| (C1 ^ C2)) == C2 when C1 ^ C2 is a power of 2. Differential Revision: http://reviews.llvm.org/D14223 Patch by Amaury SECHET! llvm-svn: 254518	2015-12-02 16:15:07 +00:00
Michael Zuckerman	15152a5c41	By intel spec \|9B DD /7\| FSTSW m2byte\| Valid Valid Store FPU status word at m2byteafter checking for pending unmasked floating-point exceptions.\| \|9B DF E0\| FSTSW AX\| Valid Valid Store FPU status word in AX register after checking for pending unmasked floating-point exceptions.\| \|DD /7 \|FNSTSW m2byte\| Valid Valid Store FPU status word at m2bytewithout checking for pending unmasked floating-point exceptions.\| \|DF E0 \|FNSTSW AX\| Valid Valid Store FPU status word in AX register without checking for pending unmasked floating-point exceptions\| m2byte is word register, and therefor instruction operand need to be change from f32mem to i16mem. Differential Revision: http://reviews.llvm.org/D14953 llvm-svn: 254512	2015-12-02 14:34:34 +00:00
Christof Douma	8b5dc2c94e	[AArch64]: Add support for Cortex-A35 Adds support for the new Cortex-A35 ARMv8-A core. llvm-svn: 254503	2015-12-02 11:53:44 +00:00
Nemanja Ivanovic	74e31bc929	Patch to fix a crash in the PowerPC back end due to ISD::ROTL and ISD::ROTR not being expanded. Test case included. llvm-svn: 254501	2015-12-02 10:36:24 +00:00
Hrvoje Varga	672b0f5582	[mips][microMIPS] Implement PREPEND, RADDU.W.QB, RDDSP, REPL.PH, REPL.QB, REPLV.PH, REPLV.QB and MTHLIP instructions Differential Revision: http://reviews.llvm.org/D14527 llvm-svn: 254496	2015-12-02 09:31:24 +00:00
Simon Pilgrim	3fc3454a0c	[X86][FMA] Optimize FNEG(FMUL) Patterns On FMA targets, we can avoid having to load a constant to negate a float/double multiply by instead using a FNMSUB (-(X*Y)-0) Fix for PR24366 Differential Revision: http://reviews.llvm.org/D14909 llvm-svn: 254495	2015-12-02 09:07:55 +00:00
Elena Demikhovsky	a1a40cce9f	AVX-512: Updated cost of FP/SINT/UINT conversion operations I checked and updated the cost of AVX-512 conversion operations. Added cost of conversion operations in DQ mode. Conversion of illegal types that requires vector split is not calculated right now (like for other X86 targets). Differential Revision: http://reviews.llvm.org/D15074 llvm-svn: 254494	2015-12-02 08:59:47 +00:00
Asaf Badouh	2489f350c0	[X86][AVX512] add comi with Sae add builtin_ia32_vcomisd and builtin_ia32_vcomisd Differential Revision: http://reviews.llvm.org/D14331 llvm-svn: 254493	2015-12-02 08:17:51 +00:00
David Blaikie	20f52662d4	[llvm-dwp] Don't rely on implicit move assignment operator (MSVC won't synthesize one) llvm-svn: 254492	2015-12-02 07:09:26 +00:00
Akira Hatanaka	237916b537	[AttributeSet] Overload AttributeSet::addAttribute to reduce compile time. The new overloaded function is used when an attribute is added to a large number of slots of an AttributeSet (for example, to function parameters). This is much faster than calling AttributeSet::addAttribute once per slot, because AttributeSet::getImpl (which calls FoldingSet::FIndNodeOrInsertPos) is called only once per function instead of once per slot. With this commit, clang compiles a file which used to take over 22 minutes in just 13 seconds. rdar://problem/23581000 Differential Revision: http://reviews.llvm.org/D15085 llvm-svn: 254491	2015-12-02 06:58:49 +00:00
Craig Topper	f419a1f69a	[X86] Change getZeroVector to take an MVT instead of EVT. One minor change needed to only try to perform 256-it shuffle combines on legal vector types. llvm-svn: 254490	2015-12-02 06:39:19 +00:00
David Blaikie	b073cb9be2	[llvm-dwp] Emit a rather fictional debug_cu_index This is very rudimentary support for debug_cu_index, but it is enough to allow llvm-dwarfdump to find the offsets for contributions and correctly dump debug_info. It will need to actually find the real signature of the unit and build the real hash table with the right number of buckets, as per the DWP specification. It will also need to be expanded to cover the tu_index as well. llvm-svn: 254489	2015-12-02 06:21:34 +00:00
Craig Topper	6164297f46	[X86] Fix weird identation. NFC llvm-svn: 254487	2015-12-02 05:24:38 +00:00
Mehdi Amini	ffe2e4aae0	Change ModuleLinker to take a set of GlobalValues to import instead of a single one For efficiency reason, when importing multiple functions for the same Module, we can avoid reparsing it every time. Differential Revision: http://reviews.llvm.org/D15102 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254486	2015-12-02 04:34:28 +00:00
Kostya Serebryany	fba04273b7	[libFuzzer] add a test that is built with -fsanitize-coverage=trace-bb llvm-svn: 254484	2015-12-02 02:49:37 +00:00
Kostya Serebryany	a3c5347764	[sanitizer coverage] when adding a bb trace instrumentation, do it instead, not in addition to, regular coverage. Do the regular coverage in the run-time instead llvm-svn: 254482	2015-12-02 02:37:13 +00:00
Quentin Colombet	bbdebefff6	[X86] Fix a think-o when checking if the eflags needs to be preserved. llvm-svn: 254480	2015-12-02 02:07:00 +00:00
Mehdi Amini	a11bdc8ef7	Modify FunctionImport to take a callback to load modules When linking static archive, there is no individual module files to load. Instead they can be mmap'ed and could be initialized from a buffer directly. The callback provide flexibility to override the scheme for loading module from the summary. Differential Revision: http://reviews.llvm.org/D15101 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254479	2015-12-02 02:00:29 +00:00
Quentin Colombet	f1e91c8bf1	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This is a superset of the fix done in r254448. This fixes PR25607. llvm-svn: 254478	2015-12-02 01:22:54 +00:00
Tim Northover	f3be9d5c0b	AArch64: fix 128-bit shifts We mustn't introduce a shift of exactly 64-bits for any inputs, since that's an UNDEF value (and worse, it's not what you want with the natural Arch64 implementation). The generated code is pretty horrific, but I couldn't come up with an obviously better alternative (if the amount is constant EXTR could help). Turns out 128-bit shifts are just nasty. rdar://22491037 llvm-svn: 254475	2015-12-02 00:33:54 +00:00
Rafael Espindola	af714765e6	Use default member initializers. llvm-svn: 254473	2015-12-01 23:06:26 +00:00
Matt Arsenault	592d068198	AMDGPU: Error on addrspacecasts that aren't actually implemented llvm-svn: 254469	2015-12-01 23:04:05 +00:00
Matt Arsenault	f9bfeafd00	AMDGPU: Implement isNoopAddrSpaceCast llvm-svn: 254468	2015-12-01 23:04:00 +00:00
Rafael Espindola	6d2c313b46	Remove unnecessary getter. llvm-svn: 254466	2015-12-01 23:01:51 +00:00
Rafael Espindola	e39cd5b144	Pass down the dst GV to linkGlobalValueBody. NFC. llvm-svn: 254465	2015-12-01 22:40:40 +00:00
Cong Hou	cb07d7016a	Fix a bug in IfConversion.cpp. The bug is introduced in r254377 which failed some tests on ARM, where a new probability is assigned to a successor but the provided BB may not be a successor. llvm-svn: 254463	2015-12-01 21:50:20 +00:00
Matthias Braun	b258d794dd	ARM: Change ArchCheck field to uint64_t The values in this field are compared against getAvailableFeatures() which returns an uint64_t. This was causing problems in an internal branch. llvm-svn: 254462	2015-12-01 21:48:52 +00:00
Matt Arsenault	3b15967008	AMDGPU: Disallow flat_scr in SI assembler llvm-svn: 254459	2015-12-01 20:31:08 +00:00
Xinliang David Li	a28306db0c	[PGO] Add support for reading multiple versions of indexed profile format profile data Profile readers using incompatible on-disk hash table format can now share the same implementation and interfaces. Differential Revision: http://reviews.llvm.org/D15100 llvm-svn: 254458	2015-12-01 20:26:26 +00:00
Rafael Espindola	edf811d68f	Delete unused includes. llvm-svn: 254457	2015-12-01 20:23:19 +00:00
Justin Bogner	909e1c0135	IR: Clean up some duplicated code in ConstantDataSequential creation. NFC ConstantDataArray::getImpl and ConstantDataVector::getImpl had a lot of copy pasta in how they handled sequences of constants. Break that out into a couple of simple functions. llvm-svn: 254456	2015-12-01 20:20:49 +00:00
Rafael Espindola	e3a933af31	clang-format LinkModules.cpp. Most of the file has been changed recently and was already clang-format clean. llvm-svn: 254454	2015-12-01 20:11:43 +00:00
Sanjay Patel	0b2a94916d	use range-based for loops; NFCI llvm-svn: 254453	2015-12-01 19:57:43 +00:00
Matt Arsenault	856d1928a8	AMDGPU: Optimize VOP2 operand legalization Don't use commuteInstruction, and don't commute if doing so will not improve legality. Skip the more complex checks for literal operands and constant bus restrictions, which are not a concern for VOP2 instructions because src1 does not accept SGPRs or constants and few implicitly read vcc. This gets called quite a few times and the attempts at commuting are a significant fraction of the time spent in SIFixSGPRCopies, so it's somewhat worthwhile to optimize. With this patch and others leading up to it, this reduces the compile time of SIFixSGPRCopies on some of the LuxMark 2 kernels from ~8ms to ~5ms on my system. llvm-svn: 254452	2015-12-01 19:57:17 +00:00
Rafael Espindola	0e309fe860	Use references now that it is natural to do so. The linker never takes ownership of a module or changes which module it is refering to, making it natural to use references. llvm-svn: 254449	2015-12-01 19:50:54 +00:00
Quentin Colombet	9cb01aa30a	[X86] Make sure the prologue does not clobber EFLAGS when it lives accross it. This fixes PR25629. llvm-svn: 254448	2015-12-01 19:49:31 +00:00
Xinliang David Li	0e6a36e17e	Use nullptr (NFC) llvm-svn: 254447	2015-12-01 19:47:32 +00:00
Sanjay Patel	b53791e5a7	don't repeat function/variable names in comments; NFC llvm-svn: 254445	2015-12-01 19:32:35 +00:00
Artyom Skrobov	5d1f2524a0	Fix Thumb1 epilogue generation Summary: This had been broken for a very long time, but nobody noticed until D14357 enabled shrink-wrapping by default. Reviewers: jroelofs, qcolombet Subscribers: tyomitch, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14986 llvm-svn: 254444	2015-12-01 19:25:11 +00:00
Sanjay Patel	96824deebc	fix typo; NFC llvm-svn: 254442	2015-12-01 19:19:18 +00:00
Weiming Zhao	56ab51870c	[AArch64] Fix a corner case in BitFeild select Summary: When not useful bits, BitWidth becomes 0 and APInt will not be happy. See https://llvm.org/bugs/show_bug.cgi?id=25571 We can just mark the operand as IMPLICIT_DEF is none bits of it is used. Reviewers: t.p.northover, jmolloy Subscribers: gberry, jmolloy, mgrang, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14803 llvm-svn: 254440	2015-12-01 19:17:49 +00:00
Matt Arsenault	e830f5427b	AMDGPU: Report extractelement as free in cost model The cost for scalarized operations is computed as N * (scalar operation cost + 1 extractelement + 1 insertelement). This partially fixes inflating the cost of scalarized operations since every operation is scalarized and free. I don't think we want any cost asociated with scalarization, but for now insertelement is still counted. I'm not sure if we should pretend that insertelement is also free, or add a way to compute a custom scalarization cost. llvm-svn: 254438	2015-12-01 19:08:39 +00:00
Keno Fischer	a6c4ce43df	[Verifier] Improve error for cross-module refs By including the module name in the error message. This makes the error message much more useful and saves a trip to the debugger. Reviewers: dexonsmith Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14473 llvm-svn: 254437	2015-12-01 19:06:36 +00:00
Rafael Espindola	3b80b8854c	Delete dead code. llvm-svn: 254436	2015-12-01 18:50:35 +00:00
Rafael Espindola	4dbdceb6fc	Use a forwarding constructor instead of an init method. llvm-svn: 254435	2015-12-01 18:46:19 +00:00
Rafael Espindola	4808c6d064	Delete the setModule method from the Linker. It was only used from LTO for a debug feature, and LTO can just create another linker. It is pretty odd to have a method to reset the module in the middle of a link. It would make IdentifiedStructTypes inconsistent with the Module for example. llvm-svn: 254434	2015-12-01 18:41:30 +00:00
Tom Stellard	38b7cbe3e0	AMDGPU/SI: Remove REGISTER_STORE/REGISTER_LOAD code which is now dead Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15050 llvm-svn: 254427	2015-12-01 17:45:22 +00:00
Tom Stellard	ff63c25753	AMDGPU: Use the default strings for data emission directives Summary: This makes the assembly output look nicer and there is no reason to have custom strings for these. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14671 llvm-svn: 254426	2015-12-01 17:45:17 +00:00
Sanjay Patel	60216f6943	[x86] add a convenience method to check for FMA capability; NFCI llvm-svn: 254425	2015-12-01 17:27:55 +00:00
Rafael Espindola	6e8ab928d5	Make appending var linking less of a special case. It has to be a bit special because: * materializeInitFor is not really supposed to call replaceAllUsesWith. The caller has a plain variable with Dst and expects just the initializer to be set, not for it to be removed. * Calling mutateType as we used to do before gets some type inconsistency which breaks the bitcode writer. * If linkAppendingVarProto create a dest decl with the correct type to avoid the above problems, it needs to put the original dst init in some side table for materializeInitFor to use. In the end the simplest solution seems to be to just have linkAppendingVarProto do all the work and set ValueMap[SrcGV to avoid recursion. llvm-svn: 254424	2015-12-01 17:17:04 +00:00
Teresa Johnson	430110cc0b	[ThinLTO] Wrap dbgs() output in DEBUG macro Missed in a couple places. llvm-svn: 254422	2015-12-01 17:12:10 +00:00
Teresa Johnson	d582f5b3f8	[ThinLTO] Remove stale comment (NFC) Stale as of r254036 which added basic profitability check. llvm-svn: 254421	2015-12-01 16:45:23 +00:00
Rafael Espindola	baa3bf8f76	Bring r254336 back: The difference is that now we don't error on out-of-comdat access to internal global values. We copy them instead. This seems to match the expectation of COFF linkers (see pr25686). Original message: Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254418	2015-12-01 15:19:48 +00:00
Chad Rosier	869962f962	[LIR] Push check into helper function. NFC. llvm-svn: 254416	2015-12-01 14:26:35 +00:00
Elena Demikhovsky	0d0692d854	AVX-512: fixed asm string of vsqrtss (vvsqrtss was generated before) llvm-svn: 254411	2015-12-01 12:43:46 +00:00
Elena Demikhovsky	47fa271a9b	Fixed a failure in getSpaltValue() llvm-svn: 254409	2015-12-01 12:30:40 +00:00
Elena Demikhovsky	0781d7b2b4	Fixed a failure in cost calculation for vector GEP Cost calculation for vector GEP failed with due to invalid cast to GEP index operand. The bug is fixed, added a test. http://reviews.llvm.org/D14976 llvm-svn: 254408	2015-12-01 12:08:36 +00:00
Hrvoje Varga	e51b0e13f3	[mips][microMIPS] Implement RECIP.fmt, RINT.fmt, ROUND.L.fmt, ROUND.W.fmt, SEL.fmt, SELEQZ.fmt, SELNEQZ.fmt and CLASS.fmt Differential Revision: http://reviews.llvm.org/D13885 llvm-svn: 254405	2015-12-01 11:59:21 +00:00
Yury Gribov	d7dbb66eb8	Introduce new @llvm.get.dynamic.area.offset.i{32, 64} intrinsics. The @llvm.get.dynamic.area.offset.* intrinsic family is used to get the offset from native stack pointer to the address of the most recent dynamic alloca on the caller's stack. These intrinsics are intendend for use in combination with @llvm.stacksave and @llvm.restore to get a pointer to the most recent dynamic alloca. This is useful, for example, for AddressSanitizer's stack unpoisoning routines. Patch by Max Ostapenko. Differential Revision: http://reviews.llvm.org/D14983 llvm-svn: 254404	2015-12-01 11:40:55 +00:00
Cong Hou	4aef7ef881	Allow known and unknown probabilities coexist in MBB's successor list. Previously it is not allowed for each MBB to have successors with both known and unknown probabilities. However, this may be too strict as at this stage we could not always guarantee that. It is better to remove this restriction now, and I will work on validating MBB's successors' probabilities first (for example, check if the sum is approximate one). llvm-svn: 254402	2015-12-01 11:05:39 +00:00
Oliver Stannard	a34e47066e	[AArch64] Add ARMv8.2-A Statistical Profiling Extension The Statistical Profiling Extension is an optional extension to ARMv8.2-A. Since it is an optional extension, I have added the FeatureSPE subtarget feature to control it. The assembler-visible parts of this extension are the new "psb csync" instruction, which is equivalent to "hint #17", and a number of system registers. Differential Revision: http://reviews.llvm.org/D15021 llvm-svn: 254401	2015-12-01 10:48:51 +00:00
Oliver Stannard	4667071574	[ARM] Add ARMv8.2-A to TargetParser Add ARMv8.2-A to TargetParser, so that it can be used by the clang command-line options and the .arch directive. Most testing of this will be done in clang, checking that the command-line options that this enables work. Differential Revision: http://reviews.llvm.org/D15037 llvm-svn: 254400	2015-12-01 10:33:56 +00:00
Oliver Stannard	8addbf4350	[ARM] Add subtarget features for ARMv8.2-A This adds subtarget features for ARMv8.2-A, which builds on (and requires the features from) ARMv8.1-A. Most assembler-visible features of ARMv8.2-A are system instructions, and are all required parts of the architecture, so just depend on the HasV8_2aOps subtarget feature. There is also one large, optional feature, which adds 16-bit floating point versions of all existing floating-point instructions (VFP and SIMD), this is represented by the FeatureFullFP16 subtarget feature. Differential Revision: http://reviews.llvm.org/D15036 llvm-svn: 254399	2015-12-01 10:23:06 +00:00
Sanjoy Das	347d272c5c	Introduce a range version of std::find, and use in SCEV Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15064 llvm-svn: 254391	2015-12-01 07:49:27 +00:00
Sanjoy Das	ff3b8b4c33	Introduce a range version of std::any_of, and use it in SCEV Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15063 llvm-svn: 254390	2015-12-01 07:49:23 +00:00
Craig Topper	c458c7c6c9	[X86] Fix patterns for memory forms of FP FSUBR and FDIVR. They need to have memory on the left hand side of the fsub/fdiv operations in their patterns. Not sure how to test this. I noticed by inspection in the isel tables where the same pattern tried to produce DIV and DIVR or SUB and SUBR. llvm-svn: 254388	2015-12-01 06:13:16 +00:00
Craig Topper	271f9ded44	[X86] Use range-based for loops. NFC llvm-svn: 254387	2015-12-01 06:13:15 +00:00
Craig Topper	ba894c3c0d	[X86] Use array_lengthof instead of calculating manually. Also change index types to size_t to match. llvm-svn: 254386	2015-12-01 06:13:13 +00:00
Craig Topper	ddc76f2bed	[Hexagon] Use std::begin() and std::end() instead of doing the same manually. NFC llvm-svn: 254385	2015-12-01 06:13:10 +00:00
Craig Topper	d824f5f0d9	[Hexagon] Use array_lengthof and const correct and type correct the array and array size. NFC llvm-svn: 254384	2015-12-01 06:13:08 +00:00
Craig Topper	6261e1b94d	Use array_lengthof instead of manually calculating it. NFC llvm-svn: 254383	2015-12-01 06:13:06 +00:00
Craig Topper	3da000c07f	[Hexagon] Use ArrayRef to avoid needing to calculate an array size. Interestingly the original code may have had a bug because it was passing the byte size of a uint16_t array instead of the number of entries. llvm-svn: 254382	2015-12-01 06:13:04 +00:00
Craig Topper	8072081b63	[ARM] Use range-based for loops to avoid the need for calculating an array size that I would have otherwise cconverted to array_lengthof. NFC llvm-svn: 254381	2015-12-01 06:13:01 +00:00
Craig Topper	fac9057ef8	Use array_lengthof instead of manually calculating it. NFC llvm-svn: 254380	2015-12-01 06:12:59 +00:00
Davide Italiano	05402671b8	[Windows] Partially revert r254363 until I can test the right fix. Reported by: David Blaikie llvm-svn: 254378	2015-12-01 05:33:24 +00:00
Cong Hou	d97c100dc4	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. (This is the second attempt to submit this patch. The first caused two assertion failures and was reverted. See https://llvm.org/bugs/show_bug.cgi?id=25687) The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254377	2015-12-01 05:29:22 +00:00
Matthias Braun	50f7f585ed	RegisterPressure: If we do not collect dead defs the list must be empty llvm-svn: 254372	2015-12-01 04:20:06 +00:00
Matthias Braun	ba6b225bf9	RegisterPressure: Remove support for recede()/advance() at MBB boundaries Nobody was checking the returnvalue of recede()/advance() so we can simply replace this code with asserts. llvm-svn: 254371	2015-12-01 04:20:04 +00:00
Matthias Braun	f9f8b92d93	RegisterPressure: Split RegisterOperands analysis code from result object; NFC This is in preparation to expose the RegisterOperands class as RegisterPressure API. llvm-svn: 254368	2015-12-01 04:19:56 +00:00
Hans Wennborg	1dbaf67537	Revert r254348: "Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces." and the follow-up r254356: "Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction." Asserts were firing in Chromium builds. See PR25687. llvm-svn: 254366	2015-12-01 03:49:42 +00:00
Davide Italiano	38518e9f53	[Windows] Follow-up r254363, remove return. llvm-svn: 254364	2015-12-01 02:38:42 +00:00
Davide Italiano	b37d6bd7ae	[Windows] Simplify assertion code. NFC. llvm-svn: 254363	2015-12-01 02:35:04 +00:00
Matt Arsenault	456fdfcdc2	Squelch unused variable warning in SIRegisterInfo.cpp. Patch by Justin Lebar llvm-svn: 254362	2015-12-01 02:14:33 +00:00
Cong Hou	1ccca9e673	Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction. The root cause is the rounding behavior in BranchProbability construction. We may consider to use truncation instead in the future. llvm-svn: 254356	2015-12-01 00:55:42 +00:00
Evgeniy Stepanov	42f3b12274	[safestack] Protect byval function arguments. Detect unsafe byval function arguments and move them to the unsafe stack. llvm-svn: 254353	2015-12-01 00:40:05 +00:00
Evgeniy Stepanov	fd07995363	Extend debug info for function parameters in SDAG. SDAG currently can emit debug location for function parameters when an llvm.dbg.declare points to either a function argument SSA temp, or to an AllocaInst. This change extends this logic by adding a fallback case when neither of the above is true. This is required for SafeStack, which may copy the contents of a byval function argument into something that is not an alloca, and then describe the target as the new location of the said argument. llvm-svn: 254352	2015-12-01 00:34:30 +00:00
Evgeniy Stepanov	a4ac3f4bdf	[safestack] Fix handling of array allocas. The current code does not take alloca array size into account and, as a result, considers any access past the first array element to be unsafe. llvm-svn: 254350	2015-12-01 00:06:13 +00:00
Cong Hou	fa1917c673	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254348	2015-12-01 00:02:51 +00:00
Rafael Espindola	e9841a6bb5	This reverts commit r254336 and r254344. They broke a bot and I am debugging why. llvm-svn: 254347	2015-11-30 23:54:19 +00:00
Rafael Espindola	a891957002	Disable a consistency check. Trying to figure out why it fails on a bot but passes locally. llvm-svn: 254344	2015-11-30 23:05:25 +00:00
Simon Pilgrim	db26b3ddfa	[X86][FMA4] Prefer FMA4 to FMA We currently output FMA instructions on targets which support both FMA4 + FMA (i.e. later Bulldozer CPUS bdver2/bdver3/bdver4). This patch flips this so FMA4 is preferred; this is for several reasons: 1 - FMA4 is non-destructive reducing the need for mov instructions. 2 - Its more straighforward to commute and fold inputs (although the recent work on FMA has reduced this difference). 3 - All supported targets have FMA4 performance equal or better to FMA - Piledriver (bdver2) in particular has half the throughput when executing FMA instructions. Its looks like no future AMD processor lines will support FMA4 after the Bulldozer series so we're not causing problems for later CPUs. Differential Revision: http://reviews.llvm.org/D14997 llvm-svn: 254339	2015-11-30 22:22:06 +00:00
Rafael Espindola	c109200c53	Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254336	2015-11-30 22:01:43 +00:00
Paul Robinson	a2550a6da3	Have 'optnone' respect the -fast-isel=false option. This is primarily useful for debugging optnone v. ISel issues. Differential Revision: http://reviews.llvm.org/D14792 llvm-svn: 254335	2015-11-30 21:56:16 +00:00
Matt Arsenault	ada6cf1b22	AMDGPU: Fix unused function llvm-svn: 254333	2015-11-30 21:32:10 +00:00
Matt Arsenault	41003af292	AMDGPU: Error if too many user SGPRs used llvm-svn: 254332	2015-11-30 21:16:07 +00:00
Matt Arsenault	26f8f3db39	AMDGPU: Rework how private buffer passed for HSA If we know we have stack objects, we reserve the registers that the private buffer resource and wave offset are passed and use them directly. If not, reserve the last 5 SGPRs just in case we need to spill. After register allocation, try to pick the next available registers instead of the last SGPRs, and then insert copies from the inputs to the reserved registers in the progloue. This also only selectively enables all of the input registers which are really required instead of always enabling them. llvm-svn: 254331	2015-11-30 21:16:03 +00:00
Matt Arsenault	ac234b604d	AMDGPU: Rename enums to be consistent with HSA code object terminology llvm-svn: 254330	2015-11-30 21:15:57 +00:00
Matt Arsenault	0e3d38937e	AMDGPU: Remove SIPrepareScratchRegs It does not work because of emergency stack slots. This pass was supposed to eliminate dummy registers for the spill instructions, but the register scavenger can introduce more during PrologEpilogInserter, so some would end up left behind if they were needed. The potential for spilling the scratch resource descriptor and offset register makes doing something like this overly complicated. Reserve registers to use for the resource descriptor and use them directly in eliminateFrameIndex. Also removes creating another scratch resource descriptor when directly selecting scratch MUBUF instructions. The choice of which registers are reserved is temporary. For now it attempts to pick the next available registers after the user and system SGPRs. llvm-svn: 254329	2015-11-30 21:15:53 +00:00
Matt Arsenault	ff6da2fe89	AMDGPU: Use assert zext for workgroup sizes llvm-svn: 254328	2015-11-30 21:15:45 +00:00
Quentin Colombet	cdad10f333	[ARM] For old thumb ISA like v4t, we cannot use PC directly in pop. Fix the epilogue emission to account for that. llvm-svn: 254325	2015-11-30 20:37:58 +00:00
Davide Italiano	1aeed6a955	[SimplifyLibCalls] Transform log(exp2(y)) to y*log(2) under fast-math. llvm-svn: 254317	2015-11-30 19:36:35 +00:00
David Majnemer	bf4119faf6	[X86] Add RIP to GR64_TCW64 The MachineVerifier wants to check that the register operands of an instruction belong to the instruction's register class. RIP-relative control flow instructions violated this by referencing RIP. While this was fixed for SysV, it was never fixed for Win64. llvm-svn: 254315	2015-11-30 19:04:19 +00:00
Kit Barton	f4ce2f3a9e	Enable shrink wrapping for PPC64 Re-enable shrink wrapping for PPC64 Little Endian. One minor modification to PPCFrameLowering::findScratchRegister was necessary to handle fall-thru blocks (blocks with no terminator) correctly. Tested with all LLVM test, clang tests, and the self-hosting build, with no problems found. PHabricator: http://reviews.llvm.org/D14778 llvm-svn: 254314	2015-11-30 18:59:41 +00:00
Rafael Espindola	c98b20b0d6	Fix another llvm.ctors merging bug. We were not looking past casts to see if an element should be included or not. llvm-svn: 254313	2015-11-30 18:54:24 +00:00
Dan Gohman	96029f7880	[WebAssembly] Fix a few minor compiler warnings. NFC. llvm-svn: 254311	2015-11-30 18:42:08 +00:00
Sanjay Patel	239be1fb0d	fix formatting; NFC llvm-svn: 254310	2015-11-30 17:52:02 +00:00
Colin LeMahieu	e6241798c9	[Hexagon] NFC Reordering headers. llvm-svn: 254307	2015-11-30 17:32:34 +00:00
Matt Arsenault	ea03cf2fa1	AMDGPU: Don't reserve SCRATCH_PTR input register This hasn't been doing anything since using relocations was added. llvm-svn: 254304	2015-11-30 15:46:47 +00:00
Aaron Ballman	33c95f08b0	Silencing a 32-bit to 64-bit implicit conversion warning; NFC. llvm-svn: 254302	2015-11-30 14:52:33 +00:00
Hrvoje Varga	c03957f049	[mips][microMIPS] Implement LBUX, LHX, LWX, MAQ_S[A].W.PHL, MAQ_S[A].W.PHR, MFHI, MFLO, MTHI and MTLO instructions Differential Revision: http://reviews.llvm.org/D14436 llvm-svn: 254297	2015-11-30 12:58:39 +00:00
Zoran Jovanovic	a887b36167	[mips][microMIPS] Fix issue with offset operand of BALC and BC instructions Value of offset operand for microMIPS BALC and BC instructions is currently shifted 2 bits, but it should be 1 bit. Differential Revision: http://reviews.llvm.org/D14770 llvm-svn: 254296	2015-11-30 12:56:18 +00:00
Zlatko Buljan	56f3b0e410	[mips][microMIPS] Implement PRECR.QB.PH, PRECR_SRA[_R].PH.W, PRECRQ.PH.W, PRECRQ.QB.PH, PRECRQU_S.QB.PH and PRECRQ_RS.PH.W instructions Differential Revision: http://reviews.llvm.org/D14605 llvm-svn: 254291	2015-11-30 08:37:38 +00:00
Craig Topper	27e2912fa8	Revert r254279 "[X86] Use ArrayRef. NFC". It seems to have upset an MSVC build bot. llvm-svn: 254280	2015-11-30 02:28:19 +00:00

1 2 3 4 5 ...

85163 Commits