llvm-project

Commit Graph

Author	SHA1	Message	Date
Martin Storsjo	d3b29223a8	[ARM] Move machine operand target flags to ARMBaseInstrInfo This makes sure the flags are available for use for thumb MIR as well. A test that requires this will be added in the next commit. llvm-svn: 340450	2018-08-22 20:34:06 +00:00
Krzysztof Parzyszek	2ff9aa15e4	[Hexagon] Enable interleaving in loop vectorizer llvm-svn: 340447	2018-08-22 20:15:04 +00:00
Eli Friedman	c11e2b9470	[ARM] Handle all-ones mask explicitly in targetShrinkDemandedConstant. This avoids a potential infinite loop setting and unsetting bits in the mask. Reduced from a failure on the polly-aosp bot. Differential Revision: https://reviews.llvm.org/D51066 llvm-svn: 340446	2018-08-22 20:13:45 +00:00
Alina Sbirlea	8b83d68544	Update MemorySSA in LoopSimplifyCFG. Summary: Add MemorySSA as a dependency to LoopSimplifyCFG and preserve it. Disabled by default until all passes preserve MemorySSA. Reviewers: bogner, chandlerc Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D50911 llvm-svn: 340445	2018-08-22 20:10:21 +00:00
Alina Sbirlea	c1a216b251	Update MemorySSA in LoopInstSimplify. Summary: Add MemorySSA as a depency to LoopInstInstSimplify and preserve it. Disabled by default until all passes preserve MemorySSA. Reviewers: chandlerc Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D50906 llvm-svn: 340444	2018-08-22 20:05:21 +00:00
Philip Reames	8abf4484fe	[AA] Remove a needless variable [NFC] There's no need to track a seperate variable for argmemonly aliasing. This falls out naturally of the modinfo union. Note that we may return earlier than we would have earlier if all arguments are explicitly readnone. The overall result doesn't change, just how we get there. llvm-svn: 340443	2018-08-22 19:50:45 +00:00
Craig Topper	538f8ab438	[X86] Replace (32/64 - n) shift amounts with (neg n) since the shift amount is masked in hardware Inspired by what AArch64 does for shifts, this patch attempts to replace shift amounts with neg if we can. This is done directly as part of isel so its as late as possible to avoid breaking some BZHI patterns since those patterns need an unmasked (32-n) to be correct. To avoid manual load folding and custom instruction selection for the negate. I've inserted new nodes in the DAG above the shift node in topological order. Differential Revision: https://reviews.llvm.org/D48789 llvm-svn: 340441	2018-08-22 19:39:09 +00:00
Philip Reames	f8681cea87	[AST] Minor whitespace cleanup [NFC] llvm-svn: 340440	2018-08-22 19:30:46 +00:00
Heejin Ahn	bc6d8970bb	[WebAssembly] Remove MachineFrameInfo arg from checking functions (NFC) Summary: There are several functions in the form of `has*` or `needs*` in `WebAssemblyFrameLowering` and its `MachineFrameInfo` argument can be obtained from `MachineFunction` so it is not necessarily has to be passed from a caller. Also, it is more in line with other overriden fuctions like `hasBP` or `hasReservedCallFrame`, which also take only `MachineFunction` argument. Reviewers: dschuff Subscribers: sbc100, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51116 llvm-svn: 340438	2018-08-22 18:53:48 +00:00
Chris Bieneman	a2133f1a68	[CMake] Remove unneeded and outdated policy This was needed way back because we didn't properly handle that the SOURCES property of a target could have things that weren't source files to compile. Almost 2 years ago Takumi fixed that, and now CMake is throwing warnings that we should get off the old behavior. llvm-svn: 340436	2018-08-22 18:41:14 +00:00
Chris Bieneman	dc622702aa	[CMake] Use LLVM_ENABLE_IDE instead of CMAKE_CONFIGURATION_TYPES There are several places where we use CMAKE_CONFIGURATION_TYPES to determine if we are using an IDE generator and in turn decide not to generate some of the convenience targets (like all the install-* and check-llvm-* targets). This decision is made because IDEs don't always deal well with the thousands of targets LLVM can generate. This approach does not work for Visual Studio 15's new CMake integration. Because VS15 uses a Ninja generator, it isn't a multi-configuration build, and generating all these extra targets mucks up the UI and adds little value. With this change we still don't generate these targets by default for Visual Studio and Xcode generators, and LLVM_ENABLE_IDE becomes a switch that can be enabled on the VS15 CMake builds, to improve the IDE experience. llvm-svn: 340435	2018-08-22 18:40:24 +00:00
Craig Topper	87f78cfe15	[X86] In OptimizeLEAs pass, check that the key is in the LEAs map before accessing When the key is not already in the map, the access operator[] creates an empty value and grows the map. Resizing a map is very slow, so this needs to be avoided. Found with csmith + asserts. May help with https://bugs.llvm.org/show_bug.cgi?id=25843 Patch by Tom Rix. Differential Revision: https://reviews.llvm.org/D50780 llvm-svn: 340434	2018-08-22 18:24:13 +00:00
Heejin Ahn	ff363539c6	[WebAssembly] Add hasSideEffects flag to catch instructions Summary: `catch` instruction certainly has rather huge side effects and the flag was missing. At the moment this does not change any unit tests we currently have. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D50919 llvm-svn: 340433	2018-08-22 18:22:45 +00:00
Vedant Kumar	a85ca3de66	[CodeGenPrepare] Set debug locs when folding a comparison into a uadd.with.overflow CGP can replace a branch + select with a uadd.with.overflow. Teach it to set debug locations as it does this. llvm-svn: 340432	2018-08-22 18:15:03 +00:00
Matt Davis	9cee1adc1a	[llvm-mca] Clean up a comment about the Context class. NFC. llvm-svn: 340431	2018-08-22 18:03:58 +00:00
George Burgess IV	d61e7071cd	[MemorySSA] Move two simple getters; NFC We're calling these functions quite a bit from outside of MemorySSA.cpp now. Given that they're relatively simple one-liners, I think the style preference is to have them inline. llvm-svn: 340430	2018-08-22 18:02:46 +00:00
Aditya Nandakumar	c106183518	[GISel]: Add legalization support for widening bit counting operations https://reviews.llvm.org/D51053 Added legalization for WidenScalar of various bitcounting opcodes. Reviewed by arsenm. llvm-svn: 340429	2018-08-22 17:59:18 +00:00
Sanjay Patel	b5686c4e4e	[x86] add tests for load scalar + insertelement; NFC llvm-svn: 340425	2018-08-22 17:46:28 +00:00
Sam Clegg	f77dc2a8d1	[WebAssembly] Ensure relocation entries are ordered by offset wasm-lld expects relocation entries to be sorted by offset. In most cases llvm produces them in order, but the CODE section (which combines many MCSections) is an exception because we order the functions in Symbol order, not in section order. What is more, its not clear weather `recordRelocation` is guaranteed to be called in offset order so this sort of most likely needed in the general case too. Differential Revision: https://reviews.llvm.org/D51065 llvm-svn: 340423	2018-08-22 17:27:31 +00:00
Matt Davis	4fc7e6a1e9	[llvm-mca] Remove unused decl. NFC. llvm-svn: 340422	2018-08-22 17:15:25 +00:00
Samuel Pitoiset	7bd9dcffcd	AMDGPU: bump AS.MAX_COMMON_ADDRESS to 6 since 32-bit addr space 32-bit constant address space is declared as 6, so the maximum number of address spaces is 6, not 5. Fixes "LLVM ERROR: Pointer address space out of range". v5: rename MAX_COMMON_ADDRESS to MAX_AMDGPU_ADDRESS v4: - fix compilation issues - fix out of bounds access v3: use static_assert() v2: add a very simple test for 32-bit addr space Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106630 llvm-svn: 340417	2018-08-22 16:08:48 +00:00
Samuel Pitoiset	d81d6f7d58	AMDGPU: fix existing alias rules for constant and global Constant and global may alias, also one rules table wasn't ordered correctly. Pinpointed by Matt. v2: add a test with swapped parameters llvm-svn: 340416	2018-08-22 16:08:43 +00:00
Simon Pilgrim	ffdfe45645	[X86][SSE] LowerMULH vXi8 - use SSE shifts directly. We know these vXi16 extended cases are legal constant splat shifts. llvm-svn: 340414	2018-08-22 15:37:11 +00:00
Sam Parker	4d519fc3b5	[ARM] Rotated operand patterns for *xtb16 Add intrinsic isel patterns for sxtb16, sxtab16, uxtb16 and uxtab16 so that they can perform a ror. Differential Revision: https://reviews.llvm.org/D51034 llvm-svn: 340405	2018-08-22 12:58:36 +00:00
David Green	9dd1d451d9	[AArch64] Add Tiny Code Model for AArch64 This adds the plumbing for the Tiny code model for the AArch64 backend. This, instead of loading addresses through the normal ADRP;ADD pair used in the Small model, uses a single ADR. The 21 bit range of an ADR means that the code and its statically defined symbols need to be within 1MB of each other. This makes it mostly interesting for embedded applications where we want to fit as much as we can in as small a space as possible. Differential Revision: https://reviews.llvm.org/D49673 llvm-svn: 340397	2018-08-22 11:31:39 +00:00
Matt Arsenault	bb8e64e7f5	AMDGPU: Fix not respecting byval alignment in call frame setup This was hackily adding in the 4-bytes reserved for the callee's emergency stack slot. Treat it like a normal stack allocation so we get the correct alignment padding behavior. This fixes an inconsistency between the caller and callee. llvm-svn: 340396	2018-08-22 11:09:45 +00:00
Andrea Di Biagio	4660fd25d1	[llvm-mca] Improved code comments and moved some method definitions from Scheduler.h to Scheduler.cpp. NFC llvm-svn: 340395	2018-08-22 10:23:28 +00:00
Simon Pilgrim	b89a4f85bf	[X86][SSE] Add sdiv test case from PR38658 llvm-svn: 340393	2018-08-22 09:47:12 +00:00
Stefan Maksimovic	6ccbd16433	[mips] Handle missing CondCodes Add patterns for unhandled CondCode enumerables: SETEQ, SETGE, SETGT, SETLE, SETLT, SETNE. Stated at the ISD::CondCode enum declaration: `All of these (except for the 'always folded ops') should be handled for floating point.` Add patterns which use these nodes, same as corresponding 'ordered' CondCode nodes. Referring to 'Ordered means that neither operand is a QNAN' we assume it is safe to match ex. SETLT node to the same instruction as SETOLT. Differential Revision: https://reviews.llvm.org/D50757 llvm-svn: 340392	2018-08-22 09:34:44 +00:00
Dean Michael Berris	d764c1b656	[XRay] Refactor file header reading (NFC) Summary: This patch moves out the definition of the XRay log file header from binary logs into its own header and implementation file. This is one part of the refactoring being done in D50441. Reviewers: eizan Subscribers: mgorny, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51086 llvm-svn: 340389	2018-08-22 07:37:55 +00:00
Philip Reames	fdd73b5037	[AST] Fix a whitespace typo [NFC] llvm-svn: 340384	2018-08-22 03:36:42 +00:00
Philip Reames	5d90c14b76	[AST] Reorder code to reduce a future patch diff [NFC] llvm-svn: 340383	2018-08-22 03:33:55 +00:00
Philip Reames	825c74c241	[AST] Move a function definition into the cpp [NFC] llvm-svn: 340382	2018-08-22 03:32:52 +00:00
Max Kazantsev	611d645a08	[GuardWidening] Ignore guards with trivial conditions Guard widening should not spend efforts on dealing with guards with trivial true/false conditions. Such guards can easily be eliminated by any further cleanup pass like instcombine. However we should not unconditionally delete them because it may be profitable to widen other conditions into such guards. Differential Revision: https://reviews.llvm.org/D50247 Reviewed By: fedor.sergeev llvm-svn: 340381	2018-08-22 02:40:49 +00:00
Fangrui Song	9ba5740ba5	[gold] -thinlto-object-suffix-replace: don't append new suffix if path does not end with old suffix Summary: This is to be consistent with lld behavior since rLLD340364. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: steven_wu, eraman, mehdi_amini, inglorion, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D51060 llvm-svn: 340380	2018-08-22 02:11:36 +00:00
Vedant Kumar	4760686823	[CodeGenPrepare] Set debug loc when widening a switch condition Set a debug location on the cast instruction used to widen a switch condition. llvm-svn: 340379	2018-08-22 01:23:31 +00:00
Bob Haarman	481d224b67	[Support][CachePruning] prune least recently accessed files first Summary: Before this change, pruning order was based on size. This changes it to be based on time of last use instead, preferring to keep recently used files and prune older ones. Reviewers: pcc, rnk, espindola Reviewed By: rnk Subscribers: emaste, arichardson, hiraditya, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D51062 llvm-svn: 340374	2018-08-22 00:52:16 +00:00
Heejin Ahn	684325955c	[WebAssembly] Fix typos in mem.grow/memory.grow opcodes This should be not 0x3f but 0x40. llvm-svn: 340373	2018-08-22 00:33:34 +00:00
Heejin Ahn	c4df1d182c	[WebAssembly] Change comments on SP writing back (NFC) Summary: We now write back not to memory but to __stack_pointer global. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51074 llvm-svn: 340372	2018-08-22 00:20:02 +00:00
Vedant Kumar	1e8a2c963c	[CodeGenPrepare] Set debug locations when splitting selects When splitting a select into a diamond, set debug locations on newly-created branch instructions and phi nodes. llvm-svn: 340371	2018-08-22 00:10:37 +00:00
Vedant Kumar	30406fd789	[CodeGenPrepare] Clean up dbg.value use-before-def as late as possible CodeGenPrepare has a strategy for moving dbg.values so that a value's definition always dominates its debug users. This cleanup was happening too early (before certain CGP transforms were run), resulting in some dbg.value use-before-def errors. Perform this cleanup as late as possible to avoid use-before-def. llvm-svn: 340370	2018-08-21 23:43:08 +00:00
Vedant Kumar	8d652b756e	[CodeGenPrepare] Pre-commit debug info test for optimizeSelectInst This test shows that optimizeSelectInst splits a select and sinks a `fdiv` operation to one side of the diamond. However, the dbg.value for the operation isn't moved. llvm-svn: 340369	2018-08-21 23:42:53 +00:00
Vedant Kumar	00e7558edd	[CodeGenPrepare] Scan past debug intrinsics to find select candidates (NFC) In optimizeSelectInst, when scanning for candidate selects to rewrite into branches, scan past debug intrinsics. This makes the debug-enabled and non-debug paths through optimizeSelectInst more congruent. NFC because every select is eventually visited either way. llvm-svn: 340368	2018-08-21 23:42:38 +00:00
Vedant Kumar	fbc3873be9	[CodeGenPrepare] Exit earlier when optimizing selects (NFC) When optimizing for size, this allows optimizeSelectInst to skip a linear scan and exit early. llvm-svn: 340367	2018-08-21 23:42:23 +00:00
Vedant Kumar	a459b9f757	Avoid dbg.value use-before-def in a few tests (NFC) This is preparation for landing a use-before-def verifier for debug intrinsics (D46100). As a drive-by, remove `tail` from debug intrinsic calls because it doesn't mean anything in that context. llvm-svn: 340366	2018-08-21 23:42:08 +00:00
Alina Sbirlea	ab6f84f763	Update MemorySSA in BasicBlockUtils. Summary: Extend BasicBlocksUtils to update MemorySSA. Subscribers: sanjoy, arsenm, nhaehnle, jlebar, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D45300 llvm-svn: 340365	2018-08-21 23:32:03 +00:00
Alina Sbirlea	5f0976a6d1	[MemorySSA] Update comment for move APIs to clarify behavior (NFC). llvm-svn: 340362	2018-08-21 23:13:02 +00:00
Zachary Turner	ee09170d25	[MS Demangler] Print template constructor args. Previously if you had something like this: template<typename T> struct Foo { template<typename U> Foo(U); }; Foo F(3.7); this would mangle as ??$?0N@?$Foo@H@@QEAA@N@Z and this would be demangled as: undname: __cdecl Foo<int>::Foo<int><double>(double) llvm-undname: __cdecl Foo<int>::Foo<int>(double) Note the lack of the constructor template parameter in our demangling. This patch makes it so we print the constructor argument list. llvm-svn: 340356	2018-08-21 22:52:52 +00:00
Tom Stellard	ecd6aa5be2	MachineScheduler: Refactor setPolicy() to limit computing remaining latency Summary: Computing the remaining latency can be very expensive especially on graphs of N nodes where the number of edges approaches N^2. This reduces the compile time of a pathological case with the AMDGPU backend from ~7.5 seconds to ~3 seconds. This test case has a basic block with 2655 stores, each with somewhere between 500 and 1500 successors and predecessors. Reviewers: atrick, MatzeB, airlied, mareko Reviewed By: mareko Subscribers: tpr, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D50486 llvm-svn: 340346	2018-08-21 21:48:43 +00:00
Philip Reames	6a2a5c99c7	[LICM] Fix a test so it actualy checks what was meant [NFC] llvm-svn: 340344	2018-08-21 21:27:26 +00:00
Scott Linder	72855e36c5	[AMDGPU] Consider loads from flat addrspace to be potentially divergent In general we can't assume flat loads are uniform, and cases where we can prove they are should be handled through infer-address-spaces. Differential Revision: https://reviews.llvm.org/D50991 llvm-svn: 340343	2018-08-21 21:24:31 +00:00
Zachary Turner	df4cd7cbf9	[MS Demangler] Fix a few more edge cases. I found these by running llvm-undname over a couple hundred megabytes of object files generated as part of building chromium. The issues fixed in this patch are: 1) decltype-auto return types. 2) Indirect vtables (e.g. const A::`vftable'{for `B'}) 3) Pointers, references, and rvalue-references to member pointers. I have exactly one remaining symbol out of a few hundred MB of object files that produces a name we can't demangle, and it's related to back-referencing. llvm-svn: 340341	2018-08-21 21:23:49 +00:00
Zachary Turner	4ca11217fc	Print "invalid mangled name" when we can't demangle something. llvm-svn: 340340	2018-08-21 21:23:29 +00:00
Heejin Ahn	78d1910891	[WebAssembly] Restore __stack_pointer after catch instructions Summary: After the stack is unwound due to a thrown exception, the `__stack_pointer` global can point to an invalid address. This inserts instructions that restore `__stack_pointer` global. Reviewers: jgravelle-google, dschuff Subscribers: mgorny, sbc100, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D50980 llvm-svn: 340339	2018-08-21 21:23:07 +00:00
Anna Thomas	1d78503f6a	NFC: update the test comments in LV test about early exit loops llvm-svn: 340337	2018-08-21 21:12:02 +00:00
Thomas Lively	22442924a8	[WebAssembly] v128.const Summary: This CL implements v128.const for each vector type. New operand types are added to ensure the vector contents can be serialized without LEB encoding. Tests are added for instruction selection, encoding, assembly and disassembly. Reviewers: aheejin, dschuff, aardappel Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D50873 llvm-svn: 340336	2018-08-21 21:03:18 +00:00
Fangrui Song	61aaa3504f	[docs][gold] Fix a typo llvm-svn: 340335	2018-08-21 21:00:54 +00:00
Marcello Maggioni	883fe455f1	[LICM] Refactor some AliasSetTracker code to get rid of new/deletes. NFC Differential Revision: https://reviews.llvm.org/D51024 llvm-svn: 340333	2018-08-21 20:30:14 +00:00
Florian Hahn	7cdf52e425	[CodeExtractor] Use 'normal destination' BB as insert point to store invoke results. Currently CodeExtractor tries to use the next node after an invoke to place the store for the result of the invoke, if it is an out parameter of the region. This fails, as the invoke terminates the current BB. In that case, we can place the store in the 'normal destination' BB, as the result will only be available in that case. Reviewers: davidxl, davide, efriedma Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D51037 llvm-svn: 340331	2018-08-21 20:07:46 +00:00
Heejin Ahn	9cd7f88a35	[WebAssembly] Don't make wasm cleanuppads into funclet entries Summary: Catchpads and cleanuppads are not funclet entries; they are only EH scope entries. We already dont't set `isEHFuncletEntry` for catchpads. This patch does the same thing for cleanuppads. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D50654 llvm-svn: 340330	2018-08-21 20:04:42 +00:00
Heejin Ahn	20c9c4438e	[WebAssembly] Change writeSPToMemory to writeSPToGlobal (NFC) Summary: SP is now a __stack_pointer global and not a memory address anymore. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51046 llvm-svn: 340328	2018-08-21 19:52:19 +00:00
Bjorn Pettersson	e06321382b	[RegisterCoalescer] Use substPhysReg in reMaterializeTrivialDef Summary: When RegisterCoalescer::reMaterializeTrivialDef is substituting a register use in a DBG_VALUE instruction, and the old register is a subreg, and the new register is a physical register, then we need to use substPhysReg in order to extract the correct subreg. Reviewers: wmi, aprantl Reviewed By: wmi Subscribers: hiraditya, MatzeB, qcolombet, tpr, llvm-commits Differential Revision: https://reviews.llvm.org/D50844 llvm-svn: 340326	2018-08-21 19:47:32 +00:00
Heejin Ahn	ed5e06b0a7	[WebAssembly] Add isEHScopeReturn instruction property Summary: So far, `isReturn` property is used to mean both a return instruction from a functon and the end of an EH scope, a scope that starts with a EH scope entry BB and ends with a catchret or a cleanupret instruction. Because WinEH uses funclets, all EH-scope-ending instructions are also real return instruction from a function. But for wasm, they only serve as the end marker of an EH scope but not a return instruction that exits a function. This mismatch caused incorrect prolog and epilog generation in wasm EH scopes. This patch fixes this. This patch is in the same vein with rL333045, which splits `MachineBasicBlock::isEHFuncletEntry` into `isEHFuncletEntry` and `isEHScopeEntry`. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D50653 llvm-svn: 340325	2018-08-21 19:44:11 +00:00
Simon Pilgrim	4a76d3e568	Fix Wdocumentation warning. NFCI. llvm-svn: 340324	2018-08-21 19:29:39 +00:00
Craig Topper	3d8fe39ca7	[InstCombine] Pull simple checks above a more complicated one. NFCI I'm assuming its easier to make sure the RHS of an XOR is all ones than it is to check for the many select patterns we have. So lets check that first. Same with the one use check. llvm-svn: 340321	2018-08-21 19:17:00 +00:00
Florian Hahn	9583d4fa03	[GVN] Assign new value number to calls reading memory, if there is no MemDep info. Currently we assign the same value number to two calls reading the same memory location if we do not have MemoryDependence info. Without MemDep Info we cannot guarantee that there is no store between the two calls, so we have to assign a new number to the second call. It also adds a new option EnableMemDep to enable/disable running MemoryDependenceAnalysis and also renamed NoLoads to NoMemDepAnalysis to be more explicit what it does. As it also impacts calls that read memory, NoLoads is a bit confusing. Reviewers: efriedma, sebpop, john.brawn, wmi Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D50893 llvm-svn: 340319	2018-08-21 19:11:27 +00:00
Krzysztof Parzyszek	b211434a78	[RegisterCoalscer] Manually remove leftover segments when commuting def In removeCopyByCommutingDef, segments from the source live range are copied into (and merged with) the segments of the target live range. This is performed for all subranges of the source interval. It can happen that there will be subranges of the target interval that had no corresponding subranges in the source interval, and in such cases these subrages will not be updated. Since the copy being coalesced is about to be removed, these ranges need to be updated by removing the segments that are started by the copy. llvm-svn: 340318	2018-08-21 19:01:26 +00:00
Benjamin Kramer	d66dde5a98	[NVPTX] Remove ftz variants of cvt with rounding mode These do not exist in ptxas, it refuses to compile them. Differential Revision: https://reviews.llvm.org/D51042 llvm-svn: 340317	2018-08-21 18:44:25 +00:00
Matt Davis	8e3f093baa	[llvm-mca] Remove unused decl. NFC. llvm-svn: 340316	2018-08-21 18:39:20 +00:00
Eric Christopher	3dc594c1e6	Temporarily Revert "[PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction" due to it causing a compiler crash on valid. This reverts commit r340016, testcase forthcoming. llvm-svn: 340315	2018-08-21 18:35:08 +00:00
Andrea Di Biagio	f3374f04ad	[llvm-mca] Add the ability to customize the instruction selection strategy in the Scheduler. The constructor of Scheduler now accepts a SchedulerStrategy object, which is used internally by method Scheduler::select() to drive the instruction selection process. The goal of this patch is to enable the definition of custom selection strategies while reusing the same algorithms implemented by class Scheduler. The motivation is that, on some targets, the default strategy may not well approximate the selection logic in the hardware schedulers. This patch also adds the ability to pass a ResourceManager object to the constructor of Scheduler. This gives a bit more flexibility to the design, and potentially it allows to expose processor resources to SchedulerStrategy objects. Differential Revision: https://reviews.llvm.org/D51051 llvm-svn: 340314	2018-08-21 18:20:16 +00:00
Simon Pilgrim	9848e0c9ac	[X86][SSE] Add non-uniform udiv test that is mostly divide by 1. The test demonstrates over-complicated codegen for a udiv that only has one divisor that doesn't equal 1. This should have allowed the codegen to be a lot simpler (uniform shifts etc.) but only the SSE2 manages to make use of this...... llvm-svn: 340313	2018-08-21 18:02:28 +00:00
Philip Reames	c3c23e8cf2	[AST] Remove notion of volatile from alias sets [NFCI] Volatility is not an aliasing property. We used to model volatile as if it had extremely conservative aliasing implications, but that hasn't been true for several years now. So, it doesn't make sense to be in AliasSet. It also turns out the code is entirely a noop. Outside of the AST code to update it, there was only one user: load store promotion in LICM. L/S promotion doesn't need the check since it walks all the users of the address anyway. It already checks each load or store via !isUnordered which causes us to bail for volatile accesses. (Look at the lines immediately following the two remove asserts.) There is the possibility of some small compile time impact here, but the only case which will get noticeably slower is a loop with a large number of loads and stores to the same address where only the last one we inspect is volatile. This is sufficiently rare it's not worth optimizing for.. llvm-svn: 340312	2018-08-21 17:59:11 +00:00
Yury Delendik	132fc5a861	Update DBG_VALUE register operand during LiveInterval operations Summary: Handling of DBG_VALUE in ConnectedVNInfoEqClasses::Distribute() was fixed in PR16110. However DBG_VALUE register operands are not getting updated. This patch properly resolves the value location. Reviewers: MatzeB, vsk Reviewed By: MatzeB Subscribers: kparzysz, thegameg, vsk, MatzeB, dschuff, sbc100, jgravelle-google, aheejin, sunfish, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D48994 llvm-svn: 340310	2018-08-21 17:48:28 +00:00
Aditya Nandakumar	c0333f7184	Revert "Revert rr340111 "[GISel]: Add Legalization/lowering code for bit counting operations"" This reverts commit d1341152d91398e9a882ba2ee924147ea2f9b589. This patch originally made use of Nested MachineIRBuilder buildInstr calls, and since order of argument processing is not well defined, the instructions were built slightly in a different order (still correct). I've removed the nested buildInstr calls to have a defined order now. Patch was tested by Mikael. llvm-svn: 340309	2018-08-21 17:30:31 +00:00
Simon Pilgrim	50eba6b380	[X86][SSE] Lower vXi8 general shifts to SSE shifts directly. NFCI. Most of these shifts are extended to vXi16 so we don't gain anything from forcing another round of generic shift lowering - we know these extended cases are legal constant splat shifts. llvm-svn: 340307	2018-08-21 17:27:03 +00:00
Peter Collingbourne	7d1790868f	llvm-readobj: Simplify. NFCI. llvm-svn: 340305	2018-08-21 17:18:18 +00:00
Craig Topper	b172b8884a	[BypassSlowDivision] Teach bypass slow division not to interfere with div by constant where constants have been constant hoisted, but not moved from their basic block DAGCombiner doesn't pay attention to whether constants are opaque before doing the div by constant optimization. So BypassSlowDivision shouldn't introduce control flow that would make DAGCombiner unable to see an opaque constant. This can occur when a div and rem of the same constant are used in the same basic block. it will be hoisted, but not leave the block. Longer term we probably need to look into the X86 immediate cost model used by constant hoisting and maybe not mark div/rem immediates for hoisting at all. This fixes the case from PR38649. Differential Revision: https://reviews.llvm.org/D51000 llvm-svn: 340303	2018-08-21 17:15:33 +00:00
Simon Pilgrim	98eb4ae499	[X86][SSE] Lower v8i16 general shifts to SSE shifts directly. NFCI. We don't gain anything from forcing another round of generic shift lowering - we know these are legal constant splat shifts. llvm-svn: 340302	2018-08-21 17:05:07 +00:00
Simon Pilgrim	dbe4e9e3ff	[X86][SSE] Lower directly to SSE shifts in the BLEND(SHIFT, SHIFT) combine. NFCI. We don't gain anything from forcing another round of generic shift lowering - we know these are legal constant splat shifts. llvm-svn: 340300	2018-08-21 16:46:48 +00:00
Matt Arsenault	182bab8d1e	Try to fix bot build failure llvm-svn: 340296	2018-08-21 16:24:56 +00:00
Farhana Aleen	3528c80378	[AMDGPU] Support idot2 pattern. Summary: Transform add (mul ((i32)S0.x, (i32)S1.x), add( mul ((i32)S0.y, (i32)S1.y), (i32)S3) => i/udot2((v2i16)S0, (v2i16)S1, (i32)S3) Author: FarhanaAleen Reviewed By: arsenm Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D50024 llvm-svn: 340295	2018-08-21 16:21:15 +00:00
Matt Arsenault	7dd9d58c66	AMDGPU: Partially move target handling code from clang to TargetParser A future change in clang necessitates access of this information from the driver, so move this into a common place. Try to mimic something resembling the API the other targets are using here. One thing I'm uncertain about is how to split amdgcn and r600 handling. Here I've mostly duplicated the functions for each, while keeping the same enums. I think this is a bit awkward for the features which don't matter for amdgcn. It's also a bit messy that this isn't a complete set of subtarget features. This is just the minimum set needed for the driver code. For example building the list of subtarget feature names is still in clang. llvm-svn: 340291	2018-08-21 16:13:01 +00:00
Simon Pilgrim	5a83a1fd13	[X86][SSE] Add helper function to convert to/between the SSE vector shift opcodes. NFCI. Also remove some more getOpcode calls from LowerShift when we already have Opc. llvm-svn: 340290	2018-08-21 15:57:33 +00:00
Daniel Sanders	6a943fb16a	[aarch64][mc] Don't lookup symbols when there is no symbol lookup callback Summary: When run under llvm-mc-disassemble-fuzzer, there is no symbol lookup callback so tryAddingSymbolicOperand() must fail gracefully instead of crashing Reviewers: aemerson, javed.absar Reviewed By: aemerson Subscribers: lhames, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D51005 llvm-svn: 340287	2018-08-21 15:47:25 +00:00
Nicola Zaghen	8a012cbabf	[InstCombine] Add new tests for icmp ugt/ult (add nuw X, C2), C Differential Revision: https://reviews.llvm.org/D51040 llvm-svn: 340284	2018-08-21 15:27:32 +00:00
Simon Pilgrim	43cf2c20ab	[X86] Add SSE2 and XOP udiv combine tests llvm-svn: 340282	2018-08-21 15:21:45 +00:00
Sanjay Patel	f3ae9cc33e	[InstSimplify] use isKnownNeverNaN to fold more fcmp ord/uno Remove duplicate tests from InstCombine that were added with D50582. I left negative tests there to verify that nothing in InstCombine tries to go overboard. If isKnownNeverNaN is improved to handle the FP binops or other cases, we should have coverage under InstSimplify, so we could remove more duplicate tests from InstCombine at that time. llvm-svn: 340279	2018-08-21 14:45:13 +00:00
Anna Thomas	b02b0ad8c7	[LV] Vectorize loops where non-phi instructions used outside loop Summary: Follow up change to rL339703, where we now vectorize loops with non-phi instructions used outside the loop. Note that the cyclic dependency identification occurs when identifying reduction/induction vars. We also need to identify that we do not allow users where the PSCEV information within and outside the loop are different. This was the fix added in rL307837 for PR33706. Reviewers: Ayal, mkuper, fhahn Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D50778 llvm-svn: 340278	2018-08-21 14:40:27 +00:00
Sanjay Patel	6bb09a4291	[InstSimplify] add tests for FP uno/ord with nnan; NFC This is a slight modification of the tests from D50582; change half of the predicates to 'uno' so we have coverage for that side too. All of the positive tests can fold to a constant (true/false), so that should happen in instsimplify. llvm-svn: 340276	2018-08-21 13:33:13 +00:00
Anna Thomas	2d33ce7701	NFC: Add loop vectorizer tests showing various control flow within loop that skip iterations llvm-svn: 340275	2018-08-21 13:02:09 +00:00
Andrea Di Biagio	5001f90b01	[llvm-mca] Replace use of llvm::any_of with std::any_of. This should unbreak the buildbots. llvm-svn: 340274	2018-08-21 13:00:44 +00:00
Andrea Di Biagio	5184995f9b	[llvm-mca] Add method cycleEvent() to class Scheduler. NFCI The goal of this patch is to simplify the Scheduler's interface in preparation for D50929. Some methods in the Scheduler's interface should not be exposed to external users, since their presence makes it hard to both understand, and extend the Scheduler's interface. This patch removes the following two methods from the public Scheduler's API: - reclaimSimulatedResources() - updatePendingQueue() Their logic has been migrated to a new method named 'cycleEvent()'. Methods 'updateIssuedSet()' and 'promoteToReadySet()' still exist. However, they are now private members of class Scheduler. This simplifies the interaction with the Scheduler from the ExecuteStage. llvm-svn: 340273	2018-08-21 12:40:15 +00:00
Tim Renouf	bb5ee41ab4	[AMDGPU] Allow int types for MUBUF vdata Summary: Previously the new llvm.amdgcn.raw/struct.buffer.load/store intrinsics only allowed float types for the data to be loaded or stored, which sometimes meant the frontend needed to generate a bitcast. In this, the new intrinsics copied the old buffer intrinsics. This commit extends the new intrinsics to allow int types as well. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D50315 Change-Id: I8202af2d036455553681dcbb3d7d32ae273f8f85 llvm-svn: 340270	2018-08-21 11:08:12 +00:00
Tim Renouf	4f703f5e11	[AMDGPU] New buffer intrinsics Summary: This commit adds new intrinsics llvm.amdgcn.raw.buffer.load llvm.amdgcn.raw.buffer.load.format llvm.amdgcn.raw.buffer.load.format.d16 llvm.amdgcn.struct.buffer.load llvm.amdgcn.struct.buffer.load.format llvm.amdgcn.struct.buffer.load.format.d16 llvm.amdgcn.raw.buffer.store llvm.amdgcn.raw.buffer.store.format llvm.amdgcn.raw.buffer.store.format.d16 llvm.amdgcn.struct.buffer.store llvm.amdgcn.struct.buffer.store.format llvm.amdgcn.struct.buffer.store.format.d16 llvm.amdgcn.raw.buffer.atomic.* llvm.amdgcn.struct.buffer.atomic.* with the following changes from the llvm.amdgcn.buffer.* intrinsics: * there are separate raw and struct versions: raw does not have an index arg and sets idxen=0 in the instruction, and struct always sets idxen=1 in the instruction even if the index is 0, to allow for the fact that gfx9 does bounds checking differently depending on whether idxen is set; * there is a combined cachepolicy arg (glc+slc) * there are now only two offset args: one for the offset that is included in bounds checking and swizzling, to be split between the instruction's voffset and immoffset fields, and one for the offset that is excluded from bounds checking and swizzling, to go into the instruction's soffset field. The AMDISD::BUFFER_* SD nodes always have an index operand, all three offset operands, combined cachepolicy operand, and an extra idxen operand. The obsolescent llvm.amdgcn.buffer.* intrinsics continue to work. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D50306 Change-Id: If897ea7dc34fcbf4d5496e98cc99a934f62fc205 llvm-svn: 340269	2018-08-21 11:07:10 +00:00
Tim Renouf	35484c9d50	[AMDGPU] New tbuffer intrinsics Summary: This commit adds new intrinsics llvm.amdgcn.raw.tbuffer.load llvm.amdgcn.struct.tbuffer.load llvm.amdgcn.raw.tbuffer.store llvm.amdgcn.struct.tbuffer.store with the following changes from the llvm.amdgcn.tbuffer.* intrinsics: * there are separate raw and struct versions: raw does not have an index arg and sets idxen=0 in the instruction, and struct always sets idxen=1 in the instruction even if the index is 0, to allow for the fact that gfx9 does bounds checking differently depending on whether idxen is set; * there is a combined format arg (dfmt+nfmt) * there is a combined cachepolicy arg (glc+slc) * there are now only two offset args: one for the offset that is included in bounds checking and swizzling, to be split between the instruction's voffset and immoffset fields, and one for the offset that is excluded from bounds checking and swizzling, to go into the instruction's soffset field. The AMDISD::TBUFFER_* SD nodes always have an index operand, all three offset operands, combined format operand, combined cachepolicy operand, and an extra idxen operand. The tbuffer pseudo- and real instructions now also have a combined format operand. The obsolescent llvm.amdgcn.tbuffer.* and llvm.SI.tbuffer.store intrinsics continue to work. V2: Separate raw and struct intrinsics. V3: Moved extract_glc and extract_slc defs to a more sensible place. V4: Rebased on D49995. V5: Only two separate offset args instead of three. V6: Pseudo- and real instructions have joint format operand. V7: Restored optionality of dfmt and nfmt in assembler. V8: Addressed minor review comments. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D49026 Change-Id: If22ad77e349fac3a5d2f72dda53c010377d470d4 llvm-svn: 340268	2018-08-21 11:06:05 +00:00
Bjorn Pettersson	d378a39603	Change how finalizeBundle selects debug location for the BUNDLE instruction Summary: Previously a BUNDLE instruction inherited the DebugLoc from the first instruction in the bundle, even if that DebugLoc had no DILocation. With this commit this is changed into selecting the first DebugLoc that has a DILocation, by searching among the bundled instructions. The idea is to reduce amount of bundles that are lacking debug locations. Reviewers: #debug-info, JDevlieghere Reviewed By: JDevlieghere Subscribers: JDevlieghere, mattd, llvm-commits Differential Revision: https://reviews.llvm.org/D50639 llvm-svn: 340267	2018-08-21 10:59:50 +00:00
Simon Pilgrim	8e15b43092	[X86] Add SSE2 sdiv combine tests llvm-svn: 340264	2018-08-21 10:44:06 +00:00
Sam Parker	597811e7a7	[DAGCombiner] Reduce load widths of shifted masks During combining, ReduceLoadWdith is used to combine AND nodes that mask loads into narrow loads. This patch allows the mask to be a shifted constant. This results in a narrow load which is then left shifted to compensate for the new offset. Differential Revision: https://reviews.llvm.org/D50432 llvm-svn: 340261	2018-08-21 10:26:59 +00:00
Simon Pilgrim	72b324de4d	[TargetLowering] Add BuildSDiv support for division by one or negone. This reduces most of the sdiv stages (the MULHS, shifts etc.) to just zero/identity values and use the numerator scale factor to multiply by +1/-1. llvm-svn: 340260	2018-08-21 10:20:36 +00:00
Petar Jovanovic	3b953c37f8	[MIPS GlobalISel] Select bitwise instructions Select bitwise instructions for i32. Patch by Petar Avramovic. Differential Revision: https://reviews.llvm.org/D50183 llvm-svn: 340258	2018-08-21 08:15:56 +00:00
Max Kazantsev	097ef69182	[LICM] Hoist guards with invariant conditions This patch teaches LICM to hoist guards from the loop if they are guaranteed to execute and if there are no side effects that could prevent that. Differential Revision: https://reviews.llvm.org/D50501 Reviewed By: reames llvm-svn: 340256	2018-08-21 08:11:31 +00:00
Bjorn Pettersson	880f291577	[RegisterCoalescer] Do not assert when trying to remat dead values Summary: RegisterCoalescer::reMaterializeTrivialDef used to assert that the input register was live in. But as shown by the new coalesce-dead-lanes.mir test case that seems to be a valid scenario. We now return false instead of the assert, simply avoiding to remat the dead def. Normally a COPY of an undef value is eliminated by eliminateUndefCopy(). Although we only do that when the destination isn't a physical register. So the situation above should be limited to the case when we copy an undef value to a physical register. Reviewers: kparzysz, wmi, tpr Reviewed By: kparzysz Subscribers: MatzeB, qcolombet, tpr, llvm-commits Differential Revision: https://reviews.llvm.org/D50842 llvm-svn: 340255	2018-08-21 07:49:05 +00:00
Max Kazantsev	f1dc867396	[NFC] Add some LICM tests llvm-svn: 340254	2018-08-21 07:37:02 +00:00
Kirill Bobyrev	e92a4b88d1	[llvm] NFC: Fix assert condition and suppress warning As mentioned by andreadb, assert condition is wrong and causes GCC warning. Related Revision: https://reviews.llvm.org/D50839 llvm-svn: 340252	2018-08-21 07:23:45 +00:00
Max Kazantsev	bfbd4d1fb6	[NFC] Factor out predecessors collection into a separate method It may be reused in a different piece of logic. Differential Revision: https://reviews.llvm.org/D50890 Reviewed By: reames llvm-svn: 340250	2018-08-21 07:15:06 +00:00
Serguei Katkov	09ab506798	[IR Verifier] Do not allow bitcast of pointer to vector of pointers and vice versa. LangRef for BitCast requires that "The bit sizes of value and the destination type, ty2, must be identical". Currently verifier allows BitCast of pointer to vector of pointers so that the sizes are different. This change fixes that. Reviewers: arsenm Reviewed By: arsenm Subscribers: llvm-commits, wdng Differential Revision: https://reviews.llvm.org/D50886 llvm-svn: 340249	2018-08-21 04:27:07 +00:00
Alex Langford	f37700506f	[docs] Fix a small typo in a debug info example llvm-svn: 340246	2018-08-21 01:43:03 +00:00
Philip Reames	a5a8546ac6	[AST] Mark invariant.starts as being readonly These intrinsics are modelled as writing for control flow purposes, but they don't actually write to any location. Marking these - as we did for guards - allows LICM to hoist loads out of loops containing invariant.starts. Differential Revision: https://reviews.llvm.org/D50861 llvm-svn: 340245	2018-08-21 00:55:35 +00:00
Philip Reames	578c64da0c	[LICM] Add tests from D50786 [NFC] Exercise more use of volatiles to illustrate that nothing changes as we tweak how we detect them. llvm-svn: 340244	2018-08-21 00:42:07 +00:00
Philip Reames	efdd0a426a	[LICM][NFC] Add tests from D50730 Landing tests so corresponding change can show effects clearly. see D50730 [AST] Generalize argument specific aliasing llvm-svn: 340243	2018-08-21 00:37:09 +00:00
Philip Reames	4009487e5c	[LICM] More tests for D50925 [NFC] This time, the corresponding cases where we can hoist (store-like) calls out of loops. llvm-svn: 340242	2018-08-21 00:14:14 +00:00
Fangrui Song	0e49ef9540	[llvm-objcopy] Simplify find(X,Y) != X.end() with is_contained() llvm-svn: 340241	2018-08-21 00:13:52 +00:00
Reid Kleckner	1d432ae284	Fix global_metadata_external_comdat.ll test llvm-svn: 340240	2018-08-21 00:03:21 +00:00
Zachary Turner	c175310a09	[MS Demangler] Demangle special operator 'dynamic initializer'. This is encoded as __E and should print something like "dynamic initializer for 'Foo'(void)" This also adds support for dynamic atexit destructor, which is basically identical but encoded as __F with slightly different description. llvm-svn: 340239	2018-08-20 23:59:21 +00:00
Zachary Turner	0002dd467d	[MS Demangler] Anonymous namespace hashes can be backreferenced. Previously we were not remembering the key values of anonymous namespaces, but we need to do this. llvm-svn: 340238	2018-08-20 23:58:58 +00:00
Zachary Turner	91c98a858c	[MS Demangler] Properly demangle anonymous namespaces. llvm-svn: 340237	2018-08-20 23:58:35 +00:00
Heejin Ahn	487992cc09	[WebAssembly] Revert type of wake count in atomic.wake to i32 Summary: We decided to revert this from i64 to i32 in Nov 28 CG meeting. Fixes PR38632. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D51010 llvm-svn: 340234	2018-08-20 23:49:29 +00:00
Philip Reames	529a590bce	[LICM][Tests] Add tests for store hoisting [NFC] https://reviews.llvm.org/D50925 will be rebased on top of this. llvm-svn: 340233	2018-08-20 23:37:59 +00:00
Reid Kleckner	85a8c12db8	Re-land r334313 "[asan] Instrument comdat globals on COFF targets" If we can use comdats, then we can make it so that the global metadata is thrown away if the prevailing definition of the global was uninstrumented. I have only tested this on COFF targets, but in theory, there is no reason that we cannot also do this for ELF. This will allow us to re-enable string merging with ASan on Windows, reducing the binary size cost of ASan on Windows. I tested this change with ASan+PGO, and I fixed an issue with the __llvm_profile_raw_version symbol. With the old version of my patch, we would attempt to instrument that symbol on ELF because it had a comdat with external linkage. If we had been using the linker GC-friendly metadata scheme, everything would have worked, but clang does not enable it by default. llvm-svn: 340232	2018-08-20 23:35:45 +00:00
Craig Topper	bee74793a3	[InstCombine] Add splat vector constant support to foldICmpAddOpConst. Differential Revision: https://reviews.llvm.org/D50946 llvm-svn: 340231	2018-08-20 23:04:25 +00:00
Heejin Ahn	c2c33c8e64	[WebAssembly] Remove an unused argument from writeSPToMemory (NFC) Reviewers: dschuff Subscribers: dschuff, sbc100, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D50933 llvm-svn: 340230	2018-08-20 23:02:15 +00:00
Fangrui Song	ffbc3e2576	[llvm-strip] Allow only one input Summary: Before, llvm-strip accepted a second argument but it would just be ignored. Reviewers: alexshap, jhenderson, paulsemel Reviewed By: alexshap Subscribers: jakehehrlich, rupprecht, llvm-commits Differential Revision: https://reviews.llvm.org/D51004 llvm-svn: 340229	2018-08-20 23:01:57 +00:00
Matt Davis	accb51152c	[llvm-mca] Remove unused formal parameter. NFC. llvm-svn: 340227	2018-08-20 22:41:27 +00:00
Michael Berg	0b838deddc	extend binop folds for selects to include true and false binops flag intersection Summary: This change address bug 38641 Reviewers: spatel, wristow Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D50996 llvm-svn: 340222	2018-08-20 22:26:58 +00:00
Zachary Turner	030ad37ef4	[llvm-objdump] Add ability to demangle COFF symbols. llvm-svn: 340221	2018-08-20 22:18:21 +00:00
Craig Topper	9c57ba0dc3	[X86] Add test command line to expose PR38649. Bypass slow division and constant hoisting are conspiring to break div+rem of large constants. llvm-svn: 340217	2018-08-20 21:51:35 +00:00
Craig Topper	210ccfe3db	[X86] Prevent lowerVectorShuffleByMerging128BitLanes from creating cycles Due to some splat handling code in getVectorShuffle, its possible for NewV1/NewV2 to have their mask modified from what is requested. This can lead to cycles being created in the DAG. This patch examines the returned mask and makes sure its different. Long term we may need to look closer at that splat code in getVectorShuffle, or add more splat awareness to getVectorShuffle. Fixes PR38639 Differential Revision: https://reviews.llvm.org/D50981 llvm-svn: 340214	2018-08-20 21:08:35 +00:00
Craig Topper	7dcb2c4b0a	[X86] Teach combineTruncatedArithmetic to handle some cases of ISD::SUB We can safely avoid interfering with the subus combine if both inputs are freely truncatable. Either both extends, or an extend and a constant vector. Differential Revision: https://reviews.llvm.org/D50878 llvm-svn: 340212	2018-08-20 20:57:35 +00:00
Craig Topper	08e7e04998	[X86] Pre-commit test cases for D50878. llvm-svn: 340211	2018-08-20 20:57:32 +00:00
Craig Topper	4ee28412a5	[LegacyPassManager] Remove analysis P from AnUsageMap before deleting it in schedulePass. If we deem the analysis pass useless and delete it, we need to make sure we remove it from AnUsageMap. Otherwise we might allocate another pass in the freed memory. This will cause us to reuse the AnalysisUsage from the original pass instead of the new one. Fixes PR38511 Differential Revision: https://reviews.llvm.org/D50573 llvm-svn: 340210	2018-08-20 20:57:30 +00:00
Krzysztof Parzyszek	cc3f630252	Consistently use MemoryLocation::UnknownSize to indicate unknown access size 1. Change the software pipeliner to use unknown size instead of dropping memory operands. It used to do it before, but MachineInstr::mayAlias did not handle it correctly. 2. Recognize UnknownSize in MachineInstr::mayAlias. 3. Print and parse UnknownSize in MIR. Differential Revision: https://reviews.llvm.org/D50339 llvm-svn: 340208	2018-08-20 20:37:57 +00:00
David Blaikie	a25e206973	Add missing include (<functional> for std::ref) llvm-svn: 340205	2018-08-20 20:02:29 +00:00
Richard Smith	8a57f2e012	Move Itanium demangler implementation into a header file and add visitation support. Summary: This transforms the Itanium demangler into a generic reusable library that can be used to build, traverse, and transform Itanium mangled name trees. This is in preparation for adding a canonicalizing demangler, which cannot live in the Demangle library for layering reasons. In order to keep the diffs simpler, this patch moves more code to the new header than is strictly necessary: in particular, all of the printLeft / printRight implementations can be moved to the implementation file. (And indeed we could make them non-virtual now if we wished, and remove the vptr from Node.) All nodes are now included in the Kind enumeration, rather than omitting some of the Expr nodes, and the three different floating-point literal node types now have distinct Kind values. As a proof of concept for the visitation / matching mechanism, this patch implements a Node dumping facility on top of it, replacing the prior mechanism that produced the pretty-printed output rather than a tree dump. Sample dump output: FunctionEncoding( NameType("int"), NameWithTemplateArgs( NestedName( NameWithTemplateArgs( NameType("A"), TemplateArgs( {NameType("B")})), NameType("f")), TemplateArgs( {NameType("int")})), {}, <null>, QualConst, FunctionRefQual::FrefQualLValue) As a next step, it would make sense to move the LLVM high-level interface to the demangler (the itaniumDemangler function and ItaniumPartialDemangler class) into the Support library, and implement them in terms of the Demangle library. This would allow the libc++abi demangler implementation to be an identical copy of the llvm Demangle library, and would allow the LLVM implementation to reuse LLVM components such as llvm::BumpPtrAllocator, but we'll need to decide how to coordinate that with the MS ABI demangler, so I'm not doing that in this patch. No functionality change intended other than the behavior of dump(). Reviewers: erik.pilkington, zturner, chandlerc, dlj Subscribers: aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D50930 llvm-svn: 340203	2018-08-20 19:44:01 +00:00
Vitaly Buka	30b5ed3eb7	Revert "AMDGPU: bump AS.MAX_COMMON_ADDRESS to 6 since 32-bit addr space" As it introduces out of bound access. This reverts commit r340172 and r340171 llvm-svn: 340202	2018-08-20 19:31:03 +00:00
Cameron McInally	94b9029be9	[FPEnv] Support constrained FREM intrinsic Differential Revision: https://reviews.llvm.org/D50975 llvm-svn: 340201	2018-08-20 19:28:56 +00:00
Marcello Maggioni	5ca4128b45	[PSV] Update API to be able to use TargetCustom without UB. getTargetCustom() requires values for "Kind" in the constructor that are not in the PSVKind enum. Passing a value that is not inside an enum as an argument to a constructor of the type of the enum is UB. Changing to the underlying type of the enum would solve the UB Differential Revision: https://reviews.llvm.org/D50909 llvm-svn: 340200	2018-08-20 19:23:45 +00:00
Zachary Turner	66555a7bed	[MS Demangler] Demangle member pointer template parameters. llvm-svn: 340199	2018-08-20 19:15:35 +00:00
Aditya Nandakumar	2a08285cf3	Revert "Revert r339977: [GISel]: Add Opcodes for a few LLVM Intrinsics" This reverts commit 7debc334e6421bb5251ef8f18e97166dfc7dd787. I missed updating legalizer-info-validation.mir as I had assertions turned off in my build and that specific test requires asserts. Fixed it now. llvm-svn: 340197	2018-08-20 18:43:19 +00:00
Simon Pilgrim	6ac905926f	[TargetLowering] Disable BuildSDiv division by one or negone. Fuzz tests have detected an issue, currently working on a fix. llvm-svn: 340195	2018-08-20 18:23:54 +00:00
Sanjay Patel	3ce999fa41	[ConstantFolding] improve folding of binops with vector undef operand A non-undef operand may still have undef constant elements, so we should always propagate the vector results per-lane. llvm-svn: 340194	2018-08-20 18:19:02 +00:00
Alina Sbirlea	b35af157c1	[MemorySSA] Update comment to better describe cfg change (NFC). llvm-svn: 340192	2018-08-20 18:15:02 +00:00
Sanjay Patel	7ff7bd9b3c	[ConstantFolding] add tests for binops on vectors with undef elements; NFC llvm-svn: 340190	2018-08-20 17:31:34 +00:00
Matt Arsenault	450fcc77a7	ValueTracking: Handle more instructions in isKnownNeverNaN llvm-svn: 340187	2018-08-20 16:51:00 +00:00
Reid Kleckner	918930adf9	Revert rr340111 "[GISel]: Add Legalization/lowering code for bit counting operations" It causes LegalizerHelperTest.LowerBitCountingCTTZ1 to fail. llvm-svn: 340186	2018-08-20 16:50:19 +00:00
Reid Kleckner	531319388d	Add cmake option to disable minidumps, default it to off Since crash dumping landed in r268519, May 2016, I have not once seen anyone use an uploaded minidump to debug a compiler crash. Therefore, I'm turning this off by default. The dumps clutter up user and buildbot temp directories. Each file is only about 56KB, but it adds up. In the context of clang, the extra line about the minidump confuses users, when what we really want from them is the pre-processed source code. llvm-svn: 340185	2018-08-20 16:49:54 +00:00
Sanjay Patel	5ae83a21b5	[InstCombine] add tests for insertelement+binop; NFC llvm-svn: 340184	2018-08-20 16:49:08 +00:00
Andrea Di Biagio	0875e759f0	[llvm-mca] Make the LSUnit a HardwareUnit, and allow derived classes to implement a different memory consistency model. The LSUnit is now a HardwareUnit, and it is owned by the mca::Context. Derived classes can now implement a different consistency model by overriding method `LSUnit::isReady()`. This patch also slightly refactors the Scheduler interface in the attempt to simplifying the interaction between ExecuteStage and the underlying Scheduler. llvm-svn: 340176	2018-08-20 14:41:36 +00:00
Simon Pilgrim	1a00042270	[SelectionDAG] Reuse the Op's VT. NFCI. llvm-svn: 340173	2018-08-20 13:44:03 +00:00
Samuel Pitoiset	216a2da577	AMDGPU: fix compilation errors since r340171 Some buildbot slaves reports compilation errors, but it compiled fine on my side, sorry for the breakage. llvm-svn: 340172	2018-08-20 13:31:41 +00:00
Samuel Pitoiset	c95ef77d37	AMDGPU: bump AS.MAX_COMMON_ADDRESS to 6 since 32-bit addr space 32-bit constant address space is declared as 6, so the maximum number of address spaces is 6, not 5. Fixes "LLVM ERROR: Pointer address space out of range". v3: use static_assert() v2: add a very simple test for 32-bit addr space Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106630 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> llvm-svn: 340171	2018-08-20 13:18:59 +00:00
Haojian Wu	54829bb3ff	Fix an undefined behavior when storing an empty StringRef. Summary: Passing a nullptr to memcpy is UB. Reviewers: ioeric Subscribers: llvm-commits, cfe-commits Differential Revision: https://reviews.llvm.org/D50966 llvm-svn: 340170	2018-08-20 13:12:54 +00:00
Simon Pilgrim	5b78c9d58d	[SelectionDAG] Add partial sign-bit support to ComputeNumSignBits for BITCAST nodes Only adds support to the existing 'large element' scalar/vector to 'small element' vector bitcasts. Handle the case where the sign bit extends to only part of the small elements. llvm-svn: 340169	2018-08-20 13:05:48 +00:00
Simon Pilgrim	11bec5b80c	[X86][SSE] Fix PACKSS bitcast test from rL340166 We need the signbits to extends to lower 16-bits of the even elements llvm-svn: 340167	2018-08-20 11:47:15 +00:00
Simon Pilgrim	cee9c64838	[X86][SSE] Add PACKSS test showing ComputeNumSignBits failure to handle a partial sign bits extension through a bitcast llvm-svn: 340166	2018-08-20 11:10:12 +00:00
Simon Pilgrim	686090a45f	[X86] Drop unnecessary exact qualifier from packss test llvm-svn: 340165	2018-08-20 11:01:51 +00:00
Victor Leschuk	cba595da82	[DWARF] Refactor DWARF classes to use unified error reporting. NFC. DWARF-related classes in lib/DebugInfo/DWARF contained duplicating code for creating StringError instances, like: template <typename... Ts> static Error createError(char const *Fmt, const Ts &... Vals) { std::string Buffer; raw_string_ostream Stream(Buffer); Stream << format(Fmt, Vals...); return make_error<StringError>(Stream.str(), inconvertibleErrorCode()); } Similar function was placed in Support lib in https://reviews.llvm.org/D49824 This revision makes DWARF classes use this function instead of their local implementation of it. Reviewers: aprantl, dblaikie, probinson, wolfgangp, JDevlieghere, jhenderson Reviewed By: JDevlieghere, jhenderson Differential Revision: https://reviews.llvm.org/D49964 llvm-svn: 340163	2018-08-20 09:59:08 +00:00
Simon Pilgrim	bbd2d15d45	Use LLVM_BUILTIN_TRAP not __builtin_trap to appease windows builds. NFCI. llvm-svn: 340162	2018-08-20 09:49:20 +00:00
Sander de Smalen	07db432265	[AArch64][SVE] Asm: Add SVE System registers This patch adds system registers for controlling aspects of SVE: - ZCR_EL1 (r/w) visible at EL1 and EL0. - ZCR_EL2 (r/w) visible at EL2 and Non-secure EL1 and EL0. - ZCR_EL3 (r/w) visible at all exception levels. and a system register identifying SVE: - ID_AA64ZFR0_EL1 (r) SVE Feature identifier. Reviewers: SjoerdMeijer, samparker, pbarrio, fhahn, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D50885 llvm-svn: 340158	2018-08-20 09:16:59 +00:00
Kirill Bobyrev	5f26a642e6	[llvm] Make YAML serialization up to 2.5 times faster This patch significantly improves performance of the YAML serializer by optimizing `YAML::isNumeric` function. This function is called on the most strings and is highly inefficient for two reasons: * It uses `Regex`, which is parsed and compiled each time this function is called * It uses multiple passes which are not necessary This patch introduces stateful ad hoc YAML number parser which does not rely on `Regex`. It also fixes YAML number format inconsistency: current implementation supports C-stile octal number format (`01234567`) which was present in YAML 1.0 specialization (http://yaml.org/spec/1.0/), [Section 2.4. Tags, Example 2.19] but was deprecated and is no longer present in latest YAML 1.2 specification (http://yaml.org/spec/1.2/spec.html), see [Section 10.3.2. Tag Resolution]. Since the rest of the rest of the implementation does not support other deprecated YAML 1.0 numeric features such as sexagecimal numbers, commas as delimiters it is treated as inconsistency and not longer supported. This patch also adds unit tests to ensure the validity of proposed implementation. This performance bottleneck was identified while profiling Clangd's global-symbol-builder tool with my colleague @ilya-biryukov. The substantial part of the runtime was spent during a single-thread Reduce phase, which concludes with YAML serialization of collected symbol collection. Regex matching was accountable for approximately 45% of the whole runtime (which involves sharded Map phase), now it is reduced to 18% (which is spent in `clang::clangd::CanonicalIncludes` and can be also optimized because all used regexes are in fact either suffix matches or exact matches). `llvm-yaml-numeric-parser-fuzzer` was used to ensure the validity of the proposed regex replacement. Fuzzing for ~60 hours using 10 threads did not expose any bugs. Benchmarking `global-symbol-builder` (using `hyperfine --warmup 2 --min-runs 5 'command 1' 'command 2'`) tool by processing a reasonable amount of code (26 source files matched by `clang-tools-extra/clangd/*.cpp` with all transitive includes) confirmed our understanding of the performance bottleneck nature as it speeds up the command by the factor of 1.6x: \| Command \| Mean [s] \| Min…Max [s] \| \| this patch (D50839) \| 84.7 ± 0.6 \| 83.3…84.7 \| \| master (rL339849) \| 133.1 ± 0.8 \| 132.4…134.6 \| Using smaller samples (e.g. by collecting symbols from `clang-tools-extra/clangd/AST.cpp` only) yields even better performance improvement, which is expected because Map phase takes less time compared to Reduce and is 2.05x faster and therefore would significantly improve the performance of standalone YAML serializations. \| Command \| Mean [ms] \| Min…Max [ms] \| \| this patch (D50839) \| 3702.2 ± 48.7 \| 3635.1…3752.3 \| \| master (rL339849) \| 7607.6 ± 109.5 \| 7533.3…7796.4 \| Reviewed by: zturner, ilya-biryukov Differential revision: https://reviews.llvm.org/D50839 llvm-svn: 340154	2018-08-20 07:00:36 +00:00
Justin Bogner	6f1740d52f	[SimplifyCFG] Replace some uses of bitwise or with logical or It's clearer to use logical or for boolean values. Thanks to Steven Zhang for noticing! llvm-svn: 340153	2018-08-20 06:37:11 +00:00
Craig Topper	24674ca773	[InstCombine] Move some variable declarations into a more appropriate scope. NFC llvm-svn: 340150	2018-08-20 05:35:12 +00:00
QingShan Zhang	f8f9af7ba5	[PowerPC] Add a peephole post RA to transform the inst that fed by add If the arch is P8, we will select XFLOAD to load the floating point, and then, expand it to vsx and non-vsx X-form instruction post RA. This patch is trying to convert the X-form to D-form if it meets the requirement that one operand of the x-form inst is the special Zero register, and another operand fed by add inst. i.e. y = add imm, reg LFDX. 0, y --> LFD imm(reg) Reviewers: Nemanjai Differential Revision: https://reviews.llvm.org/D49007 llvm-svn: 340149	2018-08-20 02:52:55 +00:00
whitequark	fdca0c6d2e	[bindings/go] Add coroutine passes Add Go bindings for CoroEarly, CoroSplit, CoroElide and CoroCleanup. Differential Revision: https://reviews.llvm.org/D50951 llvm-svn: 340148	2018-08-19 23:40:05 +00:00
whitequark	c438ac2352	[LLVM-C] Add coroutine passes Differential Revision: https://reviews.llvm.org/D50950 llvm-svn: 340147	2018-08-19 23:39:57 +00:00
whitequark	b56a4d3149	[C-API][DIBuilder] Added DIFlags in LLVMDIBuilderCreateBasicType Added DIFlags in LLVMDIBuilderCreateBasicType to add optional DWARF attributes, such as DW_AT_endianity. Patch by Chirag Patel. Differential Revision: https://reviews.llvm.org/D50832 llvm-svn: 340146	2018-08-19 23:39:47 +00:00
Craig Topper	5f695cc1e9	[InstCombine] Add test cases for an icmp combine that is missing support for splat vector constants. llvm-svn: 340144	2018-08-19 18:03:34 +00:00
Simon Pilgrim	5b936ec89e	[SelectionDAG] Add basic demanded elements support to ComputeNumSignBits for BITCAST nodes Only adds support to the existing 'large element' scalar/vector to 'small element' vector bitcasts. The next step would be to support cases where the large elements aren't all sign bits, and determine the small element equivalent based on the demanded elements. llvm-svn: 340143	2018-08-19 17:47:50 +00:00
Simon Pilgrim	0fd72ab44f	[X86][SSE] Add PACKSS test showing ComputeNumSignBits failure to handle demanded elts through a bitcast llvm-svn: 340139	2018-08-19 16:01:47 +00:00
Craig Topper	803912ea57	[X86] Fix an issue in the matching for ADDUS. We were basically assuming only one operand of the compare could be an ADD node and using that to swap operands. But we can have a normal add followed by a saturing add. This rewrites the canonicalization to just be based on the condition code. llvm-svn: 340134	2018-08-19 04:26:31 +00:00
Craig Topper	a85d7e927b	[X86] Add a test case showing an issue in our addusw pattern matching. We are unable to handle a normal add followed by a saturing add with certain operand orders on the icmp. llvm-svn: 340133	2018-08-19 04:26:29 +00:00
Aditya Kumar	6373f5ddee	Updating MergeFunctions.rst Improving readability, removing redundant contents. Reviewers: hiraditya Differential Revision: https://reviews.llvm.org/D50686 llvm-svn: 340131	2018-08-18 20:17:19 +00:00
Craig Topper	2b03df9b05	[X86] Use SDValue::operator== instead of DAG.isEqualTo in strictly integer matching. isEqualTo is more useful for floating point. operator== is sufficient for integer. llvm-svn: 340130	2018-08-18 19:16:56 +00:00
Craig Topper	3e299d896f	[X86] Simplify the PADDUS legality check in combineSelect to match PSUBUS. NFC While there remove some trailing whitespace. llvm-svn: 340129	2018-08-18 18:51:04 +00:00
Craig Topper	40c9559b74	[X86] Add support for using 512-bit PSUBUS to combineSelect. The code already support 128 and 256 and even knows to split 256 for AVX1. So we really just needed to stop looking for specific VTs and subtarget features and just look for legal VTs with i8/i16 elements. While there, add some curly braces around outer if statement bodies that contain only another if. It makes all the closing curly braces look more regular. llvm-svn: 340128	2018-08-18 18:51:03 +00:00
Craig Topper	b40a1d5f84	[X86] Add test cases to show missed opportunities to use 512-bit PSUBUS. llvm-svn: 340127	2018-08-18 18:50:59 +00:00
Zachary Turner	d9e925fca4	[MS Demangler] Resolve backreferences eagerly, not lazily. A while back I submitted a patch to resolve backreferences lazily, thinking this that it was not always possible to know in advance what type you were looking at until you had completed a full pass over the input, and therefore it would be impossible to resolve backreferences eagerly. This was mistaken though, and turned out to be an unrelated problem. In fact, the reverse is true. You must resolve backreferences eagerly. This is because certain types of nested mangled symbols do not share a backreference context with their parent symbol, and as such, if you try to resolve them lazily their backreference context will have been lost by the time you finish demangling the entire input. On the other hand, resolving them eagerly appears to always work, and enables us to port many more tests over. llvm-svn: 340126	2018-08-18 18:49:48 +00:00
Lang Hames	8e296229b5	[RuntimeDyld] Fix a bug in RuntimeDyld::loadObjectImpl that was over-allocating space for common symbols. Patch by Dmitry Sidorov. Thanks Dmitry! Differential revision: https://reviews.llvm.org/D50240 llvm-svn: 340125	2018-08-18 18:38:37 +00:00
Simon Pilgrim	9c1761a6fd	[X86] Replace all single match schedule class instregexs with instrs entries Helps reduce cost of instrw collection llvm-svn: 340124	2018-08-18 18:04:29 +00:00
Simon Pilgrim	ebfd6ebba7	[X86] Merge shift/rotate schedule class instregexs Helps reduce cost of instrw collection llvm-svn: 340123	2018-08-18 15:58:19 +00:00
Hsiangkai Wang	68c706ceb7	[DebugInfo] In FastISel, convert llvm.dbg.label to DBG_LABEL MI. Convert llvm.dbg.label(!label_metadata) to DBG_LABEL !label_metadata. Differential Revision: https://reviews.llvm.org/D50622 llvm-svn: 340122	2018-08-18 14:55:34 +00:00
Craig Topper	911efbb926	[X86] Add a signed test case for PR38622. Use nounwind to reduce the output on the unsigned test case. llvm-svn: 340121	2018-08-18 06:00:16 +00:00
Craig Topper	cc5dbbf759	[DAGCombiner] Allow divide by constant optimization on opaque constants. Summary: I believe this restores the behavior we had before r339147. Fixes PR38622. Reviewers: RKSimon, chandlerc, spatel Reviewed By: chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50936 llvm-svn: 340120	2018-08-18 05:52:42 +00:00
Zachary Turner	bc94ae437f	Add the extended XMM registers mappings for AVX-512. After this we should have the entire AVX-512 register set mapping in place. llvm-svn: 340118	2018-08-18 03:54:16 +00:00
Lang Hames	6ac2be0a98	[ORC] Fix some parameter names. NFC. llvm-svn: 340116	2018-08-18 02:48:02 +00:00
Lang Hames	76e21c9792	[ORC] Rename 'finalize' to 'emit' to avoid potential confusion. An emitted symbol has had its contents written and its memory protections applied, but it is not automatically ready to execute. Prior to ORC supporting concurrent compilation, the term "finalized" could be interpreted two different (but effectively equivalent) ways: (1) The finalized symbol's contents have been written and its memory protections applied, and (2) the symbol is ready to run. Now that ORC supports concurrent compilation, sense (1) no longer implies sense (2). We have already introduced a new term, 'ready', to capture sense (2), so rename sense (1) to 'emitted' to avoid any lingering confusion. llvm-svn: 340115	2018-08-18 02:06:18 +00:00
Peter Collingbourne	5b4f8e10b5	MC: Remove dead code from WinCOFFObjectWriter.cpp. NFCI. Remove code for writing auxiliary symbols of type function definition and begin function. These types of symbols are associated with pre-CodeView debug info and we never emit them. llvm-svn: 340113	2018-08-18 00:54:46 +00:00
Aditya Nandakumar	59b2485ba2	[GISel]: Add Legalization/lowering code for bit counting operations https://reviews.llvm.org/D48847#inline-448257 Ported legalization expansions for CTLZ/CTTZ from DAG to GISel. Reviewed by rtereshin. llvm-svn: 340111	2018-08-18 00:01:54 +00:00
Philip Reames	96bc076c3a	[AST] Clarify printing of unknown size locations [NFC] Printing "unknown" is much more clear than an arbitrary large integer llvm-svn: 340108	2018-08-17 23:17:31 +00:00
Jordan Rupprecht	be8ebccaed	[llvm-objcopy] Implement -G/--keep-global-symbol(s). Summary: Port GNU Objcopy -G/--keep-global-symbol(s). This is slightly different than the already-implemented --globalize-symbol, which marks a symbol as global when copying. When --keep-global-symbol (alias -G) is used, only those symbols marked will stay global, and all other globals are demoted to local. (Also note that it doesn't promote a symbol to global). Additionally, there is a pluralized version of the flag --keep-global-symbols, which effectively applies --keep-global-symbol for every non-comment in a file. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50589 llvm-svn: 340105	2018-08-17 22:34:48 +00:00
George Burgess IV	04df67e268	[DebugCounters] don't do redundant map lookups; NFC llvm-svn: 340104	2018-08-17 22:34:04 +00:00
Philip Reames	26f6176f38	[AST][Tests] Clarify what each test is doing llvm-svn: 340100	2018-08-17 21:58:26 +00:00
Philip Reames	9e313167cf	[AST[Tests] Shorten tests using noalias params llvm-svn: 340099	2018-08-17 21:45:57 +00:00
Philip Reames	079c92e201	[AST] Add tests for argmemonly calls [NFC] First step towards building a test set to rebase D50730 on top of. Starting with clone of memtransfer tests, more to come. llvm-svn: 340095	2018-08-17 21:42:18 +00:00
Matt Arsenault	ea4b476a30	ValueTracking: Add tests for isKnownNeverNaN llvm-svn: 340090	2018-08-17 21:39:52 +00:00
Reid Kleckner	aa56bac652	[MC] Improve error message when a codeview register is unknown This is in MCRegisterInfo, we can print the actual register name easily. llvm-svn: 340089	2018-08-17 21:35:14 +00:00
Zachary Turner	4746aa7b8f	[MS Demangler] Properly print all thunk types. We were only printing the vtordisp thunk before as the previous patch was more aimed at getting special operators working, one of which was a thunk. This patch gets all thunk types to print properly, and adds a test for each one. llvm-svn: 340088	2018-08-17 21:32:07 +00:00
Craig Topper	62dd1b1b4f	[X86] Remove detectAddSubSatPattern. This was added very recently in r339650, but appears to be completely untested and has at least one bug in it. llvm-svn: 340086	2018-08-17 21:19:28 +00:00
Matt Arsenault	25e51540e1	DAG: Fix isKnownNeverNaN for basic non-sNaN cases fadd/fsub/fmul need to worry about infinities as well as fdiv. llvm-svn: 340085	2018-08-17 21:19:22 +00:00
Lang Hames	d5f56c5979	[ORC] Rename VSO to JITDylib. VSO was a little close to VDSO (an acronym on Linux for Virtual Dynamic Shared Object) for comfort. It also risks giving the impression that instances of this class could be shared between ExecutionSessions, which they can not. JITDylib seems moderately less confusing, while still hinting at how this class is intended to be used, i.e. as a JIT-compiled stand-in for a dynamic library (code that would have been a dynamic library if you had wanted to compile it ahead of time). llvm-svn: 340084	2018-08-17 21:18:18 +00:00
Zachary Turner	469f076356	[MS Demangler] Demangle all remaining types of operators. This demangles all remaining special operators including thunks, RTTI Descriptors, and local static guard variables. llvm-svn: 340083	2018-08-17 21:18:05 +00:00
Krzysztof Parzyszek	9937e205e8	[Hexagon] Remove unused functions from HexagonInstPrinter, NFC llvm-svn: 340081	2018-08-17 21:12:37 +00:00
Michael Kruse	b67e5d3f27	[AST] Adapt Polly to AnalysisSetTracker changes. NFC. The method AliasSetTracker::getAliasSetForPointer was removed and replaced by AliasSetTracker::getAliasSetFor for the restructuring in r339930. Since Polly uses AliasSetTracker::getAliasSetForPointer, a temporary fix has been committed in r339937 with a comment: Can someone from polly please migrate usage and then delete the wrapper? This commit is doing exactly that. llvm-svn: 340072	2018-08-17 19:31:41 +00:00
Jordan Rupprecht	bb179a197c	Fix windows buildbots by removing : from filenames llvm-svn: 340071	2018-08-17 19:18:20 +00:00
Jordan Rupprecht	cf67633e66	[llvm-objcopy] Add support for -I binary -B <arch>. Summary: The -I (--input-target) and -B (--binary-architecture) flags exist but are currently silently ignored. This adds support for -I binary for architectures i386, x86-64 (and alias i386:x86-64), arm, aarch64, sparc, and ppc (powerpc:common64). This is largely based on D41687. This is done by implementing an additional subclass of Reader, BinaryReader, which works by interpreting the input file as contents for .data field, sets up a synthetic header, and adds additional sections/symbols (e.g. _binary__tmp_data_txt_start). Reviewers: jakehehrlich, alexshap, jhenderson, javed.absar Reviewed By: jhenderson Subscribers: jyknight, nemanjai, kbarton, fedor.sergeev, jrtc27, kristof.beyls, paulsemel, llvm-commits Differential Revision: https://reviews.llvm.org/D50343 llvm-svn: 340070	2018-08-17 18:51:11 +00:00
Jun Lim	da5864c73c	Test commit I just removed a blank space. llvm-svn: 340069	2018-08-17 18:40:41 +00:00
Vedant Kumar	8cd64580b7	Remove a hardcoded address in test/DebugInfo/X86/vla-multi.ll This relaxes a test to make it less brittle. llvm-svn: 340068	2018-08-17 18:39:19 +00:00
Alina Sbirlea	b8ff3fff08	[IDF] Make GD const. llvm-svn: 340067	2018-08-17 18:37:15 +00:00
Matt Davis	06ac6af297	[llvm-mca] Reformat a few lines (fix spacing). NFC. llvm-svn: 340065	2018-08-17 18:06:01 +00:00
Reka Kovacs	5bce7f8b8f	[Support] NFC: Fix docstring in FileSystem.h. llvm-svn: 340063	2018-08-17 18:05:38 +00:00
Simon Pilgrim	2f48122cc9	[X86][SSE] Lower constant vXi8 ISD::SRL/ISD::SRA using PMULLW Extending the concept introduced in D49562, this patch lowers constant vXi8 ISD::SRL/ISD::SRA by zero/sign extending to vXi16 and using PMULLW and then truncating the high 8 bits of the result. Differential Revision: https://reviews.llvm.org/D50781 llvm-svn: 340062	2018-08-17 18:03:11 +00:00
Evandro Menezes	4b39010afb	[InstCombine] Refactor the simplification of pow() (NFC) Refactor all cases dealing with `exp{,2,10}()` into one function in preparation for D49273. Otherwise, NFC. llvm-svn: 340061	2018-08-17 17:59:53 +00:00
Evandro Menezes	e219d384f9	[NFC] Expand test cases for simplifying pow() In prepatration for the improvements that D49273 enables. llvm-svn: 340060	2018-08-17 17:59:38 +00:00
Craig Topper	730890dbdb	[X86] Use hasOneUse instead of isOnlyUserOf. NFCI isOnlyUserOf is a little heavier because it allows the node to be used multiple times by the other node. In this case we are looking at a truncate which only has one operand so we know it can only use it once. Thus hasOneUse is better. llvm-svn: 340059	2018-08-17 17:57:25 +00:00
Simon Pilgrim	2784a339ab	[TableGen] Don't separately search for DefaultMode when we're going to iterate the set anyway. NFCI. llvm-svn: 340055	2018-08-17 17:45:15 +00:00
Alina Sbirlea	0dfe830318	[IDF] Teach Iterated Dominance Frontier to use a snapshot CFG based on a GraphDiff. Summary: Create the ability to compute IDF using a CFG View. For this, we'll need a new DT created using a list of Updates (to be refactored later to a GraphDiff), and the GraphTraits based on the same GraphDiff. Reviewers: kuhar, george.burgess.iv, mzolotukhin Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D50675 llvm-svn: 340052	2018-08-17 17:39:15 +00:00
Teresa Johnson	cb9a82fc7b	[ThinLTO] Add option for printing import failure reasons Summary: Adds the option for the printing of summary information about functions considered but rejected for importing during the thin link. Reviewers: davidxl Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D50881 llvm-svn: 340047	2018-08-17 16:53:47 +00:00
Zachary Turner	3461bfaa9c	[MS Demangler] Rework the way operators are demangled. Previously, some of the code for actually parsing mangled operator names was more like formatting code in nature, and was interspersed with the demangling code which builds the AST. This means that by the time we got to the printing code, we had lost all information about what type of operator we had, and all we were left with was a string that we just had to print. However, not all operators are actually even operators. it's basically just a catch-all mangling for "special names", and for some of the other types it helps to know when we're actually doing the printing what it is. This patch changes the way things work by introducing an OperatorInfo structure and corresponding enumeration. When we demangle we store the enumeration value and demangled components separately. This gives more flexibility during printing. In doing so, some demanglings of special names which we didn't previously support come out of this for free, so we now demangle those. A few are more complex and are better left for a followup patch though. An exhaustive test of every possible operator code is included, with the ones that don't yet work commented out. llvm-svn: 340046	2018-08-17 16:14:05 +00:00
Simon Pilgrim	45e61c5f99	[TableGen] TypeInfer - Cache the legal types as TypeSetByHwMode We were just caching the MVT set of legal types, then every call creating a new TypeSetByHwMode with it and passing it back on the stack. There's no need to do this - we can create and cache the whole TypeSetByHwMode once and return a const reference to it each time. Additionally, TypeInfer::expandOverloads wasn't making use of the fact that the cache just contains a default mode containing all the types. Saves up to 30secs in debug builds of x86 -gen-dag-isel. Differential Revision: https://reviews.llvm.org/D50903 llvm-svn: 340042	2018-08-17 15:54:07 +00:00
Hsiangkai Wang	2532ac880a	[DebugInfo] Generate DWARF debug information for labels. (Fix leak problems) There are two forms for label debug information in DWARF format. 1. Labels in a non-inlined function: DW_TAG_label DW_AT_name DW_AT_decl_file DW_AT_decl_line DW_AT_low_pc 2. Labels in an inlined function: DW_TAG_label DW_AT_abstract_origin DW_AT_low_pc We will collect label information from DBG_LABEL. Before every DBG_LABEL, we will generate a temporary symbol to denote the location of the label. The symbol could be used to get DW_AT_low_pc afterwards. So, we create a mapping between 'inlined label' and DBG_LABEL MachineInstr in DebugHandlerBase. The DBG_LABEL in the mapping is used to query the symbol before it. The AbstractLabels in DwarfCompileUnit is used to process labels in inlined functions. We also keep a mapping between scope and labels in DwarfFile to help to generate correct tree structure of DIEs. It also generates label debug information under global isel. Differential Revision: https://reviews.llvm.org/D45556 llvm-svn: 340039	2018-08-17 15:22:04 +00:00
Stefan Pintilie	39869ccf51	[PowerPC] Generate lxsd instead of the ld->mtvsrd sequence for vector loads This patch addresses: - Implementation within PPCISelLowering.cpp to check if we should use direct load into vector instructions (such as lxsd/lfd ) when the scalar_to_vector function is used; which will allow us to catch as many cases of the scalar_to_vector uses as possible to translate the ld->mtvsrd sequence into lxsd. - Test cases to exhibit the behaviour of emitting lxsd/lfd. Patch by amyk Differential revision: https://reviews.llvm.org/D49698 llvm-svn: 340037	2018-08-17 15:15:26 +00:00
Andrea Di Biagio	163419f976	[llvm-mca] Removed references to HWStallEvent in Scheduler.h. NFCI class Scheduler should not know anything of hardware event listeners and hardware stall events (HWStallEvent). HWStallEvent objects should only be constructed by pipeline stages to notify listeners of hardware events. No functional change intended. llvm-svn: 340036	2018-08-17 15:01:37 +00:00
Francis Visoiu Mistrih	f006b491bd	[x86] Fix test breaking on Darwin after r339962 * -march=x86-64 -> -mtriple=x86_64-unknown-linux to avoid _ prefixes to symbols * add -start-before to avoid running the whole codegen on the IR. I assumed it is meant to be running after X86SpeculativeLoadHardening. llvm-svn: 340034	2018-08-17 14:47:01 +00:00
Francis Visoiu Mistrih	8bff832534	[X86] Fix liveness information when expanding X86::EH_SjLj_LongJmp64 test/CodeGen/X86/shadow-stack.ll has the following machine verifier errors: ``` * Bad machine code: Using a killed virtual register * - function: bar - basic block: %bb.6 entry (0x7fdc81857818) - instruction: %3:gr64 = MOV64rm killed %2:gr64, 1, $noreg, 8, $noreg - operand 1: killed %2:gr64 * Bad machine code: Using a killed virtual register * - function: bar - basic block: %bb.6 entry (0x7fdc81857818) - instruction: $rsp = MOV64rm killed %2:gr64, 1, $noreg, 16, $noreg - operand 1: killed %2:gr64 * Bad machine code: Virtual register killed in block, but needed live out. * - function: bar - basic block: %bb.2 entry (0x7fdc818574f8) Virtual register %2 is used after the block. ``` The fix here is to only copy the machine operand's register without the kill flags for all the instructions except the very last one of the sequence. I had to insert dummy PHIs in the test case to force the NoPHI function property to be set to false. More on this here: https://llvm.org/PR38439 Differential Revision: https://reviews.llvm.org/D50260 llvm-svn: 340033	2018-08-17 14:46:56 +00:00
Florian Hahn	9e50e915fa	[NewGVN] Add tests for r340031. llvm-svn: 340032	2018-08-17 14:39:53 +00:00
Florian Hahn	19f9e32f07	[InstrSimplify,NewGVN] Add option to ignore additional instr info when simplifying. NewGVN uses InstructionSimplify for simplifications of leaders of congruence classes. It is not guaranteed that the metadata or other flags/keywords (like nsw or exact) of the leader is available for all members in a congruence class, so we cannot use it for simplification. This patch adds a InstrInfoQuery struct with a boolean field UseInstrInfo (which defaults to true to keep the current behavior as default) and a set of helper methods to get metadata/keywords for a given instruction, if UseInstrInfo is true. The whole thing might need a better name, to avoid confusion with TargetInstrInfo but I am not sure what a better name would be. The current patch threads through InstrInfoQuery to the required places, which is messier then it would need to be, if InstructionSimplify and ValueTracking would share the same Query struct. The reason I added it as a separate struct is that it can be shared between InstructionSimplify and ValueTracking's query objects. Also, some places do not need a full query object, just the InstrInfoQuery. It also updates some interfaces that do not take a Query object, but a set of optional parameters to take an additional boolean UseInstrInfo. See https://bugs.llvm.org/show_bug.cgi?id=37540. Reviewers: dberlin, davide, efriedma, sebpop, hiraditya Reviewed By: hiraditya Differential Revision: https://reviews.llvm.org/D47143 llvm-svn: 340031	2018-08-17 14:39:04 +00:00
Krzysztof Parzyszek	39a979c838	[Hexagon] Expand vgather pseudos during packetization This will allow packetizing the vgather expansion with other instructions. llvm-svn: 340028	2018-08-17 14:24:24 +00:00
Alex Bradbury	3291f9aa81	[AtomicExpandPass] Widen partword atomicrmw or/xor/and before tryExpandAtomicRMW This patch performs a widening transformation of bitwise atomicrmw {or,xor,and} and applies it prior to tryExpandAtomicRMW. This operates similarly to convertCmpXchgToIntegerType. For these operations, the i8/i16 atomicrmw can be implemented in terms of the 32-bit atomicrmw by appropriately manipulating the operands. There is no functional change for the handling of partword or/xor, but the transformation for partword 'and' is new. The advantage of performing this transformation early is that the same code-path can be used regardless of the approach used to expand the atomicrmw (AtomicExpansionKind). i.e. the same logic is used for AtomicExpansionKind::CmpXchg and can also be used by the intrinsic-based expansion in D47882. Differential Revision: https://reviews.llvm.org/D48129 llvm-svn: 340027	2018-08-17 14:03:37 +00:00
Anna Thomas	1962621a7e	[LICM] Add a diagnostic analysis for identifying alias information Summary: Currently, in LICM, we use the alias set tracker to identify if the instruction (we're interested in hoisting) aliases with instruction that modifies that memory location. This patch adds an LICM alias analysis diagnostic tool that checks the mod ref info of the instruction we are interested in hoisting/sinking, with every instruction in the loop. Because of O(N^2) complexity this is now only a diagnostic tool to show the limitation we have with the alias set tracker and is OFF by default. Test cases show the difference with the diagnostic analysis tool, where we're able to hoist out loads and readonly + argmemonly calls from the loop, where the alias set tracker analysis is not able to hoist these instructions out. Reviewers: reames, mkazantsev, fedor.sergeev, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50854 llvm-svn: 340026	2018-08-17 13:44:00 +00:00
Roger Ferrer Ibanez	734a04ea33	[RISCV] Remove unused function This function is not virtual, it is private and it is not called anywhere. No regression is introduced by removing it. I think we can safely remove it. Differential Revision: https://reviews.llvm.org/D50836 llvm-svn: 340024	2018-08-17 13:40:03 +00:00
Sanjay Patel	411b86081e	[ConstantFolding] add simplifications for funnel shift intrinsics This is another step towards being able to canonicalize to the funnel shift intrinsics in IR (see D49242 for the initial patch). We should not have any loss of simplification power in IR between these and the equivalent IR constructs. Differential Revision: https://reviews.llvm.org/D50848 llvm-svn: 340022	2018-08-17 13:23:44 +00:00
Simon Pilgrim	16a2f54eee	[TableGen] TypeSetByHwMode::insert - cache the default MVT. NFCI. Avoids repeated count()/find() calls that we've already have the default values for. llvm-svn: 340020	2018-08-17 13:03:17 +00:00
Luke Cheeseman	64dcdec60c	[AArch64] - Generate pointer authentication instructions - Generate pointer authentication instructions - The functions instrumented depend on function attribtues: all (all functions instrumentent) non-leaf (only those that spill LR) none - Function epilogues sign the LR before spilling to the stack and authenticate the LR once restored - If the target is v8.3a or greater than can use the combined authenticate and return instruction Differential revision: https://reviews.llvm.org/D49793 llvm-svn: 340018	2018-08-17 12:53:22 +00:00
Nemanja Ivanovic	7d27251323	Revert extraneous directory added by accident in rL340016 It appears that the way this patch was produced ended up creating an extra 'llvm' directory where the test was placed. When I committed the patch, that directory ended up being created upstream. This commit should revert that. Sorry for the noise. llvm-svn: 340017	2018-08-17 12:41:49 +00:00
Nemanja Ivanovic	39751276b0	[PowerPC] Generate Power9 extswsli extend sign and shift immediate instruction Add a DAG combine for the PowerPC code generator to generate the Power9 extswsli extend sign and shift immediate instruction. Patch by RolandF. Differential revision: https://reviews.llvm.org/D49879 llvm-svn: 340016	2018-08-17 12:35:44 +00:00
Simon Pilgrim	03e57521c0	[DAGCombiner] extractShiftForRotate - fix out of range shift issue Don't just check for negative shift amounts. Fixes OSS Fuzz #9935 https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=9935 llvm-svn: 340015	2018-08-17 12:25:18 +00:00
Andrea Di Biagio	f874607f32	[InstCombine] Remove unused method FAddCombine::createFDiv(). NFC This commit fixes a (gcc 7.3.0) [-Wunused-function] warning caused by the presence of unused method FaddCombine::createFDiv(). The last use of that method was removed at r339519. llvm-svn: 340014	2018-08-17 11:33:48 +00:00
Bernard Ogden	b828bb2a15	[ARM/AArch64] Support FP16 +fp16fml instructions Add +fp16fml feature for new FP16 instructions, which are a mandatory part of FP16 from v8.4-A and an optional part of FP16 from v8.2-A. It doesn't seem to be possible to model this in LLVM, but the relationship between the options is handled by the related clang patch. In keeping with what I think is the usual practice, the fp16fml extension is accepted regardless of base architecture version. Builds on/replaces Sjoerd Meijer's patch to add these instructions at https://reviews.llvm.org/D49839. Differential Revision: https://reviews.llvm.org/D50228 llvm-svn: 340013	2018-08-17 11:29:49 +00:00
Bernard Ogden	6cb07d2bed	[ARM/AArch64] TargetParserTest fixes Adds some missing tests for the FP16 extension, fixes an existing test that misnames it. Differential Revision: https://reviews.llvm.org/D50227 llvm-svn: 340012	2018-08-17 11:26:57 +00:00
Simon Pilgrim	5113b48798	[DAGCombine] Improve (sra (sra x, c1), c2) -> (sra x, (add c1, c2)) folding Add support for cases where only some c1+c2 results exceed the max bitshift, clamping accordingly. Differential Revision: https://reviews.llvm.org/D35722 llvm-svn: 340010	2018-08-17 10:52:49 +00:00
Daniel Cederman	0c597ca223	[Sparc] Get sret arg size from CallLoweringInfo.getArgs() Summary: Looking at the callee argument list, as is done now, might not work if the function has been typecasted into one that is expected to return a struct. This change also simplifies the code. The isFP128ABICall() function can be removed as it is no longer needed. The test in fp128.ll has been updated to verify this. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D48117 llvm-svn: 340008	2018-08-17 10:40:00 +00:00
Simon Pilgrim	22d580f2ca	Fix "control reaches end of non-void function" -Wreturn-type warning. NFCI. llvm-svn: 340006	2018-08-17 09:47:52 +00:00
Daniel Cederman	7d3e08ff8d	[Sparc] Flush register windows for @llvm.returnaddress(1) Summary: When @llvm.returnaddress is called with a value higher than 0 it needs to read from the call stack to get the return address. This means that the register windows needs to be flushed to the stack to guarantee that the data read is valid. For values higher than 1 this is done indirectly by the call to getFRAMEADDR(), but not for the value 1. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D48636 llvm-svn: 340003	2018-08-17 09:18:31 +00:00
Chen Zheng	e2d47dd1bb	[MISC]Fix wrong usage of std::equal() Differential Revision: https://reviews.llvm.org/D49958 llvm-svn: 340000	2018-08-17 07:51:01 +00:00
Sjoerd Meijer	31239a4c6a	[ARM][NFC] ARMCodeGenPrepare: some refactoring and algorithm description Differential Revision: https://reviews.llvm.org/D50846 llvm-svn: 339997	2018-08-17 07:34:01 +00:00
Max Kazantsev	7b78d3920c	[MustExecute] Fix algorithmic bug in isGuaranteedToExecute. PR38514 The description of `isGuaranteedToExecute` does not correspond to its implementation. According to description, it should return `true` if an instruction is executed under the assumption that its loop is entered. However there is a sophisticated alrogithm inside that tries to prove that the instruction is executed if the loop is exited, which is not the same thing for infinite loops. There is an attempt to protect from dealing with infinite loops by prohibiting loops without exit blocks, however an infinite loop can have exit blocks. As result of that, MustExecute can falsely consider some blocks that are never entered as mustexec, and LICM can hoist dangerous instructions out of them basing on this fact. This may introduce UB to programs which did not contain it initially. This patch removes the problematic algorithm and replaced it with a one which tries to prove what is required in description. Differential Revision: https://reviews.llvm.org/D50558 Reviewed By: reames llvm-svn: 339984	2018-08-17 06:19:17 +00:00
Max Kazantsev	cfa3e66b8e	[NFC] Add tests to ensure that improvement of MustThrow analysis will not lead to problems in future llvm-svn: 339983	2018-08-17 05:20:25 +00:00
Chandler Carruth	b898b86f49	Revert r339977: [GISel]: Add Opcodes for a few LLVM Intrinsics This is breaking ~all the bots. llvm-svn: 339982	2018-08-17 04:47:16 +00:00
Brian Cain	f72611b4d2	[llvm-mc-assemble-fuzzer] Update API - Pass MCObjectWriter instead of a stream Fixes build breakage of llvm-mc-assemble-fuzzer introduced by r332749. Fix provided by pbhatu (Pratik Bhatu) llvm-svn: 339981	2018-08-17 04:38:41 +00:00
Graydon Hoare	eac6e87118	[Support] Add a public API to allow clearing all (static) timer groups. Summary: Formerly, all timer groups were automatically cleared when printed out. In https://reviews.llvm.org/rL324788 this behaviour was changed to not-clearing timers on printout, to allow printing timers more than once, but as a result clients (specifically Swift) that relied on the clear-on-print behaviour to inhibit duplicate timer printing on shutdown were broken. Rather than revert that change, this change adds a new API that enables clients that _want_ to clear all timers to do so explicitly. Reviewers: george.karpenkov, thegameg Reviewed By: george.karpenkov Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50874 llvm-svn: 339980	2018-08-17 04:13:19 +00:00
Aditya Nandakumar	973a557338	[GISel]: Add Opcodes for a few LLVM Intrinsics https://reviews.llvm.org/D50401 Add opcodes for llvm.intrinsic.trunc, round, and update the IRTranslator for the same. Reviewed by: dsanders. llvm-svn: 339977	2018-08-17 01:41:56 +00:00
Chandler Carruth	9e86844d54	[ADT] Replace a member initializer of a union with an explicit constructor. This breaking an old/weird host compiler is my best bet for the current crashes I'm getting from bots since this functionality was added to this ADT. llvm-svn: 339975	2018-08-17 01:10:33 +00:00
Heejin Ahn	a93e726170	[WebAssembly] Modify LateEHPrepare one-line description (NFC) llvm-svn: 339972	2018-08-17 00:12:04 +00:00
David Blaikie	0e03047e85	DebugInfo: Remove command line (& target-based) disabling of pubnames in favor of metadata Now that Clang disables NVPTX pubnames via metadata there's no need for this fallback to target detection in the backend. llvm-svn: 339970	2018-08-16 23:57:15 +00:00
Heejin Ahn	e76fa9ecca	[WebAssembly] CFG stackify support for exception handling Summary: This adds support for exception handling to CFGStackify pass. This only adds TRY / END_TRY markers and DOES NOT yet fix unwind mismatches that can be created by the linearization of the CFG into the structural wasm format. The mismatch fix will be added by following patches. In detail, this patch - Added support for TRY / END_TRY markers to support EH - Changed many static functions into class member functions as they take too many arguments now - Added several more bookeeping data structures - Refactored routines that decide where to insert markers, because without refactoring this got too complicated as we added support for new kinds of markers (TRY/END_TRY). - Rewrote rethrow instructions' BB arguments to relative depths in EH pad stack. Reviewers: dschuff, sunfish Subscribers: sbc100, jgravelle-google, llvm-commits Differential Revision: https://reviews.llvm.org/D48273 llvm-svn: 339967	2018-08-16 23:50:59 +00:00
Chandler Carruth	75ca6be1c1	[x86/MIR] Implement support for pre- and post-instruction symbols, as well as MIR parsing support for `MCSymbol` `MachineOperand`s. The only real way to test pre- and post-instruction symbol support is to use them in operands, so I ended up implementing that within the patch as well. I can split out the operand support if folks really want but it doesn't really seem worth it. The functional implementation of pre- and post-instruction symbols is now completely trivial. Two tiny bits of code in the (misnamed) AsmPrinter. It should be completely target independent as well. We emit these exactly the same way as we emit basic block labels. Most of the code here is to give full dumping, MIR printing, and MIR parsing support so that we can write useful tests. The MIR parsing of MC symbol operands still isn't 100%, as it forces the symbols to be non-temporary and non-local symbols with names. However, those names often can encode most (if not all) of the special semantics desired, and unnamed symbols seem especially annoying to serialize and de-serialize. While this isn't perfect or full support, it seems plenty to write tests that exercise usage of these kinds of operands. The MIR support for pre-and post-instruction symbols was quite straightforward. I chose to print them out in an as-if-operand syntax similar to debug locations as this seemed the cleanest way and let me use nice introducer tokens rather than inventing more magic punctuation like we use for memoperands. However, supporting MIR-based parsing of these symbols caused me to change the design of the symbol support to allow setting arbitrary symbols. Without this, I don't see any reasonable way to test things with MIR. Differential Revision: https://reviews.llvm.org/D50833 llvm-svn: 339962	2018-08-16 23:11:05 +00:00
Sanjay Patel	8ba631d9c8	[InstCombine] add reflection fold for tan(-x) This is a follow-up suggested with rL339604. For tan(), we don't have a corresponding LLVM intrinsic -- unlike sin/cos -- so this is the only way/place that we can do this fold currently. llvm-svn: 339958	2018-08-16 22:46:20 +00:00
Vedant Kumar	ee6c233ae0	[InstrProf] Use atomic profile counter updates for TSan Thread sanitizer instrumentation fails to skip all loads and stores to profile counters. This can happen if profile counter updates are merged: %.sink = phi i64* ... %pgocount5 = load i64, i64* %.sink %27 = add i64 %pgocount5, 1 %28 = bitcast i64* %.sink to i8* call void @__tsan_write8(i8* %28) store i64 %27, i64* %.sink To suppress TSan diagnostics about racy counter updates, make the counter updates atomic when TSan is enabled. If there's general interest in this mode it can be surfaced as a clang/swift driver option. Testing: check-{llvm,clang,profile} rdar://40477803 Differential Revision: https://reviews.llvm.org/D50867 llvm-svn: 339955	2018-08-16 22:24:47 +00:00
Sanjay Patel	75714b598d	[InstCombine] add tests for tan with negated arg; NFC llvm-svn: 339953	2018-08-16 22:05:51 +00:00
Alina Sbirlea	2ab544bcf5	Update MemorySSA in Local utils removing blocks. Summary: Extend Local utils to update MemorySSA. Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D48790 llvm-svn: 339951	2018-08-16 21:58:44 +00:00
Justin Bogner	b9fb2aec92	[docs] Try to clarify the FuzzingLLVM docs Try to improve these docs based on some recent questions that were sent to llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2018-August/125329.html llvm-svn: 339949	2018-08-16 21:55:09 +00:00
Alina Sbirlea	d4b3f19ba6	[DomTree] Add constructor to create a new DT based on current DT/CFG and a set of Updates. Summary: Add the posibility of creating a new DT using a set of Updates. This will essentially create a DT based on a CFG snapshot/view. Additional refactoring for either this patch or follow-ups: - create an utility for building BUI. - replace BUI with a GraphDiff. Reviewers: kuhar Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D50671 llvm-svn: 339947	2018-08-16 21:54:33 +00:00
Craig Topper	883ff69c93	[DAGCombiner] Don't reassociate operations that have the vector reduction flag set. When nodes are reassociated the vector-reduction flag gets lost. The test case is here is what would happen if you had a sum of absolute differences loop that started with a non-zero but contant sum and that loop was unrolled. The vectorizer will generate a constant vector for the initial value. And DAGCombiner reassociate tries to move it down the addition tree erasing the vector-reduction flag. Interestingly this moves constants the opposite direction of the reassociate IR pass. I've chosen to just punt on the reassociate, but I suppose we could maybe preserve the flag if both nodes have it set. Differential Revision: https://reviews.llvm.org/D50827 llvm-svn: 339946	2018-08-16 21:54:05 +00:00
Craig Topper	bde2b43cb3	[X86] In EFLAGS copy pass, don't emit EXTRACT_SUBREG instructions since we're after peephole Normally the peephole pass converts EXTRACT_SUBREG to COPY instructions. But we're after peephole so we can't rely on it to clean these up. To fix this, the eflags pass now emits a COPY with a subreg input. I also noticed that in 32-bit mode we need to constrain the input to the copy to ensure the subreg is valid. Otherwise we'll fail verify-machineinstrs Differential Revision: https://reviews.llvm.org/D50656 llvm-svn: 339945	2018-08-16 21:54:02 +00:00
Richard Smith	a6c34887f7	Factor Node creation out of the demangler. No functionality change intended. llvm-svn: 339944	2018-08-16 21:40:57 +00:00
Reid Kleckner	602c0dafdd	[MC] Improve COFF associative section lookup Handle the case when the symbol is private. Private symbols are not in the COFF object file symbol table, so they aren't inserted into SymbolMap. We can't look up the section of the symbol that way. Instead, get the MCSection from the MCSymbol and map that to the object file section. Print a better error message when the symbol has no section, like when the symbol is undefined. Fixes PR38607 llvm-svn: 339942	2018-08-16 21:34:41 +00:00
Chandler Carruth	c73c0307fe	[MI] Change the array of `MachineMemOperand` pointers to be a generically extensible collection of extra info attached to a `MachineInstr`. The primary change here is cleaning up the APIs used for setting and manipulating the `MachineMemOperand` pointer arrays so chat we can change how they are allocated. Then we introduce an extra info object that using the trailing object pattern to attach some number of MMOs but also other extra info. The design of this is specifically so that this extra info has a fixed necessary cost (the header tracking what extra info is included) and everything else can be tail allocated. This pattern works especially well with a `BumpPtrAllocator` which we use here. I've also added the basic scaffolding for putting interesting pointers into this, namely pre- and post-instruction symbols. These aren't used anywhere yet, they're just there to ensure I've actually gotten the data structure types correct. I'll flesh out support for these in a subsequent patch (MIR dumping, parsing, the works). Finally, I've included an optimization where we store any single pointer inline in the `MachineInstr` to avoid the allocation overhead. This is expected to be the overwhelmingly most common case and so should avoid any memory usage growth due to slightly less clever / dense allocation when dealing with >1 MMO. This did require several ergonomic improvements to the `PointerSumType` to reasonably support the various usage models. This also has a side effect of freeing up 8 bits within the `MachineInstr` which could be repurposed for something else. The suggested direction here came largely from Hal Finkel. I hope it was worth it. ;] It does hopefully clear a path for subsequent extensions w/o nearly as much leg work. Lots of thanks to Reid and Justin for careful reviews and ideas about how to do all of this. Differential Revision: https://reviews.llvm.org/D50701 llvm-svn: 339940	2018-08-16 21:30:05 +00:00
David Blaikie	66cf14d06b	DebugInfo: Add metadata support for disabling DWARF pub sections In cases where the debugger load time is a worthwhile tradeoff (or less costly - such as loading from a DWP instead of a variety of DWOs (possibly over a high-latency/distributed filesystem)) against object file size, it can be reasonable to disable pubnames and corresponding gdb-index creation in the linker. A backend-flag version of this was implemented for NVPTX in D44385/r327994 - which was fine for NVPTX which wouldn't mix-and-match CUs. Now that it's going to be a user-facing option (likely powered by "-gno-pubnames", the same as GCC) it should be encoded in the DICompileUnit so it can vary per-CU. After this, likely the NVPTX support should be migrated to the metadata & the previous flag implementation should be removed. Reviewers: aprantl Differential Revision: https://reviews.llvm.org/D50213 llvm-svn: 339939	2018-08-16 21:29:55 +00:00
Michael Berg	ed89d069f4	add a missed case for binary op FMF propagation under select folds llvm-svn: 339938	2018-08-16 20:59:45 +00:00
Philip Reames	5f50ffe83b	[AST] Speculative build fix for a polly buildbot I don't have polly setup to bulld locally and don't plan to. This should let the old API adapt to the new one. Can someone from polly please migrate usage and then delete the wrapper? llvm-svn: 339937	2018-08-16 20:58:48 +00:00
Philip Reames	684fa57ef7	[MemLoc] Fix a bug causing any use of invariant.end to crash in LICM The fix is fairly simple, but is says something unpleasant about the usage and testing of invariant.start/end scopes that this went undetected. To put this in perspective, any invariant.end in a loop flowing through LICM crashed. I haven't bothered to figure out just how far back this goes, but it's not caused by any of the recent changes. We're probably talking months if not years. llvm-svn: 339936	2018-08-16 20:48:55 +00:00
Krzysztof Parzyszek	bb1aede865	[SystemZ] Require asserts in subregliveness-06.mir The option -misched=shuffle is only available with !NDEBUG builds. llvm-svn: 339931	2018-08-16 20:12:15 +00:00
Philip Reames	0e2f9b9e30	[LICM][NFC] Restructure pointer invalidation API in terms of MemoryLocation Main value is just simplifying code. I'll further simply the argument handling case in a bit, but that involved a slightly orthogonal change so I went with the mildy ugly intermediate for this patch. Note that the isSized check in the old LICM code was not carried across. It turns out that check was dead. a) no test exercised it, and b) langref and verifier had been updated to disallow unsized types used in loads. llvm-svn: 339930	2018-08-16 20:11:15 +00:00
Andrea Di Biagio	998373c059	[llvm-mca] Fix -Wpessimizing-move warnings introduced by r339923. Reported by buildbot `clang-with-lto-ubuntu` ( build #9858 ). llvm-svn: 339928	2018-08-16 19:45:13 +00:00
Peter Collingbourne	3da2ffb826	Add missing test file from r339799. llvm-svn: 339927	2018-08-16 19:29:01 +00:00
Craig Topper	3dfc5af178	[X86] Pre-commit test case for D50827. llvm-svn: 339926	2018-08-16 19:27:43 +00:00
Jacob Gravelle	3d668d3928	[WebAssembly] Remove temporary workaround for function bitcasts Summary: EM_ASM no longer is lowered as varargs in C, so this workaround is obsolete. Reviewers: dschuff, sunfish Subscribers: sbc100, aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D50859 llvm-svn: 339925	2018-08-16 19:24:31 +00:00
Krzysztof Parzyszek	9af86a5e01	[MachineVerifier] Check if predecessor is jointly dominated by undefs Each use of a value should be jointly dominated by the union of defs and undefs. It can happen that it will only be jointly dominated by undefs, and that is still legal. Make sure that the verifier is aware of that. llvm-svn: 339924	2018-08-16 19:13:28 +00:00
Andrea Di Biagio	db63088ea7	[llvm-mca] Refactor how execution is orchestrated by the Pipeline. This patch changes how instruction execution is orchestrated by the Pipeline. In particular, this patch makes it more explicit how instructions transition through the various pipeline stages during execution. The main goal is to simplify both the stage API and the Pipeline execution. At the same time, this patch fixes some design issues which are currently latent, but that are likely to cause problems in future if people start defining custom pipelines. The new design assumes that each pipeline stage knows the "next-in-sequence". The Stage API has gained three new methods: - isAvailable(IR) - checkNextStage(IR) - moveToTheNextStage(IR). An instruction IR can be executed by a Stage if method `Stage::isAvailable(IR)` returns true. Instructions can move to next stages using method moveToTheNextStage(IR). An instruction cannot be moved to the next stage if method checkNextStage(IR) (called on the current stage) returns false. Stages are now responsible for moving instructions to the next stage in sequence if necessary. Instructions are allowed to transition through multiple stages during a single cycle (as long as stages are available, and as long as all the calls to `checkNextStage(IR)` returns true). Methods `Stage::preExecute()` and `Stage::postExecute()` have now become redundant, and those are removed by this patch. Method Pipeline::runCycle() is now simpler, and it correctly visits stages on every begin/end of cycle. Other changes: - DispatchStage no longer requires a reference to the Scheduler. - ExecuteStage no longer needs to directly interact with the RetireControlUnit. Instead, executed instructions are now directly moved to the next stage (i.e. the retire stage). - RetireStage gained an execute method. This allowed us to remove the dependency with the RCU in ExecuteStage. - FecthStage now updates the "program counter" during cycleBegin() (i.e. before we start executing new instructions). - We no longer need Stage::Status to be returned by method execute(). It has been dropped in favor of a more lightweight llvm::Error. Overally, I measured a ~11% performance gain w.r.t. the previous design. I also think that the Stage interface is probably easier to read now. That being said, code comments have to be improved, and I plan to do it in a follow-up patch. Differential revision: https://reviews.llvm.org/D50849 llvm-svn: 339923	2018-08-16 19:00:48 +00:00
Eli Friedman	73e8a784e6	[SelectionDAG] Improve the legalisation lowering of UMULO. There is no way in the universe, that doing a full-width division in software will be faster than doing overflowing multiplication in software in the first place, especially given that this same full-width multiplication needs to be done anyway. This patch replaces the previous implementation with a direct lowering into an overflowing multiplication algorithm based on half-width operations. Correctness of the algorithm was verified by exhaustively checking the output of this algorithm for overflowing multiplication of 16 bit integers against an obviously correct widening multiplication. Baring any oversights introduced by porting the algorithm to DAG, confidence in correctness of this algorithm is extremely high. Following table shows the change in both t = runtime and s = space. The change is expressed as a multiplier of original, so anything under 1 is “better” and anything above 1 is worse. +-------+-----------+-----------+-------------+-------------+ \| Arch \| u64u64 t \| u64u64 s \| u128u128 t \| u128u128 s \| +-------+-----------+-----------+-------------+-------------+ \| X64 \| - \| - \| ~0.5 \| ~0.64 \| \| i686 \| ~0.5 \| ~0.6666 \| ~0.05 \| ~0.9 \| \| armv7 \| - \| ~0.75 \| - \| ~1.4 \| +-------+-----------+-----------+-------------+-------------+ Performance numbers have been collected by running overflowing multiplication in a loop under `perf` on two x86_64 (one Intel Haswell, other AMD Ryzen) based machines. Size numbers have been collected by looking at the size of function containing an overflowing multiply in a loop. All in all, it can be seen that both performance and size has improved except in the case of armv7 where code size has regressed for 128-bit multiply. u128*u128 overflowing multiply on 32-bit platforms seem to benefit from this change a lot, taking only 5% of the time compared to original algorithm to calculate the same thing. The final benefit of this change is that LLVM is now capable of lowering the overflowing unsigned multiply for integers of any bit-width as long as the target is capable of lowering regular multiplication for the same bit-width. Previously, 128-bit overflowing multiply was the widest possible. Patch by Simonas Kazlauskas! Differential Revision: https://reviews.llvm.org/D50310 llvm-svn: 339922	2018-08-16 18:39:39 +00:00
Jordan Rupprecht	d1767dc56f	[llvm-strip] Add support for -p/--preserve-dates Summary: [llvm-strip] Preserve access/modification timestamps when -p is used. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50744 llvm-svn: 339921	2018-08-16 18:29:40 +00:00
Krzysztof Parzyszek	17143f6111	[RegisterCoalescer] Shrink to uses if needed after removeCopyByCommutingDef llvm-svn: 339912	2018-08-16 18:02:59 +00:00
Zachary Turner	af738f7277	Fix memory leak in demangling of string literals. llvm-svn: 339909	2018-08-16 17:48:32 +00:00
Simon Pilgrim	87d0039a45	[TargetLowering] Add support for non-uniform vectors to BuildSDIV This patch refactors the existing TargetLowering::BuildSDIV base implementation to support non-uniform constant vector denominators. This is the last patch necessary to close PR36545 Differential Revision: https://reviews.llvm.org/D50765 llvm-svn: 339908	2018-08-16 17:44:33 +00:00
Reid Kleckner	bd5d71229d	[codeview] Use push_macro to avoid conflicts instead of a prefix Summary: This prefix was added in r333421, and it changed our dumper output to say things like "CVRegEAX" instead of just "EAX". That's a functional change that I'd rather avoid. I tested GCC, Clang, and MSVC, and all of them support #pragma push_macro. They don't issue warnings whem the macro is not defined either. I don't have a Mac so I can't test the real termios.h header, but I looked at the termios.h sources online and looked for other conflicts. I saw only the CR* macros, so those are the ones we work around. Reviewers: zturner, JDevlieghere Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D50851 llvm-svn: 339907	2018-08-16 17:34:31 +00:00
Nirav Dave	eb189a0ef7	[MC] Cleanup noop default case spelling. NFC. llvm-svn: 339906	2018-08-16 17:22:31 +00:00
Tom Stellard	8f669aadca	Revert "unittests: Don't install TestPlugin.so" This reverts commit r339897. This breaks the build on Windows and platforms where loadable modules aren't supported. llvm-svn: 339903	2018-08-16 17:15:03 +00:00
Matt Arsenault	7121bed210	AMDGPU: Custom lower fexp This will allow the library to just use __builtin_expf directly without expanding this itself. Note f64 still won't work because there is no exp instruction for it. llvm-svn: 339902	2018-08-16 17:07:52 +00:00
Simon Pilgrim	8b9e545477	[X86][SSE] Add sdiv by nonuniform constant vector test containing -1/+1 and all-bits style constants llvm-svn: 339901	2018-08-16 17:07:41 +00:00
Evandro Menezes	42422b33cf	[NFC] Fix typo in test cases llvm-svn: 339900	2018-08-16 17:03:22 +00:00
Simon Pilgrim	ede4905375	[TargetLowering] Refactor BuildSDIV in preparation for D50765. NFCI. Pull out magic factor calculators into a helper function, use 0/+1/-1 multiplication factor to (optionally) add/sub the numerator. llvm-svn: 339898	2018-08-16 16:54:06 +00:00
Tom Stellard	b25e645ef1	unittests: Don't install TestPlugin.so Summary: add_llvm_loadable_module adds an install target by default, but this module is only used for a unit test, so we don't need to install it. Reviewers: philip.pfaffe, thakis Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D50668 llvm-svn: 339897	2018-08-16 16:53:06 +00:00
Benjamin Kramer	0ce64c81e7	[MC] Remove unused variable llvm-svn: 339896	2018-08-16 16:50:23 +00:00
Nirav Dave	7fd992a755	[MC][X86] Enhance X86 Register expression handling to more closely match GCC. Allow the comparison of x86 registers in the evaluation of assembler directives. This generalizes and simplifies the extension from r334022 to catch another case found in the Linux kernel. Reviewers: rnk, void Reviewed By: rnk Subscribers: hiraditya, nickdesaulniers, llvm-commits Differential Revision: https://reviews.llvm.org/D50795 llvm-svn: 339895	2018-08-16 16:31:14 +00:00
Zachary Turner	d78fe2f46d	Fix -Wmicrosoft-goto warnings. llvm-svn: 339894	2018-08-16 16:30:27 +00:00
Zachary Turner	2838b59121	Add support for AVX-512 CodeView registers. When compiling with /arch:AVX512 and optimizations turned on, we could crash while emitting debug info because we did not have CodeView register constants for the AVX 512 register set defined. This patch defines them. Differential Revision: https://reviews.llvm.org/D50819 llvm-svn: 339893	2018-08-16 16:17:55 +00:00
Zachary Turner	970fdc3236	[MS Demangler] Demangle string literals. When demangling string literals, Microsoft's undname simply prints 'string'. This patch implements string literal demangling while doing a bit better than this by decoding as much of the string as possible and trying to faithfully reproduce the original string literal definition. This is a bit tricky because the different character types char, char16_t, and char32_t are not uniquely identified by the mangling, so we have to use a heuristic to try to guess the character type. But it works pretty well, and many tests are added to illustrate the behavior. Differential Revision: https://reviews.llvm.org/D50806 llvm-svn: 339892	2018-08-16 16:17:36 +00:00
Zachary Turner	83313f8f54	[MS Demangler] Don't fail on MD5-mangled names. When we have an MD5 mangled name, we shouldn't choke and say that it's an invalid name. Even though it's impossible to demangle, we should just output the original name. llvm-svn: 339891	2018-08-16 16:17:17 +00:00
Simon Pilgrim	0e18133905	[TableGen] TypeSetByHwMode::operator== optimization This operator is called a great deal, by checking for the cheap isSimple equality cases first (a common occurrence) we can improve performance as we avoid a lot of std::map find/iteration in hasDefault. isSimple also means that a default value is present, so we can avoid some hasDefault calls. This also avoids a rather dodgy piece of logic that was checking for isSimple() && !VTS.isSimple() but not the inverse - it now uses the general hasDefault mode comparison test instead. Saves around 15secs in debug builds of x86 -gen-dag-isel. Differential Revision: https://reviews.llvm.org/D50841 llvm-svn: 339890	2018-08-16 16:16:28 +00:00
Sanjay Patel	0ea8d8b951	[ConstantFolding] add tests for funnel shift intrinsics; NFC No functionality for this yet. llvm-svn: 339889	2018-08-16 16:10:42 +00:00

... 4 5 6 7 8 ...

168491 Commits