llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	8c8b10389d	[X86][SSE] Use SDValue::getConstantOperandVal helper. NFCI. Also reordered an if statement to test low cost comparisons first llvm-svn: 294748	2017-02-10 14:27:59 +00:00
Simon Pilgrim	c371159aac	[X86][SSE] Add support for extracting target constants from BUILD_VECTOR In some cases we call getTargetConstantBitsFromNode for nodes that haven't been lowered from BUILD_VECTOR yet Note: We're getting very close to being able to move most of the constant extraction code from getTargetShuffleMaskIndices into getTargetConstantBitsFromNode llvm-svn: 294746	2017-02-10 14:04:11 +00:00
Simon Pilgrim	1140281413	[X86][SSE] Add missing comment describing combing to SHUFPS. NFCI llvm-svn: 294745	2017-02-10 13:16:01 +00:00
Chandler Carruth	7bc6028d7d	[PM] Relax the patterns used in the new test I added because some compilers don't print the typedef name. llvm-svn: 294729	2017-02-10 08:48:50 +00:00
Chandler Carruth	f425292721	[PM] Fix a bug in the new loop PM when handling functions with no loops. Without any loops, we don't even bother to build the standard analyses used by loop passes. Without these, we can't run loop analyses or invalidate them properly. Unfortunately, we did these things in the wrong order which would allow a loop analysis manager's proxy to be built but then not have the standard analyses built. When we went to do the invalidation in the proxy thing would fall apart. In the test case provided, it would actually crash. The fix is to carefully check for loops first, and to in fact build the standard analyses before building the proxy. This allows it to correctly trigger invalidation for those standard analyses. An alternative might seem to be to look at whether there are any loops when doing invalidation, but this doesn't work when during the loop pipeline run we delete the last loop. I've even included that as a test case. It is both simpler and more robust to defer building the proxy until there are definitely the standard set of analyses and indeed loops. This bug was uncovered by enabling GlobalsAA in the pipeline. llvm-svn: 294728	2017-02-10 08:26:58 +00:00
Igor Breger	6677999e17	add #ifdef, fix compilation error in case LLVM_BUILD_GLOBAL_ISEL=OFF llvm-svn: 294726	2017-02-10 07:33:14 +00:00
Mehdi Amini	a826244bb1	Fix doc for `-opt-bisect-limit`: the LTO option is linker specific llvm-svn: 294725	2017-02-10 07:21:06 +00:00
Igor Breger	b4442f34cd	[X86][GlobalISel] Add general-purpose Register Bank Summary: [X86][GlobalISel] Add general-purpose Register Bank. Add trivial handling of G_ADD legalization . Add Regestry Bank selection for COPY and G_ADD instructions Reviewers: rovka, zvi, ab, t.p.northover, qcolombet Reviewed By: qcolombet Subscribers: qcolombet, mgorny, dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D29771 llvm-svn: 294723	2017-02-10 07:05:56 +00:00
Dean Michael Berris	b5600b58a0	[XRay][graph] Disambiguate name of type from member name Follow-up to D29005. Differential Revision: https://reviews.llvm.org/D29005 llvm-svn: 294722	2017-02-10 06:59:25 +00:00
Dean Michael Berris	6c97b3acda	[XRay] A graph Class for the llvm-xray graph Summary: In preparation for graph comparison and filtering, this is a library for representing graphs in LLVM. This will enable easier encapsulation and reuse of graphs in llvm-xray. Depends on D28999, D28225 Reviewers: dblaikie, dberris Reviewed By: dberris Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D29005 llvm-svn: 294717	2017-02-10 06:36:08 +00:00
Philip Reames	578dafbd8b	[LoopUnswitch] Remove BFI usage (dead code) Chandler mentioned at the last social that the need for BFI in the new pass manager was causing a slight hiccup for this pass. Given this code has been checked in, but off for over a year, it makes sense to just remove it for now. Note that there's nothing wrong with the general idea - it's actually a quite good one - and once we have the infrastructure in place to implement this without the full recompuation on every loop, we absolutely should. llvm-svn: 294715	2017-02-10 06:12:06 +00:00
Dean Michael Berris	79f5746f41	Revert "[XRay] A graph Class for the llvm-xray graph" Broke tests, reverting. llvm-svn: 294714	2017-02-10 06:05:46 +00:00
Dean Michael Berris	2957c25a5e	[XRay] A graph Class for the llvm-xray graph Summary: In preparation for graph comparison and filtering, this is a library for representing graphs in LLVM. This will enable easier encapsulation and reuse of graphs in llvm-xray. Depends on D28999, D28225 Reviewers: dblaikie, dberris Reviewed By: dberris Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D29005 llvm-svn: 294713	2017-02-10 05:40:37 +00:00
Craig Topper	a9f1121896	[SelectionDAG] Dump the DAG after legalizing vector ops and after the second type legalization Summary: With -debug, we aren't dumping the DAG after legalizing vector ops. In particular, on X86 with AVX1 only, we don't dump the DAG after we split 256-bit integer ops into pairs of 128-bit ADDs since this occurs during vector legalization. I'm only dumping if the legalize vector ops changes something since we don't print anything during legalize vector ops. So this dump shows up right after the first type-legalization dump happens. So if nothing changed this second dump is unnecessary. Having said that though, I think we should probably fix legalize vector ops to log what its doing. Reviewers: RKSimon, eli.friedman, spatel, arsenm, chandlerc Reviewed By: RKSimon Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D29554 llvm-svn: 294711	2017-02-10 05:05:57 +00:00
Adam Nemet	386cd3dd6b	opt-viewer: fix HtmlFormatter encoding Summary: Small fix to HtmlFormatter, defaults to ascii encoding, so utf-8 output may get `UnicodeEncodeError: 'ascii' codec can't encode character ... ordinal not in range(128)` during write. Patch by Brian Cain! Reviewers: anemet, fhahn Reviewed By: anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29802 llvm-svn: 294710	2017-02-10 04:50:18 +00:00
Eric Christopher	0824096cc0	Temporarily revert "For X86-64 linux and PPC64 linux align int128 to 16 bytes." until we can get better TargetMachine::isCompatibleDataLayout to compare - otherwise we can't code generate existing bitcode without a string equality data layout. This reverts commit r294702. llvm-svn: 294709	2017-02-10 04:35:32 +00:00
Ahmed Bougacha	982c5eb396	[GlobalISel] Return an Expected<RuleMatcher> for each SDAG pattern. NFC. Instead of emitting the matcher code directly, return the rule matcher and the skip reason as an Expected<RuleMatcher>. This will let us record all matchers and process them before emission. It's a somewhat unconventional use of Error, but it's nicer than, say, std::pair, because of the bool conversions. Differential Revision: https://reviews.llvm.org/D29743 llvm-svn: 294706	2017-02-10 04:00:17 +00:00
Matthias Braun	ef21cb2d95	SubtargetFeature: Increase MAX_SUBTARGET_FEATURES The ARM target is getting really close to the current limit of 128 subtarget features already breaking out of tree enhancements. Increase the size once more to 196. I filed http://llvm.org/PR31926 to request a proper solution. llvm-svn: 294704	2017-02-10 03:48:50 +00:00
Eric Christopher	42b9248803	For X86-64 linux and PPC64 linux align int128 to 16 bytes. For other platforms we should find out what they need and likely make the same change, however, a smaller additional change is easier for platforms we know have it specified in the ABI. As part of this rewrite some of the handling in the backends for data layout and update a bunch of testcases. Based on a patch by Simonas Kazlauskas! llvm-svn: 294702	2017-02-10 03:32:21 +00:00
Quentin Colombet	21136c0273	[TableGen][AsmWriterEmitter] Use a deterministic order to sort InstrAliases Inside an alias group, when ordering instruction aliases, we rely on the priority field to sort them. When the priority is not set or more generally when there is a tie between two aliases, we used to rely on the lexicographic order. However, this order can change for the anonymous records when more instruction, intrinsic, etc. are inserted. For instance, given two anonymous records r1 and r2 with respective name A_999 and A_1000, their lexicography order will be r2 then r1. Now, if an instruction is added before them, their name will become respectively A_1000 and A_1001, thus the lexicography order will be r1 then r2, i.e., it changed. If that happens in an alias group, the assembly output would prefer a different alias for no apparent good reasons. A way to fix that is to use proper priority for all aliases, but we can also make the tie breaker comparison smarter and use a deterministic ordering. This is what this patch does. llvm-svn: 294695	2017-02-10 02:43:09 +00:00
Matt Arsenault	b4493e909f	AMDGPU: Fix trailing whitespace llvm-svn: 294694	2017-02-10 02:42:31 +00:00
Wei Ding	205bfdb3e9	AMDGPU : Add trap handler support. Differential Revision: http://reviews.llvm.org/D26010 llvm-svn: 294692	2017-02-10 02:15:29 +00:00
Stanislav Mekhanoshin	6dec24316b	[AMDGPU] Override PSet for M0 This change returns empty PSet list for M0 register. Otherwise its PSet as defined by tablegen is SReg_32. This results in incorrect register pressure calculation every time an instruction uses M0. Such uses count as SReg_32 PSet and inadequately increase pressure on SGPRs. Differential Revision: https://reviews.llvm.org/D29798 llvm-svn: 294691	2017-02-10 02:07:58 +00:00
Eric Fiselier	87c87f4c30	[CMake] Fix pthread handling for out-of-tree builds LLVM defines `PTHREAD_LIB` which is used by AddLLVM.cmake and various projects to correctly link the threading library when needed. Unfortunately `PTHREAD_LIB` is defined by LLVM's `config-ix.cmake` file which isn't installed and therefore can't be used when configuring out-of-tree builds. This causes such builds to fail since `pthread` isn't being correctly linked. This patch attempts to fix that problem by renaming and exporting `LLVM_PTHREAD_LIB` as part of`LLVMConfig.cmake`. I renamed `PTHREAD_LIB` because It seemed likely to cause collisions with downstream users of `LLVMConfig.cmake`. llvm-svn: 294690	2017-02-10 01:59:20 +00:00
Marcos Pividori	a0b23b8e63	[libFuzzer] Export external functions on tests. We need to export external functions so they are found when calling GetProcAddress() on Windows. But we can't use `__declspec(dllexport)` because we want the targets to be completely independent from the fuzz engines and don't depend on other header files. Also, we don't want to include platform specific code managed with conditional macros. So, the solution is to add the exported symbols with linker flags in cmake. Differential revision: https://reviews.llvm.org/D29752 llvm-svn: 294688	2017-02-10 01:40:28 +00:00
Marcos Pividori	0ae27e80b0	[libFuzzer] Use dynamic loading for External Functions on Windows. Replace weak aliases with dynamic loading. Weak aliases were generating some problems when linking for MT on Windows. For MT, compiler-rt's libraries are statically linked to the main executable the same than libFuzzer, so if we use weak aliases, we are providing two different default implementations for the same weak function and the linker fails. In this diff I re implement ExternalFunctions() using dynamic loading, so it works in both cases (MD and MT). Also, dynamic loading is simpler, since we are not defining any auxiliary external function, and we don't need to deal with weak aliases. This is equivalent to the implementation using dlsym(RTLD_DEFAULT, FnName) for Posix. Differential revision: https://reviews.llvm.org/D29751 llvm-svn: 294687	2017-02-10 01:35:46 +00:00
David L. Jones	e072cf51da	Update test/CodeGen/X86/sse-align-10.ll to use FileCheck instead of grep Patch by Jorge Gorbe (lethalantidote). Differential Revision: https://reviews.llvm.org/D29797 llvm-svn: 294686	2017-02-10 01:35:31 +00:00
Eugene Zelenko	4b6ff6b86e	[MC] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 294685	2017-02-10 01:33:54 +00:00
Michael J. Spencer	788b10ecbc	[LoadCombine] Change test to not use instcombine. llvm-svn: 294682	2017-02-10 00:44:08 +00:00
Dan Gohman	df4f4d45f0	[WebAssembly] Pass an MCContext to WebAssemblyMCCodeEmitter. NFC. llvm-svn: 294679	2017-02-10 00:14:42 +00:00
Matthias Braun	2bef2a08c0	Fix syntax error llvm-svn: 294678	2017-02-10 00:09:20 +00:00
Matthias Braun	62e1e8531b	ARMSubtarget.h: Change to one line per enum element; NFC Change syntax to have enum elements sorted alphabetically and one per line as that is more merge/cherry pick friendly. llvm-svn: 294677	2017-02-10 00:06:44 +00:00
Dan Gohman	23a543971e	[Support] Extend SLEB128 encoding support. Add support for padded SLEB128 values, and support for writing SLEB128 values to buffers rather than to ostreams, similar to the existing ULEB128 support. llvm-svn: 294675	2017-02-10 00:02:58 +00:00
Eric Christopher	e4b10f5d37	Add an additional set of braces to deal with subobject initialization. llvm-svn: 294674	2017-02-10 00:02:09 +00:00
Matthias Braun	f0cb2fdd74	docs/conf.py: Suppress sphinx highlighting failure warnings The pygments syntax highlighting package used by sphinx fails to parse newer LLVM constructs or valid (at least to me) gas constructs like `.secrel32 _function_name + 0`. Disable this particular warning so the build doesn't abort as fixing pygments doesn't seem a workable option here. Differential Revision: https://reviews.llvm.org/D29794 llvm-svn: 294672	2017-02-10 00:00:22 +00:00
Chandler Carruth	0ede22e1c0	[PM] Add Argument Promotion to the pass pipeline. This needs explicit requires of the optimization remark emission before loop pass pipelines containing LICM as we no longer get it from the inliner -- Argument Promotion may invalidate it. Technically the inliner could also have broken this, but it never came up in testing. Differential Revision: https://reviews.llvm.org/D29595 llvm-svn: 294670	2017-02-09 23:54:57 +00:00
Davide Italiano	fc0d442cf1	[NewGVN] Fix test so that it doesn't rely on InstCombine anymore. llvm-svn: 294668	2017-02-09 23:48:10 +00:00
Chandler Carruth	addcda483e	[PM] Port ArgumentPromotion to the new pass manager. Now that the call graph supports efficient replacement of a function and spurious reference edges, we can port ArgumentPromotion to the new pass manager very easily. The old PM-specific bits are sunk into callbacks that the new PM simply doesn't use. Unlike the old PM, the new PM simply does argument promotion and afterward does the update to LCG reflecting the promoted function. Differential Revision: https://reviews.llvm.org/D29580 llvm-svn: 294667	2017-02-09 23:46:27 +00:00
Peter Collingbourne	17febdbb25	WholeProgramDevirt: Check that VCP candidate functions are defined before evaluating them. This was crashing before. llvm-svn: 294666	2017-02-09 23:46:26 +00:00
Matthias Braun	d0d8daa37c	LowerMemIntrinsics: Fix include guard I hope this fixes the clang-stage2-cmake-modules jenkins build. llvm-svn: 294665	2017-02-09 23:43:28 +00:00
Chandler Carruth	1f8fcfeac5	[PM/LCG] Teach LCG to support spurious reference edges. Somewhat amazingly, this only requires teaching it to clean them up when deleting a dead function from the graph. And we already have exactly the necessary data structures to do that in the parent RefSCCs. This allows ArgPromote to work in a much simpler way be merely letting reference edges linger in the graph after the causing IR is deleted. We will clean up these edges when we run any function pass over the IR, but don't remove them eagerly. This avoids all of the quadratic update issues both in the current pass manager and in my previous attempt with the new pass manager. Differential Revision: https://reviews.llvm.org/D29579 llvm-svn: 294663	2017-02-09 23:30:14 +00:00
George Burgess IV	ccf11c2f9f	[ARM] Add support for armv7ve triple in llvm (PR31358). Gcc supports target armv7ve which is armv7-a with virtualization extensions. This change adds support for this in llvm for gcc compatibility. Also remove redundant FeatureHWDiv, FeatureHWDivARM for a few models as this is specified automatically by FeatureVirtualization. Patch by Manoj Gupta. Differential Revision: https://reviews.llvm.org/D29472 llvm-svn: 294661	2017-02-09 23:29:14 +00:00
Chandler Carruth	aaad9f84be	[PM/LCG] Teach the LazyCallGraph how to replace a function without disturbing the graph or having to update edges. This is motivated by porting argument promotion to the new pass manager. Because of how LLVM IR Function objects work, in order to change their signature a new object needs to be created. This is efficient and straight forward in the IR but previously was very hard to implement in LCG. We could easily replace the function a node in the graph represents. The challenging part is how to handle updating the edges in the graph. LCG previously used an edge to a raw function to represent a node that had not yet been scanned for calls and references. This was the core of its laziness. However, that model causes this kind of update to be very hard: 1) The keys to lookup an edge need to be `Function`s that would all need to be updated when we update the node. 2) There will be some unknown number of edges that haven't transitioned from `Function` edges to `Node` edges. All of this complexity isn't necessary. Instead, we can always build a node around any function, always pointing edges at it and always using it as the key to lookup an edge. To maintain the laziness, we need to sink the edges* of a node into a secondary object and explicitly model transitioning a node from empty to populated by scanning the function. This design seems much cleaner in a number of ways, but importantly there is now exactly one place where the `Function` has to be updated! Some other cleanups that fall out of this include having something to model the entry* edges more accurately. Rather than hand rolling parts of the node in the graph itself, we have an explicit `EdgeSequence` object that gives us exactly the functionality needed. We also have a consistent place to define the edge iterators and can use them for both the entry edges and the internal edges of the graph. The API used to model the separation between a node and its edges is intentionally very thin as most clients are expected to deal with nodes that have populated edges. We model this exactly as an optional does with an additional method to populate the edges when that is a reasonable thing for a client to do. This is based on API design suggestions from Richard Smith and David Blaikie, credit goes to them for helping pick how to model this without it being either too explicit or too implicit. The patch is somewhat noisy due to shifting around iterator types and new syntax for walking the edges of a node, but most of the functionality change is in the `Edge`, `EdgeSequence`, and `Node` types. Differential Revision: https://reviews.llvm.org/D29577 llvm-svn: 294653	2017-02-09 23:24:13 +00:00
Dan Gohman	b6afd2070a	[WebAssembly] Refactor void return peephole using MaybeRewriteToFallthrough. NFC. llvm-svn: 294652	2017-02-09 23:19:03 +00:00
Sanjay Patel	f38bab73aa	[InstCombine] allow (X * C2) << C1 --> X * (C2 << C1) for vectors This fold already existed for vectors but only when 'C1' was a splat constant (but 'C2' could be any constant). There were no tests for any vector constants, so I'm adding a test that shows non-splat constants for both operands. llvm-svn: 294650	2017-02-09 23:13:04 +00:00
Peter Collingbourne	cea1e4e79a	De-duplicate some code for creating an AARGetter suitable for the legacy PM. I'm about to use this in a couple more places. Differential Revision: https://reviews.llvm.org/D29793 llvm-svn: 294648	2017-02-09 23:11:52 +00:00
Hans Wennborg	f1e773cab5	Don't try to link to the 4.0 release notes llvm-svn: 294647	2017-02-09 23:03:34 +00:00
Matthias Braun	6717a0ba03	lit.rst: Fix sphinx complaint about multiple option definitions llvm-svn: 294646	2017-02-09 23:03:22 +00:00
Jonathan Roelofs	ebba0507da	[docs] Fix typo llvm-svn: 294645	2017-02-09 23:02:37 +00:00
Adrian McCarthy	d6e091dcc5	Fix build break from r294633. llvm-svn: 294642	2017-02-09 22:49:35 +00:00

1 2 3 4 5 ...

144651 Commits