llvm-project

Commit Graph

Author	SHA1	Message	Date
Ulrich Weigand	758399131a	[SystemZ] Add remaining branch instructions This patch adds assembler support for the remaining branch instructions: the non-relative branch on count variants, and all variants of branch on index. The only one of those that can be readily exploited for code generation is BRCTH (branch on count using a high 32-bit register as count). Do use it, however, it is necessary to also introduce a hew CHIMux pseudo to allow comparisons of a 32-bit value agains a short immediate to go into a high register as well (implemented via CHI/CIH). This causes a bit of codegen changes overall, but those have proven to be neutral (or even beneficial) in performance measurements. llvm-svn: 288029	2016-11-28 13:40:08 +00:00
Ulrich Weigand	524f276c74	[SystemZ] Improve use of conditional instructions This patch moves formation of LOC-type instructions from (late) IfConversion to the early if-conversion pass, and in some cases additionally creates them directly from select instructions during DAG instruction selection. To make early if-conversion work, the patch implements the canInsertSelect / insertSelect callbacks. It also implements the commuteInstructionImpl and FoldImmediate callbacks to enable generation of the full range of LOC instructions. Finally, the patch adds support for all instructions of the load-store-on-condition-2 facility, which allows using LOC instructions also for high registers. Due to the use of the GRX32 register class to enable high registers, we now also have to handle the cases where there are still no single hardware instructions (conditional move from a low register to a high register or vice versa). These are converted back to a branch sequence after register allocation. Since the expandRAPseudos callback is not allowed to create new basic blocks, this requires a simple new pass, modelled after the ARM/AArch64 ExpandPseudos pass. Overall, this patch causes significantly more LOC-type instructions to be used, and results in a measurable performance improvement. llvm-svn: 288028	2016-11-28 13:34:08 +00:00
Pavel Labath	79724fc0ae	skip android in @skipIfHostIncompatibleWithRemote The current implementation of the decorator does not skip if the android target arch is the same as host arch (as in both cases the platform comes out as linux). Nonetheless android x86_64 binaries are not compatible with linux ones. Technically this should be "skip if target is android and host is not android", but currently nobody runs lldb test suite on an android host, so we don't even have a way of specifying that the host is android. llvm-svn: 288027	2016-11-28 12:15:19 +00:00
Pavel Labath	4fd5754234	Fix a crash in ProcessPOSIXLog We are getting a null pointer for the list of categories here (presumably due to the args refactor). llvm-svn: 288026	2016-11-28 11:47:14 +00:00
Malcolm Parsons	57ae857548	[Sema] Set range end of constructors and destructors in template instantiations Summary: clang-tidy checks frequently use source ranges of functions. The source range of constructors and destructors in template instantiations is currently a single token. The factory method for constructors and destructors does not allow the end source location to be specified. Set end location manually after creating instantiation. Reviewers: aaron.ballman, rsmith, arphaman Subscribers: arphaman, cfe-commits Differential Revision: https://reviews.llvm.org/D26849 llvm-svn: 288025	2016-11-28 11:11:34 +00:00
James Molloy	6bed13c551	[InlineCost] Reduce inline thresholds to compensate for cost changes In r286814, the algorithm for calculating inline costs changed. This caused more inlining to take place which is especially apparent in optsize and minsize modes. As the cost calculation removed a skewed behaviour (we were inconsistent about the cost of calls) it isn't possible to update the thresholds to get exactly the same behaviour as before. However, this threshold change accounts for the very common case where an inline candidate has no calls within it. In this case, r286814 would inline around 5-6 more (IR) instructions. The changes to -Oz have been heavily benchmarked. The "obvious" value for the inline threshold at -Oz is zero, but due to inaccuracies in the inline heuristics this can actually cause code size increases due to not inlining key thunk functions (that then disappear). Experimentally, 5 was the sweet spot for code size over the test-suite. For -Os, this change removes the outlier results shown up by green dragon (http://104.154.54.203/db_default/v4/nts/13248). Fixes D26848. llvm-svn: 288024	2016-11-28 11:07:37 +00:00
Chandler Carruth	0c6efff178	[PM] Remove weird marking of invalidated analyses as "preserved". This never made a lot of sense. They've been invalidated for one IR unit but they aren't really preserved in any normal sense. It seemed like it would be an elegant way of communicating to outer IR units that pass managers and adaptors had already handled invalidation, but we've since ended up adding sets that model this more clearly: we're now using the 'AllAnalysesOn<IRUnitT>' set to handle cases where the trick of "preserving" invalidated analyses didn't work. This patch moves to rely on that technique exclusively and removes the cumbersome API aspect of updating the preserved set when doing invalidation. This in turn will simplify a number of upcoming patches. This has a side benefit of exposing a number of places where we were failing to mark the 'AllAnalysesOn<IRUnitT>' set as preserved. This patch fixes those, and with those fixes shouldn't change any observable behavior. llvm-svn: 288023	2016-11-28 10:42:21 +00:00
George Rimar	1642c5d871	[ELF] - Do not put non exec sections first when -no-rosegment That unifies handling cases when we have SECTIONS and when -no-rosegment is given in compareSectionsNonScript() Now Config->SingleRoRx is used for check, testcase is provided. llvm-svn: 288022	2016-11-28 10:26:21 +00:00
George Rimar	18a3096282	[ELF] - Set Config->SingleRoRx differently. NFC. Previously Config->SingleRoRx was set in createFiles() and used HasSections. This change moves it to readConfigs at place of common flags handling, and adds logic that sets this flag separatelly from ScriptParser if SECTIONS present. llvm-svn: 288021	2016-11-28 10:11:10 +00:00
George Rimar	63bf011003	[ELF] - Implemented -no-rosegment. --no-rosegment: Do not put read-only non-executable sections in their own segment Differential revision: https://reviews.llvm.org/D26889 llvm-svn: 288020	2016-11-28 10:05:20 +00:00
Eugene Leviant	ed30ce7ae4	[ELF] Print file:line for 'undefined section' errors Differential revision: https://reviews.llvm.org/D27108 llvm-svn: 288019	2016-11-28 09:58:04 +00:00
Davide Italiano	0f0d5d8f8d	[ThreadPool] Rollback recent changes until I figure out the breakage. llvm-svn: 288018	2016-11-28 09:17:12 +00:00
Davide Italiano	3dd87dad64	[ThreadPool] Remove outdated comment after r288016. llvm-svn: 288017	2016-11-28 08:57:05 +00:00
Davide Italiano	3ea0bfa7e0	[ThreadPool] Simplify the interface. NFCI. The callers don't use the return value. Found by Michael Spencer. llvm-svn: 288016	2016-11-28 08:53:41 +00:00
Mehdi Amini	43c2428203	Revert "Improve error handling in YAML parsing" This reverts commit r288014, the unittest isn't passing llvm-svn: 288015	2016-11-28 04:57:04 +00:00
Mehdi Amini	c54281be4f	Improve error handling in YAML parsing Some scanner errors were not checked and reported by the parser. Fix PR30934 Patch by: Serge Guelton <serge.guelton@telecom-bretagne.eu> Differential Revision: https://reviews.llvm.org/D26419 llvm-svn: 288014	2016-11-28 04:44:13 +00:00
Chandler Carruth	4cf2c89883	[PM] Add an ASCII-art diagram for the call graph in the CGSCC unit test. No functionality changed. llvm-svn: 288013	2016-11-28 03:40:33 +00:00
Rafael Espindola	8e67000f1a	Always create a PT_ARM_EXIDX if needed. Unfortunatelly PT_ARM_EXIDX is special. There is no way to create it from linker scripts, so we have to create it even if PHDRS is used. This matches bfd and is required for the lld output to survive bfd's strip. llvm-svn: 288012	2016-11-28 00:40:21 +00:00
Craig Topper	17786f77f0	[X86][FMA4] Remove isCommutable from FMA4 scalar intrinsics. They aren't commutable as operand 0 should pass its upper bits through to the output. llvm-svn: 288011	2016-11-27 21:37:04 +00:00
Craig Topper	13b27a2748	[X86][FMA] Add missing Predicates qualifier around scalar FMA intrinsic patterns. llvm-svn: 288010	2016-11-27 21:37:02 +00:00
Craig Topper	ff9d45875a	[X86][FMA4] Add load folding support for FMA4 scalar intrinsic instructions. llvm-svn: 288009	2016-11-27 21:37:00 +00:00
Craig Topper	b00872b983	[X86][FMA4] Add test cases to demonstrate missed folding opportunities for FMA4 scalar intrinsics. llvm-svn: 288008	2016-11-27 21:36:58 +00:00
Craig Topper	3674f44e40	[X86] Add SHL by 1 to the load folding tables. I don't think isel selects these today, favoring adding the register to itself instead. But the load folding tables shouldn't be so concerned with what isel will use and just represent the relationships. llvm-svn: 288007	2016-11-27 21:36:54 +00:00
Simon Pilgrim	91d6f5fbc1	[X86][SSE] Add support for combining target shuffles to 128/256-bit PSLL/PSRL bit shifts llvm-svn: 288006	2016-11-27 21:08:19 +00:00
Sanjay Patel	8ca30ab0c5	[InstSimplify] allow integer vector types to use computeKnownBits Note that the non-splat lshr+lshr test folded, but that does not work in general. Something is missing or wrong in computeKnownBits as the non-splat shl+shl test still shows. llvm-svn: 288005	2016-11-27 21:07:28 +00:00
Craig Topper	4fab487265	[AVX-512] Add integer and fp unpck instructions to load folding tables. llvm-svn: 288004	2016-11-27 19:51:41 +00:00
Simon Pilgrim	cdb2ce661d	[X86][SSE] Split lowerVectorShuffleAsShift ready for combines. NFCI. Moved most of matching code into matchVectorShuffleAsShift to share with target shuffle combines (in a future commit). llvm-svn: 288003	2016-11-27 19:28:39 +00:00
Rui Ueyama	1dd86a664f	Add paralell_for and use it where appropriate. When we iterate over numbers as opposed to iterable elements, parallel_for fits better than parallel_for_each. llvm-svn: 288002	2016-11-27 19:28:32 +00:00
Craig Topper	7ad961cc70	[X86] Add TB_NO_REVERSE to entries in the load folding table where the instruction's load size is smaller than the register size. If we were to unfold these, the load size would be increased to the register size. This is not safe to do since the enlarged load can do things like cross a page boundary into a page that doesn't exist. I probably missed some instructions, but this should be a large portion of them. llvm-svn: 288001	2016-11-27 18:51:13 +00:00
Simon Pilgrim	4571157d2d	[X86][SSE] Added tests showing missed combines for shuffle to shifts. llvm-svn: 288000	2016-11-27 18:25:02 +00:00
Hal Finkel	fec8345108	Adjust type-trait evaluation to properly handle Using(Shadow)Decls Since r274049, for an inheriting constructor declaration, the name of the using declaration (and using shadow declaration comes from the using declaration) is the name of a derived class, not the base class (line 8225-8232 of lib/Sema/SemaDeclCXX.cpp in https://reviews.llvm.org/rL274049). Because of this, name-based lookup performed inside Sema::LookupConstructors returns not only CXXConstructorDecls but also Using(Shadow)Decls, which results assertion failure reported in PR29087. Patch by Taewook Oh, thanks! Differential Revision: https://reviews.llvm.org/D23765 llvm-svn: 287999	2016-11-27 16:26:14 +00:00
Sanjay Patel	dc2917b969	add tests to show missing analysis; NFC llvm-svn: 287998	2016-11-27 15:54:45 +00:00
Sanjay Patel	da9f7bf0fc	fix formatting; NFC llvm-svn: 287997	2016-11-27 15:53:48 +00:00
Rafael Espindola	5fcc99c27d	Also skip regular symbol assignment at the start of a script. Unfortunatelly some scripts look like kernphys = ... . = .... and the expectation in that every orphan section is after the assignment. llvm-svn: 287996	2016-11-27 09:44:45 +00:00
Craig Topper	c3b3926f8b	[AVX-512] Add masked EVEX vpmovzx/sx instructions to load folding tables. llvm-svn: 287995	2016-11-27 08:55:31 +00:00
Rafael Espindola	7fe4ec9b3a	Don't put an orphan before the first . assignment. This is an horrible special case, but seems to match bfd's behaviour and is important for avoiding placing an orphan section before the expected start of the file. llvm-svn: 287994	2016-11-27 07:39:45 +00:00
Mohammad Shahid	2f5cb60b07	[SLP] Add new and update existing lit testfor providing more context to incoming patch for vectorization of jumbled load Change-Id: Ifb9091bb0f84c1937c2c8bd2fc345734f250d2f9 llvm-svn: 287992	2016-11-27 03:35:31 +00:00
Craig Topper	fb64a25ba1	[X86] Remove alignment restrictions from load folding table for some instructions that don't have a restriction. Most of these are the SSE4.1 PMOVZX/PMOVSX instructions which all read less than 128-bits. The only other was PMOVUPD which by definition is an unaligned load. llvm-svn: 287991	2016-11-27 01:52:51 +00:00
Ekaterina Romanova	4c77e8940e	[DOXYGEN] Updated instruction names corresponding to avxintrin.h intrinsics. Documentation for some of the avxintrin.h's intrinsics errorneously said that non VEX-prefixed instructions could be generated. This was fixed. I tried several different solutions to achieve pretty printing of unordered lists (nested and non-nested) in param sections in doxygen. llvm-svn: 287990	2016-11-26 19:38:19 +00:00
Kuba Mracek	3a481cf0bd	[tsan] Fix the lit expansion of %deflake not to eat a space The lit expansion of "%deflake " (notice the space after) expands in a way that the space is removed, this fixes that. Differential Revision: https://reviews.llvm.org/D27139 llvm-svn: 287989	2016-11-26 19:09:32 +00:00
Marshall Clow	13320a50e5	Implement conjuntion/disjuntion/negation for LFTS v2. Same code and tests for the ones in std:: llvm-svn: 287988	2016-11-26 18:45:03 +00:00
Craig Topper	837ff25da1	[X86] Remove hasOneUse check that is redundant with the one in IsProfitableToFold. llvm-svn: 287987	2016-11-26 18:43:26 +00:00
Craig Topper	e266e126ff	[X86] Fix the zero extending load detection in X86DAGToDAGISel::selectScalarSSELoad to pass the load node to IsProfitableToFold and IsLegalToFold. Previously we were passing the SCALAR_TO_VECTOR node. llvm-svn: 287986	2016-11-26 18:43:24 +00:00
Craig Topper	d3ab1a3905	[X86] Simplify control flow. NFCI llvm-svn: 287985	2016-11-26 18:43:21 +00:00
Tobias Grosser	278f9e7d27	[ScopInfo] Use SCEVRewriteVisitor to simplify SCEVSensitiveParameterRewriter [NFC] llvm-svn: 287984	2016-11-26 17:58:40 +00:00
Craig Topper	991d1ca3ba	[X86] Add a hasOneUse check to selectScalarSSELoad to keep the same load from being folded multiple times. Summary: When selectScalarSSELoad is looking for a scalar_to_vector of a scalar load, it makes sure the load is only used by the scalar_to_vector. But it doesn't make sure the scalar_to_vector is only used once. This can cause the same load to be folded multiple times. This can be bad for performance. This also causes the chain output to be duplicated, but not connected to anything so chain dependencies will not be satisfied. Reviewers: RKSimon, zvi, delena, spatel Subscribers: andreadb, llvm-commits Differential Revision: https://reviews.llvm.org/D26790 llvm-svn: 287983	2016-11-26 17:29:25 +00:00
Sanjay Patel	12a2af447b	[InstCombine] add test to show missing vector optimization; NFC llvm-svn: 287982	2016-11-26 16:13:23 +00:00
Marshall Clow	3b3352dead	Implement the 'detection idiom' from LFTS v2 llvm-svn: 287981	2016-11-26 15:49:40 +00:00
Sanjay Patel	8bd69b7ed9	[InstCombine] don't drop metadata in FoldOpIntoSelect() llvm-svn: 287980	2016-11-26 15:23:20 +00:00
Rui Ueyama	e8a077badf	Change return types of split{Non,}Strings. They return new vectors, but at the same time they mutate other vectors, so returning values doesn't make much sense. We should just mutate two vectors. llvm-svn: 287979	2016-11-26 15:15:11 +00:00

1 2 3 4 5 ...

248281 Commits All Branches Search

248281 Commits

All Branches