llvm-project

Commit Graph

Author	SHA1	Message	Date
stefan	0ee47cc92f	[Attributor] Split the Attributor::run() into multiple functions. Summary: This patch splits the Attributor::run() function into multiple functions. Simple Logic changes to make this possible: # Moved iteration count verification earlier. # NumFinalAAs get set a little bit later. Reviewers: jdoerfert, sstefan1, uenoku Reviewed By: jdoerfert Subscribers: hiraditya, uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81022	2020-06-10 09:48:58 +00:00
Vitaly Buka	4666953ce2	[StackSafety] Add info into function summary Summary: This patch adds optional field into function summary, implements asm and bitcode serialization. YAML serialization is omitted and can be added later if needed. This patch includes this information into summary only if module contains at least one sanitize_memtag function. In a near future MTE is the user of the analysis. Later if needed we can provede more direct control on when information is included into summary. Reviewers: eugenis Subscribers: hiraditya, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80908	2020-06-10 02:43:28 -07:00
Paul Walker	8fd2270370	[FileCheck] Add function call support to numerical expressions. This patch extends numerical expressions to allow calls to predefined functions. These calls can be combined with the existing numerical operators, which includes nesting calls. The call syntax is: <func>(<args>) Where <func> is a predefined string literal, currently limited to one of add, max, min and sub. <arg> is a comma seperated list of numerical expressions. Subscribers: arichardson, hiraditya, thopre, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79936	2020-06-10 09:42:00 +00:00
Florian Hahn	67671024c8	[DSE,MSSA] Relax post-dom restriction for objs visible after return. This patch relaxes the post-dominance requirement for accesses to objects visible after the function returns. Instead of requiring the killing def to post-dominate the access to eliminate, the set of 'killing blocks' (= blocks that completely overwrite the original access) is collected. If all paths from the access to eliminate and an exit block go through a killing block, the access can be removed. To check this property, we first get the common post-dominator block for the killing blocks. If this block does not post-dominate the access block, there may be a path from DomAccess to an exit block not involving any killing block. Otherwise we have to check if there is a path from the DomAccess to the common post-dominator, that does not contain a killing block. If there is no such path, we can remove DomAccess. For this check, we start at the common post-dominator and then traverse the CFG backwards. Paths are terminated when we hit a killing block or a block that is not executed between DomAccess and a killing block according to the post-order numbering (if the post order number of a block is greater than the one of DomAccess, the block cannot be in in a path starting at DomAccess). This gives the following improvements on the total number of stores after DSE for MultiSource, SPEC2K, SPEC2006: Tests: 237 Same hash: 206 (filtered out) Remaining: 31 Metric: dse.NumRemainingStores Program base new100 diff test-suite...CFP2000/188.ammp/188.ammp.test 3624.00 3544.00 -2.2% test-suite...ch/g721/g721encode/encode.test 128.00 126.00 -1.6% test-suite.../Benchmarks/Olden/mst/mst.test 73.00 72.00 -1.4% test-suite...CFP2006/433.milc/433.milc.test 3202.00 3163.00 -1.2% test-suite...000/186.crafty/186.crafty.test 5062.00 5010.00 -1.0% test-suite...-typeset/consumer-typeset.test 40460.00 40248.00 -0.5% test-suite...Source/Benchmarks/sim/sim.test 642.00 639.00 -0.5% test-suite...nchmarks/McCat/09-vor/vor.test 642.00 644.00 0.3% test-suite...lications/sqlite3/sqlite3.test 35664.00 35563.00 -0.3% test-suite...T2000/300.twolf/300.twolf.test 7202.00 7184.00 -0.2% test-suite...lications/ClamAV/clamscan.test 19475.00 19444.00 -0.2% test-suite...INT2000/164.gzip/164.gzip.test 2199.00 2196.00 -0.1% test-suite...peg2/mpeg2dec/mpeg2decode.test 2380.00 2378.00 -0.1% test-suite.../Benchmarks/Bullet/bullet.test 39335.00 39309.00 -0.1% test-suite...:: External/Povray/povray.test 36951.00 36927.00 -0.1% test-suite...marks/7zip/7zip-benchmark.test 67396.00 67356.00 -0.1% test-suite...6/464.h264ref/464.h264ref.test 31497.00 31481.00 -0.1% test-suite...006/453.povray/453.povray.test 51441.00 51416.00 -0.0% test-suite...T2006/401.bzip2/401.bzip2.test 4450.00 4448.00 -0.0% test-suite...Applications/kimwitu++/kc.test 23481.00 23471.00 -0.0% test-suite...chmarks/MallocBench/gs/gs.test 6286.00 6284.00 -0.0% test-suite.../CINT2000/254.gap/254.gap.test 13719.00 13715.00 -0.0% test-suite.../Applications/SPASS/SPASS.test 30345.00 30338.00 -0.0% test-suite...006/450.soplex/450.soplex.test 15018.00 15016.00 -0.0% test-suite...ications/JM/lencod/lencod.test 27780.00 27777.00 -0.0% test-suite.../CINT2006/403.gcc/403.gcc.test 105285.00 105276.00 -0.0% There might be potential to pre-compute some of the information of which blocks are on the path to an exit for each block, but the overall benefit might be comparatively small. On the set of benchmarks, 15738 times out of 20322 we reach the CFG check, the CFG check is successful. The total number of iterations in the CFG check is 187810, so on average we need less than 10 steps in the check loop. Bumping the threshold in the loop from 50 to 150 gives a few small improvements, but I don't think they warrant such a big bump at the moment. This is all pending further tuning in the future. Reviewers: dmgreen, bryant, asbirlea, Tyker, efriedma, george.burgess.iv Reviewed By: george.burgess.iv Differential Revision: https://reviews.llvm.org/D78932	2020-06-10 10:39:25 +01:00
Vitaly Buka	5a3b380f49	Revert "[InstrProfiling] Use !associated metadata for counters, data and values" This reverts commit `69c5ff4668`. This reverts commit `603d58b5e4`. This reverts commit `ba10bedf56`. This reverts commit `39b3c41b65`.	2020-06-10 02:32:50 -07:00
Alex Bradbury	d9bc8bd54a	[RISCV] Make visibility of overridden methods in RISCVISelLowering match the parent Currently, some fairly arbitrary subset of overriden methods in RISCVISelLowering are private rather than public (which is the visibility they have in TargetLowering). I suspect this is a holdover from too closely copying another backend. D78545 pointed out this can be difficult for some downstream patches, and nobody has come forward to suggest a reason for keeping the visibility as-is. This commit simply makes all overridden methods match the public visiblity of the parent. Differential Revision: https://reviews.llvm.org/D79928	2020-06-10 09:16:09 +01:00
Sam Parker	09d30cb977	[CostModel] Unify Shuffle and InsertElement Costs Extract the existing code from getInstructionThroughput into TTImpl::getUserCost. The duplicated code in the AMDGPU backend has also been removed. Differential Revision: https://reviews.llvm.org/D81448	2020-06-10 09:13:34 +01:00
Sam Parker	fa8bff0cd1	[CostModel] Unify getArithmeticInstrCost Add the remaining arithmetic opcodes into the generic implementation of getUserCost and then call this from getInstructionThroughput. Most of the backends have been modified to return the base implementation for cost kinds other RecipThroughput. The outlier here is AMDGPU which already uses getArithmeticInstrCost for all the cost kinds. This change means that most of the opcodes can be removed from that backends implementation of getUserCost. Differential Revision: https://reviews.llvm.org/D80992	2020-06-10 09:08:45 +01:00
Kazushi (Jam) Marukawa	49e4faa010	[VE] Support host memory access instructions in MC layer Summary: Add LHM/SHM instructions. Add regression tests for them of asmparser, mccodeemitter, and disassembler. In order to add those instructions, add new decode functions to disassembler, and add new print functions to instprinter. Differential Revision: https://reviews.llvm.org/D81535	2020-06-10 10:02:14 +02:00
Wang, Pengfei	6eb9eae010	[MS] Copy the symbols assigned to the former instruction when memory folding. The memory folding raplaced the old instruction without copying the symbols assigned. Which will resulted in built fail due to the lost symbols. Reviewed by craig.topper Differential Revision: https://reviews.llvm.org/D78471	2020-06-10 15:38:32 +08:00
Eli Friedman	a92dcffcd3	Revert "[SPARC] Lower fp16 ops to libcalls" This reverts commit `28415e588f`. It's causing buildbot failures. (Probably just need to fix the triple for the test, but I'll look more tomorrow.)	2020-06-10 00:27:29 -07:00
LLVM GN Syncbot	801d1235c8	[gn build] Port `4f03c0b806`	2020-06-10 06:34:38 +00:00
LLVM GN Syncbot	c4e3e81786	[gn build] Port `075890ca55`	2020-06-10 06:34:37 +00:00
Amara Emerson	075890ca55	[AArch64] Move RegisterBankInfo.cpp/h to GISel. Missed this file in the recent reorg.	2020-06-09 23:26:25 -07:00
Shawn Landden	9ec57cce62	[AArch64] custom lowering for i128 popcount halves the number of CNT instructions generated	2020-06-10 09:44:16 +04:00
LemonBoy	28415e588f	[SPARC] Lower fp16 ops to libcalls The fp16 ops are legalized by extending/chopping them as needed. The tests are shamelessly stolen from the RISC-V backend. Differential Revision: https://reviews.llvm.org/D77569	2020-06-09 19:29:42 -07:00
Fangrui Song	ceaee253f4	[Support][unittest] Fix asan failure after D81156	2020-06-09 17:48:00 -07:00
Amara Emerson	938cc573ee	[AArch64][GlobalISel] Select G_ADD_LOW into a MOVaddr pseudo. This ensures that we match SelectionDAG behaviour by waiting until the expand pseudos pass to generate ADRP + ADD pairs. Doing this at selection time for the G_ADD_LOW is fine because by the time we get to selecting the G_ADD_LOW, previous attempts to fold it into loads/stores must have failed. Differential Revision: https://reviews.llvm.org/D81512	2020-06-09 16:47:58 -07:00
Craig Topper	641d5ac4d1	[X86] Assign a feature to tremont, goldmont, goldmont-plus, icelake-client, and icelake for target multiversioning priority. Without this these CPUs all caused the compiler to assert when used for multiversioning.	2020-06-09 16:39:41 -07:00
Whitney Tsang	01e64c9712	[LoopFusion] Update second loop guard non loop successor phis incoming blocks. Summary: The current LoopFusion forget to update the incoming block of the phis in second loop guard non loop successor from second loop guard block to first loop guard block. A test case is provided to better understand the problem. Reviewed By: jdoerfert Subscribers: hiraditya, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D81421	2020-06-09 21:14:51 +00:00
Christopher Tetreault	765ac39db2	[SVE] Eliminate calls to default-false VectorType::get() from Scalar Reviewers: efriedma, kmclaughlin, sdesmalen, fhahn, bkramer, anna, gchatelet, c-rhodes, david-arm, fpetrogalli Reviewed By: david-arm Subscribers: tschuett, hiraditya, rkruppe, psnobl, dantrushin, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80336	2020-06-09 14:09:02 -07:00
Christopher Tetreault	e8f815a494	[SVE] Eliminate calls to default-false VectorType::get() from FuzzMutate Reviewers: efriedma, kmclaughlin, sdesmalen, bogner, chandlerc, c-rhodes, david-arm, fpetrogalli Reviewed By: c-rhodes Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80325	2020-06-09 13:57:36 -07:00
Thomas Lively	a96414527c	[NFC][WebAssembly] Add tests for alignment on new SIMD loads Summary: The natural alignments for extending and splatting loads had not previously been tested. It is good to have them tested because they are non-obvious details in the SIMD spec proposal. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81303	2020-06-09 13:46:12 -07:00
diggerlin	2a3f5021f5	Added test case for the patch D75866 "supporting the visibility attribute for aix assembly" The test case has been reviewed in the patch D75866 Reviewers: Jason Liu ,hubert.reinterpretcast,James Henderson Differential Revision: https://reviews.llvm.org/D75866	2020-06-09 16:29:28 -04:00
diggerlin	edd819c757	[AIX] supporting the visibility attribute for aix assembly SUMMARY: in the aix assembly , it do not have .hidden and .protected directive. in current llvm. if a function or a variable which has visibility attribute, it will generate something like the .hidden or .protected , it can not recognize by aix as. in aix assembly, the visibility attribute are support in the pseudo-op like .extern Name [ , Visibility ] .globl Name [, Visibility ] .weak Name [, Visibility ] in this patch, we implement the visibility attribute for the global variable, function or extern function . for example. extern __attribute__ ((visibility ("hidden"))) int bar(int* ip); __attribute__ ((visibility ("hidden"))) int b = 0; __attribute__ ((visibility ("hidden"))) int foo(int* ip){ return (*ip)++; } the visibility of .comm linkage do not support , we will have a separate patch for it. we have the unsupported cases ("default" and "internal") , we will implement them in a a separate patch for it. Reviewers: Jason Liu ,hubert.reinterpretcast,James Henderson Differential Revision: https://reviews.llvm.org/D75866	2020-06-09 16:15:06 -04:00
Mitch Phillips	e26b25f8b1	[HWASan] Add sizeof(global) in report even if symbols missing. Summary: Refactor the current global header iteration to be callback-based, and add a feature that reports the size of the global variable during reporting. This allows binaries without symbols to still report the size of the global variable, which is always available in the HWASan globals PT_NOTE metadata. Reviewers: eugenis, pcc Reviewed By: pcc Subscribers: mgorny, llvm-commits, #sanitizers Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D80599	2020-06-09 13:02:13 -07:00
Mitch Phillips	9bca45bd45	Rebase.	2020-06-09 13:01:40 -07:00
Mitch Phillips	2ecf32fb35	remove redundant comment about Android.	2020-06-09 13:01:40 -07:00
Mitch Phillips	1bfb5b8e36	Address Peter's comments.	2020-06-09 13:01:40 -07:00
Mitch Phillips	184b437699	Move DSO dependencies inside the group.	2020-06-09 13:01:40 -07:00
Mitch Phillips	9e9142cbb9	Patch up issues with GN builds (pthread / libz) Summary: Fixes up two small issues with the gn build. 1 - Ensures that the correct ldflag `-pthread` is provided, not just linking the library. 2 - Ensures that libraries are linked in the same group as the dependencies. This fixes a problem where system libraries (libc) are involved in a link-order dependency that's not being fulfilled. Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80591	2020-06-09 13:01:40 -07:00
LLVM GN Syncbot	a7e0d55de0	[gn build] Port `d5c28c4094`	2020-06-09 19:53:21 +00:00
Craig Topper	d5c28c4094	[X86] Move CPUKind enum from clang to llvm/lib/Support. NFCI Similar to what some other targets have done. This information could be reused by other frontends so doesn't make sense to live in clang. -Rename CK_Generic to CK_None to better reflect its illegalness. -Move function for translating from string to enum into llvm. -Call checkCPUKind directly from the string to enum translation and update CPU kind to CK_None accordinly. Caller will use CK_None as sentinel for bad CPU. I'm planning to move all the CPU to feature mapping out next. As part of that I want to devise a better way to express CPUs inheriting features from an earlier CPU. Allowing this to be expressed in a less rigid way than just falling through a switch. Or using gotos as we've had to do lately. Differential Revision: https://reviews.llvm.org/D81439	2020-06-09 12:52:41 -07:00
Matt Arsenault	44b355f34b	AMDGPU/GlobalISel: Add new baseline tests for bitcast legalization	2020-06-09 15:46:53 -04:00
Sanjay Patel	6f6d2d2383	[x86] refine conditions for immediate hoisting to save code-size As shown in PR46237: https://bugs.llvm.org/show_bug.cgi?id=46237 The size-savings win for hoisting an 8-bit ALU immediate (intentionally excluding store constants) requires extreme conditions; it may not even be possible when including REX prefix bytes on x86-64. I did draft a version of this patch that included use counts after the loop, but I suspect that accounting is not working as expected. I think that is because the number of constant uses are changing as we select instructions (for example as we transform shl/add into LEA). Differential Revision: https://reviews.llvm.org/D81468	2020-06-09 15:44:55 -04:00
Matt Arsenault	32823091c3	GlobalISel: Set instr/debugloc before any legalizer action It was annoying enough that every custom lowering needed to set the insert point, but this was made worse since now these all needed to be updated to setInstrAndDebugLoc. Consolidate these so every legalization action has the right insert position by default. This should fix dropping debug info in every custom AMDGPU legalization.	2020-06-09 15:37:02 -04:00
Sanjay Patel	f71a3b54f0	[InstCombine] add tests for diff-of-sums; NFC	2020-06-09 15:33:38 -04:00
Matt Arsenault	b94c9e3b55	GlobalISel: Improve MachineIRBuilder construction The current relationship between LegalizerHelper and MachineIRBuilder confuses me, because the LegalizerHelper modifies the MachineIRBuilder which it does not own. Constructing a LegalizerHelper destroys the insert point, since the constructor calls setMF, which clears all the fields. Try to separate these functions, so it's possible to construct a LegalizerHelper from an existing MachineIRBuilder without losing the insert point/debug loc.	2020-06-09 15:05:04 -04:00
Matt Arsenault	babbf4441b	GlobalISel: Move some trivial MIRBuilder methods into the header The construction APIs for MachineIRBuilder don't make much sense, and it's been annoying to sort through it with these trivial functions separate from the declaration.	2020-06-09 15:04:48 -04:00
Matt Arsenault	bb6cb6bfe4	GlobalISel: Remove redundant check in verifier This was already checked earlier for all instructions.	2020-06-09 15:04:27 -04:00
Matt Arsenault	6eeac6ae33	GlobalISel: Fix double printing new instructions in legalizer New instructions were getting printed both in createdInstr, and in the final printNewInstrs, so it made it look like the same instructions were created twice. This overall made reading the debug output harder. Stop printing the initial construction and only print new instructions in the summary at the end. This avoids printing the less useful case where instructions are sometimes initially created with no operands. I'm not sure this is the correct instance to remove; now the visible ordering is different. Now you will typically see the one erased instruction message before all the new instructions in order. I think this is the more logical view of typical legalization changes, although it's mechanically backwards from the normal insert-new-erase-old pattern.	2020-06-09 15:02:31 -04:00
Mehdi Amini	d31c9e5a46	Change filecheck default to dump input on failure Having the input dumped on failure seems like a better default: I debugged FileCheck tests for a while without knowing about this option, which really helps to understand failures. Remove `-dump-input-on-failure` and the environment variable FILECHECK_DUMP_INPUT_ON_FAILURE which are now obsolete. Differential Revision: https://reviews.llvm.org/D81422	2020-06-09 18:57:46 +00:00
Anh Tuyen Tran	e7c5412b37	[NFC][LV][TEST]: extend pr45679-fold-tail-by-masking.ll with -force-vector-width=1 -force-vector-interleave=4 Summary: Add -force-vector-width=1 -force-vector-interleave=4 to pr45679-fold-tail-by-masking.ll Author: anhtuyen (Anh Tuyen Tran) Reviewers: Ayal (Ayal Zaks) Reviewed By: Ayal (Ayal Zaks) Subscribers: rkruppe (Hanna Kruppe), llvm-commits, LLVM Tag: LLVM Differential Revision: https://reviews.llvm.org/D80446	2020-06-09 18:30:56 +00:00
David Green	2fea3fe41c	[MachineScheduler] Update available queue on the first mop of a new cycle If a resource can be held for multiple cycles in the schedule model then an instruction can be placed into the available queue, another instruction can be scheduled, but the first will not be taken back out if the two instructions hazard. To fix this make sure that we update the available queue even on the first MOp of a cycle, pushing available instructions back into the pending queue if they now conflict. This happens with some downstream schedules we have around MVE instruction scheduling where we use ResourceCycles=[2] to show the instruction executing over two beats. Apparently the test changes here are OK too. Differential Revision: https://reviews.llvm.org/D76909	2020-06-09 19:13:53 +01:00
Fangrui Song	6bb93e3dd0	[gcov][test] Add mkdir -p %t && cd %t This allows an alternative lit runner (which does not chdir to %T) to run within a read-only source tree.	2020-06-09 11:09:50 -07:00
Simon Pilgrim	5dc4e7c2b9	[VectorCombine] scalarizeBinop - support an all-constant src vector operand scalarizeBinop currently folds vec_bo((inselt VecC0, V0, Index), (inselt VecC1, V1, Index)) -> inselt(vec_bo(VecC0, VecC1), scl_bo(V0,V1), Index) This patch extends this to account for cases where one of the vec_bo operands is already all-constant and performs similar cost checks to determine if the scalar binop with a constant still makes sense: vec_bo((inselt VecC0, V0, Index), VecC1) -> inselt(vec_bo(VecC0, VecC1), scl_bo(V0,extractelt(V1,Index)), Index) Fixes PR42174 Differential Revision: https://reviews.llvm.org/D80885	2020-06-09 19:02:05 +01:00
Daniel Kiss	7a38618a20	[AArch64] Allow BTI mnemonics in the HINT space with BTI disabled Summary: It is important to emit HINT instructions instead of BTI ones when BTI is disabled. This allows compatibility with other assemblers (e.g. GAS). Still, developers of assembly code will want to write code that is compatible with both pre- and post-BTI CPUs. They could use HINT mnemonics, but the new mnemonics are a lot more readable (e.g. bti c instead of hint #34), and they will result in the same encodings. So, while LLVM should not emit the new mnemonics when BTI is disabled, this patch will at least make LLVM accept assembly code that uses them. Reviewers: pbarrio, tamas.petz, ostannard Reviewed By: pbarrio, ostannard Subscribers: ostannard, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81257	2020-06-09 19:57:02 +02:00
Jessica Paquette	cb2d8b30ad	[AArch64][GlobalISel] Select trn1 and trn2 Same idea as for zip, uzp, etc. Teach the post-legalizer combiner to recognize G_SHUFFLE_VECTORs that are trn1/trn2 instructions. - Add G_TRN1 and G_TRN2 - Port mask matching code from AArch64ISelLowering - Produce G_TRN1 and G_TRN2 in the post-legalizer combiner - Select via importer Add select-trn.mir to test selection. Add postlegalizer-combiner-trn.mir to test the combine. This is similar to the existing arm64-trn test. Note that both of these tests contain things we currently don't legalize. I figured it would be easier to test these now rather than later, since once we legalize the G_SHUFFLE_VECTORs, it's not guaranteed that someone will update the tests. Differential Revision: https://reviews.llvm.org/D81182	2020-06-09 10:55:19 -07:00
Thomas Lively	b7d369280b	[WebAssembly] Implement prototype SIMD rounding instructions Summary: As specified in https://github.com/WebAssembly/simd/pull/232. These instructions are implemented as LLVM intrinsics for now rather than normal ISel patterns to make these instructions opt-in. Once the instructions are merged to the spec proposal, the intrinsics will be replaced with proper ISel patterns. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D81222	2020-06-09 10:14:14 -07:00
Fangrui Song	81cca98768	[DebugInfo] Drop unneeded format() calls (fix -Wformat-security) after `3b7ec64d59`	2020-06-09 09:56:13 -07:00

1 2 3 4 5 ...

198170 Commits