llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Liao	81db614411	Fix `-Wunused-variable` warnings. NFC.	2020-07-11 10:09:44 -04:00
Michael Liao	0b4cf802fa	[fix-irreducible] Skip unreachable predecessors. Summary: - Skip unreachable predecessors during header detection in SCC. Those unreachable blocks would be generated in the switch lowering pass in the corner cases or other frontends. Even though they could be removed through the CFG simplification, we should skip them during header detection. Reviewers: sameerds Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83562	2020-07-11 10:08:44 -04:00
sstefan1	850b150cff	[Attributor][NFC] Add more debug output for deleted functions	2020-07-11 14:26:08 +02:00
Christudasan Devadasan	d7a05698ef	[AMDGPU] Move LowerSwitch pass to CodeGenPrepare. It is possible that LowerSwitch pass leaves certain blocks unreachable from the entry. If not removed, these dead blocks can cause undefined behavior in the subsequent passes. It caused a crash in the AMDGPU backend after the instruction selection when a PHI node has its incoming values coming from these unreachable blocks. In the AMDGPU pass flow, the last invocation of UnreachableBlockElim precedes where LowerSwitch is currently placed and eventually missed out on the opportunity to get these blocks eliminated. This patch ensures that LowerSwitch pass get inserted earlier to make use of the existing unreachable block elimination pass. Reviewed By: sameerds, arsenm Differential Revision: https://reviews.llvm.org/D83584	2020-07-11 16:33:38 +05:30
Alexey Lapshin	f7907e9d22	[TRE] allow TRE for non-capturing calls. The current implementation of Tail Recursion Elimination has a very restricted pre-requisite: AllCallsAreTailCalls. i.e. it requires that no function call receives a pointer to local stack. Generally, function calls that receive a pointer to local stack but do not capture it - should not break TRE. This fix allows us to do TRE if it is proved that no pointer to the local stack is escaped. Reviewed by: efriedma Differential Revision: https://reviews.llvm.org/D82085	2020-07-11 14:01:48 +03:00
Roman Lebedev	4500db8c59	Revert "Reland "[InstCombine] Lower infinite combine loop detection thresholds""" And there's a new hit: https://bugs.llvm.org/show_bug.cgi?id=46680 This reverts commit `7103c87596`.	2020-07-11 13:53:24 +03:00
Nico Weber	09a95f51fb	[gn build] (manually) merge `943660fd15`	2020-07-11 06:44:28 -04:00
Johannes Doerfert	dce6bc18c4	[OpenMP][FIX] remove unused variable and long if-else chain MSVC throws an error if you use "too many" if-else in a row: `Frontend/OpenMP/OMPKinds.def(570): fatal error C1061: compiler limit: blocks nested too deeply` We work around it now...	2020-07-11 02:37:57 -05:00
Mehdi Amini	c44702bcdf	Remove unused variable `KMPC_KERNEL_PARALLEL_WORK_FN_PTR_ARG_NO` (NFC) This fixes a compiler warning.	2020-07-11 07:17:28 +00:00
Johannes Doerfert	5b0581aedc	[OpenMP] Replace function pointer uses in GPU state machine In non-SPMD mode we create a state machine like code to identify the parallel region the GPU worker threads should execute next. The identification uses the parallel region function pointer as that allows it to work even if the kernel (=target region) and the parallel region are in separate TUs. However, taking the address of a function comes with various downsides. With this patch we will identify the most common situation and replace the function pointer use with a dummy global symbol (for identification purposes only). That means, if the parallel region is only called from a single target region (or kernel), we do not use the function pointer of the parallel region to identify it but a new global symbol. Fixes PR46450. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D83271	2020-07-11 01:44:00 -05:00
Johannes Doerfert	624d34afff	[OpenMP] Compute a proper module slice for the CGSCCC pass The module slice describes which functions we can analyze and transform while working on an SCC as part of the CGSCC OpenMPOpt pass. So far, we simply restricted it to the SCC. In a follow up we will need to have a bigger scope which is why this patch introduces a proper identification of the module slice. In short, everything that has a transitive reference to a function in the SCC or is transitively referenced by one is fair game. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D83270	2020-07-11 01:44:00 -05:00
Johannes Doerfert	e8039ad4de	[OpenMP] Identify GPU kernels (aka. OpenMP target regions) We now identify GPU kernels, that is entry points into the GPU code. These kernels (can) correspond to OpenMP target regions. With this patch we identify and on request print them via remarks. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D83269	2020-07-11 01:44:00 -05:00
Johannes Doerfert	54bd3751ce	[OpenMP][NFC] Add convenient helper and early exit check	2020-07-11 00:51:51 -05:00
Johannes Doerfert	b726c55709	[OpenMP][NFC] Fix some typos	2020-07-11 00:51:51 -05:00
Wang, Pengfei	e628092524	[X86][MMX] Optimize MMX shift intrinsics. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D83534	2020-07-11 11:16:23 +08:00
Jinsong Ji	3e3acc1cc7	[PowerPC][MachinePipeliner] Enable pipeliner if hasInstrSchedModel P9 is the only one with InstrSchedModel, but we may have more in the future, we should not hardcoded it to P9, check hasInstrSchedModel instead. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D83590	2020-07-11 02:24:12 +00:00
Thomas Lively	b59c6fcaf3	[WebAssembly] Prefer v128.const for constant splats In BUILD_VECTOR lowering, we used to generally prefer using splats over v128.const instructions because v128.const has a very large encoding. However, in `d5b7a4e2e8` we switched to preferring consts because they are expected to be more efficient in engines. This patch updates the ISel patterns to match this current preference. Differential Revision: https://reviews.llvm.org/D83581	2020-07-10 18:27:52 -07:00
sstefan1	b8235d2bd8	Reland "[OpenMPOpt] ICV Tracking" This reverts commit `1d542f0ca8`. `recollectUses()` is added to prevent looking at dead uses after Attributor run. This is the first and most basic ICV Tracking implementation. For this first version, we only support deduplication within the same BB. Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6, uenoku, baziotis, lebedev.ri Differential Revision: https://reviews.llvm.org/D81788	2020-07-11 02:25:57 +02:00
Valentin Clement	943660fd15	[openmp] Remove OMPConstants.cpp and replace it by OMP.cpp generated by tablegen Summary: Diff D83176 moved the last piece of code from OMPConstants.cpp and now this file was only useful to include the tablegen generated file. This patch replace OMPConstants.cpp with OMP.cpp generated by tablegen. Reviewers: sstefan1, jdoerfert, jdenny Reviewed By: sstefan1 Subscribers: mgorny, yaxunl, hiraditya, guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83583	2020-07-10 20:11:57 -04:00
Zequan Wu	0f0c5af3db	[COFF] Add cg_profile directive and .llvm.call-graph-profile section Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83597	2020-07-10 17:07:30 -07:00
Teresa Johnson	3e5173dbc3	[BPI] Compile time improvement when erasing blocks (NFC) Summary: eraseBlock is trying to erase all probability info for the given BB. This info is stored in a DenseMap organized like so: using Edge = std::pair<const BasicBlock *, unsigned>; DenseMap<Edge, BranchProbability> Probs; where the unsigned in the Edge key is the successor id. It was walking through every single map entry, checking if the BB in the key's pair matched the given BB. Much more efficient is to do what another method (getEdgeProbability) was already doing, which is to walk the successors of the BB, and simply do a map lookup on the key formed from each <BB, successor id> pair. Doing this dropped the overall compile time for a file containing a very large function by around 32%. Reviewers: davidxl, xur Subscribers: llvm-commits, hiraditya Tags: #llvm Differential Revision: https://reviews.llvm.org/D83596	2020-07-10 16:55:54 -07:00
Alexandre Ganea	b71499ac9e	Revert "Re-land [CodeView] Add full repro to LF_BUILDINFO record" This reverts commit `add59ecb34` and `41d2813a5f`.	2020-07-10 19:46:16 -04:00
Eric Christopher	cc28058c13	Temporarily revert "[NFC] Separate bitcode reading for FUNC_CODE_INST_CMPXCHG(_OLD)" as it wasn't NFC and is causing issues with thinlto bitcode reading. I've followed up offline with reproduction instructions and testcases. This reverts commit `30582457b4`.	2020-07-10 15:21:00 -07:00
Matt Arsenault	31f4e43f3f	AMDGPU: Remove .value_type from kernel metadata This doesn't appear used for anything, and is emitted incorrectly based on the description. This also depends on the IR type, and pointee element type.	2020-07-10 18:16:31 -04:00
Craig Topper	122a45fbac	[X86] Add isel patterns for matching broadcast vpternlog if the ternlog and the broadcast have different types.	2020-07-10 15:15:02 -07:00
Arthur Eubanks	21b4cc1db9	Reland [NFC] Derive from PassInfoMixin for no-op/printing passes PassInfoMixin should be used for all NPM passes, rater than a custom `name()`. This caused ambiguous references in LegacyPassManager.cpp, so had to remove "using namespace llvm::legacy" and move some things around. Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D83498	2020-07-10 12:51:28 -07:00
Matt Arsenault	9ff310d5bf	AArch64: Fix unused variables	2020-07-10 15:12:25 -04:00
Sidharth Baveja	e541e1b757	[NFC] Separate Peeling Properties into its own struct (re-land after minor fix) Summary: This patch separates the peeling specific parameters from the UnrollingPreferences, and creates a new struct called PeelingPreferences. Functions which used the UnrollingPreferences struct for peeling have been updated to use the PeelingPreferences struct. Author: sidbav (Sidharth Baveja) Reviewers: Whitney (Whitney Tsang), Meinersbur (Michael Kruse), skatkov (Serguei Katkov), ashlykov (Arkady Shlykov), bogner (Justin Bogner), hfinkel (Hal Finkel), anhtuyen (Anh Tuyen Tran), nikic (Nikita Popov) Reviewed By: Meinersbur (Michael Kruse) Subscribers: fhahn (Florian Hahn), hiraditya (Aditya Kumar), llvm-commits, LLVM Tag: LLVM Differential Revision: https://reviews.llvm.org/D80580	2020-07-10 18:39:30 +00:00
Lei Huang	90b1a710ae	[PowerPC] Enable default support of quad precision operations Summary: Remove option guarding support of quad precision operations. Reviewers: nemanjai, #powerpc, steven.zhang Reviewed By: nemanjai, #powerpc, steven.zhang Subscribers: qiucf, wuzish, nemanjai, hiraditya, kbarton, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83437	2020-07-10 13:27:48 -05:00
Davide Italiano	fdb7856d54	Revert "[NFC] Derive from PassInfoMixin for no-op/printing passes" This reverts commit `8039d2c3bf` as it breaks the modules build on macOS.	2020-07-10 11:19:13 -07:00
Alexandre Ganea	add59ecb34	Re-land [CodeView] Add full repro to LF_BUILDINFO record This patch adds some missing information to the LF_BUILDINFO which allows for rebuilding an .OBJ without any external dependency but the .OBJ itself (other than the compiler executable). Some tools need this information to reproduce a build without any knowledge of the build system. The LF_BUILDINFO therefore stores a full path to the compiler, the PWD (which is the CWD at program startup), a relative or absolute path to the TU, and the full CC1 command line. The command line needs to be freestanding (not depend on any environment variable). In the same way, MSVC doesn't store the provided command-line, but an expanded version (somehow their equivalent of CC1) which is also freestanding. For more information see PR36198 and D43002. Differential Revision: https://reviews.llvm.org/D80833	2020-07-10 13:59:28 -04:00
Craig Topper	1cf6f210a2	[IR] Disable select ? C : undef -> C fold in ConstantFoldSelectInstruction unless we know C isn't poison. This matches the recent change to InstSimplify from D83440. Differential Revision: https://reviews.llvm.org/D83535	2020-07-10 10:42:25 -07:00
Luke Geeson	954db63cd1	[ARM] Add Cortex-A78 and Cortex-X1 Support for Clang and LLVM This patch upstreams support for the Arm-v8 Cortex-A78 and Cortex-X1 processors for AArch64 and ARM. In detail: - Adding cortex-a78 and cortex-x1 as cpu options for aarch64 and arm targets in clang - Adding Cortex-A78 and Cortex-X1 CPU names and ProcessorModels in llvm details of the CPU can be found here: https://www.arm.com/products/cortex-x https://www.arm.com/products/silicon-ip-cpu/cortex-a/cortex-a78 The following people contributed to this patch: - Luke Geeson - Mikhail Maltsev Reviewers: t.p.northover, dmgreen Reviewed By: dmgreen Subscribers: dmgreen, kristof.beyls, hiraditya, danielkiss, cfe-commits, llvm-commits, miyuki Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D83206	2020-07-10 18:24:11 +01:00
Benjamin Kramer	b887da81cc	[CGProfile] Fix layering, IPO depends in Instrumentation.	2020-07-10 19:13:47 +02:00
Zequan Wu	1fbb719470	[LPM] Port CGProfilePass from NPM to LPM Reviewers: hans, chandlerc!, asbirlea, nikic Reviewed By: hans, nikic Subscribers: steven_wu, dexonsmith, nikic, echristo, void, zhizhouy, cfe-commits, aeubanks, MaskRay, jvesely, nhaehnle, hiraditya, kerbowa, llvm-commits Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D83013	2020-07-10 09:04:51 -07:00
Roman Lebedev	1d542f0ca8	Revert "[OpenMPOpt] ICV Tracking" There appears to be some kind of memory corruption/use-after-free/etc going on here. In particular, in `OpenMPOpt::deleteParallelRegions()`, in `DeleteCallCB()`, `CI` is garbage. WIll post reproducer in the original review. This reverts commit `6c4a5e9257`.	2020-07-10 19:00:15 +03:00
Florian Hahn	ec00aa99dd	[DomTreeUpdater] Use const auto * when iterating over pointers (NFC). This silences the warning below: llvm-project/llvm/lib/Analysis/DomTreeUpdater.cpp:510:20: warning: loop variable 'BB' is always a copy because the range of type 'const SmallPtrSet<llvm::BasicBlock , 8>' does not return a reference [-Wrange-loop-analysis] for (const auto &BB : DeletedBBs) { ^ llvm-project/llvm/lib/Analysis/DomTreeUpdater.cpp:510:8: note: use non-reference type 'llvm::BasicBlock ' for (const auto &BB : DeletedBBs) { ^~~~~~~~~~~~~~~~ 1 warning generated.	2020-07-10 16:39:15 +01:00
Sanjay Patel	02fec9d2a5	[DAGCombiner] move/rename variables for readability; NFC	2020-07-10 11:28:51 -04:00
Alexandre Ganea	23cd70d71c	[PDB] Fix out-of-bounds acces when sorting GSI buckets When building in Debug on Windows-MSVC after `b7402edce3`, a lot of tests were failing because we were dereferencing an element past the end of HashRecords. This happened towards the end of the table, in unused slots.	2020-07-10 10:55:27 -04:00
Roman Lebedev	7103c87596	Reland "[InstCombine] Lower infinite combine loop detection thresholds"" This relands commit `cd7f8051ac` that was reverted since lower threshold have successfully found an issue. Now that the issue is fixed, let's wait until the next one is reported. This reverts commit `caa423eef0`.	2020-07-10 17:49:16 +03:00
Roman Lebedev	2655a70a04	[InstCombine] After merging store into successor, queue prev. store to be visited (PR46661) We can happen to have a situation with many stores eligible for transform, but due to our visitation order (top to bottom), when we have processed the first eligible instruction, we would not try to reprocess the previous instructions that are now also eligible. So after we've successfully merged a store that was second-to-last instruction into successor, if the now-second-to-last instruction is also a such store that is eligible, add it to worklist to be revisited. Fixes https://bugs.llvm.org/show_bug.cgi?id=46661	2020-07-10 17:49:16 +03:00
David Green	e1135b486a	Revert "[BasicAA] Enable -basic-aa-recphi by default" This reverts commit `af839a9618`. Some issues appear to be being caused by this. Reverting whilst we investigate.	2020-07-10 13:43:54 +01:00
Simon Pilgrim	b69e0f674f	DomTreeUpdater::dump() - use const auto& iterator in for-range-loop. Avoids unnecessary copies and silences clang tidy warning.	2020-07-10 12:47:15 +01:00
Simon Pilgrim	9ce9831289	StackSafetyAnalysis.cpp - pass ConstantRange arg as const reference. Avoids unnecessary copies and silences clang tidy warning - we do this in most places, there are just a few that were missed.	2020-07-10 12:13:34 +01:00
Simon Pilgrim	4cc26a44ca	[X86][SSE] Use shouldUseHorizontalOp helper to determine whether to use (F)HADD. NFCI.	2020-07-10 12:13:34 +01:00
dstuttar	69a89b54c6	[NFC] Change isFPPredicate comparison to ignore lower bound Summary: Since changing the Predicate to be an unsigned enum, the lower bound check for isFPPredicate no longer needs to check the lower bound, since it will always evaluate to true. Also fixed a similar issue in SIISelLowering.cpp by removing the need for comparing to FIRST and LAST predicates Added an assert to the isFPPredicate comparison to flag if the FIRST_FCMP_PREDICATE is ever changed to anything other than 0, in which case the logic will break. Without this change warnings are generated in VS. Change-Id: I358f0daf28c0628c7bda8ad4cab4e1757b761bab Subscribers: arsenm, jvesely, nhaehnle, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83540	2020-07-10 11:57:20 +01:00
Paul Walker	f78e6a3095	[SVE] Code generation for fixed length vector truncates. Lower fixed length vector truncates to a sequence of SVE UZP1 instructions. Differential Revision: https://reviews.llvm.org/D83395	2020-07-10 10:37:19 +00:00
Florian Hahn	264ab1e2c8	[LV] Pick vector loop body as insert point for SCEV expansion. Currently the DomTree is not kept up to date for additional blocks generated in the vector loop, for example when vectorizing with predication. SCEVExpander relies on dominance checks when looking for existing instructions to re-use and in some cases that can lead to the expander picking instructions that do not actually dominate their insert point (e.g. as in PR46525). Unfortunately keeping the DT up-to-date is a bit tricky, because the CFG is only patched up after generating code for a block. For now, we can just use the vector loop header, as this ensures the inserted instructions dominate all uses in the vector loop. There should be no noticeable impact on the generated code, as other passes should sink those instructions, if profitable. Fixes PR46525. Reviewers: Ayal, gilr, mkazantsev, dmgreen Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D83288	2020-07-10 10:37:12 +01:00
Mirko Brkusanin	cf40db21af	[AMDGPU][GlobalISel] Fix G_AMDGPU_TBUFFER_STORE_FORMAT mapping Add missing mappings and tablegen definitions for TBUFFER_STORE_FORMAT. Differential Revision: https://reviews.llvm.org/D83240	2020-07-10 11:32:32 +02:00
Simon Pilgrim	9a3e8b11a8	extractConstantWithoutWrapping - use const APInt& returned by SCEVConstant::getAPInt() Avoids unnecessary APInt copies and silences clang tidy warning.	2020-07-10 10:24:29 +01:00

1 2 3 4 5 ...

136604 Commits