llvm-project

Commit Graph

Author	SHA1	Message	Date
Igor Kudrin	2ba4df6c11	[DebugInfo] Fix dumping CIE ID in .eh_frame sections. We do not keep the actual value of the CIE ID field, because it is predefined, and use a constant when dumping a CIE record. The issue was that the predefined value is different for .debug_frame and .eh_frame sections, but we always printed the one which corresponds to .debug_frame. The patch fixes that by choosing an appropriate constant to print. See the following for more information about .eh_frame sections: https://refspecs.linuxfoundation.org/LSB_5.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html Differential Revision: https://reviews.llvm.org/D73627	2020-02-13 15:42:14 +07:00
Abdurrahman Akkas	2e8c112ecf	[mlir] Add elementAttr to TypedArrayAttrBase. In code generators, one can automate the translation of typed ArrayAttrs if element attribute translators are already implemented. However, the type of the element attribute is lost at the construction of TypedArrayAttrBase. With this change one can inspect the element type and generate the translation logic automatically, which will reduce the code repetition. Differential Revision: https://reviews.llvm.org/D73579	2020-02-13 09:25:27 +01:00
Kern Handa	005b720373	[NFC][mlir] Adding some helpful EDSC intrinsics Differential Revision: https://reviews.llvm.org/D74119	2020-02-13 09:21:17 +01:00
Pavel Labath	cb6c9f731b	[lldb] Make gdbremote.py utility py2and3 compatible	2020-02-13 09:18:55 +01:00
River Riddle	a134ccbbeb	[mlir][DeclarativeParser] Move operand type resolution into a functor to share code. This reduces the duplication for the two different cases.	2020-02-12 23:56:07 -08:00
River Riddle	c74150e75f	[mlir][ODS][NFC] Mark OpaqueType as a buildable type. This allows for using it in the declarative assembly form, among other things.	2020-02-12 23:51:38 -08:00
Johannes Doerfert	3f3ec9c40b	[OpenMP][FIX] Collect blocks to be outlined after finalization Finalization can introduce new blocks we need to outline as well so it makes sense to identify the blocks that need to be outlined after finalization happened. There was also a minor unit test adjustment to account for the fact that we have a single outlined exit block now.	2020-02-13 00:42:22 -06:00
Fangrui Song	81cebfd008	[ELF][test] Change -o %t to -o /dev/null if the output is not needed	2020-02-12 21:54:50 -08:00
Sterling Augustine	a7ecf4c324	Explicitly state the output file. Summary: Even though this test is a check for failure, lld still attempts to open the final output file, which fails when the default "a.out" file is used and the current directory is read-only. Specifying an output file works around this problem. Reviewers: espindola Subscribers: emaste, MaskRay, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74523	2020-02-12 21:25:30 -08:00
Vladimir Vereschaka	637a24bc0c	Revert "Replace std::foo with std::foo_t in LLVM." This reverts commit `a4384c756b`. These changes break LLVM build on Windows builders. See https://reviews.llvm.org/rGa4384c756bd8a819051009b5b273b2a34be8261b for details.	2020-02-12 20:54:21 -08:00
Craig Topper	af15082af4	[X86] Add test RUN lines to show cases where we use 512-bit vcmppd/ps with garbage upper bits for 128/256-bit strict_fsetcc On KNL targets, we widen 128/256-bit strict_fsetcc nodes to 512-bits without forcing the upper bits to zero. This can cause spurious exceptions due to garbage upper bits. This behavior was inherited from the non-strict case where the spurious exception isn't a problem.	2020-02-12 20:51:52 -08:00
Yonghong Song	61bd33e37b	[BPF] explicit warning of not supporting dynamic stack allocation Currently, BPF does not support dynamic static allocation. For a program like below: extern void bar(int *); void foo(int n) { int a[n]; bar(a); } The current error message looks like: unimplemented operand UNREACHABLE executed at /.../llvm/lib/Target/BPF/BPFISelLowering.cpp:199! Let us make error message explicit so it will be clear to the user what is the problem. With this patch, the error message looks like: fatal error: error in backend: Unsupported dynamic stack allocation ... Differential Revision: https://reviews.llvm.org/D74521	2020-02-12 20:43:06 -08:00
Johannes Doerfert	70cac41a2b	Reapply "[OpenMP][IRBuilder] Perform finalization (incl. outlining) late" Reapply `8a56d64d76` with minor fixes. The problem was that cancellation can cause new edges to the parallel region exit block which is not outlined. The CodeExtractor will encode the information which "exit" was taken as a return value. The fix is to ensure we do not return any value from the outlined function, to prevent control to value conversion we ensure a single exit block for the outlined region. This reverts commit `3aac953afa`.	2020-02-12 22:29:07 -06:00
Serguei Katkov	a6f38b4697	[Statepoint] Remove redundant clear of call target on register Patchable statepoint is lowered into sequence of nops, so zeroed call target should not be on register. It is better to use getTargetConstant instead of getConstant to select zero constant for call target. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D74465	2020-02-13 10:25:50 +07:00
Austin Kerbow	5db0b2521c	[AMDGPU][GlobalISel] Handle 64byte EltSIze in getRegSplitParts Reviewers: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74518	2020-02-12 19:11:52 -08:00
Melanie Blower	a0d913a1ac	Fix regression due to reviews.llvm.org/D74436 by adding option ffp-contract=off to RUN line	2020-02-12 19:05:18 -08:00
Nico Weber	528bd04f84	Fix ReST syntax on link to "Bisecting LLVM code" page Patch from nicolas17 (Nicolás Alvarez)! Differential Revision: https://reviews.llvm.org/D74422	2020-02-12 21:18:25 -05:00
Frank Laub	fdc7a16a82	[MLIR][Affine] Add affine.parallel op Summary: As discussed in https://llvm.discourse.group/t/rfc-add-affine-parallel/350, this is the first in a series of patches to bring in support for the `affine.parallel` operation. This first patch adds the IR representation along with custom printer/parser implementations. Reviewers: bondhugula, herhut, mehdi_amini, nicolasvasilache, rriddle, earhart, jbruestle Reviewed By: bondhugula, nicolasvasilache, rriddle, earhart, jbruestle Subscribers: jpienaar, burmako, shauheen, antiagainst, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74288	2020-02-12 18:00:24 -08:00
Fangrui Song	c662795b07	[AsmPrinter][ELF] Emit local alias for ExternalLinkage dso_local GlobalAlias	2020-02-12 17:08:22 -08:00
Amy Huang	de1d90299b	Revert "[X86][SSE] lowerShuffleAsBitRotate - lower to vXi8 shuffles to ROTL on pre-SSSE3 targets" This reverts commit `11c16e7159` because it causes a crash in chromium code. See https://reviews.llvm.org/rG11c16e71598d51f15b4cfd0f719c4dabcc0bebf7.	2020-02-12 17:00:37 -08:00
Johannes Doerfert	3aac953afa	Revert "[OpenMP][IRBuilder] Perform finalization (incl. outlining) late" This reverts commit `8a56d64d76`. Will be recommitted once the clang test problem is addressed.	2020-02-12 18:50:43 -06:00
Matt Arsenault	d1b393d92c	AMDGPU/GlobalISel: Select G_CTTZ_ZERO_UNDEF Directly select this rather than going through the intermediate instruction, which may provide some combine value in the future.	2020-02-12 16:19:46 -08:00
Matt Arsenault	045a8921d7	AMDGPU/GlobalISel: Select G_CTLZ_ZERO_UNDEF Directly select this rather than going through the intermediate instruction, which may provide some combine value in the future.	2020-02-12 16:19:45 -08:00
Matt Arsenault	e174c278ca	AMDGPU/GlobalISel: Fix mapping G_ICMP with constrained result When SI_IF is inserted, it constrains the source register with a register class, which was quite likely a G_ICMP. This was incorrectly treating it as a scalar, and then applyMappingImpl would end up producing invalid MIR since this was unexpected. Also fix not using all VGPR sources for vcc outputs.	2020-02-12 16:19:45 -08:00
Matt Arsenault	de71617335	PPC: Prepare tests for switch of default denormal-fp-math These tests fail when the default is switched to assume IEEE denormal handling. I'm not sure if PPC really has a way to control the denormal input handling.	2020-02-12 16:19:45 -08:00
Caroline Lebar	a4384c756b	Replace std::foo with std::foo_t in LLVM. This patch is replacements missed in my last change doing this across LLVM. No functional change, although I think there was a missing typename in struct conjunction that is now fixed.	2020-02-12 16:14:36 -08:00
Mitch Phillips	91e194d1ff	[GWP-ASan] [NFC] Change enum from ANDROID->BIONIC.	2020-02-12 16:06:59 -08:00
Yuanfang Chen	4caeb62e51	[Fuzzer] Rename ExecuteCommandWithPopen to ExecuteCommandNon-Fushsia target will keep using popen/pclose implementation. OnFuchsia, Two-args version of `ExecuteCommand` is a simple wrapper of theone-arg version. (Hopefully) Fix D73329 build on Fuchsia.	2020-02-12 16:03:55 -08:00
Johannes Doerfert	8a56d64d76	[OpenMP][IRBuilder] Perform finalization (incl. outlining) late In order to fix PR44560 and to prepare for loop transformations we now finalize a function late, which will also do the outlining late. The logic is as before but the actual outlining step happens now after the function was fully constructed. Once we have loop transformations we can apply them in the finalize step before the outlining. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D74372	2020-02-12 17:55:01 -06:00
John McCall	77b2ffc498	Fix a reentrance bug with deserializing ObjC type parameters. This is a longstanding bug that seems to have been hidden by a combination of (1) the normal flow being to deserialize the interface before deserializing its parameter and (2) a precise ordering of work that was apparently recently disturbed, perhaps by my abstract-serialization work or Bruno's ObjC module merging work. Fixes rdar://59153545.	2020-02-12 18:44:19 -05:00
Johannes Doerfert	23f41f16d4	[Attributor] Use fine-grained liveness in all helpers We used coarse-grained liveness before, thus we looked if the instruction was executed, but we did not use fine-grained liveness, hence if the instruction was needed or could be deleted even if the surrounding ones are live. This patches introduces this level of liveness checks together with other liveness queries, e.g., for uses. For more control we enforce that all liveness queries go through the Attributor. Test have been adjusted to reflect the changes or augmented to prevent deletion of the parts we want to check. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D73313	2020-02-12 17:36:38 -06:00
Johannes Doerfert	b2c76002ca	[Attributor] Ignore uses if a value is simplified If we have a replacement for a value, via AAValueSimplify, the original value will lose all its uses. Thus, as long as a value is simplified we can skip the uses in checkForAllUses, given that these uses are transitive uses for the simplified version and will therefore affect the simplified version as necessary. Since this allowed us to remove calls without side-effects and a known return value, we need to make sure not to eliminate `musttail` calls. Those we keep around, or later remove the entire `musttail` call chain.	2020-02-12 17:36:38 -06:00
Johannes Doerfert	86509e8c3b	[Attributor] Use assumed information to determine side-effects We relied on wouldInstructionBeTriviallyDead before but that functions does not take assumed information, especially for calls, into account. The replacement, AAIsDead::isAssumeSideEffectFree, does. This change makes AAIsDeadCallSiteReturn more complex as we can have a dead call or only dead users. The test have been modified to include a side effect where there was none in order to keep the coverage. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D73311	2020-02-12 17:36:38 -06:00
Ethan Stewart	190a11148b	Changed omp_get_max_threads() implementation to more closely match spec description. Summary: The 5.0 spec states, "The omp_get_max_threads routine returns an upper bound on the number of threads that could be used to form a new team if a parallel construct without a num_threads clause were encountered after execution returns from this routine." The attached test shows Max Threads: 96, Num Threads: 128 without the proposed change. The number of threads should not exceed the (max) nthreads ICV, hence we should return the higher SPMD thread number even when omp_get_max_threads() is called in a generic kernel. This change does fail the api test, max_threads.c, because now it would return 64 instead of 32. Reviewers: jdoerfert, ABataev, grokos, JonChesterfield Reviewed By: jdoerfert Subscribers: openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D74092	2020-02-12 23:29:34 +00:00
JonChesterfield	c2ce9ea4e3	[libomptarget][nfc] Change enum values to match those in cuda/rtl Summary: [libomptarget][nfc] Change enum values to match those in cuda/rtl support.h and cuda/rtl.cpp (and downsteam hsa/rtl.cpp) have enums for execution mode. These are actually independent - the numbers that used within support, or within the plugin, are never passed across the boundary. Nevertheless, trying to work out why the values are different between the two has generated a reasonable amount of confusion. This patch changes support to match the values in plugin, on the basis that the plugin also has some comments which I'd have to update if I changed that one instead. Credit to Ron for working through this in our own fork. See rocm-developer-tools/aomp/issues/7 for that earlier diagnostic write up. Also happy with generic = 0, spmd = 1 - provided it's the same in both places. Reviewers: jdoerfert, grokos, ABataev, ronlieb Reviewed By: grokos Subscribers: openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D74503	2020-02-12 23:27:08 +00:00
Mitch Phillips	5f2a74c87a	[GWP-ASan] Update alignment on Android. Summary: Android has different alignment requirements. You can read more about them here (https://cs.android.com/android/platform/superproject/+/master:bionic/tests/malloc_test.cpp;l=808), but the general gist is that for malloc(x <= 8), we do malloc(8), and for everything else, we do 16-byte alignment. Reviewers: eugenis, morehouse, cferris Reviewed By: eugenis, morehouse Subscribers: #sanitizers, llvm-commits, pcc Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D74364	2020-02-12 15:24:58 -08:00
Guozhi Wei	369d086d78	[MBP] Partial tail duplication into hot predecessors Current tail duplication embedded in MBP duplicates a BB into all or none of its predecessors without too much cost analysis. So sometimes it is duplicated into cold predecessors, and in other cases it may miss the duplication into hot predecessors. This patch improves tail duplication in 3 aspects: A successor can be duplicated into part of its predecessors. A more fine-grained benefit analysis, combined with 1, now a successor is duplicated into hot predecessors only. If a successor can't be duplicated into one predecessor, it doesn't impact the duplication into other predecessors. Differential Revision: https://reviews.llvm.org/D73387	2020-02-12 15:22:33 -08:00
Petr Hosek	67f4e0011d	[CMake][Fuchsia] Enable in-process cc1 This is now supported by Goma so we can re-enable it.	2020-02-12 14:05:24 -08:00
Alexandre Ganea	20f1abe306	[Clang] Limit -fintegrated-cc1 to only one TU As discussed in https://reviews.llvm.org/D74447, this patch disables integrated-cc1 behavior if there's more than one job to be executed. This is meant to limit memory bloating, given that currently jobs don't clean up after execution (-disable-free is always active in cc1 mode). I see this behavior as temporary until release 10.0 ships (to ease merging of this patch), then we'll reevaluate the situation, see if D74447 makes more sense on the long term. Differential Revision: https://reviews.llvm.org/D74490	2020-02-12 17:02:57 -05:00
Alexandre Ganea	60cba345ca	[Clang] When -ftime-trace is used, clean CompilerInstance::OutputFiles before exiting cc_main() This fixes cc1 execution when '-disable-free' is not used (currently not the case, that flag is always used for cc1).	2020-02-12 17:02:57 -05:00
Nicolas Vasilache	10382ebe8f	[mlir][Linalg] Fix build warnings	2020-02-12 16:50:40 -05:00
Jonas Devlieghere	6e30fd05c9	[lldb/Plugins] Move DynamicLoaderMacOS into DynamicLoaderMacOSXDYLD (NFCI) Move the logic for initialization and termination for DynamicLoaderMacOS into DynamicLoaderMacOSXDYLD so that there's one initializer for the DynamicLoaderMacOSXDYLD plugin.	2020-02-12 13:44:20 -08:00
Tobias Gysi	4f865b7794	[mlir] support creating memref descriptors from static shape with non-zero offset This patch adapts the method MemRefDescriptor::fromStaticShape to support static non-zero offsets. The updated method uses the getStridesAndOffset method to extract strides and offset. The patch also adapts the test cases since sizes and strides are now set in forward instead of reverse order. Differential Revision: https://reviews.llvm.org/D74474	2020-02-12 22:40:49 +01:00
Valentin Clement	56aba9699d	[MLIR] Fix wrong header for mlir-cuda-runner Just updated the wrong header probably copied from the mlir-cpu-runner Differential Revision: https://reviews.llvm.org/D74497	2020-02-12 22:35:46 +01:00
Elizabeth Andrews	a58017e5ca	Fix type-dependency of bitfields in templates This patch is a follow up to `878a24ee24`. Name of bitfields with value-dependent width should be set as type-dependent. This patch adds the required value-dependency check and sets the type-dependency accordingly. Patch fixes PR44886 Differential revision: https://reviews.llvm.org/D72242	2020-02-12 13:31:41 -08:00
Stanislav Mekhanoshin	f8d044bbcf	[TBLGEN] Fix subreg value overflow in DAGISelMatcher Tablegen's DAGISelMatcher emits integers in a VBR format, so if an integer is below 128 it can fit into a single byte, otherwise high bit is set, next byte is used etc. MatcherTable is essentially an unsigned char table. When SelectionDAGISel parses the table it does a reverse translation. In a situation when numeric value of an integer to emit is unknown it can be emitted not as OPC_EmitInteger but as OPC_EmitStringInteger using a symbolic name of the value. In this situation the value should not exceed 127. One of the situations when OPC_EmitStringInteger is used is if we need to emit a subreg into a matcher table. However, number of subregs can exceed 127. Currently last defined subreg for AMDGPU is 192. That results in a silent bug in the ISel with matcher reading from an invalid offset. Fixed this bug to emit actual VBR encoded value for a subregs which value exceeds 127. Differential Revision: https://reviews.llvm.org/D74368	2020-02-12 13:29:57 -08:00
Jinsong Ji	baf3a53b57	[docs] Minor updates to DeveloperPolicy due to svn to git Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D73971	2020-02-12 21:08:15 +00:00
Evandro Menezes	905ccf8b2f	[README] Add note on using cmake to perform the build Also, some spelling fixes. Test commit.	2020-02-12 14:51:24 -06:00
Kelvin Li	4f1f2b7a5b	[OpenMP] update strings output of libomp.so [NFC] Change the string from "Intel(R) OMP" to "LLVM OMP" in libomp.so Differential Revision: https://reviews.llvm.org/D74462	2020-02-12 15:45:55 -05:00
Ehud Katz	d8a2ea9fd5	[LoopExtractor] Fix legacy pass dependencies Fixes a memory leak of allocating `LoopInfoWrapperPass` and `DominatorTreeWrapperPass`.	2020-02-12 22:39:21 +02:00

1 2 3 4 5 ...

342550 Commits All Branches Search

342550 Commits

All Branches