llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Atanasyan	b00f0d4238	[mips] Support 64-bit relative relocations MIPS 64-bit ABI does not provide special PC-relative relocation like R_MIPS_PC32 in 32-bit case. But we can use a "chain of relocation" defined by N64 ABIs. In that case one relocation record might contain up to three relocations which applied sequentially. Width of a final relocation mask applied to the result of relocation depends on the last relocation in the chain. In case of 64-bit PC-relative relocation we need the following chain: `R_MIPS_PC32 \| R_MIPS_64`. The first relocation calculates an offset, but does not truncate the result. The second relocation just apply calculated result as a 64-bit value. The 64-bit PC-relative relocation might be useful in generation of `.eh_frame` sections to escape passing `-Wl,-z,notext` flags to linker. Differential Revision: https://reviews.llvm.org/D80390	2020-06-02 11:44:11 +03:00
Dmitri Gribenko	44f989e780	Run syntax tree tests in many language modes Reviewers: hlopko, eduucaldas Reviewed By: hlopko, eduucaldas Subscribers: gribozavr2, mgorny, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80822	2020-06-02 10:30:01 +02:00
Kazushi (Jam) Marukawa	ec2e9ce73e	[VE] Support I32/F32 registers in assembler parser Summary: Support I32/F32 registers in assembler parser and add regression tests of LD/ST instructions. Differential Revision: https://reviews.llvm.org/D80777	2020-06-02 10:22:45 +02:00
Clement Courbet	5b8c1ed2c8	[llvm-exegesis] Fix D80610. Summary: Using a .data() member on a StringRef was discarding the StringRef size, breaking llvm-exegesis on machines with counter sums (e.g. Zen2). Reviewers: oontvoo Subscribers: mstojanovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80982	2020-06-02 10:10:01 +02:00
Sam Parker	e70cf280f8	[NFC][ARM][AArch64] Test runs Add code size tests runs for memory ops for both architectures.	2020-06-02 09:05:30 +01:00
Joachim Protze	10995c77b4	[OpenMP][OMPT] Fix and add event callbacks for detached tasks The OpenMP spec has the task-fulfill event for a call to omp_fulfill_event. If the task did not yet finish execution, ompt_task_early_fulfill is used, otherwise ompt_task_late_fulfill. If a task does not complete, when the execution finishes (i.e., the task goes in detached mode), ompt_task_detach instead of ompt_task_complete must be used, when the next task is scheduled. A test for both cases is included, which only work with clang-11+ Reviewed By: hbae Differential revision: https://reviews.llvm.org/D80843	2020-06-02 09:52:40 +02:00
Sriraman Tallam	e0bca46b08	Options for Basic Block Sections, enabled in D68063 and D73674. This patch adds clang options: -fbasic-block-sections={all,<filename>,labels,none} and -funique-basic-block-section-names. LLVM Support for basic block sections is already enabled. + -fbasic-block-sections={all, <file>, labels, none} : Enables/Disables basic block sections for all or a subset of basic blocks. "labels" only enables basic block symbols. + -funique-basic-block-section-names: Enables unique section names for basic block sections, disabled by default. Differential Revision: https://reviews.llvm.org/D68049	2020-06-02 00:23:32 -07:00
Denis Antrushin	fa818ded24	[StatepointLowering] Handle UNDEF gc values. Do not spill UNDEF GC values. Instead, replace corresponding gc.relocate intrinsic with an (arbitrary, but recognizable) constant. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D80714	2020-06-02 10:18:33 +03:00
Dominik Montada	052c962ced	[GlobalISel] Combine scalar unmerge(trunc) Summary: Combine unmerge(trunc) to enable other merge combines. Without this combine, the scalar unmerge(trunc(merge)) pattern cannot be combined and easily lead to hard-to-legalize merge/unmerge artifacts. Reviewed By: arsenm Tags: #llvm Differential Revision: https://reviews.llvm.org/D79567	2020-06-02 08:56:18 +02:00
Dominik Montada	b3c6a36dba	[NFC] Move vector unmerge(trunc) combine to function In preparation of D79567, move arsenm's vector unmerge(trunc) combine to a new function `tryFoldUnmergeCast`	2020-06-02 08:56:17 +02:00
Xing GUO	d3f49b8d37	[ObjectYAML][DWARF] Let `dumpPubSection` return `DWARFYAML::PubSection`. Summary: This patch addresses comments in [D80722](https://reviews.llvm.org/D80722#inline-742353) Reviewers: grimar, jhenderson Reviewed By: grimar, jhenderson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80861	2020-06-02 14:38:26 +08:00
MaheshRavishankar	2bcd1927dd	[mlir][SCFToGPU] Remove conversions from scf.for to gpu.launch. Keeping in the affine.for to gpu.launch conversions, which should probably be the affine.parallel to gpu.launch conversion as well. Differential Revision: https://reviews.llvm.org/D80747	2020-06-01 23:06:20 -07:00
Fangrui Song	a6ae333a0c	[ELF] --wrap: don't error `undefined reference to __real_foo` (--no-allow-shlib-undefined) if foo is a wrapped definition This is a regression after D51283. Also, export `foo` if `__real_foo` is referenced by a shared object.	2020-06-01 23:00:51 -07:00
Yevgeny Rouban	07239c736a	[BrachProbablityInfo] Proportional distribution of reachable probabilities When fixing probability of unreachable edges in BranchProbabilityInfo::calcMetadataWeights() proportionally distribute remainder probability over the reachable edges. The old implementation distributes the remainder probability evenly. See examples in the fixed tests. Reviewers: yamauchi, ebrevnov Tags: #llvm Differential Revision: https://reviews.llvm.org/D80611	2020-06-02 12:06:52 +07:00
Richard Smith	4ccb6c36a9	Fix violations of [basic.class.scope]p2. These cases all follow the same pattern: struct A { friend class X; //... class X {}; }; But 'friend class X;' injects 'X' into the surrounding namespace scope, rather than introducing a class member. So the second 'class X {}' is a completely different type, which changes the meaning of the earlier name 'X' from '::X' to 'A::X'. Additionally, the friend declaration is pointless -- members of a class don't need to be befriended to be able to access private members.	2020-06-01 22:03:05 -07:00
Craig Topper	e51d5bc7a4	[X86] Fix a few recursivelyDeleteUnusedNodes calls that were trying to delete nodes before their user was really gone. We looked through a truncate to get to the load. So we should be deleting the truncate first. There is a check that the node is really unused before deleting so this didn't cause a functional issue.	2020-06-01 21:55:13 -07:00
Yevgeny Rouban	3bb0d95fdc	[BrachProbablityInfo] Rename loop variables. NFC	2020-06-02 10:55:27 +07:00
Kostya Serebryany	801d823bde	[asan] fix a comment typo	2020-06-01 19:14:56 -07:00
Kostya Serebryany	2e6c3e3e7b	add debug code to chase down a rare crash in asan/lsan https://github.com/google/sanitizers/issues/1193 Summary: add debug code to chase down a rare crash in asan/lsan https://github.com/google/sanitizers/issues/1193 Reviewers: vitalybuka Subscribers: #sanitizers, llvm-commits Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D80967	2020-06-01 19:14:56 -07:00
John McCall	8a8d703be0	Fix how cc1 command line options are mapped into FP options. Canonicalize on storing FP options in LangOptions instead of redundantly in CodeGenOptions. Incorporate -ffast-math directly into the values of those LangOptions rather than considering it separately when building FPOptions. Build IR attributes from those options rather than a mix of sources. We should really simplify the driver/cc1 interaction here and have the driver pass down options that cc1 directly honors. That can happen in a follow-up, though. Patch by Michele Scandale! https://reviews.llvm.org/D80315	2020-06-01 22:00:30 -04:00
Reid Kleckner	11d1aa0bcc	[COFF] Free some memory used for chunks First, do not reserve numSections in the Chunks array. In cases where there are many non-prevailing sections, this will overallocate memory which will not be used. Second, free the memory for sparseChunks after initializeSymbols. After that, it is never used. This saves 50MB of 627MB for my use case without affecting performance.	2020-06-01 18:51:47 -07:00
Adrian Prantl	a0b674fd7f	Fix UB in EmulateInstructionARM64.cpp This fixes an unhandled signed integer overflow in AddWithCarry() by using the llvm::checkedAdd() function. Thats to Vedant Kumar for the suggestion! <rdar://problem/60926115> Differential Revision: https://reviews.llvm.org/D80955	2020-06-01 18:11:50 -07:00
Vedant Kumar	a66e1d2aa9	[os_log][test] Remove -O1 from a test, NFC	2020-06-01 16:54:16 -07:00
Vedant Kumar	b429a0fef0	[docs] Sketch outline for HowToUpdateDebugInfo.rst Summary: Sketch the outline for a new document that explains how to update debug info in various kinds of code transformations. Some of the guidelines that belong in HowToUpdateDebugInfo.rst were in SourceLevelDebugging.rst already under the debugify section. It seems like the distinction between the two docs ought to be that the former is more prescriptive, while the latter is more descriptive. To that end I've consolidated the "how to update debug info" guidelines which were in SourceLevelDebugging.rst into the new doc, along with the information about using "debugify" to test transformations. Since we've added a mir-debugify pass, I've described that as well. Reviewers: aprantl, jmorse, chrisjackson, dsanders Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80052	2020-06-01 16:45:18 -07:00
Amara Emerson	f573d489b6	[AArch64][GlobalISel] Split G_GLOBAL_VALUE into ADRP + G_ADD_LOW and optimize. The concept of G_GLOBAL_VALUE is nice and simple, but always using it as the representation for global var addressing until selection time creates some problems in optimizing accesses in certain code/relocation models. The problem comes from trying to optimize adrp -> add -> load/store sequences in the most common "small" code model. These accesses can be optimized into an adrp -> load with the add offset being folded into the load's immediate field. If we try to keep all global var references as a single generic instruction then by the time we get to the complex operand trying to match these, we end up generating an adrp at the point of use. The real issue here is that we don't have any form of CSE during selection, so the code size will bloat from many redundant adrp's. This patch custom legalizes small code mode non-GOT G_GLOBALs into target ADRP and a new "target specific generic opcode" G_ADD_LOW. We also teach the localizer to localize these instructions via the custom hook that was added recently. Finally, the complex pattern for indexed loads/stores is extended to try to fold these G_ADD_LOW instructions into the load immediate. On -O0 CTMark, we see a 0.8% geomean code size improvement. We should also see some minor performance improvements too. Differential Revision: https://reviews.llvm.org/D78465	2020-06-01 16:00:56 -07:00
Amara Emerson	19ff00dab8	[AArch64] Fix CollectLOH creating an AdrpAdd LOH when there's a live used reg between the two instructions. If there's a pattern like: $xA = ADRP foo @PAGE [some killing use of reg Xb] $Xb = ADDXri $Xa, 0, @PAGEOFF CollectLOH would create an AdrpAdd LOH that resulted in the linker optimizing this sequence into: $xB = ADR foo [some killing use of reg $Xb] ... and therefore clobbers the live $Xb register that was used by the instruction in between. This was discovered by a GlobalISel patch D78465 which broke up global variable accesses into two pseudos, which in some cases could be moved apart. Differential Revision: https://reviews.llvm.org/D80834	2020-06-01 16:00:55 -07:00
Vedant Kumar	776708b00b	[LiveDebugValues] Remove early-exit when testing regmasks, NFC In transferRegisterDef, if the instruction has a regmask attached, we'll check if any currently used register is clobbered by the regmask. The early exit in this scan isn't necessary, costs a set lookup, and is almost never taken [1]. Delete it. [1] http://lab.llvm.org:8080/coverage/coverage-reports/coverage/Users/buildslave/jenkins/workspace/coverage/llvm-project/llvm/lib/CodeGen/LiveDebugValues.cpp.html#L1136	2020-06-01 15:16:10 -07:00
Matt Arsenault	a8f7209255	AMDGPU: Change internal tracking of wave size Store the log2 wave size instead of forcing division and log2 operations when querying either.	2020-06-01 17:55:08 -04:00
Olivier Giroux	06aaf0b343	Updated synopsis of <atomic> to match what is implemented	2020-06-01 14:30:13 -07:00
Akira Hatanaka	959517ace1	Clean up clang/test/CodeGenObjC/os_log.m Don't run optimization passes at -O2 and remove unneeded #ifdef and test cases.	2020-06-01 13:47:20 -07:00
Kirstóf Umann	6bedfaf520	[analyzer][MallocChecker] Fix the incorrect retrieval of the from argument in realloc() In the added testfile, the from argument was recognized as &Element{SymRegion{reg_$0<long * global_a>},-1 S64b,long} instead of reg_$0<long * global_a>.	2020-06-01 22:38:29 +02:00
Louis Dionne	23776a178f	[libc++] Add assertions on OOB accesses in std::array when the debug mode is enabled Like we do for empty std::array, make sure we have assertions in place for obvious out-of-bounds issues in std::array when the debug mode is enabled (which isn't by default).	2020-06-01 16:37:39 -04:00
Lei Huang	7cfded350a	[PowerPC] Add clang option -m[no-]pcrel Summary: Add user-facing front end option to turn off pc-relative memops. This will be compatible with gcc. Reviewers: stefanp, nemanjai, hfinkel, power-llvm-team, #powerpc, NeHuang, saghir Reviewed By: stefanp, NeHuang, saghir Subscribers: saghir, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc Differential Revision: https://reviews.llvm.org/D80757	2020-06-01 15:34:59 -05:00
Louis Dionne	66a14d151e	[libc++] NFC: Minor refactoring in std::array	2020-06-01 16:28:44 -04:00
Joseph Huber	1a4fb2edcb	[OpenMP] Replace Clang's OpenMP RTL Definitions with OMPKinds.def Summary: This changes Clang's generation of OpenMP runtime functions to use the types and functions defined in OpenMPKinds and OpenMPConstants. New OpenMP runtime function information should now be added to OMPKinds.def. This patch also changed the definitions of __kmpc_push_num_teams and __kmpc_copyprivate to match those found in the runtime. Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: jfb, AndreyChurbanov, openmp-commits, fghanim, hiraditya, sstefan1, cfe-commits, llvm-commits Tags: #openmp, #clang, #llvm Differential Revision: https://reviews.llvm.org/D80222	2020-06-01 16:23:10 -04:00
Reid Kleckner	45fd3e4688	[PDB] Share code to relocate .debug$[SF] sections, NFC Sink relocateDebugChunk near the only call site.	2020-06-01 13:16:57 -07:00
Sterling Augustine	f027cfa37e	For --relativenames, ignore directory 0, which is the comp_dir. Update for upstream comments. Improve test by writing all the debug info by hand. Reviewers: dblaikie, jhenderson Subscribers: hiraditya, MaskRay, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80168	2020-06-01 13:13:37 -07:00
Jonas Devlieghere	382f6d37a1	[lldb/Test] Add test for man page and lldb --help output	2020-06-01 13:04:45 -07:00
Mircea Trofin	999ea25a9e	[llvm][NFC] Cache FAM in InlineAdvisor Summary: This simplifies the interface by storing the function analysis manager with the InlineAdvisor, and, thus, not requiring it be passed each time we inquire for an advice. Reviewers: davidxl, asbirlea Subscribers: eraman, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80405	2020-06-01 13:02:34 -07:00
Daniel Grumberg	a05f1e5ae4	Add DIAError.h to list of headers excluded from the LLVM_DebugInfo_PDB module Differential Revision: https://reviews.llvm.org/D80808	2020-06-01 21:01:05 +01:00
Paula Toth	1ab092b758	[libc] Expose APIGenerator. Summary: This is split off from D79192 and exposes APIGenerator (renames to APIIndexer) for use in generating the integrations tests. Reviewers: sivachandra Reviewed By: sivachandra Subscribers: tschuett, ecnelises, libc-commits Tags: #libc-project Differential Revision: https://reviews.llvm.org/D80832	2020-06-01 12:30:35 -07:00
Reid Kleckner	8f0a660030	[PDB] Use inlinee file checksum offsets directly The inlinees section contains references to the file checksum table. The file checksum table in the PDB must have the same layout as the file checksum table in the object file, so all the existing file id references should stay valid. Previously, we would do this: for all inlined functions: - lookup filename from checksum and string table - make that filename absolute - look up the new file id for that filename up in the new checksum table This lead to pdbMakeAbsolute and remove_dots ending up in the hot path. We should only need to absolutify the source path once, not once every time we process an inline function from that source file. This speeds up linking chrome PGO stage 1 net_unittests.exe from 9.203s to 8.500s (-7.6%). Looking just at time to process symbol records, it goes from ~2000ms to ~1300ms, which is consistent with the overall speedup of about 700ms. This will be less noticeable in debug builds, which have fewer inlined functions records.	2020-06-01 12:28:32 -07:00
Florian Hahn	8f3f88d2f5	[Matrix] Implement matrix index expressions ([][]). This patch implements matrix index expressions (matrix[RowIdx][ColumnIdx]). It does so by introducing a new MatrixSubscriptExpr(Base, RowIdx, ColumnIdx). MatrixSubscriptExprs are built in 2 steps in ActOnMatrixSubscriptExpr. First, if the base of a subscript is of matrix type, we create a incomplete MatrixSubscriptExpr(base, idx, nullptr). Second, if the base is an incomplete MatrixSubscriptExpr, we create a complete MatrixSubscriptExpr(base->getBase(), base->getRowIdx(), idx) Similar to vector elements, it is not possible to take the address of a MatrixSubscriptExpr. For CodeGen, a new MatrixElt type is added to LValue, which is very similar to VectorElt. The only difference is that we may need to cast the type of the base from an array to a vector type when accessing it. Reviewers: rjmccall, anemet, Bigcheese, rsmith, martong Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D76791	2020-06-01 20:08:49 +01:00
Martin Liska	b638b63b99	Move internal_uname to #if SANITIZER_LINUX scope. Remove it from target-specific scope which corresponds to sanitizer_linux.cpp where it lives in the same macro scope. Differential Revision: https://reviews.llvm.org/D80864	2020-06-01 21:04:51 +02:00
Fangrui Song	751f18e7d4	[ELF] Refine --export-dynamic-symbol semantics to be compatible GNU ld 2.35 GNU ld from binutils 2.35 onwards will likely support --export-dynamic-symbol but with different semantics. https://sourceware.org/pipermail/binutils/2020-May/111302.html Differences: 1. -export-dynamic-symbol is not supported 2. --export-dynamic-symbol takes a glob argument 3. --export-dynamic-symbol can suppress binding the references to the definition within the shared object if (-Bsymbolic or -Bsymbolic-functions) 4. --export-dynamic-symbol does not imply -u I don't think the first three points can affect any user. For the fourth point, Not implying -u can lead to some archive members unfetched. Add -u foo to restore the previous behavior. Exact semantics: * -no-pie or -pie: matched non-local defined symbols will be added to the dynamic symbol table. * -shared: matched non-local STV_DEFAULT symbols will not be bound to definitions within the shared object even if they would otherwise be due to -Bsymbolic, -Bsymbolic-functions, or --dynamic-list. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D80487	2020-06-01 11:30:03 -07:00
Sanjay Patel	26ebe936f3	[InstCombine] fix use of base VectorType; NFC SimplifyDemandedVectorElts() bails out on ScalableVectorType anyway, but we can exit faster with the external check. Move this to a helper function because there are likely other vector folds that we can try here.	2020-06-01 14:28:31 -04:00
Matt Arsenault	89d48ccabe	AMDGPU: Fix not emitting nofpexcept on fdiv expansion In this awkward case, we have to emit custom pseudo-constrained FP wrappers. InstrEmitter concludes that since a mayRaiseFPException instruction had a chain, it can't add nofpexcept. Test deferred until mayRaiseFPException is really set on everything.	2020-06-01 14:10:26 -04:00
Vedant Kumar	11c617c417	[LiveDebugValues] Add LocIndex::u32_{location,index}_t types for readability, NFC This is per Adrian's suggestion in https://reviews.llvm.org/D80684.	2020-06-01 11:02:36 -07:00
Vedant Kumar	2ecaf93525	[LiveDebugValues] Speed up removeEntryValue, NFC Summary: Instead of iterating over all VarLoc IDs in removeEntryValue(), just iterate over the interval reserved for entry value VarLocs. This changes the iteration order, hence the test update -- otherwise this is NFC. This appears to give an ~8.5x wall time speed-up for LiveDebugValues when compiling sqlite3.c 3.30.1 with a Release clang (on my machine): ``` ---User Time--- --System Time-- --User+System-- ---Wall Time--- --- Name --- Before: 2.5402 ( 18.8%) 0.0050 ( 0.4%) 2.5452 ( 17.3%) 2.5452 ( 17.3%) Live DEBUG_VALUE analysis After: 0.2364 ( 2.1%) 0.0034 ( 0.3%) 0.2399 ( 2.0%) 0.2398 ( 2.0%) Live DEBUG_VALUE analysis ``` The change in removeEntryValue() is the only one that appears to affect wall time, but for consistency (and to resolve a pending TODO), I made the analogous changes for iterating over SpillLocKind VarLocs. Reviewers: nikic, aprantl, jmorse, djtodoro Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80684	2020-06-01 11:02:36 -07:00
Matt Arsenault	836c7dcf12	DAG: Fix getNode dropping flags if there's a glue output The AMDGPU non-strict fdiv lowering needs to introduce an FP mode switch in some cases, and has custom nodes to provide chain/glue for the intermediate FP operations. We need to propagate nofpexcept here, but getNode was dropping the flags. Adding nofpexcept in the AMDGPU custom lowering is left to a future patch. Also fix a second case where flags were dropped, but in this case it seems it just didn't handle this number of operands. Test will be included in future AMDGPU patch.	2020-06-01 13:48:02 -04:00

1 2 3 4 5 ...

355898 Commits All Branches Search

355898 Commits

All Branches