llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	8f3f88d2f5	[Matrix] Implement matrix index expressions ([][]). This patch implements matrix index expressions (matrix[RowIdx][ColumnIdx]). It does so by introducing a new MatrixSubscriptExpr(Base, RowIdx, ColumnIdx). MatrixSubscriptExprs are built in 2 steps in ActOnMatrixSubscriptExpr. First, if the base of a subscript is of matrix type, we create a incomplete MatrixSubscriptExpr(base, idx, nullptr). Second, if the base is an incomplete MatrixSubscriptExpr, we create a complete MatrixSubscriptExpr(base->getBase(), base->getRowIdx(), idx) Similar to vector elements, it is not possible to take the address of a MatrixSubscriptExpr. For CodeGen, a new MatrixElt type is added to LValue, which is very similar to VectorElt. The only difference is that we may need to cast the type of the base from an array to a vector type when accessing it. Reviewers: rjmccall, anemet, Bigcheese, rsmith, martong Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D76791	2020-06-01 20:08:49 +01:00
Martin Liska	b638b63b99	Move internal_uname to #if SANITIZER_LINUX scope. Remove it from target-specific scope which corresponds to sanitizer_linux.cpp where it lives in the same macro scope. Differential Revision: https://reviews.llvm.org/D80864	2020-06-01 21:04:51 +02:00
Fangrui Song	751f18e7d4	[ELF] Refine --export-dynamic-symbol semantics to be compatible GNU ld 2.35 GNU ld from binutils 2.35 onwards will likely support --export-dynamic-symbol but with different semantics. https://sourceware.org/pipermail/binutils/2020-May/111302.html Differences: 1. -export-dynamic-symbol is not supported 2. --export-dynamic-symbol takes a glob argument 3. --export-dynamic-symbol can suppress binding the references to the definition within the shared object if (-Bsymbolic or -Bsymbolic-functions) 4. --export-dynamic-symbol does not imply -u I don't think the first three points can affect any user. For the fourth point, Not implying -u can lead to some archive members unfetched. Add -u foo to restore the previous behavior. Exact semantics: * -no-pie or -pie: matched non-local defined symbols will be added to the dynamic symbol table. * -shared: matched non-local STV_DEFAULT symbols will not be bound to definitions within the shared object even if they would otherwise be due to -Bsymbolic, -Bsymbolic-functions, or --dynamic-list. Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D80487	2020-06-01 11:30:03 -07:00
Sanjay Patel	26ebe936f3	[InstCombine] fix use of base VectorType; NFC SimplifyDemandedVectorElts() bails out on ScalableVectorType anyway, but we can exit faster with the external check. Move this to a helper function because there are likely other vector folds that we can try here.	2020-06-01 14:28:31 -04:00
Matt Arsenault	89d48ccabe	AMDGPU: Fix not emitting nofpexcept on fdiv expansion In this awkward case, we have to emit custom pseudo-constrained FP wrappers. InstrEmitter concludes that since a mayRaiseFPException instruction had a chain, it can't add nofpexcept. Test deferred until mayRaiseFPException is really set on everything.	2020-06-01 14:10:26 -04:00
Vedant Kumar	11c617c417	[LiveDebugValues] Add LocIndex::u32_{location,index}_t types for readability, NFC This is per Adrian's suggestion in https://reviews.llvm.org/D80684.	2020-06-01 11:02:36 -07:00
Vedant Kumar	2ecaf93525	[LiveDebugValues] Speed up removeEntryValue, NFC Summary: Instead of iterating over all VarLoc IDs in removeEntryValue(), just iterate over the interval reserved for entry value VarLocs. This changes the iteration order, hence the test update -- otherwise this is NFC. This appears to give an ~8.5x wall time speed-up for LiveDebugValues when compiling sqlite3.c 3.30.1 with a Release clang (on my machine): ``` ---User Time--- --System Time-- --User+System-- ---Wall Time--- --- Name --- Before: 2.5402 ( 18.8%) 0.0050 ( 0.4%) 2.5452 ( 17.3%) 2.5452 ( 17.3%) Live DEBUG_VALUE analysis After: 0.2364 ( 2.1%) 0.0034 ( 0.3%) 0.2399 ( 2.0%) 0.2398 ( 2.0%) Live DEBUG_VALUE analysis ``` The change in removeEntryValue() is the only one that appears to affect wall time, but for consistency (and to resolve a pending TODO), I made the analogous changes for iterating over SpillLocKind VarLocs. Reviewers: nikic, aprantl, jmorse, djtodoro Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80684	2020-06-01 11:02:36 -07:00
Matt Arsenault	836c7dcf12	DAG: Fix getNode dropping flags if there's a glue output The AMDGPU non-strict fdiv lowering needs to introduce an FP mode switch in some cases, and has custom nodes to provide chain/glue for the intermediate FP operations. We need to propagate nofpexcept here, but getNode was dropping the flags. Adding nofpexcept in the AMDGPU custom lowering is left to a future patch. Also fix a second case where flags were dropped, but in this case it seems it just didn't handle this number of operands. Test will be included in future AMDGPU patch.	2020-06-01 13:48:02 -04:00
Julian Lettner	f97a609b17	[Darwin] Add and adopt a way to query the Darwin kernel version This applies the learnings from [1]. What I intended as a simple cleanup made me realize that the compiler-rt version checks have two separate issues: 1) In some places (e.g., mmap flag setting) what matters is the kernel version, not the OS version. 2) OS version checks are implemented by querying the kernel version. This is not necessarily correct inside the simulators if the simulator runtime isn't aligned with the host macOS. This commit tackles 1) by adopting a separate query function for the Darwin kernel version. 2) (and cleanups) will be dealt with in follow-ups. [1] https://reviews.llvm.org/D78942 rdar://63031937 Reviewed By: delcypher Differential Revision: https://reviews.llvm.org/D79965	2020-06-01 10:37:03 -07:00
Hiroshi Yamauchi	6c27c61d32	[PGO] Improve the working set size heuristics under the partial sample PGO. Summary: The working set size heuristics (ProfileSummaryInfo::hasHugeWorkingSetSize) under the partial sample PGO may not be accurate because the profile is partial and the number of hot profile counters in the ProfileSummary may not reflect the actual working set size of the program being compiled. To improve this, the (approximated) ratio of the the number of profile counters of the program being compiled to the number of profile counters in the partial sample profile is computed (which is called the partial profile ratio) and the working set size of the profile is scaled by this ratio to reflect the working set size of the program being compiled and used for the working set size heuristics. The partial profile ratio is approximated based on the number of the basic blocks in the program and the NumCounts field in the ProfileSummary and computed through the thin LTO indexing. This means that there is the limitation that the scaled working set size is available to the thin LTO post link passes only. Reviewers: davidxl Subscribers: mgorny, eraman, hiraditya, steven_wu, dexonsmith, arphaman, dang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79831	2020-06-01 10:29:23 -07:00
Matt Arsenault	20793b2aef	AMDGPU: Fix test in code directory	2020-06-01 13:26:51 -04:00
Matt Arsenault	ed08c4fb2e	AMDGPU: Remove dead file	2020-06-01 13:26:51 -04:00
hsmahesha	0ed2c04636	[AMDGPU/MemOpsCluster] Let mem ops clustering logic also consider number of clustered bytes Summary: While clustering mem ops, AMDGPU target needs to consider number of clustered bytes to decide on max number of mem ops that can be clustered. This patch adds support to pass number of clustered bytes to target mem ops clustering logic. Reviewers: foad, rampitec, arsenm, vpykhtin, javedabsar Reviewed By: foad Subscribers: MatzeB, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, javed.absar, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80545	2020-06-01 22:52:34 +05:30
Fangrui Song	ee9a251caf	[ELF] Set DF_1_PIE for -pie DF_1_PIE originated from Solaris (https://docs.oracle.com/cd/E36784_01/html/E36857/chapter6-42444.html ). GNU ld since https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=5fe2850dd96483f176858fd75c098313d5b20bc2 sets the flag on non-Solaris platforms. It can help distinguish PIE from ET_DYN. eu-classify from elfutils uses this to recognize PIE (https://sourceware.org/git/?p=elfutils.git;a=commit;h=3f489b5c7c78df6d52f8982f79c36e9a220e8951 ) glibc uses this flag to reject dlopen'ing a PIE (https://sourceware.org/bugzilla/show_bug.cgi?id=24323 ) Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D80872	2020-06-01 10:19:41 -07:00
Stanislav Mekhanoshin	4e963299ee	Temporarily removed unstable test. NFC.	2020-06-01 10:18:54 -07:00
Matt Arsenault	7ad36491ca	AMDGPU: Fix alignment for dynamic allocas The alignment value also needs to be scaled by the wave size.	2020-06-01 13:06:37 -04:00
Christopher Tetreault	796898172c	[SVE] Eliminate calls to default-false VectorType::get() from Clang Reviewers: efriedma, david-arm, fpetrogalli, ddunbar, rjmccall Reviewed By: fpetrogalli, rjmccall Subscribers: tschuett, rkruppe, psnobl, dmgreen, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D80323	2020-06-01 10:02:14 -07:00
Nick Desaulniers	ef1d4bec89	[Clang][CGM] style cleanups NFC Summary: Forked from: https://reviews.llvm.org/D80242 Use the getter for access to DebugInfo consistently. Use break in switch in CodeGenModule::EmitTopLevelDecl consistently. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: cfe-commits, srhines Tags: #clang Differential Revision: https://reviews.llvm.org/D80840	2020-06-01 09:33:08 -07:00
Eric Schweitz	ae6e499d25	[flang] This adds the lowering stubs for Open MP. The lowering bridge will call these lowering hooks to process the Open MP directives that it iterates over in the PFT. This is a mock interface without an implementation in this patch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D80815	2020-06-01 09:11:53 -07:00
Stanislav Mekhanoshin	e132a9c012	Update some names in test. NFC. There seems to be some instability with IR nameing between platforms. Attempted to fix it with replacing dot-numbered names.	2020-06-01 09:11:18 -07:00
Fangrui Song	d9943e7f0c	[Object] Add DF_1_PIE This flag (and the whole field DT_FLAGS_1) originated from Solaris. I intend to use it in an LLD patch D80872. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D80871	2020-06-01 08:56:02 -07:00
Sanjay Patel	b874dc4dda	[InstCombine] add test for select-of-shuffle; NFC This is based on an example in D80658	2020-06-01 11:52:07 -04:00
Stanislav Mekhanoshin	745c6c8458	Process gep (phi ptr1, ptr2) in SROA Differential Revision: https://reviews.llvm.org/D79218	2020-06-01 08:41:05 -07:00
Siva Chandra Reddy	1caedd0c55	[libc] Add implementations of ceil[f], floor[f] and trunc[f] from math.h. Reviewers: abrachet Differential Revision: https://reviews.llvm.org/D80612	2020-06-01 08:36:59 -07:00
Sam Clegg	26c78e3095	[WebAssembly] Update test expectations simd-2.C now compiles thanks to: https://github.com/WebAssembly/wasi-libc/pull/183 Differential Revision: https://reviews.llvm.org/D80930	2020-06-01 08:35:27 -07:00
Sanjay Patel	dd54432a0f	[InstNamer] use 'i' for Instructions, not 'tmp' As discussed in https://bugs.llvm.org/show_bug.cgi?id=45951 and D80584, the name 'tmp' is almost always a bad choice, but we have a legacy of regression tests with that name because it was baked into utils/update_test_checks.py. This change makes -instnamer more consistent (already using "arg" and "bb", the common LLVM shorthand). And it avoids the conflict in telling users of the FileCheck script to run "-instnamer" to create a better regression test and having that cause a warn/fail in update_test_checks.py.	2020-06-01 11:11:14 -04:00
AndreyChurbanov	5e111c5df8	[openmp] Fixed taskloop recursive splitting so that taskloop tasks have same parent tasks. Differential Revision: https://reviews.llvm.org/D80577	2020-06-01 17:51:02 +03:00
Aaron Ballman	522934da1f	Support GCC [[gnu::attributes]] in C2x mode GCC 10.1 introduced support for the [[]] style spelling of attributes in C mode. Similar to how GCC supports __attribute__((foo)) as [[gnu::foo]] in C++ mode, it now supports the same spelling in C mode as well. This patch makes a change in Clang so that when you use the GCC attribute spelling, the attribute is automatically available in all three spellings by default. However, like Clang, GCC has some attributes it only recognizes in C++ mode (specifically, abi_tag and init_priority), which this patch also honors.	2020-06-01 10:42:42 -04:00
Ehud Katz	8a84158e5b	[StructurizeCFG] Fix an incorrect comment, NFC.	2020-06-01 17:42:09 +03:00
Sanjay Patel	c0303e5391	[CodeGen] remove instnamer dependency from test file; NFC This file was originally added without instnamer at: rL283716 / `fe2b9b4fbf` But that was reverted and the test file reappeared with instnamer at: rL285688 / `62f516f590` I'm not seeing any difference locally from checking nameless values, so trying to remove a layering violation and see if that can survive the build bots.	2020-06-01 10:21:17 -04:00
James Henderson	8d9070e040	[Support] Add more context to DataExtractor getLEB128 errors Reviewed by: clayborg, dblaikie, labath Differential Revision: https://reviews.llvm.org/D80799	2020-06-01 14:00:01 +01:00
Raphael Isemann	54422d2170	Revert "[lldb] Pass -fPIC flag even when DYLIB_ONLY is set" This reverts commit `fd0ab3b3eb`. The fix here is incorrect and the actual fault was an incorrect test Makefile. To give some more background: The original test for D80798 compiled three source files into either one executable or one executable + 2 shared libraries, each being one different test setup. If both the monolithic executable and the shared libraries where compiled in the same directory, then Make would overwrite the .o files of one test setup with the other. This caused that while -fPIC was passed correctly to the test setup with the shared libraries, the compiler invocations for the monolithic executable would later overwrite these object files (and as only the test setup with the shared library used -fPIC, it appeared as if the shared library object files didn't receive the -fPIC flag). Thanks to Pavel for figuring this out.	2020-06-01 14:41:08 +02:00
James Henderson	e8bcf4ef07	[DebugInfo] Add use of truncating data extractor to debug line parsing This will ensure that nothing can ever start parsing data from a future sequence and part-read data will be returned as 0 instead. Reviewed by: aprantl, labath Differential Revision: https://reviews.llvm.org/D80796	2020-06-01 12:33:21 +01:00
Raphael Isemann	2b37c5b560	[lldb][NFC] Make ClangExpressionSourceCode's wrapping logic more consistent Summary: ClangExpressionSourceCode has different ways to wrap the user expression based on which context the expression is executed in. For example, if we're in a C++ member function we put the expression inside a fake member function of a fake class to make the evaluation possible. Similar things are done for Objective-C instance/static methods. There is also a default wrapping where we put the expression in a normal function just to make it possible to execute it. The way we currently define which kind of wrapping the expression needs is based on the `wrapping_language` we keep passing to the ClangExpressionSourceCode instance. We repurposed the language type enum for that variable to distinguish the cases above with the following mapping: * language = C_plus_plus -> member function wrapping * language = ObjC -> instance/static method wrapping (`is_static` distinguished between those two). * language = C -> normal function wrapping * all other cases like C_plus_plus11, Haskell etc. make our class a no-op that does mostly nothing. That mapping is currently not documented and just confusing as the `language` is unrelated to the expression language (and in the ClangUserExpression we even pretend that it is the actual language, but luckily never used it for anything). Some of the code in ClangExpressionSourceCode is also obviously thinking that this is the actual language of the expression as it checks for non-existent cases such as `ObjC_plus_plus` which is not part of the mapping. This patch makes a new enum to describe the four cases above (with instance/static Objective-C methods now being their own case). It also make that enum just a member of ClangExpressionSourceCode instead of having to pass the same value to the class repeatedly. This gets also rid of all the switch-case-checks for 'unknown' language such as C_plus_plus11 as this is no longer necessary. Reviewers: labath, JDevlieghere Reviewed By: labath Subscribers: abidh Differential Revision: https://reviews.llvm.org/D80793	2020-06-01 13:24:30 +02:00
Sanjay Patel	e5b8772756	[utils] change default nameless value to "TMP" This is effectively reverting rGbfdc2552664d to avoid test churn while we figure out a better way forward. We at least salvage the warning on name conflict from that patch though. If we change the default string again, we may want to mass update tests at the same time. Alternatively, we could live with the poor naming if we change -instnamer. This also adds a test to LLVM as suggested in the post-commit review. There's a clang test that is also affected. That seems like a layering violation, but I have not looked at fixing that yet. Differential Revision: https://reviews.llvm.org/D80584	2020-06-01 06:54:45 -04:00
James Henderson	7bcde99f77	[llvm-dwarfdump][test] Use verbose output to check expected opcodes The debug_line_invalid.test test case was previously using the interpreted line table dumping to identify which opcodes have been parsed. This change moves to looking for the expected opcodes explicitly. This is probably a little clearer and also allows for testing some cases that wouldn't be easily identifiable from the interpreted table. Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D80795	2020-06-01 11:48:02 +01:00
Simon Pilgrim	014648e8f2	ARMFrameLowering.h - remove unnecessary includes. NFC. They are implicitly included in TargetFrameLowering.h and only ever used in TargetFrameLowering override methods.	2020-06-01 11:47:13 +01:00
Simon Pilgrim	de82114db8	MIPatternMatch.h - remove unused APFloat/APInt includes. NFC.	2020-06-01 11:47:13 +01:00
Igor Kudrin	cbec419b3e	[DebugInfo] Separate fields with commas in headers of type units (3/3). For most tables, we already use commas in headers. This set of patches unifies dumping the remaining ones. Differential Revision: https://reviews.llvm.org/D80806	2020-06-01 17:40:28 +07:00
Igor Kudrin	2a7af30482	[DebugInfo] Separate fields with commas in headers of compile units (2/3). For most tables, we already use commas in headers. This set of patches unifies dumping the remaining ones. Differential Revision: https://reviews.llvm.org/D80806	2020-06-01 17:40:24 +07:00
Igor Kudrin	937403d684	[DebugInfo] Separate fields with commas in headers of .debug_pub* tables (1/3). For most tables, we already use commas in headers. This set of patches unifies dumping the remaining ones. Differential Revision: https://reviews.llvm.org/D80806	2020-06-01 17:39:48 +07:00
Georgii Rymar	b21f32fcec	[llvm-readelf] - Add explicit braces again. NFC. Partially reverts `feee98645d`. Add explicit braces to a different place to fix "error: add explicit braces to avoid dangling else [-Werror,-Wdangling-else]"	2020-06-01 13:10:16 +03:00
Georgii Rymar	feee98645d	[llvm-readelf] - Add explicit braces. NFC. Should fix the BB (http://lab.llvm.org:8011/builders/clang-ppc64le-rhel/builds/3907/steps/build%20stage%201/logs/stdio): llvm-readobj/ELFDumper.cpp:4708:5: error: add explicit braces to avoid dangling else [-Werror,-Wdangling-else] else ^	2020-06-01 12:55:24 +03:00
Ehud Katz	85c3088049	[StructurizeCFG] Fix region nodes ordering This is a reimplementation of the `orderNodes` function, as the old implementation didn't take into account all cases. The new implementation uses SCCs instead of Loops to take account of irreducible loops. Fix PR41509 Differential Revision: https://reviews.llvm.org/D79037	2020-06-01 12:50:35 +03:00
Georgii Rymar	e75efcc3c1	[llvm-readobj] - Improve error reporting for hash tables. This improves the next points for broken hash tables: 1) Use reportUniqueWarning to prevent duplication when --hash-table and --elf-hash-histogram are used together. 2) Dump nbuckets and nchain fields. It is often possible to dump them even when the table itself goes past the EOF etc. Differential revision: https://reviews.llvm.org/D80373	2020-06-01 12:36:23 +03:00
Tim Northover	dace8224f3	AArch64: materialize large stack offset into xzr correctly. When a stack offset was too big to materialize in a single instruction, we were trying to do it in stages: adds xD, sp, #imm adds xD, xD, #imm Unfortunately, if xD is xzr then the second instruction doesn't exist and wouldn't do what was needed if it did. Instead we can use a temporary register for all but the last addition.	2020-06-01 09:30:05 +01:00
Djordje Todorovic	40a3fcb05c	[DebugInfo][CallSites] Remove decl subprograms from 'retainedTypes:' After the D70350, the retainedTypes: isn't being used for the purpose of call site debug info for extern calls, so it is safe to delete it from IR representation. We are also adding a test to ensure the subprogram isn't stored within the retainedTypes: from corresponding DICompileUnit. Differential Revision: https://reviews.llvm.org/D80369	2020-06-01 09:10:05 +02:00
Nathan James	b6d23f2efc	[ASTMatchers] Force c++ unittests to specify correct language standard Force the unittests on c++ code for matchers to specify the correct standard. Reviewed By: gribozavr2 Differential Revision: https://reviews.llvm.org/D80884	2020-06-01 07:52:01 +01:00
serge-sans-paille	11efb0837c	Improve SmallPtrSetImpl::count implementation Relying on the find method implies a roundtrip to the iterator world, which is not costless because iterator creation involves a few check to ensure the iterator is in a valid position (through the SmallPtrSetIteratorImpl::AdvanceIfNotValid method). It turns out that the result of SmallPtrSetImpl::find_imp is either valid or the EndPointer, so there's no need to go through that abstraction, and the compiler cannot guess it. Differential Revision: https://reviews.llvm.org/D80708	2020-06-01 07:49:19 +02:00
serge-sans-paille	af38074874	Fix strict aliasing warning in msan.cpp Use internal_memcpy instead. Differential Revision: https://reviews.llvm.org/D80732	2020-06-01 07:42:10 +02:00

... 3 4 5 6 7 ...

356056 Commits All Branches Search

356056 Commits

All Branches