llvm-project

Commit Graph

Author	SHA1	Message	Date
Siva Chandra Reddy	f5cbbb9988	[libc] Add implementation of POSIX setrlimit and getrlimit functions. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D134016	2022-09-16 20:50:28 +00:00
Teresa Johnson	c2cf93c1a9	[WPD/LTT] Lower type test feeding assumes via phi correctly This fixes https://github.com/llvm/llvm-project/issues/57616. Type test lowering in ThinLTO modules relies on having type id summaries set up for the referenced types, which provide the type test resolution. If there is no summary, the type tests are lowered to false. At the very least, a default type id summary gives the type tests a resolution of Unknown, which is handled correctly (ignored by the first invocation of LTT, and lowered to true by the second). WPD sets up the type id summaries (with a default type test resolution) as it is processing the type tests, but only does this for the patterns handled by WPD, which is a type test directly feeding an assume. In the case of type tests feeding an assume via a phi, the type id summary was not being set up, leading to the type tests being lowered to false incorrectly. Fix this by adding the default type id summary entries for all type ids used on globals during index-only WPD. This is not an issue for hybrid (split-lto-unit) LTO, as in that case the type test resolution is determined and set up during LTT, since the type definitions are in the regular LTO split module, and exported via the summary to the ThinLTO split module. Differential Revision: https://reviews.llvm.org/D134012	2022-09-16 13:50:01 -07:00
Lang Hames	47e9e58808	[ORC][ORC-RT][MachO] Reset __data and __common sections on library close. If we want to be able to close and then re-open a library then we need to reset the data section states when the library is closed. This commit updates MachOPlatform and the ORC runtime to track __data and __common sections, and reset the state in MachOPlatformRuntimeState::dlcloseDeinitialize. This is only a first step to full support -- there are other data sections that we're not capturing, and we'll probably want a more efficient representation for the sections (rather than passing their string name over IPC), but this is a reasonable first step. This commit also contains a fix to MapperJITLinkMemoryManager that prevents it from calling OnDeallocated twice in the case of an error.	2022-09-16 13:40:19 -07:00
Maksim Panchenko	1d5393526c	[BOLT] Change base class of ExecutableFileMemoryManager When we derive EFMM from SectionMemoryManager, it brings into EFMM extra functionality, such as the registry of exception handling sections, page permission management, etc. Such functionality is of no use to llvm-bolt and can even be detrimental (see https://github.com/llvm/llvm-project/issues/56726). Change the base class of ExecutableFileMemoryManager to MemoryManager, avoid registering EH sections, and skip memory finalization. Fixes #56726 Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D133994	2022-09-16 13:39:12 -07:00
Maksim Panchenko	9742c25b98	[BOLT] Fix empty function emission in non-relocation mode In non-relocation mode, every function is emitted in its own section. If a function is empty, RuntimeDyld will still allocate 1-byte section for the function and initialize it with zero. As a result, we will overwrite the first byte of the original function contents with zero. Such scenario can happen when the input function had only NOP instructions which BOLT removes by default. Even though such functions likely cause undefined behavior, it's better to preserve their contents. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D133978	2022-09-16 13:38:32 -07:00
Jessica Paquette	1076b31da8	[GlobalISel] Combine select + fcmp to fminnum/fmaxnum/fminimum/fmaximum This is a partial port of the code used by the SelectionDAGBuilder to translate selects. In particular, see matchSelectPattern in ValueTracking.cpp. This is a GISel-equivalent of the portion which handles fminnum/fmaxnum/fminimum/fmaximum. I tried to set it up so it'd be easy to add the non-FP cases. Those are simpler. On the AArch64-end, it seems like the FP cases are more important for perf right now, so I bit the bullet and went at the more complicated problem. :) I elected to do this as a post-legalize combine rather than in the IRTranslator because Deciding which fmax/fmin to use can depend on legalization rules Philosophically-speaking (TM), putting it in a combine just feels cleaner Being able to enable/disable the combine is handy Another option would be to use the ValueTracking code in the IRTranslator and match what SelectionDAGBuilder::visitSelect does. I think that may be somewhat annoying since we'd need to write lowerings back into the selects in the legalizer. I'm not strongly opposed to the approach. We'd also want to be careful with vector selects once that's implemented, which explicitly check if a vector select is legal on the target. That'd probably need a hook. From what I can tell, doing this as a combine is probably a cleaner option long-term. Differential Revision: https://reviews.llvm.org/D116702	2022-09-16 13:35:46 -07:00
Pengxuan Zheng	59365f33e2	[MachineCSE] Add a threshold to avoid spending too much time in isProfitableToCSE Currently, it can become extremely costly to compute MayIncreasePressure if the size of CSUses turns out to be very large. In that case, it's no longer cost effective to keep computing MayIncreasePressure. Therefore, to limit the amount of time spent in isProfitableToCSE, we simply conservatively assume MayIncreasePressure if the size of CSUses is too large. This can reduce overall compile time by 30% for some benchmarks. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D134003	2022-09-16 13:35:17 -07:00
Craig Topper	1121eca685	[VP][VE] Default VP_SREM/UREM to Expand and add generic expansion using VP_SDIV/UDIV+VP_MUL+VP_SUB. I want to default all VP operations to Expand. These 2 were blocking because VE doesn't support them and the tests were expecting them to fail a specific way. Using Expand caused them to fail differently. Seemed better to emulate them using operations that are supported. @simoll mentioned on Discord that VE has some expansion downstream. Not sure if its done like this or in the VE target. Reviewed By: frasercrmck, efocht Differential Revision: https://reviews.llvm.org/D133514	2022-09-16 13:19:02 -07:00
Xing Xue	8ace9ea258	[libc++][lit][AIX] Enable test case last_write_time.pass.cpp for AIX Summary: This patch enables libc++ LIT test case last_write_time.pass.cpp for AIX. Because system call utimensat() of AIX which is used in the libc++ implementation of last_write_time() does not accept the times parameter with a negative tv_sec or tv_nsec field, testing of setting file time to before epoch time is excluded for AIX. Reviewed by: ldionne, libc++ Differential Revision: https://reviews.llvm.org/D133124	2022-09-16 16:08:40 -04:00
Craig Topper	61595c45af	[RISCV] Simplify some code in vector fp<->int handling. NFC We changed the way container types are selected since this code was written. We no longer need to use the largest type.	2022-09-16 12:56:42 -07:00
Aiden Grossman	c0bc461999	[Clang] Give error message for invalid profile path when compiling IR Before this patch, when compiling an IR file (eg the .llvmbc section from an object file compiled with -Xclang -fembed-bitcode=all) and profile data was passed in using the -fprofile-instrument-use-path flag, there would be no error printed (as the previous implementation relied on the error getting caught again in the constructor of CodeGenModule which isn't called when -x ir is set). This patch moves the error checking directly to where the error is caught originally rather than failing silently in setPGOUseInstrumentor and waiting to catch it in CodeGenModule to print diagnostic information to the user. Regression test added. Reviewed By: xur, mtrofin Differential Revision: https://reviews.llvm.org/D132991	2022-09-16 19:45:57 +00:00
David Majnemer	8a868d8859	Revert "Revert "[clang, llvm] Add __declspec(safebuffers), support it in CodeView"" This reverts commit `cd20a18286` and adds a "let Heading" to NoStackProtectorDocs.	2022-09-16 19:39:48 +00:00
Kazu Hirata	e0bc76eb23	[ModuleInliner] Move InlinePriority and its derived classes to InlineOrder.cpp (NFC) These classes are referred to only from getInlineOrder in InlineOrder.cpp. This patch hides the entire class declarations and definitions in InlineOrder.cpp. Differential Revision: https://reviews.llvm.org/D134056	2022-09-16 12:32:16 -07:00
Stanislav Mekhanoshin	167826ee12	[AMDGPU] Fix runline for windows in sdag-print-divergence.ll. NFC.	2022-09-16 12:15:08 -07:00
Abhina Sreeskantharajan	ce19343114	[test] Use host platform specific error message substitution This patch modifies the testcase to use error substitution so it will pass on all platforms. Reviewed By: fanbo-meng, zibi Differential Revision: https://reviews.llvm.org/D134034	2022-09-16 15:13:45 -04:00
Jakub Kuderski	a0debaa936	[mlir][arith] Remove misleading comment in EmulateWideInt. NFC. At the request of @Mogball.	2022-09-16 15:02:47 -04:00
Amir Ayupov	e002523b65	[BOLT] Verify externally referenced blocks against jump table targets For functions with references to internal offsets from data, verify externally referenced blocks against the set of jump table targets. Mark the function as non-simple if there are any unclaimed data to code references. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D132495	2022-09-16 11:44:33 -07:00
Stanislav Mekhanoshin	a0c8f5fefa	[SDAG] Print divergence in SDNode::dump If target does not support divergence the field is set to false and not printed. Differential Revision: https://reviews.llvm.org/D133984	2022-09-16 11:43:34 -07:00
Wei Yi Tee	a4f8e3d240	Revert "[clang][dataflow] Replace `transfer(const Stmt , ...)` with `transfer(const CFGElement , ...)` in `Analysis/FlowSensitive/Models/UncheckedOptionalAccessModel`." This reverts commit `41f235d268`. Details at https://lab.llvm.org/buildbot#builders/139/builds/28171. Breakage due to API change.	2022-09-16 18:07:35 +00:00
Wei Yi Tee	41f235d268	[clang][dataflow] Replace `transfer(const Stmt , ...)` with `transfer(const CFGElement , ...)` in `Analysis/FlowSensitive/Models/UncheckedOptionalAccessModel`. Reviewed By: gribozavr2, sgatev Differential Revision: https://reviews.llvm.org/D133930	2022-09-16 17:54:12 +00:00
Justin Lebar	8cc3bfd13f	[NFC] Fix indentation in ValueTracking.h. In a separate patch I want to modify ValueTracking.h. When I touch the header, arc wants to clang-format the lines I touch (reasonable!). But then these whitespace changes get mixed into my patch.	2022-09-16 10:46:23 -07:00
Thomas Raoux	54db8cc7b1	[mlir][vector] Remove ExtractMap/InsertMap operations As discussed on discourse: https://discourse.llvm.org/t/vector-vector-distribution-large-vector-to-small-vector/1983/22 removing insert_map/extract_map op as vector distribution now uses warp_execute_on_lane_0 op. Differential Revision: https://reviews.llvm.org/D134000	2022-09-16 17:41:26 +00:00
Michał Górny	587729c3ad	[clang] [Driver] Add an option to disable default config filenames Add a `--no-default-config` option that disables the search for default set of config filenames (based on the compiler executable name). Suggested in https://discourse.llvm.org/t/rfc-adding-a-default-file-location-to-config-file-support/63606. Differential Revision: https://reviews.llvm.org/D134018	2022-09-16 19:38:11 +02:00
Brett Wilson	e191086bfc	[clang-doc] Support default args for functions. Adds support for default arguments in the internal representation and reads these values from the source. Implements writing these values to YAML but does not implement this for the HTML or markdown outputs. Reviewed By: paulkirth Differential Revision: https://reviews.llvm.org/D133732	2022-09-16 17:26:07 +00:00
Zequan Wu	d21b417025	[LLDB][NativePDB] ResolveSymbolContext should return the innermost block Before, it returns the outermost blocks if nested blocks have the same address range. That casuses lldb unable to find variables that are inside inner blocks. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D133601	2022-09-16 10:19:09 -07:00
LLVM GN Syncbot	6073e0a2f7	[gn build] Port `7061a3f3f8`	2022-09-16 16:49:26 +00:00
Jan Svoboda	4e637fcb25	[clang][deps] Make sure ScanInstance outlives collector The `ScanInstance` is a local variable in `DependencyScanningAction::runInvocation()` that is referenced by `ModuleDepCollector`. Since D132405, `ModuleDepCollector` can escape the function and can outlive its `ScanInstance`. This patch fixes that. Reviewed By: benlangmuir Differential Revision: https://reviews.llvm.org/D133988	2022-09-16 09:44:12 -07:00
Kazu Hirata	9111920af8	[ModuleInliner] clang-format ModuleInliner.cpp (NFC)	2022-09-16 09:41:42 -07:00
Michael Buch	e456d2ba8b	[clang][ASTImporter] DeclContext::localUncachedLookup: Continue lookup into decl chain when regular lookup fails The uncached lookup is mainly used in the ASTImporter/LLDB code-path where we're not allowed to load from external storage. When importing a FieldDecl with a DeclContext that had no external visible storage (but came from a Clang module or PCH) the above call to `lookup(Name)` the regular `DeclContext::lookup` fails because: 1. `DeclContext::buildLookup` doesn't set `LookupPtr` for decls that came from a module 2. LLDB doesn't use the `SharedImporterState` In such a case we would never continue with the "slow" path of iterating through the decl chain on the DeclContext. In some cases this means that ASTNodeImporter::VisitFieldDecl ends up importing a decl into the DeclContext a second time. The patch removes the short-circuit in the case where we don't find any decls via the regular lookup. Tests * Un-skip the failing LLDB API tests Differential Revision: https://reviews.llvm.org/D133945	2022-09-16 12:38:50 -04:00
Michael Buch	3004a759bc	[lldb][tests][gmodules] Test for expression evaluator crash for types referencing the same template The problem here is that the ASTImporter adds the template class member FieldDecl to the DeclContext twice. This happens because we don't construct a `LookupPtr` for decls that originate from modules and thus the ASTImporter never realizes that the FieldDecl has already been imported. These duplicate decls then break the assumption of the LayoutBuilder which expects only a single member decl to exist. The test will be fixed by a follow-up revision and is thus skipped for now. Differential Revision: https://reviews.llvm.org/D133944	2022-09-16 12:38:49 -04:00
Kazu Hirata	4475470529	[ModuleInliner] Remove a stale comment (NFC) These comments refer to the nested loop in the module inliner where the inner loop grouped call sites from the same caller. We don't group call sites anymore, so the comment has become stale.	2022-09-16 09:37:43 -07:00
Kazu Hirata	42a90e6017	[ModuleInliner] Remove a redundaunt variable (NFC) In the CGSCC inliner, DidInline was used as an indicator to update the call graph. In the module inliner, DidInline is always true at the end of the "while" loop, so can just drop it.	2022-09-16 09:32:02 -07:00
Simon Pilgrim	95c2c9c5c5	[LoopIdiom][X86] Add non-LZCNT test coverage to 'rshift until zero' idiom tests	2022-09-16 17:23:54 +01:00
Nicolas Vasilache	0422a4407f	[mlir][scf][Transform] Refactor transform.fuse_into_containing_op so it is iterative and supports output fusion. This revision revisits the implementation of `transform.fuse_into_containing_op` so that it iterates on producers one use at a time. Support is added to fuse a producer through a foreach_thread shared tensor argument, in which case we tile and fuse the op inside the containing op and update the shared tensor argument to the unique destination operand. If one cannot find such a unique destination operand the transform fails. Differential Revision: https://reviews.llvm.org/D134051	2022-09-16 09:21:46 -07:00
mbs	7061a3f3f8	[support] Prepare TimeProfiler for cross-thread support This NFC prepares the TimeProfiler to support the construction and completion of time profiling 'entries' across threads. Add ClockType alias so we can change the clock in one place. (trivial) Use c++ usings instead of typedefs Rename Entry to TimeTraceProfilerEntry since this type will eventually become public. Add an intro comment. Add some smoke unit tests. Reviewed By: russell.gallop, rriddle, lattner, jloser Differential Revision: https://reviews.llvm.org/D133153	2022-09-16 10:20:18 -06:00
Wei Yi Tee	9cbdef6103	[clang][dataflow] Replace usage of the deprecated overload of `checkDataflow`. Updated files: - `ChromiumCheckModelTest.cpp`. - `MatchSwitchTest.cpp`. - `MultiVarConstantPropagationTest.cpp`. - `SingleVarConstantPropagationTest.cpp`. - `TestingSupportTest.cpp`. - `TransferTest.cpp`. Reviewed By: gribozavr2, sgatev Differential Revision: https://reviews.llvm.org/D133865	2022-09-16 16:19:07 +00:00
Kazu Hirata	513717ddd0	[ModuleInliner] Remove a write-only variable (NFC) InlinedCallees is a remnant from the CGSCC inliner. We don't use it in the module inliner.	2022-09-16 09:15:53 -07:00
Jakub Kuderski	95c3e9c222	[mlir][arith] Support wide int shrui emulation Tested by checking all 16-bit LHS and all valid RHS when emulating i16 with i8 operations. Reviewed By: antiagainst, Mogball Differential Revision: https://reviews.llvm.org/D133722	2022-09-16 12:09:33 -04:00
Jakub Kuderski	db0d6e567d	[mlir][arith] Support wide integer multiplication emulation Emulate multiplication by splitting each input element of type i2N into 4 digits of type iN and bit width i(N/2). This is so that the intermediate multiplications and additions do not overflow. We extract these i(N/2) digits from iN vector elements by masking (low digit) and shifting right (high digit). The multiplication algorithm used is the standard (long) multiplication. Multiplying two i2N integers produces (at most) a i4N result, but because the calculation of top i2N is not necessary, we omit it. In total, this implementations performs 10 intermediate multiplications and 16 additions. The number of multiplications could be decreased by switching to a more efficient algorithm like Karatsuba. This would, however, require being able to perform (intermediate) wide additions and subtractions, so it is not clear that such implementation would be more efficient. I tested this on all 16-bit inut pairs, when emulating i16 with i8. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D133629	2022-09-16 12:02:25 -04:00
Simon Pilgrim	23cb1c42cd	[CostModel][X86] Update throughput costs for CTLZ ops This was achieved with an updated version of the 'cost-tables vs llvm-mca' script D103695 (and recent fixes to the bdver2 + alderlake models) Adding full CostKinds costs are affecting some other tests as they make assumptions about SizeLatency costs, so they need addressing first	2022-09-16 16:56:49 +01:00
Kazu Hirata	77501bfab8	[IPO] Simplify the module inliner loop (NFC) In the bottom-up inliner, we have a two-level nested "while" loop, with the inner one grouping call sites with the same caller. We need to do so to keep CGSCC up to date. Now, with the module inliner, we don't have any per-caller work. We don't update CGSCC. Plus, the caller will likely keep changing as we pop call sites in some priority order. This patch simply removes the inner "while" loop while indenting its body. Further cleanup is possible, but that's left for follow-up patches. Differential Revision: https://reviews.llvm.org/D133969	2022-09-16 08:56:18 -07:00
Jakub Kuderski	e41498dddb	[mlir][arith] Add initial files for (runtime) integration tests The goal is to have a set of runtime tests for further extercise the wide integer emulation pass and its conversion patterns. This was suggested by @Mogball in D133629. Add a minimal runtime test to demonstrate that printing and pass pipeline works as expected. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D134004	2022-09-16 11:52:48 -04:00
Juan Manuel MARTINEZ CAAMAÑO	e438f2d821	[DAGCombine] Do not fold SRA/SRL of MUL into MULH when MUL's LSB are used, and MUL_LOHI is available Folding into a sra(mul) / srl(mul) into a mulh introduces an extra multiplication to compute the high half of the multiplication, while it is more profitable to compute the high and lower halfs with a single mul_lohi. Differential Revision: https://reviews.llvm.org/D133768	2022-09-16 15:48:36 +00:00
Matheus Izvekov	f4ea3bd4b2	[clang] Fixes how we represent / emulate builtin templates We change the template specialization of builtin templates to behave like aliases. Though unlike real alias templates, these might still produce a canonical TemplateSpecializationType when some important argument is dependent. For example, we can't do anything about make_integer_seq when the count is dependent, or a type_pack_element when the index is dependent. We change type deduction to not try to deduce canonical TSTs of builtin templates. We also change those buitin templates to produce substitution sugar, just like a real instantiation would, making the resulting type correctly represent the template arguments used to specialize the underlying template. And make_integer_seq will now produce a TST for the specialization of it's first argument, which we use as the underlying type of the builtin alias. When performing member access on the resulting type, it's now possible to map from a Subst* node to the template argument as-written used in a regular fashion, without special casing. And this fixes a bunch of bugs with relation to these builtin templates factoring into deduction. Fixes GH42102 and GH51928. Depends on D133261 Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Differential Revision: https://reviews.llvm.org/D133262	2022-09-16 17:44:12 +02:00
Daniel Bertalan	a8843ec952	[lld-macho] Parallelize linker optimization hint processing This commit moves the parsing of linker optimization hints into `ARM64::applyOptimizationHints`. This lets us avoid allocating memory for holding the parsed information, and moves work out of `ObjFile::parse`, which is not parallelized at the moment. This change reduces the overhead of processing LOHs to 25-30 ms when linking Chromium Framework on my M1 machine; previously it took close to 100 ms. There's no statistically significant change in runtime for a --threads=1 link. Performance figures with all 8 cores utilized: N Min Max Median Avg Stddev x 20 3.8027232 3.8760762 3.8505335 3.8454145 0.026352574 + 20 3.7019017 3.8660538 3.7546209 3.7620371 0.032680043 Difference at 95.0% confidence -0.0833775 +/- 0.019 -2.16823% +/- 0.494094% (Student's t, pooled s = 0.0296854) Differential Revision: https://reviews.llvm.org/D133439	2022-09-16 17:38:46 +02:00
Sourabh Singh Tomar	396ed327bb	[flang][OpenMP] Lower OpenMP `taskgroup` construct Lower Fortran OpenMP `taskgroup` to FIR + OpenMP Dialect. Reviewed By: kiranchandramohan, peixin Differential Revision: https://reviews.llvm.org/D133918	2022-09-16 21:07:47 +05:30
Dave Lee	6e450d1347	[lldb] Use SWIG_fail in python-typemaps.swig (NFC) When attempting to use SWIG's `-builtin` flag, there were a few compile failures caused by a mismatch between return type and return value. In those cases, the return type was `int` but many of the type maps assume returning `NULL`/`nullptr` (only the latter caused compile failures). This fix abstracts failure paths to use the `SWIG_fail` macro, which performs `goto fail;`. Each of the generated functions contain a `fail` label, which performs any resource cleanup and returns the appropriate failure value. This change isn't strictly necessary at this point, but seems like the right thing to do, and for anyone who tries `-builtin` later, it resolves those issues. Differential Revision: https://reviews.llvm.org/D133961	2022-09-16 08:25:01 -07:00
Dmitry Preobrazhensky	ef8feb6359	[AMDGPU][MC][NFC] Correct error message Differential Revision: https://reviews.llvm.org/D134028	2022-09-16 18:22:08 +03:00
Matheus Izvekov	591452eb23	NFC: [clang] add template AST test for make_integer_seq and type_pack_element Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Differential Revision: https://reviews.llvm.org/D133261	2022-09-16 17:12:49 +02:00
Zahira Ammarguellat	1b69ce1208	Currently the options ‘ffast-math’ and ‘ffp-contract’ are connected. When ‘ffast-math’ is set, ffp-contract is altered this way: -ffast-math/ Ofast -> ffp-contract=fast -fno-fast-math -> if ffp-contract= fast then ffp-contract=on else ffp-contract unchanged This differs from gcc which doesn’t connect the two options. Connecting these two options in clang, resulted in spurious warnings when the user combines these two options -ffast-math -fno-fast-math; see issue https://github.com/llvm/llvm-project/issues/54625. The issue is that the ‘ffast-math’ option is an on/off flag, but the ‘ffp-contract’ is an on/off/fast flag. So when ‘fno-fast-math’ is used there is no obvious value for ‘ffp-contract’. What should the value of ffp-contract be for -ffp-contract=fast -fno-fast-math and -ffast-math -ffp-contract=fast -fno-fast-math? The current logic sets ffp-contract back to on in these cases. This doesn’t take into account that the value of ffp-contract is modified by an explicit ffp-contract` option. This patch is proposing a set of rules to apply when ffp-contract', ffast-math and fno-fast-math are combined. These rules would give the user the expected behavior and no diagnostic would be needed. See RFC https://discourse.llvm.org/t/rfc-making-ffast-math-option-unrelated-to-ffp-contract-option/61912	2022-09-16 11:08:41 -04:00

... 3 4 5 6 7 ...

436343 Commits All Branches Search

436343 Commits

All Branches