llvm-project

Commit Graph

Author	SHA1	Message	Date
Dan Liew	8337fb00e6	[compiler-rt] Add `DARWIN_osx_MIN_VER_FLAG` variable to match the other Apple platforms. rdar://problem/59463146	2020-02-14 13:53:53 -08:00
Dan Liew	86dceb9bcd	[compiler-rt] Use the correct minimum version flag for simulators on Apple platforms. For the simulators this changes the following compilation flags ( used both for building the runtime and lit tests). iOS simulator: `-miphoneos-version-min` -> `-mios-simulator-version-min` watchOS simulator: `-mwatchos-version-min` -> `-mwatchos-simulator-version-min` tvOS simulator: `-mtvos-version-min` -> `-mtvos-simulator-version-min` rdar://problem/59463146	2020-02-14 13:53:53 -08:00
Craig Topper	1fe6e6fd85	[llvm-exegesis] Rename range based for loop variable in a unit test so its different than the container being iterated over. NFC It seems like gcc 5.5 wants to iterate over the new variable instead of the container that lives outside the loop. But of course this new container is empty. Plus using a different variable names makes the code more readable.	2020-02-14 13:51:39 -08:00
Craig Topper	8dc659c131	[Hexagon] Add an explicit makeArrayRef to pacify gcc 5.5 The array seemed to have decayed to a pointer before the ArrayRef constructor got called so there was no size information available.	2020-02-14 13:51:39 -08:00
Diego Caballero	d7058acc14	[mlir] Add MemRef filter to affine data copy optimization This patch extends affine data copy optimization utility with an optional memref filter argument. When the memref filter is used, data copy optimization will only generate copies for such a memref. Note: this patch is just porting the memref filter feature from Uday's 'hop' branch: https://github.com/bondhugula/llvm-project/tree/hop. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D74342	2020-02-14 13:41:45 -08:00
Christopher Ferris	5f91c7b980	[scudo][standalone] Allow setting release to OS Summary: Add a method to set the release to OS value as the system runs, and allow this to be set differently in the primary and the secondary. Also, add a default value to use for primary and secondary. This allows Android to have a default that is different for primary/secondary. Update mallopt to support setting the release to OS value. Reviewers: pcc, cryptoad Reviewed By: cryptoad Subscribers: cryptoad, jfb, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D74448	2020-02-14 12:57:34 -08:00
Sean Fertile	b75692c30e	[AsmPrinter] Use the McASMInfo to determine if we need descriptors. In https://reviews.llvm.org/rG8b737688c21a9755cae14cb9343930e0882164ab I switched the condition gating the creation of the descriptor symbol from checking the MCAsmInfo if we need to support descriptors, to if the OS was AIX. Technically the 2 should be interchangeable: if we are targeting AIX then we need to emit XCOFF object files, and the MCAsmInfo must return true for needing function descriptors. This doesn't account for lit test with runsteps that only set the arch. Eg: test/CodeGen/XCore/section-name.ll which when run natively on AIX we end up with a target xcore-ibm-aix and needFunctionDescriptors is false. This patch reverts to using the MCAsmInfo and adds an assert that the target OS must be AIX since that is the only target using the descriptor hook. Differential Revision: https://reviews.llvm.org/D74622	2020-02-14 15:20:39 -05:00
Nico Weber	87e80e5e28	fix some comment typos to cycle bots	2020-02-14 15:18:50 -05:00
Nico Weber	09153ab9d2	[windows] Add /Gw to compiler flags This is like -fdata-sections, and it's not part of /O2 by default for some reason. In the cmake build, reduces the size of clang.exe from 70,358,016 bytes to 69,982,720 bytes. clang-format.exe goes from 3,703,296 bytes to 3,331,072 bytes. Differential Revision: https://reviews.llvm.org/D74573	2020-02-14 15:15:00 -05:00
Austin Kerbow	07824e65bf	[AMDGPU] Always enable XNACK feature when support is explicitly requested Differential Revision: https://reviews.llvm.org/D74630	2020-02-14 11:58:58 -08:00
Evandro Menezes	4af3be7b04	[docs] Add note on using cmake to perform the build Repeat the build instructions from the top level README in the Getting Started guide.	2020-02-14 13:44:56 -06:00
Matt Arsenault	9ec668606b	AMDGPU: Add option to disable CGP division expansion The division expansions in AMDGPUCodeGenPrepare can't be relied on for correctness, since they punt to later optimization and possibly legalization in some cases. We still need a way to be able to write tests for the legalizer versions of the expansion. This is mostly for GlobalISel, since the expected optimzations is expecting aren't implemented. The interaction with the flag to expand 64-bit division in the IR is pretty confusing, but these flags have different purposes.	2020-02-14 11:37:07 -08:00
Sanjay Patel	63ed0eceaf	[x86] remove stray test assertions; NFC I updated the prefix and forgot to manually remove the old names as part of rG6071fc57a45.f	2020-02-14 14:28:50 -05:00
Sanjay Patel	6071fc57a4	[x86] regenerate complete test checks for sqrt{est}; NFC The existing checks were trying to test both CPU-specific codegen and generic codegen with explicit attributes for the various sqrt estimate possibilities, but that was hard to decipher and update (D69989). Instead generate the complete results for various CPUs, and that makes it clear which models have slow/fast sqrt attributes along with all of the other potential diffs (FMA, AVX2, scheduling). Also, explicitly add the function attributes corresponding to whether DAZ/FTZ denorm settings are expected.	2020-02-14 14:21:28 -05:00
Matt Arsenault	34d9a16e54	AMDGPU: Add option to expand 64-bit integer division in IR I didn't realize we were already expanding 24/32-bit division here already. Use the available IntegerDivision utilities. This uses loops, so produces significantly smaller code than the inline DAG expansion. This now requires width reductions of 64-bit divisions before introducing the expanded loops. This helps work around missing legalization in GlobalISel for division, which are the only remaining core instructions that didn't work at all. I think this is plausibly a better implementation than exists in the DAG, although turning it on by default misses out on the constant value optimizations and also needs benchmarking.	2020-02-14 11:16:08 -08:00
Craig Topper	391cc4dd41	[X86] Use ZERO_EXTEND instead of SIGN_EXTEND in the fast isel handling of convert_from_fp16.	2020-02-14 10:57:12 -08:00
Craig Topper	fc0c72b2df	[X86] Add AVX512 support to the fast isel code for Intrinsic::convert_from_fp16/convert_to_fp16.	2020-02-14 10:57:11 -08:00
Alina Sbirlea	1326a5a4cf	[LoopRotate] Get and update MSSA only if available in legacy pass manager. Summary: Potential fix for: https://bugs.llvm.org/show_bug.cgi?id=44889 and https://bugs.llvm.org/show_bug.cgi?id=44408 In the legacy pass manager, loop rotate need not compute MemorySSA when not being in the same loop pass manager with other loop passes. There isn't currently a way to differentiate between the two cases, so this attempts to limit the usage in LoopRotate to only update MemorySSA when the analysis is already available. The side-effect of this is that it will split the Loop pipeline. This issue does not apply to the new pass manager, where we have a flag specifying if all loop passes in that loop pass manager preserve MemorySSA. Reviewers: dmgreen, fedor.sergeev, nikic Subscribers: Prazek, hiraditya, george.burgess.iv, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74574	2020-02-14 10:47:26 -08:00
Matt Arsenault	bfbfa18591	GlobalISel: Lower s64->s16 G_FPTRUNC This is more or less directly ported from the AMDGPU custom lowering for FP_TO_FP16. I made a few minor fixups (using G_UNMERGE_VALUES instead of creating shift/trunc to extract the two halves, and zexting an inverted compare instead of select_cc). This also does not include the fast math expansion the DAG which converts to f32 and then to f16. I think that belongs in a pre-legalize combine instead.	2020-02-14 10:46:58 -08:00
Volkan Keles	187686a22f	[GlobalISel] LegalizationArtifactCombiner: Fix a bug in tryCombineMerges Like COPY instructions explained in D70616, we don't check the constraints when combining G_UNMERGE_VALUES. Use the same logic used in D70616 to check if registers can be replaced, or a COPY instruction needs to be built. https://reviews.llvm.org/D70564	2020-02-14 10:45:58 -08:00
Brian Cain	bf3b86bc2f	[Hexagon] v67+ HVX register pairs should support either direction Assembler now permits pairs like 'v0:1', which are encoded differently from the odd-first pairs like 'v1:0'. The compiler will require more work to leverage these new register pairs.	2020-02-14 12:43:43 -06:00
Aaron Puchert	705306526b	Fix tests after previous commit We don't want to test for this warning, so we just fix it.	2020-02-14 19:41:01 +01:00
Aaron Puchert	2f26bc5542	Warn about zero-parameter K&R definitions in -Wstrict-prototypes Summary: Zero-parameter K&R definitions specify that the function has no parameters, but they are still not prototypes, so calling the function with the wrong number of parameters is just a warning, not an error. The C11 standard doesn't seem to directly define what a prototype is, but it can be inferred from 6.9.1p7: "If the declarator includes a parameter type list, the list also specifies the types of all the parameters; such a declarator also serves as a function prototype for later calls to the same function in the same translation unit." This refers to 6.7.6.3p5: "If, in the declaration “T D1”, D1 has the form D(parameter-type-list) or D(identifier-list_opt) [...]". Later in 6.11.7 it also refers only to the parameter-type-list variant as prototype: "The use of function definitions with separate parameter identifier and declaration lists (not prototype-format parameter type and identifier declarators) is an obsolescent feature." We already correctly treat an empty parameter list as non-prototype declaration, so we can just take that information. GCC also warns about this with -Wstrict-prototypes. This shouldn't affect C++, because there all FunctionType's are FunctionProtoTypes. I added a simple test for that. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D66919	2020-02-14 19:25:02 +01:00
Simon Pilgrim	f0181cc7ba	[APInt] Add some basic APInt::byteSwap unit tests As noted on D74621 we currently have no test coverage	2020-02-14 18:15:13 +00:00
Matt Arsenault	b38940dfb9	TTI: Fix vectorization cost for bswap	2020-02-14 10:14:07 -08:00
Jonas Devlieghere	bba9ba8d95	[lldb/Plugin] s/LLDB_PLUGIN/LLDB_PLUGIN_DEFINE/ (NFC) Rename LLDB_PLUGIN to LLDB_PLUGIN_DEFINE as Pavel suggested in D73067 to avoid name conflict.	2020-02-14 09:58:24 -08:00
Eric Fiselier	e8358455a2	[libc++] Add missing include for is_same in test	2020-02-14 18:55:27 +01:00
Matt Arsenault	8c2c0b3637	AMDGPU: Improve i16/v2i16 bswap	2020-02-14 09:53:22 -08:00
Craig Topper	7badb38918	[X86] Fix copy/paste mistake in comment. NFC	2020-02-14 09:47:50 -08:00
Matt Arsenault	e0fd2d6d62	AMDGPU: Add baseline tests for 16-bit bswap	2020-02-14 09:34:13 -08:00
Matt Arsenault	a257bde420	AMDGPU/GlobalISel: Handle G_BSWAP	2020-02-14 09:09:44 -08:00
Eric Fiselier	cccf1ef0c8	[libc++] Remove cycle between <type_traits> and <cstddef> This was caused by byte depending on traits. This patch moves the minimal amount of meta-programming into <cstddef> to break the cycle.	2020-02-14 17:36:27 +01:00
Alexandre Ganea	0d2ba6577d	Fix compilation breakage introduced by `8404aeb56a`. Also fix BitVector unittest failure when DLLVM_ENABLE_ASSERTIONS are OFF, introduced by `d110c3a9f5`.	2020-02-14 11:17:18 -05:00
Fangrui Song	597dfb3bd5	[Driver] Rename AddGoldPlugin to addLTOOptions. NFC AddGoldPlugin does more than adding `-plugin path/to/LLVMgold.so`. It works with lld and GNU ld, and adds other LTO options. So AddGoldPlugin is no longer a suitable name. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D74591	2020-02-14 08:06:37 -08:00
Evgeniy Brevnov	cae643d596	Reverting D73027 [DependenceAnalysis] Dependecies for loads marked with "ivnariant.load" should not be shared with general accesses(PR42151).	2020-02-14 22:57:23 +07:00
Eric Fiselier	e337fb0790	add type_traits include as required for std::integral_constant	2020-02-14 16:38:28 +01:00
Melanie Blower	9122b92f8e	Revert "Reland D74436 "Change clang option -ffp-model=precise to select ffp-contract=on"" This reverts commit `0a1123eb43`. Want to revert this because it's causing trouble for PowerPC I also fixed test fp-model.c which was looking for an incorrect error message	2020-02-14 07:32:09 -08:00
Alexandre Ganea	8404aeb56a	[Support] On Windows, ensure hardware_concurrency() extends to all CPU sockets and all NUMA groups The goal of this patch is to maximize CPU utilization on multi-socket or high core count systems, so that parallel computations such as LLD/ThinLTO can use all hardware threads in the system. Before this patch, on Windows, a maximum of 64 hardware threads could be used at most, in some cases dispatched only on one CPU socket. == Background == Windows doesn't have a flat cpu_set_t like Linux. Instead, it projects hardware CPUs (or NUMA nodes) to applications through a concept of "processor groups". A "processor" is the smallest unit of execution on a CPU, that is, an hyper-thread if SMT is active; a core otherwise. There's a limit of 32-bit processors on older 32-bit versions of Windows, which later was raised to 64-processors with 64-bit versions of Windows. This limit comes from the affinity mask, which historically is represented by the sizeof(void). Consequently, the concept of "processor groups" was introduced for dealing with systems with more than 64 hyper-threads. By default, the Windows OS assigns only one "processor group" to each starting application, in a round-robin manner. If the application wants to use more processors, it needs to programmatically enable it, by assigning threads to other "processor groups". This also means that affinity cannot cross "processor group" boundaries; one can only specify a "preferred" group on start-up, but the application is free to allocate more groups if it wants to. This creates a peculiar situation, where newer CPUs like the AMD EPYC 7702P (64-cores, 128-hyperthreads) are projected by the OS as two (2) "processor groups". This means that by default, an application can only use half of the cores. This situation could only get worse in the years to come, as dies with more cores will appear on the market. == The problem == The heavyweight_hardware_concurrency() API was introduced so that only one hardware thread per core* was used. Once that API returns, that original intention is lost, only the number of threads is retained. Consider a situation, on Windows, where the system has 2 CPU sockets, 18 cores each, each core having 2 hyper-threads, for a total of 72 hyper-threads. Both heavyweight_hardware_concurrency() and hardware_concurrency() currently return 36, because on Windows they are simply wrappers over std:🧵:hardware_concurrency() -- which can only return processors from the current "processor group". == The changes in this patch == To solve this situation, we capture (and retain) the initial intention until the point of usage, through a new ThreadPoolStrategy class. The number of threads to use is deferred as late as possible, until the moment where the std::threads are created (ThreadPool in the case of ThinLTO). When using hardware_concurrency(), setting ThreadCount to 0 now means to use all the possible hardware CPU (SMT) threads. Providing a ThreadCount above to the maximum number of threads will have no effect, the maximum will be used instead. The heavyweight_hardware_concurrency() is similar to hardware_concurrency(), except that only one thread per hardware core will be used. When LLVM_ENABLE_THREADS is OFF, the threading APIs will always return 1, to ensure any caller loops will be exercised at least once. Differential Revision: https://reviews.llvm.org/D71775	2020-02-14 10:24:22 -05:00
Alexandre Ganea	d9049e871f	[clang-scan-deps] Switch to using a ThreadPool Use a ThreadPool instead of plain std::threads in clang-scan-deps. This is needed to further support https://reviews.llvm.org/D71775. Differential Revision: https://reviews.llvm.org/D74569	2020-02-14 10:24:22 -05:00
Alexandre Ganea	d110c3a9f5	[ADT] Support BitVector as a key in DenseSet/Map This patch adds DenseMapInfo<> support for BitVector and SmallBitVector. This is part of https://reviews.llvm.org/D71775, where a BitVector is used as a thread affinity mask.	2020-02-14 10:24:22 -05:00
Alex Richardson	c29310707e	Fix line endings produced by update_cc_test_checks.py Use the same appraoch as update_llc_test_checks.py to always write \n line endings. This should fix the Windows buildbots.	2020-02-14 15:17:27 +00:00
Louis Dionne	f54e7b4e3a	[libc++] Remove unnecessary typenames from std/numerics/c.math/abs.pass.cpp There are some unnecessary typenames in std/numerics/c.math/abs.pass.cpp; e.g. they're not in a dependent context. Patch by Bryce Adelstein Lelbach Differential Revision: https://reviews.llvm.org/D72106	2020-02-14 16:04:40 +01:00
Luís Marques	13700c383f	Revert "[clang-tools-extra] fix the check for if '-latomic' is necessary" This reverts commit `1d40c41506`. This seemed to have caused build failures on ARM/AArch64.	2020-02-14 15:01:52 +00:00
Haojian Wu	5dc2314d5e	[clangd] Update the CompletionItemKind. Summary: Fix some FIXMEs. Reviewers: kadircet Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74609	2020-02-14 15:48:30 +01:00
Alex Richardson	61dd0603bd	Move update_cc_test_checks.py tests to clang Having tests that depend on clang inside llvm/ are not a good idea since it can break incremental `ninja check-llvm`. Fixes https://llvm.org/PR44798 Reviewed By: lebedev.ri, MaskRay, rsmith Differential Revision: https://reviews.llvm.org/D74051	2020-02-14 14:39:55 +00:00
Haojian Wu	74c97ca1b4	[clangd] Add tracer to the rename workflow, NFC Reviewers: kbobyrev Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D74395	2020-02-14 15:39:29 +01:00
Teresa Johnson	2102ef8aad	Reenable "Always import constants" after compile time fixes Summary: Reenables importing of constants by default, which was disabled in D73724 due to excessive thin link times. These inefficiencies were fixed in D73851. I re-measured thin link times for a number of binaries that had compile time explosions with importing of constants previously and confirmed they no longer have any notable increases with it enabled. Reviewers: wmi, evgeny777 Subscribers: hiraditya, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74512	2020-02-14 06:37:14 -08:00
Pavel Iliin	b6a9fe2099	[AArch64] Add BIT/BIF support. This patch added generation of SIMD bitwise insert BIT/BIF instructions. In the absence of GCC-like functionality for optimal constraints satisfaction during register allocation the bitwise insert and select patterns are matched by pseudo bitwise select BSP instruction with not tied def. It is expanded later after register allocation with def tied to BSL/BIT/BIF depending on operands registers. This allows to get rid of redundant moves. Reviewers: t.p.northover, samparker, dmgreen Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D74147	2020-02-14 14:19:39 +00:00
Gokturk Yuksek	1d40c41506	[clang-tools-extra] fix the check for if '-latomic' is necessary Summary: The CheckAtomic module performs two tests to determine if passing '-latomic' to the linker is required: one for 64-bit atomics, and another for non-64-bit atomics. clangd only uses the result from HAVE_CXX_ATOMICS64_WITHOUT_LIB. This is incomplete because there are uses of non-64-bit atomics in the code, such as the ReplyOnce::Replied of type std::atomic<bool> defined in clangd/ClangdLSPServer.cpp. Fix by also checking for the result of HAVE_CXX_ATOMICS_WITHOUT_LIB. See also: https://reviews.llvm.org/D68964 Reviewers: ilya-biryukov, nridge, kadircet, beanz, compnerd, luismarques Reviewed By: luismarques Tags: #clang Differential Revision: https://reviews.llvm.org/D69869	2020-02-14 14:16:10 +00:00
Luís Marques	9816e726e7	[Driver][RISCV] Add RedHat Linux RISC-V triple Summary: Adds the RedHat Linux triple to the list of 64-bit RISC-V triples. Without this the gcc libraries wouldn't be found by clang on a redhat/fedora system, as the search list included `/usr/lib/gcc/riscv64-redhat-linux-gnu` but the correct path didn't include the `-gnu` suffix. Reviewers: lenary, asb, dlj Reviewed By: lenary Tags: #clang Differential Revision: https://reviews.llvm.org/D74399	2020-02-14 13:46:26 +00:00

1 2 3 4 5 ...

342684 Commits All Branches Search

342684 Commits

All Branches