llvm-project

Commit Graph

Author	SHA1	Message	Date
Filipe Cabecinhas	8c1c8e5d7c	Revert "Compiler-rt part of D26230: Add (constant) masked load/store support (Try #2 )" This reverts commit r288504. clang-bpf-build fails with no details: ****************** TEST 'AddressSanitizer-x86_64-linux :: TestCases/masked-ops.cpp' FAILED **************** Script: -- /mnt/buildbot/slave-root/clang-bpf-build/stage1/./bin/clang --driver-mode=g++ -fsanitize=address -mno-omit-leaf-frame-pointer -fno-omit-frame-pointer -fno-optimize-sibling-calls -gline-tables-only -m64 -o /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp -mavx -O1 not /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp l1 2>&1 \| FileCheck -check-prefix=CHECK-L1 /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp l6 2>&1 \| FileCheck -check-prefix=CHECK-L6 /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp la 2>&1 \| FileCheck -check-prefix=CHECK-LA /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp not /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp s1 2>&1 \| FileCheck -check-prefix=CHECK-S1 /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp s6 2>&1 \| FileCheck -check-prefix=CHECK-S6 /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp /mnt/buildbot/slave-root/clang-bpf-build/stage1/projects/compiler-rt/test/asan/X86_64LinuxConfig/TestCases/Output/masked-ops.cpp.tmp sa 2>&1 \| FileCheck -check-prefix=CHECK-SA /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp -- Exit Code: 2 Command Output (stderr): -- FileCheck error: '-' is empty. FileCheck command line: FileCheck -check-prefix=CHECK-L6 /mnt/buildbot/slave-root/clang-bpf-build/llvm/projects/compiler-rt/test/asan/TestCases/masked-ops.cpp -- ****************** llvm-svn: 288507	2016-12-02 16:19:14 +00:00
Nicolai Haehnle	33ca182c91	[DAGCombiner] do not fold (fmul (fadd X, 1), Y) -> (fmad X, Y, Y) by default Summary: When X = 0 and Y = inf, the original code produces inf, but the transformed code produces nan. So this transform (and its relatives) should only be used when the no-infs-fp-math flag is explicitly enabled. Also disable the transform using fmad (intermediate rounding) when unsafe-math is not enabled, since it can reduce the precision of the result; consider this example with binary floating point numbers with two bits of mantissa: x = 1.01 y = 111 x * (y + 1) = 1.01 * 1000 = 1010 (this is the exact result; no rounding occurs at any step) x * y + x = 1000.11 + 1.01 =r 1000 + 1.01 = 1001.01 =r 1000 (with rounding towards zero) The example relies on rounding towards zero at least in the second step. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98578 Reviewers: RKSimon, tstellarAMD, spatel, arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D26602 llvm-svn: 288506	2016-12-02 16:06:18 +00:00
Simon Pilgrim	9cb74267ac	Tidyup code with indentation and clang-format. NFCI. llvm-svn: 288505	2016-12-02 15:44:30 +00:00
Filipe Cabecinhas	2f461bf24a	Compiler-rt part of D26230: Add (constant) masked load/store support (Try #2 ) Summary: Unfortunately, there is no way to emit an llvm masked load/store in clang without optimizations, and AVX enabled. Unsure how we should go about making sure this test only runs if it's possible to execute AVX code. Reviewers: kcc, RKSimon, pgousseau Subscribers: kubabrecka, dberris, llvm-commits Differential Revision: https://reviews.llvm.org/D26506 llvm-svn: 288504	2016-12-02 15:33:04 +00:00
Daniel Cederman	ef62c59dd6	[Sparc] Fix parsing of double-precision %f18, %f20, and %f22 Summary: They are currently being parsed as %f14, %f16, and %f18. Reviewers: venkatra, jyknight Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27342 llvm-svn: 288503	2016-12-02 15:05:26 +00:00
Felix Berger	e4ab0602f2	[clang-tidy] Do not trigger unnecessary-value-param check on methods marked as final Summary: Virtual method overrides of dependent types cannot be recognized unless they are marked as override or final. Exclude methods marked as final from check and add test. Reviewers: sbenza, hokein, alexfh Subscribers: malcolm.parsons, JDevlieghere, cfe-commits Differential Revision: https://reviews.llvm.org/D27248 llvm-svn: 288502	2016-12-02 14:44:16 +00:00
Simon Pilgrim	3a19863f1c	[X86][SSE] Renamed shuffle combine test. We're trying to combine to vpunpckhbw not vpunpckhwd llvm-svn: 288501	2016-12-02 14:43:39 +00:00
Teresa Johnson	17fc326d9a	CODE_OWNERS: Take ownership of IR Linker as discussed on llvm-dev llvm-svn: 288500	2016-12-02 14:06:53 +00:00
Simon Pilgrim	cbf5f97018	[X86][SSE] Add support for extracting constant bit data from broadcasted constants llvm-svn: 288499	2016-12-02 13:16:08 +00:00
Haojian Wu	08e402ac93	[clang-move] some tweaks. * Don't save SourceManager for each declarations. * Rename some out-dated methods. No functionality change. llvm-svn: 288498	2016-12-02 12:39:39 +00:00
Alexey Bataev	e8e94a7176	[SLP] Fix for PR6246: vectorization for scalar ops on vector elements. When trying to vectorize trees that start at insertelement instructions function tryToVectorizeList() uses vectorization factor calculated as MinVecRegSize/ScalarTypeSize. But sometimes it does not work as tree cost for this fixed vectorization factor is too high. Patch tries to improve the situation. It tries different vectorization factors from max(PowerOf2Floor(NumberOfVectorizedValues), MinVecRegSize/ScalarTypeSize) to MinVecRegSize/ScalarTypeSize and tries to choose the best one. Differential Revision: https://reviews.llvm.org/D27215 llvm-svn: 288497	2016-12-02 12:20:22 +00:00
Simon Pilgrim	b3ae416839	[X86] Refactored getTargetConstantBitsFromNode to allow for expansion. NFCI. getTargetConstantBitsFromNode currently only extracts constant pool vector data, but it will need to be generalized to support broadcast and scalar constant pool data as well. Converted Constant bit extraction and Bitset splitting to helper lambda functions. llvm-svn: 288496	2016-12-02 11:58:05 +00:00
Eric Liu	4bf67af2d5	Fix a buildbot failure in include-fixer. llvm-svn: 288495	2016-12-02 11:23:07 +00:00
Pavel Labath	e705c8b5e6	Replace __ANDROID_NDK__ with __ANDROID__ Summary: This replaces all the uses of the __ANDROID_NDK__ define with __ANDROID__. This is a preparatory step to remove our custom android toolchain file and rely on the standard android NDK one instead, which does not provide this define. Instead I rely, on __ANDROID__, which is set by the compiler. I haven't yet removed the cmake variable with the same name, as we will need to do something completely different there -- NDK toolchain defines CMAKE_SYSTEM_NAME to Android, while our current one pretends it's linux. Reviewers: tberghammer, zturner Subscribers: danalbert, srhines, mgorny, lldb-commits Differential Revision: https://reviews.llvm.org/D27305 llvm-svn: 288494	2016-12-02 11:15:15 +00:00
Eric Liu	964782adbb	[ClangFormat] Only insert #include into the #include block in the beginning of the file. Summary: This avoid inserting #include into: - raw string literals containing #include. - #if block. - Special #include among declarations (e.g. functions). Reviewers: djasper Subscribers: cfe-commits, klimek Differential Revision: https://reviews.llvm.org/D26909 llvm-svn: 288493	2016-12-02 11:01:43 +00:00
Simon Pilgrim	c70d3796fb	[SLPVectorizer][X86] Add tests for vectorization of buildvector of scalar fp-ops (PR6246) llvm-svn: 288492	2016-12-02 10:54:46 +00:00
Alex Lorenz	8c1b5c9ea9	[Frontend] Fix an issue where a quoted search path is incorrectly removed as a duplicate header search path The commit r126167 started passing the First index into RemoveDuplicates, but forgot to update 0 to First in the loop that looks for the duplicate. This resulted in a bug where an -iquoted search path was incorrectly removed if you passed in the same path into -iquote and more than one time into -isystem. rdar://23991350 Differential Revision: https://reviews.llvm.org/D27298 llvm-svn: 288491	2016-12-02 09:51:51 +00:00
NAKAMURA Takumi	5ce101a848	compiler-rt/test/profile/Linux/lit.local.cfg: [Py3] Use text mode (universal_newlines=True). llvm-svn: 288490	2016-12-02 08:17:17 +00:00
Tobias Grosser	bedef00e2c	[ScopInfo] Fold constant coefficients in array dimensions to the right This allows us to delinearize code such as the one below, where the array sizes are A[][2 * n] as there are n times two elements in the innermost dimension. Alternatively, we could try to generate another dimension for the struct in the innermost dimension, but as the struct has constant size, recovering this dimension is easy. struct com { double Real; double Img; }; void foo(long n, struct com A[][n]) { for (long i = 0; i < 100; i++) for (long j = 0; j < 1000; j++) A[i][j].Real += A[i][j].Img; } int main() { struct com A[100][1000]; foo(1000, A); llvm-svn: 288489	2016-12-02 08:10:56 +00:00
Maxim Ostapenko	6bdcb1f0e9	[sanitizer] Add a bunch of ifdefs for sparc targets to avoid build failures. Differential Revision: https://reviews.llvm.org/D27301 llvm-svn: 288488	2016-12-02 08:07:35 +00:00
Rui Ueyama	3a618e5606	Port parallel ICF to COFF. LLD used to take 11.73 seconds to link Clang. Now it is 6.94 seconds. MSVC link takes 83.02 seconds. Note that ICF is enabled by default on Windows, so a low latency ICF is more important than in ELF. llvm-svn: 288487	2016-12-02 08:03:58 +00:00
Stephan Bergmann	612d8c66f4	Don't include system header inside namespace ...causes build failure at least with GCC 6.2.1, as smmintrin.h indirectly includes cstdlib, which then runs into problems. llvm-svn: 288486	2016-12-02 08:03:57 +00:00
Rafael Espindola	5708b2f8a6	Ignore R_X86_64_NONE. It looks like the way dtrace works is * The user creates .o files that reference magical symbol names. * dtrace reads those files, collecs the info it needs and changes the relocation to R_X86_64_NONE expecting the linker to ignore them. llvm-svn: 288485	2016-12-02 08:00:09 +00:00
Craig Topper	4961fa9bba	[AVX-512] Add EVEX vpshuflw/vpshufhw/vpshufd instructions to load folding tables. llvm-svn: 288484	2016-12-02 07:57:11 +00:00
Rui Ueyama	27498b5dd5	Fix a bug in ICF involving COFF associative sections. Associative sections are sections that need to be linked if their associated sections are linked. Associative sections are used to append auxiliary data such as debug info. Previously, we compared all associative sections when comparing two comdat sections. Because usually assocative sections are not mergeable sections, we missed a lot of mergeable sections. MSVC linker doesn't seem to check the identity of associative sections. This patch makes LLD to ignore associative sections when doing ICF. llvm-svn: 288483	2016-12-02 07:46:12 +00:00
Craig Topper	17ddb521ef	[AVX-512] Add EVEX PSHUFB instructions to load folding tables. llvm-svn: 288482	2016-12-02 07:06:30 +00:00
Craig Topper	f7866fad54	[AVX-512] Add masked VINSERTF/VINSERTI instructions to load folding tables. llvm-svn: 288481	2016-12-02 06:24:38 +00:00
Rui Ueyama	1b6bab011c	Fix the worse case performance of ICF. r288228 seems to have regressed ICF performance in some cases in which a lot of sections are actually mergeable. In r288228, I made a change to create a Range object for each new color group. So every time we split a group, we allocated and added a new group to a list of groups. This patch essentially reverted r288228 with an improvement to parallelize the original algorithm. Now the ICF main loop is entirely allocation-free and lock-free. Just like pre-r288228, we search for group boundaries by linear scan instead of managing the information using Range class. r288228 was neutral in performance-wise, and so is this patch. I confirmed that this produces the exact same result as before using chromium and clang as tests. llvm-svn: 288480	2016-12-02 05:35:46 +00:00
Tobias Grosser	491b799a4d	[ScopInfo] Separate construction and finalization of memory accesses [NFC] After having built memory accesses we perform some additional transformations on them to increase the chances that our delinearization guesses the right shape. Only after these transformations, we take the assumptions that the array shape we predict is such that no out-of-bounds memory accesses arise. Before this change, the construction of the memory access, the access folding that improves the represenation for certain parametric subscripts, and taking the assumption was all done right after a memory access was created. In this change we split this now into three separate iterations over all memory accesses. This means only after all memory accesses have been built, we start to canonicalize accesses, and to take assumptions. This split prepares for future canonicalizations that must consider all memory accesses for deriving additional beneficial transformations. llvm-svn: 288479	2016-12-02 05:21:22 +00:00
NAKAMURA Takumi	91a5fa1b6f	clang/test/Driver/defsym.s: Appease targeting msc. It is incapable of external assembler in trunk. llvm-svn: 288478	2016-12-02 05:09:21 +00:00
Rafael Espindola	103fc28961	Add a test documenting how we handle addends on Elf_Rela. llvm-svn: 288477	2016-12-02 04:20:47 +00:00
Peter Collingbourne	bc0705240e	IR: Move NumElements field from {Array,Vector}Type to SequentialType. Now that PointerType is no longer a SequentialType, all SequentialTypes have an associated number of elements, so we can move that information to the base class, allowing for a number of simplifications. Differential Revision: https://reviews.llvm.org/D27122 llvm-svn: 288464	2016-12-02 03:20:58 +00:00
Dehao Chen	c3be225895	Change LoopUnrollPass cost from int to unsigned to make it consistent. (NFC) llvm-svn: 288463	2016-12-02 03:17:07 +00:00
Peter Collingbourne	4568158c4d	IR: Change PointerType to derive from Type rather than SequentialType. As proposed on llvm-dev: http://lists.llvm.org/pipermail/llvm-dev/2016-October/106640.html This is for a couple of reasons: - Values of type PointerType are unlike the other SequentialTypes (arrays and vectors) in that they do not hold values of the element type. By moving PointerType we can unify certain aspects of how the other SequentialTypes are handled. - PointerType will have no place in the SequentialType hierarchy once pointee types are removed, so this is a necessary step towards removing pointee types. Differential Revision: https://reviews.llvm.org/D26595 llvm-svn: 288462	2016-12-02 03:05:41 +00:00
Rafael Espindola	858c092daa	Allow duplicated abs symbols with the same value. This is a fairly reasonable bfd extension since there is one obvious value. dtrace depends on this feature as it creates multiple absolute symbols with the same value. llvm-svn: 288461	2016-12-02 02:58:21 +00:00
Peter Collingbourne	25a40759c1	Fix GlobalISel build. llvm-svn: 288460	2016-12-02 02:55:30 +00:00
Matt Arsenault	47a4b39646	ConstantFolding: Factor code into helper function llvm-svn: 288459	2016-12-02 02:26:02 +00:00
Peter Collingbourne	ab85225be4	IR: Change the gep_type_iterator API to avoid always exposing the "current" type. Instead, expose whether the current type is an array or a struct, if an array what the upper bound is, and if a struct the struct type itself. This is in preparation for a later change which will make PointerType derive from Type rather than SequentialType. Differential Revision: https://reviews.llvm.org/D26594 llvm-svn: 288458	2016-12-02 02:24:42 +00:00
Richard Smith	6afcab3588	Update implementation of ABI support for throwing noexcept function pointers and catching as non-noexcept to match the final design per discusson on cxx-abi-dev. llvm-svn: 288457	2016-12-02 02:06:53 +00:00
Jason Henline	13bba6966f	[CUDA] Fix faulty test from rL288448 Summary: The test introduced by rL288448 is currently failing because unimportant but unexpected errors appear as output from a test compile line. This patch looks for a more specific error message, in order to avoid false positives. Reviewers: jlebar Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D27328 Switch to more specific error llvm-svn: 288453	2016-12-02 02:04:43 +00:00
Richard Smith	6cc02c2fcd	p0012r1: define corresponding feature test macro llvm-svn: 288452	2016-12-02 02:02:23 +00:00
Rafael Espindola	f4ff80c128	Write the addent to got entries when using Elf_Rel. llvm-svn: 288451	2016-12-02 01:57:24 +00:00
Paul Robinson	dad4907bc1	[DWARF] Put linkage-name on abstract origin even when there's a declaration. In r266692, we made it possible to emit linkage names for just inlined functions, putting the attribute on the abstract origin. Make sure we don't think the linkage-name was already emitted on a declaration. Differential Revision: http://reviews.llvm.org/D27320 llvm-svn: 288450	2016-12-02 01:55:17 +00:00
Richard Smith	444e6f3d82	Recover better from an incompatible .pcm file being provided by -fmodule-file=. We try to include the headers of the module textually in this case, still enforcing the modules semantic rules. In order to make that work, we need to still track that we're entering and leaving the module. Also, if the module was also marked as unavailable (perhaps because it was missing a file), we shouldn't mark the module unavailable -- we don't need the module to be complete if we're going to enter it textually. llvm-svn: 288449	2016-12-02 01:52:28 +00:00
Jason Henline	c3e24403f0	[CUDA] "Support" ASAN arguments in CudaToolChain This fixes a bug that was introduced in rL287285. The bug made it illegal to pass -fsanitize=address during CUDA compilation because the CudaToolChain class was switched from deriving from the Linux toolchain class to deriving directly from the ToolChain toolchain class. When CudaToolChain derived from Linux, it used Linux's getSupportedSanitizers method, and that method allowed ASAN, but when it switched to deriving directly from ToolChain, it inherited a getSupportedSanitizers method that didn't allow for ASAN. This patch fixes that bug by creating a getSupportedSanitizers method for CudaToolChain that supports ASAN. This patch also fixes the test that checks that -fsanitize=address is passed correctly for CUDA builds. That test didn't used to notice if an error message was emitted, and that's why it didn't catch this bug when it was first introduced. With the fix from this patch, that test will now catch any similar bug in the future. llvm-svn: 288448	2016-12-02 01:42:54 +00:00
Dan Gohman	d0eed81dc0	[WebAssembly] Add an -mdirect flag for the direct wasm object feature. Add a target flag for enabling the new direct wasm object emission feature. llvm-svn: 288447	2016-12-02 01:12:40 +00:00
Teresa Johnson	185b4ab6d4	[ThinLTO] Stop importing constant global vars as copies in the backend Summary: We were doing an optimization in the ThinLTO backends of importing constant unnamed_addr globals unconditionally as a local copy (regardless of whether the thin link decided to import them). This should be done in the thin link instead, so that resulting exported references are marked and promoted appropriately, but will need a summary enhancement to mark these variables as constant unnamed_addr. The function import logic during the thin link was trying to handle this proactively, by conservatively marking all values referenced in the initializer lists of exported global variables as also exported. However, this only handled values referenced directly from the initializer list of an exported global variable. If the value is itself a constant unnamed_addr variable, we could end up exporting its references as well. This caused multiple issues. The first is that the transitively exported references weren't promoted. Secondly, some could not be promoted/renamed (e.g. they had a section or other constraint). recursively, instead of just adding the first level of initializer list references to the ExportList directly. Remove this optimization and the associated handling in the function import backend. SPEC measurements indicate we weren't getting much from it in any case. Fixes PR31052. Reviewers: mehdi_amini Subscribers: krasin, llvm-commits Differential Revision: https://reviews.llvm.org/D26880 llvm-svn: 288446	2016-12-02 01:02:30 +00:00
Matt Arsenault	c47701c0e9	AMDGPU: Use wider scalar spills for SGPR spilling Since the spill is for the whole wave, these don't have the swizzling problems that vector stores do and a single 4-byte allocation is enough to spill a 64 element register. This should reduce the number of spill instructions and put all the spills for a register in the same cacheline. This should save allocated private size, but for now it doesn't. The extra slots are allocated for each component, but never used because the frame layout is essentially finalized before frame indices are replaced. For always using the scalar store path, this should probably be moved into processFunctionBeforeFrameFinalized. llvm-svn: 288445	2016-12-02 00:54:45 +00:00
Jonathan Roelofs	28b9668db3	Delete tautological assertion. After r256463, both the LHS and RHS now refer to the same variable. Before, they referred to the member, the parameter respectively. Now GCC6's -Wtautological-compare complains. llvm-svn: 288444	2016-12-02 00:51:58 +00:00
Rui Ueyama	395859bdb7	Fix undefined behavior. New items can be added to Ranges here, and that invalidates an iterater that previously pointed the end of the vector. llvm-svn: 288443	2016-12-02 00:38:15 +00:00

... 3 4 5 6 7 ...

248896 Commits All Branches Search

248896 Commits

All Branches