llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	30caae6d23	[GPGPU] Fix compilation issue with latest CUDA upgrade to i128 llvm-svn: 309366	2017-07-28 06:38:49 +00:00
Jessica Paquette	4602c3437c	[MachineOutliner] NFC: Comment tidying The comment on describing the suffix tree had some pruning stuff that was out of date in it. Also fixed some typos. llvm-svn: 309365	2017-07-28 05:59:30 +00:00
Michal Gorny	a3ad81d255	Revert rL309320 - "[OCaml] Respect CMAKE_C_FLAGS for OCaml C files" This causes buildbot breakage for systems where OCaml files are built with a different compiler. llvm-svn: 309364	2017-07-28 04:29:20 +00:00
Saleem Abdulrasool	61d81ec754	test: require x86 backend Ensure that the target is registered before using it. Should fix the hexagon Bots. llvm-svn: 309363	2017-07-28 04:15:35 +00:00
Petr Hosek	a7a9ca4ad1	Support libc++abi in addition to libstdc++ This change adds sanitizer support for LLVM's libunwind and libc++abi as an alternative to libstdc++. This allows using the in tree version of libunwind and libc++abi which is useful when building a toolchain for different target. Differential Revision: https://reviews.llvm.org/D34501 llvm-svn: 309362	2017-07-28 03:39:39 +00:00
Petr Hosek	2bbb6ad217	Support compiler-rt builtins This change adds support for compiler-rt builtins as an alternative compiler runtime to libgcc. Differential Revision: https://reviews.llvm.org/D35165 llvm-svn: 309361	2017-07-28 03:39:38 +00:00
Saleem Abdulrasool	a219b3d8d1	MC: add support for cfi_return_column This adds support for the CFI pseudo-op return_column. This specifies the frame table column which contains the return address. Addresses PR33953! llvm-svn: 309360	2017-07-28 03:39:19 +00:00
Saleem Abdulrasool	b3c70c09e3	MC: clang-format enumeration (NFC) This was hard to insert elements into. clang-format it so that it is easier. NFC. llvm-svn: 309359	2017-07-28 03:39:18 +00:00
Jan Vesely	09f0a560e1	add __kernel_exec macros also consolidate macros into one file, and rename to clcmacros.h Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Aaron Watry <awatry@gmail.com> llvm-svn: 309358	2017-07-28 03:39:03 +00:00
Sanjoy Das	843ab57457	Revert "[SCEV] Cache results of computeExitLimit" This reverts commit r309080. The patch needs to clear out the ScalarEvolution::ExitLimits cache in forgetMemoizedResults. I've replied on the commit thread for the patch with more details. llvm-svn: 309357	2017-07-28 03:25:07 +00:00
Jessica Paquette	809d708b8a	[MachineOutliner] NFC: Split up getOutliningBenefit This is some more cleanup in preparation for some actual functional changes. This splits getOutliningBenefit into two cost functions: getOutliningCallOverhead and getOutliningFrameOverhead. These functions return the number of instructions that would be required to call a specific function and the number of instructions that would be required to construct a frame for a specific funtion. The actual outlining benefit logic is moved into the outliner, which calls these functions. The goal of refactoring getOutliningBenefit is to: - Get us closer to getting rid of the IsTailCall flag - Further split up "target-specific" things and "general algorithm" things llvm-svn: 309356	2017-07-28 03:21:58 +00:00
Davide Italiano	75a001ba78	[JumpThreading] Stop falsely preserving LazyValueInfo. JumpThreading claims to preserve LVI, but it doesn't preserve the analyses which LVI holds a reference to (e.g. the Dominator). In the current pass manager infrastructure, after JT runs, the PM frees these analyses (including DominatorTree) but preserves LVI. CorrelatedValuePropagation runs immediately after and queries a corrupted domtree, causing weird miscompiles. This commit disables the preservation of LVI for the time being. Eventually, we should either move LVI to a proper dependency tracking mechanism (i.e. an analyses shouldn't hold references to other analyses and compute them on demand if needed), or we should teach all the passes preserving LVI to preserve the analyses LVI depends on. The new pass manager has a mechanism to invalidate LVI in case one of the analyses it depends on becomes invalid, so this problem shouldn't exist (at least not in this immediate form), but handling of analyses holding references is still a very delicate subject. Fixes PR33917 (and rustc). llvm-svn: 309355	2017-07-28 03:10:43 +00:00
David Blaikie	89daf77a11	DebugInfo: Consider a CU containing only local imported entities to be 'empty' This can come up in ThinLTO & wastes space & makes degenerate IR. As per the added FIXME, ultimately, local imported entities should hang off the function and that way the imported entity list on the CU can be tested for emptiness like all the other CU lists. (function-attached local imported entities are probably also the best path forward for fixing how imported entities are handled both in cross-module use (currently, while ThinLTO preserves the imported entities, they would not get used at the imported inlined location - only in the abstract origin that appears in the partial CU created by the import (which isn't emitted under Fission due to cross-CU limitations there)) and to reduce the number of points where imported entities are emitted (they're currently emitted into every inlined instance, concrete instance, and abstract origin - they should only go in teh abstract origin if there is one, otherwise in the concrete instance - but this requires lots of delayed handling and wiring up, same as abstract variables & subprograms)) llvm-svn: 309354	2017-07-28 03:06:25 +00:00
Davide Italiano	01cb947abb	[JumpThreading] Add an option to dump LazyValueInfo after the run. Differential Revision: https://reviews.llvm.org/D35973 llvm-svn: 309353	2017-07-28 02:57:43 +00:00
Akira Hatanaka	7ce6ed49e8	Update comment in test case after r309308. llvm-svn: 309352	2017-07-28 01:58:14 +00:00
Kostya Serebryany	ce5e65ee93	[sanitizer-coverage] add a run-time test for -fsanitize-coverage=inline-8bit-counters,pc-table llvm-svn: 309351	2017-07-28 01:38:43 +00:00
Matthias Braun	c618a466f1	ARMFrameLowering: Only set ExtraCSSpill for actually unused registers. The code assumed that unclobbered/unspilled callee saved registers are unused in the function. This is not true for callee saved registers that are also used to pass parameters such as swiftself. rdar://33401922 llvm-svn: 309350	2017-07-28 01:36:32 +00:00
Erik Pilkington	fe33551051	[demangler] Fix some overzealous -Wreturn-type errors llvm-svn: 309349	2017-07-28 01:35:14 +00:00
Reid Kleckner	4ca8d21ef3	[lit] Port googletest lit tests to Windows Summary: The technique of directly calling subprocess.Popen on a python script doesn't work on Windows. The executable path of the command must refer to a valid win32 executable. Instead, rename all the python scripts masquerading as gtest executables to have .py extensions, so we can easily detect then and call the python executable for them. Do this on Linux as well as Windows for consistency. The test suite directory names also come out in lower-case on Windows. We can consider removing that in a later patch. This change just updates the FileCheck lines to match on Windows. Fixes PR33933 Reviewers: modocache, mgorny Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35909 llvm-svn: 309347	2017-07-28 01:05:55 +00:00
Dehao Chen	e70a472bad	Changing the default MaxNumPromotions from 2 to 3. Summary: In performance tuning, we see performance benefits when enlarge the maximum num promotion targets to 3. This is safe as soon as we have total percentage threshold properly setup (https://reviews.llvm.org/D35962) Reviewers: davidxl, tejohnson Reviewed By: tejohnson Subscribers: llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D35966 llvm-svn: 309346	2017-07-28 01:03:10 +00:00
Dehao Chen	f4240b5b91	Separate the ICP total threshold and remaining threshold. Summary: In the current implementation, isPromotionProfitable only checks if the call count to a direct target is no less than a certain percentage threshold of the remaining call counts that have not been promoted. This causes code size problems when the target count is small but greater than a large portion of remaining counts. E.g. target1 takes 99.9%, while target2 takes 0.1%. Both targets will be promoted and inlined, makes the function size too large, which potentially prevents it from further inlining into its callers. This patch adds another percentage threshold against the total indirect call count. If the target count needs to be no less than both thresholds in order to be promoted speculatively. Reviewers: davidxl, tejohnson Reviewed By: tejohnson Subscribers: sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D35962 llvm-svn: 309345	2017-07-28 01:02:54 +00:00
Dehao Chen	8260d66556	Increase the ImportHotMultiplier to 10.0 Summary: The original 3.0 hot mupltiplier is too small, and would prevent hot callsites from being inline. This patch increases the hot multilier to 10.0 Reviewers: davidxl, tejohnson Reviewed By: tejohnson Subscribers: llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D35969 llvm-svn: 309344	2017-07-28 01:02:34 +00:00
Reid Kleckner	07a5d4372e	[X86] Fix latent bug in sibcall eligibility logic The X86 tail call eligibility logic was correct when it was written, but the addition of inalloca and argument copy elision broke its assumptions. It was assuming that fixed stack objects were immutable. Currently, we aim to emit a tail call if no arguments have to be re-arranged in memory. This code would trace the outgoing argument values back to check if they are loads from an incoming stack object. If the stack argument is immutable, then we won't need to store it back to the stack when we tail call. Fortunately, stack objects track their mutability, so we can just make the obvious check to fix the bug. This was http://crbug.com/749826 llvm-svn: 309343	2017-07-28 00:58:35 +00:00
Erik Pilkington	bdfd1228f6	[demangler] Attempt to fix linux bots, include <cstdio> llvm-svn: 309342	2017-07-28 00:53:30 +00:00
George Karpenkov	50dd3fe903	[sanitizers] Sanitizer tests CMake clean up: try #2 This patch addresses two issues: Most of the time, hacks with `if/else` in order to get support for multi-configuration builds are superfluous. The variable `CMAKE_CFG_INTDIR` was created precisely for this purpose: it expands to `.` on all single-configuration builds, and to a configuration name otherwise. The `if/else` hacks for the library name generation should also not be done, as CMake has `TARGET_FILE` generator expression precisely for this purpose, as it expands to the exact filename of the resulting target. Differential Revision: https://reviews.llvm.org/D35952 llvm-svn: 309341	2017-07-28 00:50:56 +00:00
Erik Pilkington	94d2ac7160	[demangler] Use an AST to represent demangled names The demangler now demangles by producing an AST, then traverses that AST to produce a demangled name. This is done for performance reasons, now the demangler doesn't manuiplate std::strings, which hurt performance and caused string operations to be inlined into the parser, leading to large code size and stack usage. Differential revision: https://reviews.llvm.org/D35159 llvm-svn: 309340	2017-07-28 00:43:49 +00:00
Evgeniy Stepanov	fea50c286a	[sancov] Add missing line breaks in test. NFC. llvm-svn: 309339	2017-07-28 00:22:42 +00:00
Kostya Serebryany	6145776720	[sanitizer-coverage] clang flags pumbing for -fsanitize-coverage=pc-table llvm-svn: 309338	2017-07-28 00:10:10 +00:00
Kostya Serebryany	063b652096	[sanitizer-coverage] rename sanitizer-coverage-create-pc-table into sanitizer-coverage-pc-table and add plumbing for a clang flag llvm-svn: 309337	2017-07-28 00:09:29 +00:00
Adrian Prantl	8f4b353ee1	Remove unused function from AArch64 backend (NFC) llvm-svn: 309336	2017-07-27 23:52:06 +00:00
Kostya Serebryany	b75d002f15	[sanitizer-coverage] add a feature sanitizer-coverage-create-pc-table=1 (works with trace-pc-guard and inline-8bit-counters) that adds a static table of instrumented PCs to be used at run-time llvm-svn: 309335	2017-07-27 23:36:49 +00:00
Jessica Paquette	78681be2a4	[MachineOutliner] Cleanup: move findCandidates out of suffix tree Doing some cleanup in preparation for some functional changes. This commit moves findCandidates out of the suffix tree and into the MachineOutliner class. This is much easier to follow, and removes the burden of candidate choice from the suffix tree. It also adds a couple FIXMEs and simplifies building outlined function names. llvm-svn: 309334	2017-07-27 23:24:43 +00:00
Evgeniy Stepanov	59b5e9c00c	[sancov] Implement __sanitizer_cov_reset. Summary: Clears all collected coverage. Reviewers: kcc Subscribers: srhines, kubamracek, llvm-commits Differential Revision: https://reviews.llvm.org/D35958 llvm-svn: 309333	2017-07-27 23:22:37 +00:00
Reid Kleckner	dd853e5406	[llvm-pdbutil] Clean up ExitOnError usage to add ": " to our errors The banner parameter is supposed to end in a separator, like ": ". Otherwise, we get ugly errors like: Error while reading publics streamNative error: blah blah llvm-svn: 309332	2017-07-27 23:13:18 +00:00
Reid Kleckner	ef443296a4	[PDB] Initialize the std::array<ulittle32_t> used for the gsi bitmap With ASan, we would write about 512 bytes of malloc fill value to the PDB, with some random bits ORed in here and there. Dumping the PDB would always fail reliably. llvm-svn: 309331	2017-07-27 23:13:05 +00:00
Davide Italiano	1a26f24f35	[ConstantFolder] Don't try to fold gep when the idx is a vector. The code in ConstantFoldGetElementPtr() assumes integers, and therefore it crashes trying to get the integer bidwith of a vector type (in this case <4 x i32>. I just changed the code to prevent the folding in case of vectors and I didn't bother to generalize as this doesn't seem to me something that really happens in practice, but I'm willing to change the patch if you think it's worth it. This is hard to trigger from -instsimplify or -instcombine only as the second instruction is dead, so the test uses loop-unroll. Differential Revision: https://reviews.llvm.org/D35956 llvm-svn: 309330	2017-07-27 22:20:44 +00:00
Rafael Espindola	06ad7ed70c	Add a test. This would have found the error in r309252. llvm-svn: 309329	2017-07-27 22:08:11 +00:00
Saleem Abdulrasool	b5eca2f9a2	Headers: fix _Unwind_{G,S}etGR for non-EHABI targets The EHABI definition was being inlined into the users even when EHABI was not in use. Adjust the condition to ensure that the right version is defined. llvm-svn: 309327	2017-07-27 21:56:25 +00:00
Rafael Espindola	a2df2f09f4	Avoid warnings when asserts are disabled. llvm-svn: 309326	2017-07-27 21:42:42 +00:00
Ahmed Bougacha	c890993726	[X86] Don't lie about legality to TLI's demanded bits. Like r309323, X86 had a typo where it passed the wrong flags to TLO. Found by inspection; I haven't been able to tickle this into having observable behavior. I don't think it does, given that X86 doesn't have custom demanded bits logic, and the generic logic doesn't have a lot of exposure to illegal constructs. llvm-svn: 309325	2017-07-27 21:28:59 +00:00
Ahmed Bougacha	52cecb1f27	[AArch64] Remove outdated comment. NFC. There hasn't been a ternary since r231987. llvm-svn: 309324	2017-07-27 21:27:58 +00:00
Ahmed Bougacha	87807c5a86	[AArch64] Fix legality info passed to demanded bits for TBI opt. The (seldom-used) TBI-aware optimization had a typo lying dormant since it was first introduced, in r252573: when asking for demanded bits, it told TLI that it was running after legalize, where the opposite was true. This is an important piece of information, that the demanded bits analysis uses to make assumptions about the node. r301019 added such an assumption, which was broken by the TBI combine. Instead, pass the correct flags to TLO. llvm-svn: 309323	2017-07-27 21:27:25 +00:00
Stephan T. Lavavej	f85e2e218a	[libcxx] [test] Remove an unused local typedef in enable_shared_from_this.pass.cpp. Trivial change, committed without review. llvm-svn: 309322	2017-07-27 21:16:37 +00:00
Michal Gorny	83012b4721	[OCaml] Fix undefined reference to LLVMDumpType() with NDEBUG Account for the possibility of LLVMDumpType() not being available with NDEBUG in the OCaml bindings. If it is not built into LLVM, make the dump function raise an exception. Since rL293359, the dump functions are built only if either NDEBUG is not defined, or LLVM_ENABLE_DUMP is defined. As a result, if the dump functions are not built in LLVM, the dynamic OCaml libraries fail to load due to undefined LLVMDumpType symbol. Differential Revision: https://reviews.llvm.org/D35899 llvm-svn: 309321	2017-07-27 21:13:25 +00:00
Michal Gorny	3073ff9fd3	[OCaml] Respect CMAKE_C_FLAGS for OCaml C files Pass the values of CMAKE_C_FLAGS and CMAKE_C_FLAGS_${CMAKE_BUILD_TYPE} as -ccopt to ocamlc. This enforces the specific flags used for the LLVM build to be used for OCaml bindings as well, notably -O and -march flags. This also solves the issue of the user being unable to force specific flags for OCaml bindings builds. Gentoo needs this to enforce -DNDEBUG consistently between the LLVM build and the split OCaml bindings build. Differential Revision: https://reviews.llvm.org/D35898 llvm-svn: 309320	2017-07-27 21:13:19 +00:00
Jonathan Peyton	f439246328	Fix implementation of OMP_THREAD_LIMIT This change fixes the implementation of OMP_THREAD_LIMIT. The implementation of this previously was not restricted to a contention group (but it should be, according to the spec), and this is fixed here. A field is added to root thread to store a counter of the threads in the contention group. An extra check is added when reserving threads for a parallel region that checks this variable and compares to threadlimit-var, which is implemented as a new global variable, kmp_cg_max_nth. Associated settings changes were also made, and clean up of comments that referred to OMP_THREAD_LIMIT, but should refer to the new KMP_DEVICE_THREAD_LIMIT (added in an earlier patch). Patch by Terry Wilmarth Differential Revision: https://reviews.llvm.org/D35912 llvm-svn: 309319	2017-07-27 20:58:41 +00:00
George Karpenkov	1bf535daae	Revert "[sanitizers] Sanitizer tests CMake clean up" This reverts commit 0ab44db2aa1cd3710355ad79b04f954ce68c0b3a. Fails on some bots, reverting until I can fix it. llvm-svn: 309318	2017-07-27 20:44:33 +00:00
Eric Beckmann	d8bac0fa3f	Add test to reject merging of empty manifest. Reviewers: ruiu, rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35954 llvm-svn: 309317	2017-07-27 19:58:12 +00:00
Florian Hahn	e3583bdf91	[ARM] Add use-misched feature, to enable the MachineScheduler. Summary: This change makes it easier to experiment with the MachineScheduler in the ARM backend and also makes it very explicit which CPUs use the MachineScheduler (currently only swift and cyclone). Reviewers: MatzeB, t.p.northover, javed.absar Reviewed By: MatzeB Subscribers: aemerson, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D35935 llvm-svn: 309316	2017-07-27 19:56:44 +00:00
Dinar Temirbulatov	636ac1b6da	Change prefix in vector-shuffle-combining-avx.patch to reduce test size. llvm-svn: 309315	2017-07-27 19:47:35 +00:00

... 3 4 5 6 7 ...

268319 Commits All Branches Search

268319 Commits

All Branches