llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexey Bataev	e397812dcb	[OPENMP] Removed loop statement as its body executes at most once, NFC. Removed not required loop statement, addressing comments from Richard Smith. llvm-svn: 275947	2016-07-19 05:06:39 +00:00
Derek Bruening	9419737fa8	[esan] Fix sideline thread flaky assert Fixes an esan sideline thread CHECK that failed to account for the sideline thread reaching its code before the internal_clone() return value was assigned in the parent. llvm-svn: 275946	2016-07-19 05:03:38 +00:00
Alexey Bataev	5140e748b5	[OPENMP] Improved processing of 'priority' clause, NFC. Removed some old comments + improved handling of 'priority' clause value during codegen after comments from Richard Smith. llvm-svn: 275945	2016-07-19 04:21:09 +00:00
Jason Molenda	68715551f8	Ignore clang-module-cache directories that may be created in the testsuite directory while it runs. llvm-svn: 275944	2016-07-19 02:37:07 +00:00
Saleem Abdulrasool	b12d27bdfa	clang-rename: fix referenced variable in vim-script llvm-svn: 275943	2016-07-19 02:13:08 +00:00
Craig Topper	d6ca1dc45e	[AVX512] Give priority to EVEX encoded PSHUFB over the VEX versions. llvm-svn: 275942	2016-07-19 02:00:38 +00:00
Craig Topper	592dc30708	[X86] Remove superfluous parameter from a multiclass. All instantiations passed the same value. llvm-svn: 275941	2016-07-19 02:00:35 +00:00
George Burgess IV	5f30897b7b	[MemorySSA] Update to the new shiny walker. This patch updates MemorySSA's use-optimizing walker to be more accurate and, in some cases, faster. Essentially, this changed our core walking algorithm from a cache-as-you-go DFS to an iteratively expanded DFS, with all of the caching happening at the end. Said expansion happens when we hit a Phi, P; we'll try to do the smallest amount of work possible to see if optimizing above that Phi is legal in the first place. If so, we'll expand the search to see if we can optimize to the next phi, etc. An iteratively expanded DFS lets us potentially quit earlier (because we don't assume that we can optimize above all phis) than our old walker. Additionally, because we don't cache as we go, we can now optimize above loops. As an added bonus, this patch adds a ton of verification (if EXPENSIVE_CHECKS are enabled), so finding bugs is easier. Differential Revision: https://reviews.llvm.org/D21777 llvm-svn: 275940	2016-07-19 01:29:15 +00:00
Craig Topper	6189d3ecd4	[X86] Rename VINSERTzrr to use a capital Z to match other instructions. NFC llvm-svn: 275939	2016-07-19 01:26:19 +00:00
Vedant Kumar	e3a0bf5048	Retry: [llvm-profdata] Speed up merging by using a thread pool Add a "-j" option to llvm-profdata to control the number of threads used. Auto-detect NumThreads when it isn't specified, and avoid spawning threads when they wouldn't be beneficial. I tested this patch using a raw profile produced by clang (147MB). Here is the time taken to merge 4 copies together on my laptop: No thread pool: 112.87s user 5.92s system 97% cpu 2:01.08 total With 2 threads: 134.99s user 26.54s system 164% cpu 1:33.31 total Changes since the initial commit: - When handling odd-length inputs, call ThreadPool::wait() before merging the last profile. Should fix a race/off-by-one (see r275937). Differential Revision: https://reviews.llvm.org/D22438 llvm-svn: 275938	2016-07-19 01:17:20 +00:00
Vedant Kumar	21ab20e005	Revert "[llvm-profdata] Speed up merging by using a thread pool" This reverts commit r275921. It broke the ppc64be bot: http://lab.llvm.org:8011/builders/clang-ppc64be-linux-multistage/builds/3537 I'm not sure why it broke, but based on the output, it looks like an off-by-one (one profile left un-merged). llvm-svn: 275937	2016-07-19 00:57:09 +00:00
Wei Mi	79997a24d7	Recommit the patch "Use uniforms set to populate VecValuesToIgnore". For instructions in uniform set, they will not have vector versions so add them to VecValuesToIgnore. For induction vars, those only used in uniform instructions or consecutive ptrs instructions have already been added to VecValuesToIgnore above. For those induction vars which are only used in uniform instructions or non-consecutive/non-gather scatter ptr instructions, the related phi and update will also be added into VecValuesToIgnore set. The change will make the vector RegUsages estimation less conservative. Differential Revision: https://reviews.llvm.org/D20474 The recommit fixed the testcase global_alias.ll. llvm-svn: 275936	2016-07-19 00:50:43 +00:00
Matt Arsenault	fe358066ea	AMDGPU/SI: Fix SI scheduler refcount issue Without this fix, releaseSuccessors when InOrOutBlock is false could release SUs outside the schedule BasicBlock. Patch by Axel Davy llvm-svn: 275935	2016-07-19 00:35:22 +00:00
Matt Arsenault	cb540bc03c	AMDGPU: Expand register indexing pseudos in custom inserter This is to help moveSILowerControlFlow to before regalloc. There are a couple of tradeoffs with this. The complete CFG is visible to more passes, the loop body avoids an extra copy of m0, vcc isn't required, and immediate offsets can be shrunk into s_movk_i32. The disadvantage is the register allocator doesn't understand that the single lane's vector is dead within the loop body, so an extra register is used to outlive the loop block when expanding the VGPR -> m0 loop. This also now results in worse waitcnt insertion before the loop instead of after for pending operations at the point of the indexing, but that should be fixed by future improvements to cross block waitcnt insertion. v_movreld_b32's operands are now modeled more correctly since vdst is not a true output. This is kind of a hack to treat vdst as a use operand. Extra checking is required in the verifier since I can't seem to get tablegen to emit an implicit operand for a virtual register. llvm-svn: 275934	2016-07-19 00:35:03 +00:00
Lang Hames	0de9b91a71	[Kaleidoscope][BuildingAJIT] More work on the text for Chapter 3. Add an overview of stubs and compile callbacks before the discussion of the source changes. -- This line, and those below, will be ignored-- M docs/tutorial/BuildingAJIT3.rst llvm-svn: 275933	2016-07-19 00:25:52 +00:00
Sanjoy Das	ab73c9d88e	[LoopReroll] Reroll loops with unordered atomic memory accesses Reviewers: hfinkel, jfb, reames Subscribers: mcrosier, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D22385 llvm-svn: 275932	2016-07-19 00:23:54 +00:00
Samuel Antao	b101838244	Append clang system include path for offloading tool chains. Summary: This patch adds clang system include path when offloading tool chains, e.g. CUDA, are used in the current compilation. This fixes an issue detected by @rsmith in response to r275645. Reviewers: rsmith, tra Subscribers: rsmith, cfe-commits Differential Revision: https://reviews.llvm.org/D22490 llvm-svn: 275931	2016-07-19 00:01:12 +00:00
Samuel Antao	dab51bbd48	[OpenMP] Remove dead code in conditional of mappable expressions SEMA. llvm-svn: 275930	2016-07-18 23:22:11 +00:00
Matt Arsenault	4cb438b93c	TableGen: Allow custom register operand decoder method This is for a situation where the encoding for a register may be different depending on the specific operand. For some instructions, we want to apply additional restrictions beyond the encoding's constraints. In AMDGPU some operands are VSrc_32, using the VS_32 pseudo register class which accept VGPRs, SGPRs, or immediates in the encoding. Some specific instructions with the same encoding operand do not want to allow immediates or SGPRs, but the encoding format is different in this case than a regular VGPR_32 operand. This allows specifying the encoding should be treated the same without introducing yet another dummy register class. llvm-svn: 275929	2016-07-18 23:20:46 +00:00
Matt Arsenault	50b76399ed	AMDGPU: Fix test name and broken CHECK-LABEL llvm-svn: 275928	2016-07-18 23:09:51 +00:00
Vedant Kumar	05ee94f1b5	[utils] Generate html reports with the code coverage utility script Instead of extracting raw coverage mappings into an artifact directory, actually generate useful html reports for a given list of binaries with symbol demangling turned on. No tests, but this is actively being used to drive the (still nascent) coverage bot. llvm-svn: 275927	2016-07-18 22:50:10 +00:00
Kelvin Li	9f645ae63b	[OpenMP] Fix incorrect diagnostics in map clause Having the following code pattern will result in incorrect diagnostic int main() { int arr[10]; #pragma omp target data map(arr[:]) #pragma omp target map(arr) {} } t.cpp:4:24: error: original storage of expression in data environment is shared but data environment do not fully contain mapped expression storage #pragma omp target map(arr) ^~~ t.cpp:3:29: note: used here #pragma omp target data map(arr[:]) ^~~~~~ 1 error generated. Patch by David S. Differential Revision: https://reviews.llvm.org/D22075 llvm-svn: 275926	2016-07-18 22:49:16 +00:00
Richard Smith	cb2ba5a5a7	Fix some minor issues found by Coverity. llvm-svn: 275925	2016-07-18 22:37:35 +00:00
Vedant Kumar	d04929d875	[Coverage] Remove '..' from filenames after getting an absolute path Failure to do this breaks relative paths which begin with '..'. This issue was caught by the (still nascent) coverage bot. llvm-svn: 275924	2016-07-18 22:32:02 +00:00
Matt Arsenault	4ced16dd2e	Fix -Wreturn-type with gcc 4.8 and libc++ llvm-svn: 275922	2016-07-18 22:12:46 +00:00
Vedant Kumar	0bd9907581	[llvm-profdata] Speed up merging by using a thread pool Add a "-j" option to llvm-profdata to control the number of threads used. Auto-detect NumThreads when it isn't specified, and avoid spawning threads when they wouldn't be beneficial. I tested this patch using a raw profile produced by clang (147MB). Here is the time taken to merge 4 copies together on my laptop: No thread pool: 112.87s user 5.92s system 97% cpu 2:01.08 total With 2 threads: 134.99s user 26.54s system 164% cpu 1:33.31 total Differential Revision: https://reviews.llvm.org/D22438 llvm-svn: 275921	2016-07-18 22:02:39 +00:00
Artem Belevich	9f97dcb018	[NVPTX] Make sure we adjust alignment at all call sites .. including calls from kernel functions that were ignored by mistake before. llvm-svn: 275920	2016-07-18 21:58:48 +00:00
Dehao Chen	6132ee8502	[PM] Convert Loop Strength Reduce pass to new PM Summary: Convert Loop String Reduce pass to new PM Reviewers: davidxl, silvas Subscribers: junbuml, sanjoy, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D22468 llvm-svn: 275919	2016-07-18 21:41:50 +00:00
Mehdi Amini	4d74631ea4	Update doxygen description for `WriteBitcodeToFile()` API (NFC) llvm-svn: 275917	2016-07-18 21:29:24 +00:00
Teresa Johnson	2124157102	[PM] Port FunctionImport Pass to new PM Summary: Port FunctionImport Pass to new PM. Reviewers: mehdi_amini, davide Subscribers: davidxl, llvm-commits Differential Revision: https://reviews.llvm.org/D22475 llvm-svn: 275916	2016-07-18 21:22:24 +00:00
Wei Mi	f9afff71a2	Revert rL275912. llvm-svn: 275915	2016-07-18 21:14:43 +00:00
Chaoren Lin	926db72a4d	Add missing headers after header cleanup in r275882. llvm-svn: 275914	2016-07-18 21:11:43 +00:00
Vedant Kumar	14f8fb6864	[Coverage] Normalize '..' out of filename strings This fixes the issue of having duplicate entries for the same file in a coverage report s.t none of the entries actually displayed the correct coverage information. llvm-svn: 275913	2016-07-18 21:01:27 +00:00
Wei Mi	1fd25726af	Use uniforms set to populate VecValuesToIgnore. For instructions in uniform set, they will not have vector versions so add them to VecValuesToIgnore. For induction vars, those only used in uniform instructions or consecutive ptrs instructions have already been added to VecValuesToIgnore above. For those induction vars which are only used in uniform instructions or non-consecutive/non-gather scatter ptr instructions, the related phi and update will also be added into VecValuesToIgnore set. The change will make the vector RegUsages estimation less conservative. Differential Revision: https://reviews.llvm.org/D20474 llvm-svn: 275912	2016-07-18 20:59:53 +00:00
Sanjay Patel	5f5eb58eb5	refactor SimplifySelectInst; NFCI llvm-svn: 275911	2016-07-18 20:56:53 +00:00
Justin Lebar	4133584504	Write isUInt using template specializations to work around an incorrect MSVC warning. Summary: Per D22441, MSVC warns on our old implementation of isUInt<64>. It sees uint64_t(1) << 64 and doesn't realize that it's not going to be executed. Writing as a template specialization is ugly, but prevents the warning. Reviewers: RKSimon Subscribers: majnemer, llvm-commits Differential Revision: https://reviews.llvm.org/D22472 llvm-svn: 275909	2016-07-18 20:40:35 +00:00
Sanjay Patel	dbf44f5016	add tests for missed sext transform llvm-svn: 275908	2016-07-18 20:37:51 +00:00
Bruno Cardoso Lopes	d9b7dfe4a4	[Sema] Create a separate group for incompatible function pointer warning Give incompatible function pointer warning its own diagnostic group but still leave it as a subgroup of incompatible-pointer-types. This is in preparation to promote -Wincompatible-function-pointer-types to error on darwin. Differential Revision: https://reviews.llvm.org/D22248 rdar://problem/12907612 llvm-svn: 275907	2016-07-18 20:37:06 +00:00
Mehdi Amini	320a5a615b	Add missing header in ClangFuzzer (after r275882 cleanup) llvm-svn: 275906	2016-07-18 20:33:09 +00:00
Bob Wilson	4cf27c4d6f	Allow iOS and tvOS version numbers with 2-digit major version numbers. rdar://problem/26921601 llvm-svn: 275905	2016-07-18 20:29:14 +00:00
Marshall Clow	258a651f3a	Bump version # to 4.0.0 llvm-svn: 275904	2016-07-18 20:27:19 +00:00
Hans Wennborg	4ba35d1f4f	build_llvm_package.bat: update version to 4.0.0 llvm-svn: 275903	2016-07-18 20:26:46 +00:00
Vedant Kumar	55ecc100b0	[interception] Remove extra whitespace to appease linters (NFC) Attempt to fix: http://lab.llvm.org:8011/builders/clang-s390x-linux/builds/7774 llvm-svn: 275901	2016-07-18 20:07:27 +00:00
Sanjay Patel	8a2bf3099f	auto-generate checks llvm-svn: 275899	2016-07-18 20:06:51 +00:00
Hans Wennborg	39eff98d38	Revert r273099 "If the revision number starts with r, drop it. It will get added back" This doesn't seem to work with Bash: $ /work/llvm/utils/release/merge.sh --proj llvm --rev r275870 /work/llvm/utils/release/merge.sh: line 34: ${$1#r}: bad substitution I get the same error with and without a leading 'r'. llvm-svn: 275898	2016-07-18 20:06:27 +00:00
Vedant Kumar	bf51e703cf	[Driver] Compute effective target triples once per job (NFCI) Compute an effective target triple exactly once in ConstructJob(), and then simply pass around references to it. This eliminates wasteful re-computation of effective triples (e.g in getARMFloatABI()). Differential Revision: https://reviews.llvm.org/D22290 llvm-svn: 275895	2016-07-18 19:56:38 +00:00
Vedant Kumar	f2030b931c	[Driver] Make Driver::DefaultTargetTriple private (NFCI) No in-tree targets access this `DefaultTargetTriple` directly, and usage of default triples is generally discouraged. Make the field private. This is part of en effort to make the clang driver use effective triples more pervasively. Differential Revision: https://reviews.llvm.org/D22289 llvm-svn: 275894	2016-07-18 19:56:33 +00:00
Artem Belevich	052b1ed2fd	[NVPTX] Force minimum alignment of 4 for byval arguments of device-side functions. Taking address of a byval variable in PTX is legal, but currently runs into miscompilation by ptxas on sm_50+ (NVIDIA issue 1789042). Work around the issue by enforcing minimum alignment on byval arguments of device functions. The change is a no-op on SASS level for sm_3x where ptxas already aligns local copy by at least 4. Differential Revision: https://reviews.llvm.org/D22428 llvm-svn: 275893	2016-07-18 19:54:56 +00:00
Etienne Bergeron	a81a44ffd4	[compiler-rt] Fix incorrect handling of indirect load. Summary: Indirect load are relative offset from RIP. The current trampoline implementation is incorrectly copying these instructions which make some unittests crashing. This patch is not fixing the unittests but it's fixing the crashes. The functions are no longer hooked. Patches will come soon to fix these unittests. Reviewers: rnk Subscribers: llvm-commits, wang0109, chrisha Differential Revision: https://reviews.llvm.org/D22410 llvm-svn: 275892	2016-07-18 19:50:55 +00:00
Michael Zolotukhin	ea5b72825b	[LoopSimplify] Update LCSSA after separating nested loops. Summary: Usually LCSSA survives this transformation, but in some cases (see attached test) it doesn't: values from the original loop after separating might be used from the outer loop. Before the transformation it was the same loop, so LCSSA phis were not required. This fixes PR28272. Reviewers: sanjoy, hfinkel, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21665 llvm-svn: 275891	2016-07-18 19:44:19 +00:00

1 2 3 4 5 ...

236903 Commits All Branches Search

236903 Commits

All Branches