llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	f2898d73a5	Convert test to FileCheck. llvm-svn: 273609	2016-06-23 20:37:49 +00:00
Anna Thomas	31a0b2088f	InstCombine rule to fold trunc when value available Summary: This instcombine rule folds away trunc operations that have value available from a prior load or store. This kind of code can be generated as a result of GVN widening the load or from source code as well. Reviewers: reames, majnemer, sanjoy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21246 llvm-svn: 273608	2016-06-23 20:22:22 +00:00
Tobias Grosser	fb780bfc35	Drop unnecessary ';' This addresses warnings produced by clang's -Wextra-semi. This cleanup was suggested by Eugene Zelenko <eugene.zelenko@gmail.com> in http://reviews.llvm.org/D21488 and was split out to increase readability. llvm-svn: 273607	2016-06-23 20:21:47 +00:00
Dehao Chen	bd3ed3c55b	Invoke simplifycfg and sroa before instcombine. Summary: InstCombine needs to be performed after simplifycfg and sroa, otherwise it may make bad optimization decisions. Reviewers: davidxl, wmi, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21568 llvm-svn: 273606	2016-06-23 20:13:10 +00:00
Matt Arsenault	8d4b0eddd6	AMDGPU: Add option to disable spilling SGPRs to VGPRs. This can help debug spilling problems. llvm-svn: 273605	2016-06-23 20:00:34 +00:00
Greg Clayton	fe9d1ee9e4	Added a new python example which installs a command called "shadow". This shows how to grab individual blocks from stack frames and get only the variables from those blocks. It then will iterate over all of the parent blocks and look for shadowed variables. llvm-svn: 273604	2016-06-23 19:54:32 +00:00
Martin Probst	1b7f98042d	clang-format: [JS] recognize more type locations. Summary: Includes parenthesized type expressions and type aliases. Reviewers: djasper Subscribers: klimek, cfe-commits Differential Revision: http://reviews.llvm.org/D21597 llvm-svn: 273603	2016-06-23 19:52:32 +00:00
Richard Smith	b130fe7d31	Implement p0292r2 (constexpr if), a likely C++1z feature. llvm-svn: 273602	2016-06-23 19:16:49 +00:00
George Burgess IV	fe1397b977	Attempt to fix breakage caused by r273596. llvm-svn: 273601	2016-06-23 19:16:04 +00:00
Richard Smith	03a4aa3d00	Re-commit r273548, reverted in r273589, with a fix to not produce -Wfor-loop-analysis warnings for a for-loop with a condition variable. In such a case, the loop condition variable is modified on each iteration of the loop by definition. Original commit message: Rearrange condition handling so that semantic checks on a condition variable are performed before the other substatements of the construct are parsed, rather than deferring them until the end. This allows better error recovery from semantic errors in the condition, improves diagnostic order, and is a prerequisite for C++17 constexpr if. llvm-svn: 273600	2016-06-23 19:02:52 +00:00
Aaron Ballman	2cd2a18a9f	Default to using the Unicode version of Win32 APIs instead of the ANSI version. This helps to catch instances where a developer accidentally forgets to explicitly specify which version of the API to use and accidentally winds up failing to support non-ASCII characters properly. llvm-svn: 273599	2016-06-23 19:02:09 +00:00
Tobias Grosser	8a12bd9035	Update isl to isl-0.17.1-84-g72ffe88 This is a regular maintenance update to ensure we are testing with a recent version of isl. llvm-svn: 273597	2016-06-23 18:59:30 +00:00
George Burgess IV	1f99da54c2	[CFLAA] Use better interprocedural function summaries. Previously, we just unified any arguments that seemed to be related to each other. With this patch, we now respect dereference levels, etc. which should make us substantially more accurate. Proper handling of StratifiedAttrs will be done in a later patch. Patch by Jia Chen. Differential Revision: http://reviews.llvm.org/D21536 llvm-svn: 273596	2016-06-23 18:55:23 +00:00
Rafael Espindola	53fd425e06	Refactor duplicated code. NFC. llvm-svn: 273595	2016-06-23 18:43:06 +00:00
Hans Wennborg	a21a263101	[codeview] Fix letter casing in FileCheck regexes We print those hex numbers with uppercase letters. llvm-svn: 273594	2016-06-23 18:23:28 +00:00
Michael Kuperstein	0194d30e09	[X86] Extract HiPE prologue constants into metadata X86FrameLowering::adjustForHiPEPrologue() contains a hard-coded offset into an Erlang Runtime System-internal data structure (the PCB). As the layout of this data structure is prone to change, this poses problems for maintaining compatibility. To address this problem, the compiler can produce this information as module-level named metadata. For example (where P_NSP_LIMIT is the offending offset): !hipe.literals = !{ !2, !3, !4 } !2 = !{ !"P_NSP_LIMIT", i32 152 } !3 = !{ !"X86_LEAF_WORDS", i32 24 } !4 = !{ !"AMD64_LEAF_WORDS", i32 24 } Patch by Magnus Lang Differential Revision: http://reviews.llvm.org/D20363 llvm-svn: 273593	2016-06-23 18:17:25 +00:00
Vassil Vassilev	e3ffbc38d9	Typo. llvm-svn: 273592	2016-06-23 18:13:46 +00:00
Reid Kleckner	8f4bd1fdf2	Fix the wasm build by including EndianStream.h llvm-svn: 273591	2016-06-23 18:12:31 +00:00
Peter Collingbourne	ae72fa2f97	Add a test case for the regression in -Wfor-loop-analysis caused by r273548. llvm-svn: 273590	2016-06-23 18:11:19 +00:00
Peter Collingbourne	b77ebd749a	Revert r273548, "Rearrange condition handling so that semantic checks on a condition variable" as it caused a regression in -Wfor-loop-analysis. llvm-svn: 273589	2016-06-23 18:11:15 +00:00
Nirav Dave	38bb1c15fd	Prevent generation of temp file in test from r273585. llvm-svn: 273588	2016-06-23 18:06:35 +00:00
Sanjoy Das	2951e6b314	[SCEV] Don't unnecessarily namespace; NFC llvm-svn: 273587	2016-06-23 18:03:32 +00:00
Sanjoy Das	81c00fe022	[IRCE] Use getTerminator instead of rbegin; NFC llvm-svn: 273586	2016-06-23 18:03:26 +00:00
Nirav Dave	bfdb483755	Preserve DebugInfo when replacing values in DAGCombiner Recommiting after correcting over-eager Debug Value transfer fixing PR28270. [DAG] Previously debug values would transfer debuginfo for the selected start node for a replacement which allows for debug to be dropped. Push debug value transfer to occur with node/value replacement in SelectionDAG, remove now extraneous transfers of debug values. This refixes PR9817 which was being incompletely checked in the testsuite. Reviewers: jyknight Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D21037 llvm-svn: 273585	2016-06-23 17:52:57 +00:00
Sanjay Patel	e053621071	[ValueTracking] simplify logic in ComputeNumSignBits (NFCI) This was noted in http://reviews.llvm.org/D21610 . The previous code predated the use of APInt ( http://reviews.llvm.org/rL47654 ), so it had to account for the fixed width of uint64_t. Now that we're using the variable width APInt, we can remove some complexity. llvm-svn: 273584	2016-06-23 17:41:59 +00:00
Ahmed Bougacha	ef3358d579	[TableGen] Use StringRef::compare instead of != and <. NFC. The previous code would always do 1 or 2 prefix compares; explicitly only do one. This speeds up debug -gen-asm-matcher by ~10% (e.g. X86: 40s -> 35s). llvm-svn: 273583	2016-06-23 17:09:49 +00:00
Todd Fiala	31ae3c5ade	fix Xcode build for r273547 llvm-svn: 273582	2016-06-23 16:54:39 +00:00
Pablo Barrio	7a64346533	[ARM] Lower (select_cc k k (select_cc ~k ~k x)) into (SSAT l_k x) Summary: SSAT saturates an integer, making sure that its value lies within an interval [-k, k]. Since the constant is given to SSAT as the number of bytes set to one, k + 1 must be a power of 2, otherwise the optimization is not possible. Also, the select_cc must use < and > respectively so that they define an interval. Reviewers: mcrosier, jmolloy, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D21372 llvm-svn: 273581	2016-06-23 16:53:49 +00:00
Artur Pilipenko	80771b9ad9	Upgrade other old memset/memcpy signatures in tests causing buildbot failures with rL273568. llvm-svn: 273580	2016-06-23 16:34:52 +00:00
Hans Wennborg	b510b458b9	[codeview] Emit retained types Differential Revision: http://reviews.llvm.org/D21630 llvm-svn: 273579	2016-06-23 16:33:53 +00:00
Chris Lattner	a9d4085fee	Change the email address for commit access requests to my llvm address. llvm-svn: 273578	2016-06-23 16:29:22 +00:00
Vedant Kumar	c298341566	NFC, add an "end namespace" comment for consistency llvm-svn: 273577	2016-06-23 16:27:08 +00:00
Jonathan Peyton	e119e8e5b5	Remove redundant %libomp-compile step from test/lock/omp_lock.c llvm-svn: 273576	2016-06-23 16:18:59 +00:00
Hans Wennborg	a8b7d4f73f	Revert r273567 "[SystemZ] Let z13 also support FeatureMiscellaneousExtensions." It broke test/CodeGen/SystemZ/vec-extract-02.ll llvm-svn: 273575	2016-06-23 16:13:26 +00:00
Hans Wennborg	a63b50afb8	Revert r273568 "Remangle intrinsics names when types are renamed" It broke 2008-07-15-Bswap.ll and 2009-09-01-PostRAProlog.ll llvm-svn: 273574	2016-06-23 16:13:23 +00:00
Artur Pilipenko	4fec7b7131	Fix an old memset signature in 2009-09-01-PostRAProlog.ll test causing a buildbot failure llvm-svn: 273573	2016-06-23 16:07:10 +00:00
Ben Craig	4067e35fae	[Analyzer] Don't cache report generation ExplodedNodes During the core analysis, ExplodedNodes are added to the ExplodedGraph, and those nodes are cached for deduplication purposes. After core analysis, reports are generated. Here, trimmed copies of the ExplodedGraph are made. Since the ExplodedGraph has already been deduplicated, there is no need to deduplicate again. This change makes it possible to add ExplodedNodes to an ExplodedGraph without the overhead of deduplication. "Uncached" nodes also cannot be iterated over, but none of the report generation code attempts to iterate over all nodes. This change reduces the analysis time of a large .C file from 3m43.941s to 3m40.256s (~1.6% speedup). It should slightly reduce memory consumption. Gains should be roughly proportional to the number (and path length) of static analysis warnings. This patch enables future work that should remove the need for an InterExplodedGraphMap inverse map. I plan on using the (now unused) ExplodedNode link to connect new nodes to the original nodes. http://reviews.llvm.org/D21229 llvm-svn: 273572	2016-06-23 15:47:12 +00:00
Reid Kleckner	02d5315237	Use CreateFileA and add a FIXME to switch to the wide variant No functional change. Required to build with -DUNICODE, as is done in http://reviews.llvm.org/D21643 llvm-svn: 273571	2016-06-23 15:40:42 +00:00
Renato Golin	c1bd489028	[docs] Bump minimum version of CMake in its own doc llvm-svn: 273570	2016-06-23 15:28:00 +00:00
Simon Atanasyan	002e244717	[ELF][MIPS] Support MIPS TLS relocations The patch adds one more partition to the MIPS GOT. This time it is for TLS related GOT entries. Such entries are located after 'local' and 'global' ones. We cannot get a final offset for these entries at the time of creation because we do not know size of 'local' and 'global' partitions. So we have to adjust the offset later using `getMipsTlsOffset()` method. All MIPS TLS relocations which need GOT entries operates MIPS style GOT offset - 'offset from the GOT's beginning' - MipsGPOffset constant. That is why I add new types of relocation expressions. One more difference from othe ABIs is that the MIPS ABI does not support any TLS relocation relaxations. I decided to make a separate function `handleMipsTlsRelocation` and put MIPS TLS relocation handling code there. It is similar to `handleTlsRelocation` routine and duplicates its code. But it allows to make the code cleaner and prevent pollution of the `handleTlsRelocation` by MIPS 'if' statements. Differential Revision: http://reviews.llvm.org/D21606 llvm-svn: 273569	2016-06-23 15:26:31 +00:00
Artur Pilipenko	f0c9f81379	Remangle intrinsics names when types are renamed This is a fix for the problem mentioned in "LTO and intrinsics mangling" llvm-dev mail thread: http://lists.llvm.org/pipermail/llvm-dev/2016-April/098387.html Reviewers: mehdi_amini, reames Differential Revision: http://reviews.llvm.org/D19373 llvm-svn: 273568	2016-06-23 15:25:09 +00:00
Jonas Paulsson	b1a2b5a708	[SystemZ] Let z13 also support FeatureMiscellaneousExtensions. This processor feature had been left out by mistake from the z13 ProcessorModel. Reviewed by Ulrich Weigand. llvm-svn: 273567	2016-06-23 15:12:06 +00:00
Rafael Espindola	c9d336e549	Restructure the propagation of -fPIC/-fPIE. The PIC and PIE levels are not independent. In fact, if PIE is defined it is always the same as PIC. This is clear in the driver where ParsePICArgs returns a PIC level and a IsPIE boolean. Unfortunately that is currently lost and we pass two redundant levels down the pipeline. This patch keeps a bool and a PIC level all the way down to codegen. llvm-svn: 273566	2016-06-23 15:07:32 +00:00
Simon Dardis	fcc7f6fad2	Revert "[misched] Extend scheduler to handle unsupported features" This reverts commit r273551. Patch contained a wrong check for isUnsupported. llvm-svn: 273565	2016-06-23 14:54:47 +00:00
Aaron Ballman	0da8b2ec09	Explicitly specify the ANSI version of these Win32 APIs. While these are seemingly unrelated changes, they are all NFC because we currently default to the ANSI versions of the APIs when building for Windows. This simply makes the ANSI usage explicit. llvm-svn: 273564	2016-06-23 14:45:54 +00:00
Aaron Ballman	b06a359beb	Fixing a FIXME related to Unicode support on Windows. Converted the Win32 APIs to explicitly use the W version when it involves strings that can hold non-ASCII characters (like file paths). Now explicitly using the A version for strings that will always be ASCII (like registry key paths). No extra tests required as this is currently covered by existing testing, and this is basically impossible to write Unicode-specific tests for. llvm-svn: 273563	2016-06-23 14:33:53 +00:00
Michael Zolotukhin	2d3592d481	[LoopUnrollAnalyzer] Fix a bug in UnrolledInstAnalyzer::visitLoad. When simplifying a load we need to make sure that the type of the simplified value matches the type of the instruction we're processing. In theory, we can handle casts here as we deal with constant data, but since it's not implemented at the moment, we at least need to bail out. This fixes PR28262. llvm-svn: 273562	2016-06-23 14:31:31 +00:00
Valery Pykhtin	a852d695b8	[AMDGPU] Enable absolute expression initializer for amd_kernel_code_t fields. Differential Revision: http://reviews.llvm.org/D21380 llvm-svn: 273561	2016-06-23 14:13:06 +00:00
Simon Pilgrim	595dddb103	[X86][AVX512] Added AVX512F vector sign extend tests Now that Elena has confirmed that PR26474 has been fixed llvm-svn: 273560	2016-06-23 14:01:45 +00:00
Hal Finkel	a1271036c5	Allow DeadStoreElimination to track combinations of partial later wrties DeadStoreElimination can currently remove a small store rendered unnecessary by a later larger one, but could not remove a larger store rendered unnecessary by a series of later smaller ones. This adds that capability. It works by keeping a map, which is used as an effective interval map, for each store later overwritten only partially, and filling in that interval map as more such stores are discovered. No additional walking or aliasing queries are used. In the map forms an interval covering the the entire earlier store, then it is dead and can be removed. The map is used as an interval map by storing a mapping between the ending offset and the beginning offset of each interval. I discovered this problem when investigating a performance issue with code like this on PowerPC: #include <complex> using namespace std; complex<float> bar(complex<float> C); complex<float> foo(complex<float> C) { return bar(C)C; } which produces this: define void @_Z4testSt7complexIfE(%"struct.std::complex" noalias nocapture sret %agg.result, i64 %c.coerce) { entry: %ref.tmp = alloca i64, align 8 %tmpcast = bitcast i64* %ref.tmp to %"struct.std::complex"* %c.sroa.0.0.extract.shift = lshr i64 %c.coerce, 32 %c.sroa.0.0.extract.trunc = trunc i64 %c.sroa.0.0.extract.shift to i32 %0 = bitcast i32 %c.sroa.0.0.extract.trunc to float %c.sroa.2.0.extract.trunc = trunc i64 %c.coerce to i32 %1 = bitcast i32 %c.sroa.2.0.extract.trunc to float call void @_Z3barSt7complexIfE(%"struct.std::complex"* nonnull sret %tmpcast, i64 %c.coerce) %2 = bitcast %"struct.std::complex"* %agg.result to i64* %3 = load i64, i64* %ref.tmp, align 8 store i64 %3, i64* %2, align 4 ; <--- *** THIS SHOULD NOT BE HERE ** %_M_value.realp.i.i = getelementptr inbounds %"struct.std::complex", %"struct.std::complex"* %agg.result, i64 0, i32 0, i32 0 %4 = lshr i64 %3, 32 %5 = trunc i64 %4 to i32 %6 = bitcast i32 %5 to float %_M_value.imagp.i.i = getelementptr inbounds %"struct.std::complex", %"struct.std::complex"* %agg.result, i64 0, i32 0, i32 1 %7 = trunc i64 %3 to i32 %8 = bitcast i32 %7 to float %mul_ad.i.i = fmul fast float %6, %1 %mul_bc.i.i = fmul fast float %8, %0 %mul_i.i.i = fadd fast float %mul_ad.i.i, %mul_bc.i.i %mul_ac.i.i = fmul fast float %6, %0 %mul_bd.i.i = fmul fast float %8, %1 %mul_r.i.i = fsub fast float %mul_ac.i.i, %mul_bd.i.i store float %mul_r.i.i, float* %_M_value.realp.i.i, align 4 store float %mul_i.i.i, float* %_M_value.imagp.i.i, align 4 ret void } the problem here is not just that the i64 store is unnecessary, but also that it blocks further backend optimizations of the other uses of that i64 value in the backend. In the future, we might want to add a special case for handling smaller accesses (e.g. using a bit vector) if the map mechanism turns out to be noticeably inefficient. A sorted vector is also a possible replacement for the map for small numbers of tracked intervals. Differential Revision: http://reviews.llvm.org/D18586 llvm-svn: 273559	2016-06-23 13:46:39 +00:00

... 2 3 4 5 6 ...

234819 Commits All Branches Search

234819 Commits

All Branches