llvm-project

Commit Graph

Author	SHA1	Message	Date
Rui Ueyama	72b1ee2533	Make getColorDiagnostics return a boolean value instead of an enum. Config->ColorDiagnostics was of type enum before. Now it is just a boolean flag. Thanks Rafael for suggestion. llvm-svn: 287978	2016-11-26 15:10:01 +00:00
Rui Ueyama	1880bbed39	Split MergeOutputSection::finalize. llvm-svn: 287977	2016-11-26 15:09:58 +00:00
Sanjay Patel	91e73a7bfa	add optional param to copy metadata when creating selects; NFC There are other spots where we can use this; we're currently dropping metadata in some places, and there are proposed changes where we will want to propagate metadata. IRBuilder's CreateSelect() already has a parameter like this, so this change makes the regular 'Create' API line up with that. llvm-svn: 287976	2016-11-26 15:01:59 +00:00
Craig Topper	10d5eec1a1	[AVX-512] Add unmasked EVEX vpmovzx/sx instructions to load folding tables. llvm-svn: 287975	2016-11-26 08:21:52 +00:00
Craig Topper	97169ea5f9	[AVX-512] Add masked 128/256-bit integer add/sub instructions to load folding tables. llvm-svn: 287974	2016-11-26 08:21:48 +00:00
Tobias Grosser	b45ae5601b	[ScopDetect] Expand statistics of the detected scops We now collect: Number of total loops Number of loops in scops Number of scops Number of scops with maximal loop depth 1 Number of scops with maximal loop depth 2 Number of scops with maximal loop depth 3 Number of scops with maximal loop depth 4 Number of scops with maximal loop depth 5 Number of scops with maximal loop depth 6 and larger Number of loops in scops (profitable scops only) Number of scops (profitable scops only) Number of scops with maximal loop depth 1 (profitable scops only) Number of scops with maximal loop depth 2 (profitable scops only) Number of scops with maximal loop depth 3 (profitable scops only) Number of scops with maximal loop depth 4 (profitable scops only) Number of scops with maximal loop depth 5 (profitable scops only) Number of scops with maximal loop depth 6 and larger (profitable scops only) These statistics are certainly completely accurate as we might drop scops when building up their polyhedral representation, but they should give a good indication of the number of scops we detect. llvm-svn: 287973	2016-11-26 07:37:46 +00:00
Craig Topper	53b33de1e3	[AVX-512] Add masked 512-bit integer add/sub instructions to load folding tables. llvm-svn: 287972	2016-11-26 07:21:00 +00:00
Craig Topper	6677bb4e50	[AVX-512] Teach LowerFormalArguments to use the extended register class when available. Fix the avx512vl stack folding tests to clobber more registers or otherwise they use xmm16 after this change. llvm-svn: 287971	2016-11-26 07:20:57 +00:00
Craig Topper	39265bb1ce	[AVX-512] Add VLX versions of VDIVPD/PS and VMULPD/PS to load folding tables. llvm-svn: 287970	2016-11-26 07:20:53 +00:00
Rafael Espindola	f93b8c29c8	Create sections with just assignments as STT_NOBITS. This matches the behaviour of bfd ld. Using 0 was causing problems with strip, which would remove these sections. llvm-svn: 287969	2016-11-26 06:55:35 +00:00
Tobias Grosser	5c00b0dc74	[ScopDetectionDiagnostic] Collect statistics for each diagnostic type Our original statistics were added before we introduced a more fine-grained diagnostic system, but the granularity of our statistics has never been increased accordingly. This change introduces now one statistic counter per diagnostic to enable us to collect fine-grained statistics about who certain scops are not detected. In case coarser grained statistics are needed, the user is expected to combine counters manually. llvm-svn: 287968	2016-11-26 05:53:09 +00:00
Davide Italiano	3bfa081aa9	[ELF] Be compliant with LLVM and rename Lto into LTO. NFCI. llvm-svn: 287967	2016-11-26 05:37:04 +00:00
Alexander Shaposhnikov	696bd63550	[lldb] Fix typos in file headers This diff fixes typos in file headers (incorrect file names). Test plan: Under llvm/tools/lldb/source: find ./* -type f \| grep -e '$cpp\\|h$$' \| while read F; do B=$(basename $F); echo $F head -n 1 $F \| grep -v $B \| wc -l ; done Differential revision: https://reviews.llvm.org/D27115 llvm-svn: 287966	2016-11-26 05:23:44 +00:00
Tobias Grosser	0dcbcaa98b	[ScopDetectionDiagnostic] IrreducibleRegion is a subclasses of CFG Reflect this correctly in the RejectReasonKind enum. The definition of RejectReasonKind::IrreducibleRegion was introduced in r258497, when we started to refuse regions containing irreducible loops. llvm-svn: 287965	2016-11-26 05:08:27 +00:00
Tobias Grosser	8c21b1a50f	[ScopDetectionDiagnostic] Remove leftover RejectReasonKind for Conditions [NFC] In r248118 some diagnostics for unstructured control flow have been removed, but the corresponding RejectReasonKind was accidentally not removed. This change removes it, as it is not needed any more. llvm-svn: 287964	2016-11-26 05:08:24 +00:00
Tobias Grosser	c64269ea1b	[ScopDectionDiagnostic] Use scoped enums instead three letter prefix [NFC] This improves readability of the code. llvm-svn: 287963	2016-11-26 03:44:31 +00:00
Tom Stellard	1473f07ceb	AMDGPU/SI: Use float as the operand type for amdgcn.interp intrinsics Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D26724 llvm-svn: 287962	2016-11-26 02:26:04 +00:00
Craig Topper	7f76c23781	[X86][XOP] Add a reversed reg/reg form for VPROT instructions. The W bit distinquishes which operand is the memory operand. But if the mod bits are 3 then the memory operand is a register and there are two possible encodings. We already did this correctly for several other XOP instructions. llvm-svn: 287961	2016-11-26 02:14:00 +00:00
Craig Topper	516fd7abfe	[X86] Add SSE, AVX, and AVX2 version of MOVDQU to the load/store folding tables for consistency. Not sure this is truly needed but we had the floating point equivalents, the aligned equivalents, and the EVEX equivalents. So this just makes it complete. llvm-svn: 287960	2016-11-26 02:13:58 +00:00
Kuba Mracek	23551fa811	[asan] Support handle_sigill on Darwin Handling SIGILL on Darwin works fine, so let's just make this feature work and re-enable the ill.cc testcase. Differential Revision: https://reviews.llvm.org/D27141 llvm-svn: 287959	2016-11-26 01:30:31 +00:00
Dylan McKay	656c1fa544	Un-XFAIL an AVR CodeGen test llvm-svn: 287958	2016-11-26 01:07:32 +00:00
Kuba Mracek	073cea6128	[asan] Add a "dump_registers" flag to print out CPU registers after a SIGSEGV This patch prints out all CPU registers after a SIGSEGV. These are available in the signal handler context. Only implemented for Darwin. Can be turned off with the dump_registers flag. Differential Revision: https://reviews.llvm.org/D11365 llvm-svn: 287957	2016-11-26 00:50:08 +00:00
Craig Topper	a363d42973	[AVX-512] Put the AVX-512 sections of the load folding tables into mostly alphabetical order. This is consistent with the older sections of the table. NFC llvm-svn: 287956	2016-11-25 23:21:34 +00:00
David Majnemer	d5648c7a7d	Replace some callers of setTailCall with setTailCallKind We were a little sloppy with adding tailcall markers. Be more consistent by using setTailCallKind instead of setTailCall. llvm-svn: 287955	2016-11-25 22:35:09 +00:00
Sanjay Patel	534e270ae5	[SimplifyCFG] auto-generate better checks; NFC llvm-svn: 287954	2016-11-25 21:12:39 +00:00
Sanjay Patel	d1a147f9f4	[SimplifyCFG] auto-generate better checks; NFC llvm-svn: 287953	2016-11-25 21:07:13 +00:00
Rui Ueyama	d873e3a694	Fix buildbots. llvm-svn: 287952	2016-11-25 20:42:39 +00:00
Rui Ueyama	1df9316922	Fix typo. llvm-svn: 287951	2016-11-25 20:41:45 +00:00
Rui Ueyama	c01321c6b8	Do not print out ARGV0 in white because it's unreadable on white background. llvm-svn: 287950	2016-11-25 20:37:16 +00:00
Rui Ueyama	8c8818a58c	Support -color-diagnostics={auto,always,never}. -color-diagnostics=auto is default because that's the same as Clang's default. When color is enabled, error or warning messages are colored like this. error: <bold>ld.lld</bold> <red>error:</red> foo.o: no such file warning: <bold>ld.lld</bold> <magenta>warning:</magenta> foo.o: no such file Differential Revision: https://reviews.llvm.org/D27117 llvm-svn: 287949	2016-11-25 20:27:32 +00:00
Rui Ueyama	6066641423	We shouldn't call parallle_for_each if -no-thread is given. llvm-svn: 287948	2016-11-25 20:20:57 +00:00
Joerg Sonnenberger	92d91569a1	Typo. llvm-svn: 287947	2016-11-25 20:15:57 +00:00
Rui Ueyama	2555952ba8	Parallelize uncompress() and splitIntoPieces(). Uncompressing section contents and spliting mergeable section contents into smaller chunks are heavy tasks. They scan entire section contents and do CPU-intensive tasks such as uncompressing zlib-compressed data or computing a hash value for each section piece. Luckily, these tasks are independent to each other, so we can do that in parallel_for_each. The number of input sections is large (as opposed to the number of output sections), so there's a large parallelism here. Actually the current design to call uncompress() and splitIntoPieces() in batch was chosen with doing this in mind. Basically what we need to do here is to replace `for` with `parallel_for_each`. It seems this patch improves latency significantly if linked programs contain debug info (which in turn contain lots of mergeable strings.) For example, the latency to link Clang (debug build) improved by 20% on my machine as shown below. Note that ld.gold took 19.2 seconds to do the same thing. Before: 30801.782712 task-clock (msec) # 3.652 CPUs utilized ( +- 2.59% ) 104,084 context-switches # 0.003 M/sec ( +- 1.02% ) 5,063 cpu-migrations # 0.164 K/sec ( +- 13.66% ) 2,528,130 page-faults # 0.082 M/sec ( +- 0.47% ) 85,317,809,130 cycles # 2.770 GHz ( +- 2.62% ) 67,352,463,373 stalled-cycles-frontend # 78.94% frontend cycles idle ( +- 3.06% ) <not supported> stalled-cycles-backend 44,295,945,493 instructions # 0.52 insns per cycle # 1.52 stalled cycles per insn ( +- 0.44% ) 8,572,384,877 branches # 278.308 M/sec ( +- 0.66% ) 141,806,726 branch-misses # 1.65% of all branches ( +- 0.13% ) 8.433424003 seconds time elapsed ( +- 1.20% ) After: 35523.764575 task-clock (msec) # 5.265 CPUs utilized ( +- 2.67% ) 159,107 context-switches # 0.004 M/sec ( +- 0.48% ) 8,123 cpu-migrations # 0.229 K/sec ( +- 23.34% ) 2,372,483 page-faults # 0.067 M/sec ( +- 0.36% ) 98,395,342,152 cycles # 2.770 GHz ( +- 2.62% ) 79,294,670,125 stalled-cycles-frontend # 80.59% frontend cycles idle ( +- 3.03% ) <not supported> stalled-cycles-backend 46,274,151,813 instructions # 0.47 insns per cycle # 1.71 stalled cycles per insn ( +- 0.47% ) 8,987,621,670 branches # 253.003 M/sec ( +- 0.60% ) 148,900,624 branch-misses # 1.66% of all branches ( +- 0.27% ) 6.747548004 seconds time elapsed ( +- 0.40% ) llvm-svn: 287946	2016-11-25 20:05:08 +00:00
Rui Ueyama	623b36e358	Move typedefs inside a class definition. llvm-svn: 287945	2016-11-25 18:51:56 +00:00
Rui Ueyama	22375f2406	Remove a parameter from ScriptParser. llvm-svn: 287944	2016-11-25 18:51:54 +00:00
Rui Ueyama	da06bfb794	Move getLocation from Relocations.cpp to InputSection.cpp. The function was used only within Relocations.cpp, but now we are using it in many places, so this patch moves it to a file that fits to the functionality. llvm-svn: 287943	2016-11-25 18:51:53 +00:00
Marek Olsak	79c05871a2	AMDGPU/SI: Add back reverted SGPR spilling code, but disable it suggested as a better solution by Matt llvm-svn: 287942	2016-11-25 17:37:09 +00:00
Simon Pilgrim	c5fb167df0	Use SDValue helpers instead of explicitly going via SDValue::getNode(). NFCI llvm-svn: 287941	2016-11-25 17:25:21 +00:00
Simon Pilgrim	8e8ae7219f	Use SDValue helper instead of explicitly going via SDValue::getNode(). NFCI llvm-svn: 287940	2016-11-25 17:19:53 +00:00
Craig Topper	88071b37ab	[AVX-512] Add support for changing VSHUFF64x2 to VSHUFF32x4 when its feeding a vselect with 32-bit element size. Summary: Shuffle lowering may have widened the element size of a i32 shuffle to i64 before selecting X86ISD::SHUF128. If this shuffle was used by a vselect this can prevent us from selecting masked operations. This patch detects this and changes the element size to match the vselect. I don't handle changing integer to floating point or vice versa as its not clear if its better to push such a bitcast to the inputs of the shuffle or to the user of the vselect. So I'm ignoring that case for now. Reviewers: delena, zvi, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27087 llvm-svn: 287939	2016-11-25 16:48:05 +00:00
Eugene Leviant	f04777527e	[ELF] Add explicit template instantiations for toString llvm-svn: 287938	2016-11-25 16:42:04 +00:00
Craig Topper	1e48829747	[AVX-512] Add VPERMT2* and VPERMI2* instructions to load folding tables. llvm-svn: 287937	2016-11-25 16:33:53 +00:00
Marek Olsak	e3895bfb47	Revert "AMDGPU: Implement SGPR spilling with scalar stores" This reverts commit 4404d0d6e354e80dd7f8f0a0e12d8ad809cf007e. llvm-svn: 287936	2016-11-25 16:03:34 +00:00
Marek Olsak	dad553a5cf	Revert "AMDGPU: Fix MMO when splitting spill" This reverts commit 79d4f8b8b1ce430c3d5dac4fc72a9eebaed24fe1. llvm-svn: 287935	2016-11-25 16:03:27 +00:00
Marek Olsak	8cbbf65361	Revert "AMDGPU: Fix adding extra implicit def of register" This reverts commit e834ce5976567575621901fb967b8018b9916d71. llvm-svn: 287934	2016-11-25 16:03:22 +00:00
Marek Olsak	713e6fc531	Revert "AMDGPU: Fix not setting kill flag on temp reg when spilling" This reverts commit 057bbbe4ae170247ba37f08f2e70ef185267d1bb. llvm-svn: 287933	2016-11-25 16:03:19 +00:00
Marek Olsak	a45dae458d	Revert "AMDGPU: Make m0 unallocatable" This reverts commit 124ad83dae04514f943902446520c859adee0e96. llvm-svn: 287932	2016-11-25 16:03:15 +00:00
Marek Olsak	ea848df84c	Revert "AMDGPU: Remove m0 spilling code" This reverts commit f18de36554eb22416f8ba58e094e0272523a4301. llvm-svn: 287931	2016-11-25 16:03:06 +00:00
Marek Olsak	18a95bcb3c	Revert "AMDGPU: Preserve m0 value when spilling" This reverts commit a5a179ffd94fd4136df461ec76fb30f04afa87ce. llvm-svn: 287930	2016-11-25 16:03:02 +00:00
Eric Liu	6135581cdf	Do not do raw name replacement when FromDecl is a class forward-declaration. Summary: If the `FromDecl` is a class forward declaration, the reference is still considered as referring to the original definition given the nature of forward-declarations, so we can't do a raw name replacement in this case. Reviewers: bkramer Subscribers: cfe-commits, klimek Differential Revision: https://reviews.llvm.org/D27132 llvm-svn: 287929	2016-11-25 16:02:49 +00:00

1 2 3 4 5 ...

248281 Commits All Branches Search

248281 Commits

All Branches