llvm-project

Commit Graph

Author	SHA1	Message	Date
Xinliang David Li	2a27fc40a8	Test cleanup -- remove duplicate run lines llvm-svn: 255673	2015-12-15 21:15:06 +00:00
Tom Stellard	a6f24c6565	AMDGPU/SI: Select constant loads with non-uniform addresses to MUBUF instructions Summary: We were previously selecting all constant loads to SMRD instructions and legalizing the SMRDs with non-uniform addresses during the SIFixSGPRCopesPass. This new solution is more simple and also generates much better code, because the instruction selector is able to take advantage of all the MUBUF addressing modes that are legalization pass wasn't able to. We also no longer need to generate v_add_* instructions when we have a uniform pointer and a non-uniform offset, as this is now folded into the MUBUF instruction during instruction selection. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15425 llvm-svn: 255672	2015-12-15 20:55:55 +00:00
Alex Denisov	596e97924a	LLVM tutorial: fix broken links/anchors llvm-svn: 255671	2015-12-15 20:50:29 +00:00
Xinliang David Li	4ec401406e	Coverage code refactoring /NFC llvm-svn: 255670	2015-12-15 19:44:45 +00:00
Justin Bogner	843fb204b7	LPM: Stop threading `Pass ` through all of the loop utility APIs. NFC A large number of loop utility functions take a `Pass ` and reach into it to find out which analyses to preserve. There are a number of problems with this: - The APIs have access to pretty well any Pass state they want, so it's hard to tell what they may or may not do. - Other APIs have copied these and pass around a `Pass *` even though they don't even use it. Some of these just hand a nullptr to the API since the callers don't even have a pass available. - Passes in the new pass manager don't work like the current ones, so the APIs can't be used as is there. Instead, we should explicitly thread the analysis results that we actually care about through these APIs. This is both simpler and more reusable. llvm-svn: 255669	2015-12-15 19:40:57 +00:00
James Y Knight	33beb24318	[Sparc] Fix handling of double incoming arguments on sparc little-endian. On SparcV8, doubles get passed in two 32-bit integer registers. The call code was already handling endianness correctly, but the incoming argument code was not -- it got the two halves in opposite order. Also remove some dead code in LowerFormalArguments_32 to handle less-than-32bit values, which can't actually happen. Finally, add some test cases for the 32-bit calling convention, cribbed from the 64abi.ll test, and run for both big and little-endian. llvm-svn: 255668	2015-12-15 19:23:12 +00:00
Krzysztof Parzyszek	6c3b837452	Unsupport test that should not be run on Hexagon llvm-svn: 255667	2015-12-15 19:14:24 +00:00
Akira Hatanaka	a84428e687	[Docs] Fix Unexpected indentation errors. llvm-svn: 255665	2015-12-15 19:11:48 +00:00
Michael Kuperstein	53946bf8c6	[X86] MOVPC32r should only emit CFI adjustments when needed We only want to emit CFI adjustments when actually using DWARF. This fixes PR25828. Differential Revision: http://reviews.llvm.org/D15522 llvm-svn: 255664	2015-12-15 18:50:32 +00:00
Tom Stellard	0abd28140b	AMDGPU: Add aliases for all VI targets llvm-svn: 255663	2015-12-15 18:37:04 +00:00
Tom Stellard	655680a22b	AMDGPU: Add alias for tonga Patch by: Vedran Mileti llvm-svn: 255662	2015-12-15 18:37:02 +00:00
Tom Stellard	dbe374b2c5	AMDGPU/SI: Implement AMDGPUTargetTransformInfo::isSourceOfDivergence() Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15476 llvm-svn: 255661	2015-12-15 18:04:38 +00:00
Sanjay Patel	38a022623a	[SimplifyCFG] allow speculation of exactly one expensive instruction (PR24818) This is the last general step to allow more IR-level speculation with a safety harness in place in CodeGenPrepare. The intent is to restore the behavior enabled by: http://reviews.llvm.org/rL228826 but prevent bad performance such as: https://llvm.org/bugs/show_bug.cgi?id=24818 Earlier patches in this sequence: D12882 (disable SimplifyCFG speculation for expensive instructions) D13297 (have CGP despeculate expensive ops) D14630 (have CGP despeculate special versions of cttz/ctlz) As shown in the test cases, we only have two instructions currently affected: ctz for some x86 and fdiv generally. Allowing exactly one expensive instruction is a bit of a hack, but it lines up with what is currently implemented in CGP. If we make the despeculation more general in CGP, we can make the speculation here more liberal. A follow-up patch will adjust the cost for sqrt and possibly other typically expensive math intrinsics (currently everything is cheap by default). GPU targets would likely want to override those expensive default costs (just as they probably should already override the cost of div/rem) because just about any math is cheaper than control-flow on those targets. Differential Revision: http://reviews.llvm.org/D15213 llvm-svn: 255660	2015-12-15 17:38:29 +00:00
Nathan Slingerland	7f5b47ddd4	[llvm-profdata] Add support for weighted merge of profile data (2nd try) Summary: This change adds support for specifying a weight when merging profile data with the llvm-profdata tool. Weights are specified by using the --weighted-input=<weight>,<filename> option. Input files not specified with this option (normal positional list after options) are given a default weight of 1. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: davidxl, dnovillo, bogner, silvas Subscribers: silvas, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D15306 llvm-svn: 255659	2015-12-15 17:37:09 +00:00
Nicolai Hahnle	78fd4f087b	AMDGPU: mark ldexp LibCalls as unavailable Summary: The LibCallSimplifier will turn llvm.exp2.* intrinsics into ldexp* libcalls which do not make sense with the AMDGPU backend. In the long run, we'll want an llvm.ldexp.* intrinsic to properly make use of this optimization, but this works around the problem for now. See also: http://reviews.llvm.org/D14327 (suggested llvm.ldexp.* implementation) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92709 Reviewers: arsenm, tstellarAMD Differential Revision: http://reviews.llvm.org/D14990 llvm-svn: 255658	2015-12-15 17:24:15 +00:00
Tom Stellard	8f307217c3	AMDGPU/SI: Fix bitcast between v2f32 and f64 The radeonsi fp64 support can hit these now that some redundant bitcasts are folded. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 255657	2015-12-15 17:11:17 +00:00
Hans Wennborg	08d5905bac	[X86] Smaller code for materializing 32-bit 1 and -1 constants "movl $-1, %eax" is 5 bytes, "xorl %eax, %eax; decl %eax" is 3 bytes. This commit makes LLVM use the latter when optimizing for size. Differential Revision: http://reviews.llvm.org/D14971 llvm-svn: 255656	2015-12-15 17:10:28 +00:00
Nico Weber	0d10b2cf3c	clang-cl: Add an alias for /wd4100 llvm-svn: 255655	2015-12-15 17:07:16 +00:00
JF Bastien	dac806c783	WebAssembly: update expected torture test failures We now have 252 expected failures. llvm-svn: 255654	2015-12-15 17:07:07 +00:00
Krzysztof Parzyszek	372bd80834	[Hexagon] Preprocess mapped instructions before lowering to MC llvm-svn: 255653	2015-12-15 17:05:45 +00:00
Tom Stellard	43f52df0b5	AMDGPU/SI: Add llvm.amdgcn.mbcnt.* intrinsics Summary: These are meant to be used instead of the llvm.SI.tid intrinsic which will be deprecated at some point. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15475 llvm-svn: 255652	2015-12-15 17:02:52 +00:00
Tom Stellard	ad7d03daa6	AMDGPU/SI: Add llvm.amdgcn.v.interp.p[12] intrinsics Summary: These are meant to be used instead of the llvm.SI.fs.interp intrinsic which will be deprecated at some point. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15474 llvm-svn: 255651	2015-12-15 17:02:49 +00:00
Tom Stellard	ac00eb5470	AMDGPU/SI: Add getShaderType() function to Utils/ Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15424 llvm-svn: 255650	2015-12-15 16:26:16 +00:00
Nemanja Ivanovic	8922476bcb	Bitcasts between FP and INT values using direct moves This patch corresponds to review: http://reviews.llvm.org/D15286 This patch was meant to land in revision 255246, but I accidentally uploaded the patch that corresponds to http://reviews.llvm.org/D15372 in that revision accidentally. Thereby, this patch is the actual Bitcasts using direct moves patch, whereas http://reviews.llvm.org/rL255246 actually corresponds to http://reviews.llvm.org/D15372. llvm-svn: 255649	2015-12-15 14:50:34 +00:00
Michael Zuckerman	724d02a21e	[Microsoft][C++] Clang doesn't support a use of "this" pointer inside inline asm add triple to test Differential Revision: http://reviews.llvm.org/D15115 llvm-svn: 255647	2015-12-15 14:35:51 +00:00
George Rimar	5be170ed8b	Fixed mistype in comment. NFC. llvm-svn: 255646	2015-12-15 14:20:57 +00:00
Michael Zuckerman	229158c491	[Microsoft][C++] Clang doesn't support a use of "this" pointer inside inline asm Clang doesn’t support a use of “this” pointer inside inline asm. When I tried to compile a class or a struct (see example) with an inline asm that contains "this" pointer. Clang returns with an error. This patch fixes that. error: expected unqualified-id For example: ''' struct A { void f() { __asm mov eax, this // error: expected unqualified-id } }; ''' Differential Revision: http://reviews.llvm.org/D15115 llvm-svn: 255645	2015-12-15 14:04:18 +00:00
Asaf Badouh	5acf66ff97	[x86] adding PKU feature flag the feature flag is essential for RDPKRU and WRPKRU instruction more about the instruction can be found in the SDM rev 56, vol 2 from http://www.intel.com/sdm Differential Revision: http://reviews.llvm.org/D15491 llvm-svn: 255644	2015-12-15 13:35:29 +00:00
Michael Kuperstein	801ee74167	Do not try to use i8 and i16 versions of FP_TO_U/SINT soft float library calls It appears that neither compiler-rt nor the gnu soft-float libraries actually implement these conversions. Instead of emitting calls to library functions that don't exist, handle it similarly to the way we handle i8 -> float and i16 -> float conversions: call the i32 library function, and adjust the type. Differential Revision: http://reviews.llvm.org/D15151 llvm-svn: 255643	2015-12-15 12:55:50 +00:00
Nemanja Ivanovic	b033f67df0	Define a feature for __float128 support in the PPC back end This patch corresponds to review: http://reviews.llvm.org/D15117 In preparation for supporting IEEE Quad precision floating point, this patch simply defines a feature to specify the target supports this. For now, nothing is done with the target feature, we just don't want warnings from the Clang FE when a user specifies -mfloat128. Calling convention and other related work will add to this patch in the near future. llvm-svn: 255642	2015-12-15 12:19:34 +00:00
Tamas Berghammer	0ecdae1bdc	Merge ENABLE_THREADS and ENABLE_STD_THREADS markers Both of these markers are used in the test suit for annotating when a test needs multi threaded support. Previously they had slightly different meening but they converged to the point where they are used interchangably. This CL removes the ENABLE_STD_THREADS one to simplify the test suite and avoid some confusion. Differential revision: http://reviews.llvm.org/D15498 llvm-svn: 255641	2015-12-15 12:11:00 +00:00
Alexey Bataev	d60e2a3ebf	[OPENMP 4.5] Fix test compatibility with 32 bit mode. llvm-svn: 255640	2015-12-15 11:38:29 +00:00
Alexey Bataev	fc57d1601d	[OPENMP 4.5] Codegen for 'hint' clause of 'critical' directive OpenMP 4.5 defines 'hint' clause for 'critical' directive. Patch adds codegen for this clause. llvm-svn: 255639	2015-12-15 10:55:09 +00:00
Cong Hou	3ba9cf6020	Improve the successor list update in TailDuplication.cpp. This patch improves a temporary fix in r255530 so that we can normalize successor list without trigger assertion failures in tail duplication pass. llvm-svn: 255638	2015-12-15 10:10:40 +00:00
NAKAMURA Takumi	ec6b1fcf63	InstCombineLoadStoreAlloca.cpp: Avoid instantiating Twine. llvm-svn: 255637	2015-12-15 09:37:31 +00:00
NAKAMURA Takumi	b4a6884844	clang/test/Analysis/padding_c.c: Suppress a test incompatible to i686-linux. error: 'warning' diagnostics expected but not seen: File clang/test/Analysis/padding_c.c Line 194 (directive at clang/test/Analysis/padding_c.c:193): Excessive padding in 'struct DefaultAttrAlign' 1 error generated. llvm-svn: 255636	2015-12-15 09:37:01 +00:00
Benjamin Kramer	5c248d89f3	[libclang] Add a flag to create the precompiled preamble on the first parse. Summary: The current default is to create the preamble on the first reparse, aka second parse. This is useful for clients that do not want to block when opening a file because serializing the preamble takes a bit of time. However, this makes the reparse much more expensive and that may be on the critical path as it's the first interaction a user has with the source code. YouCompleteMe currently optimizes for the first code interaction by parsing the file twice when loaded. That's just unnecessarily slow and this flag helps to avoid that. Reviewers: doug.gregor, klimek Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D15490 llvm-svn: 255635	2015-12-15 09:30:31 +00:00
James Molloy	6045cc89bd	[PassManagerBuilder] Add a few more scalar optimization passes This patch does two things: 1. mem2reg is now run immediately after globalopt. Now that globalopt can localize variables more aggressively, it makes sense to lower them to SSA form earlier rather than later so they can benefit from the full set of optimization passes. 2. More scalar optimizations are run after the loop optimizations in LTO mode. The loop optimizations (especially indvars) can clean up scalar code sufficiently to make it worthwhile running more scalar passes. I've particularly added SCCP here as it isn't run anywhere else in the LTO pass pipeline. Mem2reg is super cheap and shouldn't affect compilation time at all. The rest of the added passes are in the LTO pipeline only so doesn't affect the vast majority of compilations, just the link step. llvm-svn: 255634	2015-12-15 09:24:01 +00:00
Mehdi Amini	4b8d75b596	Mark ThreadPool unittests as unsupported on PowerPC64 Bots are crashing unexpectingly, see: https://llvm.org/bugs/show_bug.cgi?id=25829 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255633	2015-12-15 09:10:28 +00:00
Mehdi Amini	942e52c70b	ThreadPool unittest: add a rough mechanism to mark UNSUPPORTED on a given platform From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255632	2015-12-15 09:10:25 +00:00
George Rimar	c7dc0be36a	Reapply fixed r255626 that broke buildbot: [ELF] - refactor of code in RelocationSection<ELFT>::writeTo() Just a little reformat of 'if' conditions, NFC. Differential revision: http://reviews.llvm.org/D15453 Fix was: * Renamed unsigned Rel; to unsigned Reloc; llvm-svn: 255631	2015-12-15 08:48:39 +00:00
Gabor Horvath	454564a2d9	[clang-tidy] Check for suspicious string assignments. It is possible to assign arbitrary integer types to strings. Sometimes it is the result of missing to_string call or apostrophes. Reviewers: alexfh Differential Revision: http://reviews.llvm.org/D15411 llvm-svn: 255630	2015-12-15 08:47:20 +00:00
Elena Demikhovsky	6015f5c823	Type legalizer for masked gather and scatter intrinsics. Full type legalizer that works with all vectors length - from 2 to 16, (i32, i64, float, double). This intrinsic, for example void @llvm.masked.scatter.v2f32(<2 x float>%data , <2 x float*>%ptrs , i32 align , <2 x i1>%mask ) requires type widening for data and type promotion for mask. Differential Revision: http://reviews.llvm.org/D13633 llvm-svn: 255629	2015-12-15 08:40:41 +00:00
George Rimar	b076446368	Revert of r255626 "[ELF] - refactor of code in RelocationSection<ELFT>::writeTo()" as it broke buildbot: http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/17836/steps/build_Lld/logs/stdio /Users/buildslave/as-bldslv9/lld-x86_64-darwin13/llvm.src/tools/lld/ELF/OutputSections.cpp:268:14: error: redefinition of 'Rel' unsigned Rel; ^ /Users/buildslave/as-bldslv9/lld-x86_64-darwin13/llvm.src/tools/lld/ELF/OutputSections.cpp:241:34: note: previous definition is here for (const DynamicReloc<ELFT> &Rel : Relocs) { That compiles fine on my MSVS 2015 thought. llvm-svn: 255628	2015-12-15 08:39:42 +00:00
Gabor Horvath	009c5d52e3	Add a new matcher to match character types. llvm-svn: 255627	2015-12-15 08:35:45 +00:00
George Rimar	e3556420c1	[ELF] - refactor of code in RelocationSection<ELFT>::writeTo() Just a little reformat of 'if' conditions, NFC. Differential revision: http://reviews.llvm.org/D15453 llvm-svn: 255626	2015-12-15 08:23:08 +00:00
Alexey Bataev	28c75417b2	[OPENMP 4.5] Parsing/sema for 'hint' clause of 'critical' directive. OpenMP 4.5 adds 'hint' clause to critical directive. Patch adds parsing/semantic analysis for this clause. llvm-svn: 255625	2015-12-15 08:19:24 +00:00
Craig Topper	cc03b49444	[IR] Add classof for GetElementPtrConstantExpr, CompareConstantExpr, InsertValueConstantExpr, and ExtractValueConstantExpr. All but CompareConstantExpr were being used in casts that were erroneously using ConstantExpr::classof due to inheritance. While there use cast<CompareConstantExpr> to simplify code slightly. I believe in one place we were always casting to ExtractValueConstantExpr when we were trying to choose between ExtractValueConstantExpr and InsertValueConstantExpr because of this. But since they have identical layouts this didn't cause any observable problems. llvm-svn: 255624	2015-12-15 06:11:36 +00:00
Craig Topper	1c3f28313e	Use CmpInst::Predicate instead of 'unsigned short' in some places. NFC llvm-svn: 255623	2015-12-15 06:11:33 +00:00
Simon Atanasyan	350311974b	[ELF][MIPS] Remove applying the redundant bit-mask The `mipsHigh` return type is `uint16_t` so we do not need to extract low 16-bits from return value explicitly. llvm-svn: 255622	2015-12-15 06:06:34 +00:00

1 2 3 4 5 ...

218017 Commits All Branches Search

218017 Commits

All Branches