llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	e304168853	Move PreprocessorOptions to std::shared_ptr from IntrusiveRefCntPtr llvm-svn: 291160	2017-01-05 19:11:36 +00:00
David Blaikie	f95113dacf	Move FailedModulesSet over to shared_ptr from IntrusiveRefCntPtr llvm-svn: 291159	2017-01-05 19:11:31 +00:00
Simon Pilgrim	a8bf97569a	[CostModel][X86] Move vXi64 MUL costs into existing tables. NFCI. Removes need for yet another LUT. llvm-svn: 291158	2017-01-05 19:01:50 +00:00
Andrew Kaylor	7353cf4623	[LICM] Small update to note changes made in hoistRegion Differential Revision: https://reviews.llvm.org/D28363 llvm-svn: 291157	2017-01-05 18:53:24 +00:00
David Blaikie	feaf9d1463	Move VariantMatcher's Payload to std::shared_ptr rather than IntrusiveRefCntPtr llvm-svn: 291156	2017-01-05 18:51:54 +00:00
David Blaikie	95dd362c77	Simplify ASTReader ctor by using in-class initializers for many member variables llvm-svn: 291155	2017-01-05 18:45:45 +00:00
David Blaikie	9d7c1ba5cf	Simplify ASTReader ctor by using in-class initializers (NSDMIs to the rest of you) for many member variables llvm-svn: 291154	2017-01-05 18:45:43 +00:00
Simon Pilgrim	430d34fc14	[CostModel][X86] Strip unused 256-bit vector shift costs. NFCI. Remove SSE2 256-bit entries - AVX targets will have used the SSE42 costs instead. llvm-svn: 291152	2017-01-05 18:36:48 +00:00
Sanjay Patel	686527c1e0	[x86] add test to show bug in select lowering; NFC llvm-svn: 291151	2017-01-05 18:35:44 +00:00
David Blaikie	61137e1a50	Use shared_ptr instead of IntrusiveRefCntPtr for ModuleFileExtension The intrusiveness wasn't needed here, so this simplifies/clarifies the ownership model. llvm-svn: 291150	2017-01-05 18:23:18 +00:00
Simon Pilgrim	b01e844241	[CostModel][X86] Include the cost of 256-bit upper subvector extract/insertion in AVX1 v4i64 MUL Matches other MUL/ADD/SUB 256-bit case on AVX1 llvm-svn: 291149	2017-01-05 18:20:25 +00:00
Joerg Sonnenberger	e9987a1d2f	Typo llvm-svn: 291148	2017-01-05 17:59:44 +00:00
Joerg Sonnenberger	d7baada5dd	Typo llvm-svn: 291147	2017-01-05 17:59:22 +00:00
Simon Pilgrim	f74700aa8c	[CostModel][X86] Merged SK_PermuteSingleSrc/SK_PermuteTwoSrc into common shuffle cost LUTs. NFCI. llvm-svn: 291146	2017-01-05 17:56:19 +00:00
Saleem Abdulrasool	58a0dcee80	thread_support: split out {,non-}recursive mutex Split out the recursive and non-recursive mutex. This split is needed for platforms which may use differing types for the two mutex (e.g. Win32 threads). llvm-svn: 291145	2017-01-05 17:54:45 +00:00
Matt Arsenault	ec63f62c58	Reapply r291025 ("AMDGPU: Remove unneccessary intermediate vector") Arrays are supposed to be static const llvm-svn: 291144	2017-01-05 17:36:11 +00:00
David Blaikie	0a0c275ffd	Migrate PathDiagnosticPiece to std::shared_ptr Simplifies and makes explicit the memory ownership model rather than implicitly passing/acquiring ownership. llvm-svn: 291143	2017-01-05 17:26:53 +00:00
Saleem Abdulrasool	16a6efe43d	test: add a requires registered target It seems that the ARM buildbots do not include x86 support. However, other x86 targets do not support the ARM target. Use a x86 triple and require the registered target. llvm-svn: 291142	2017-01-05 17:09:20 +00:00
Mike Aizatsky	dc58a7d618	Revert "[sancov] introducing SANCOV_OPTIONS" and related changes https://llvm.org/svn/llvm-project/compiler-rt/trunk@291068 llvm-svn: 291141	2017-01-05 16:55:56 +00:00
Chad Rosier	e20a3a4831	[AArch64][CostModel] Add coverage for bswap intrinsics. llvm-svn: 291140	2017-01-05 16:55:32 +00:00
Justin Lebar	e2cd288f57	[Docs] Update docs to indicate that CUDA compilation is supported on Windows. Subscribers: cfe-commits, llvm-commits Differential Revision: https://reviews.llvm.org/D28326 llvm-svn: 291139	2017-01-05 16:54:28 +00:00
Justin Lebar	b8f7a3b8b1	[CUDA] Rename keywords used in macro so they don't conflict with MSVC. Summary: MSVC seems to use "__in" and "__out" for its own purposes, so we have to pick different names in this macro. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28325 llvm-svn: 291138	2017-01-05 16:54:11 +00:00
Justin Lebar	11d5116904	[CUDA] Don't define functions that the CUDA headers themselves define on Windows. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28324 llvm-svn: 291137	2017-01-05 16:53:55 +00:00
Justin Lebar	86c4e63ff9	[CUDA] Let NVPTX inherit the host's calling conventions. Summary: When compiling device code, we may still see host code with explicit calling conventions. NVPTX needs to claim that it supports these CCs, so that (a) we don't raise noisy warnings, and (b) we don't break existing code which relies on the existence of these CCs when specializing templates. (If a CC doesn't exist, clang ignores it, so two template specializations which are different only insofar as one specifies a CC are considered identical and therefore are an error if that CC is not supported.) Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28323 llvm-svn: 291136	2017-01-05 16:53:38 +00:00
Justin Lebar	b662659355	[CUDA] More correctly inherit primitive types from the host during device compilation. Summary: CUDA lets users share structs between the host and device, so for that and other reasons, primitive types such as ptrdiff_t should be the same on both sides of the compilation. Our code to do this wasn't entirely successful. In particular, we did a bunch of work during the NVPTXTargetInfo constructor, only to override it in the NVPTX{32,64}TargetInfo constructors. It worked well enough on Linux and Mac, but Windows is LLP64, which is different enough to break it. This patch removes the NVPTX{32,64}TargetInfo classes entirely and fixes the bug described above. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28322 llvm-svn: 291135	2017-01-05 16:53:21 +00:00
Justin Lebar	0203f2c26e	[CUDA] Add __declspec spellings for CUDA attributes. Summary: CUDA attributes are spelled __declspec(__foo__) on Windows. Reviewers: tra Subscribers: cfe-commits, rnk Differential Revision: https://reviews.llvm.org/D28321 llvm-svn: 291134	2017-01-05 16:53:04 +00:00
Justin Lebar	33387ea045	[ToolChains] Use "static" instead of an anonymous namespace for a function. NFC llvm-svn: 291133	2017-01-05 16:52:47 +00:00
Xin Tong	9efb049fb3	Remove a unnecessary hasLoopInvariantOperands check in loop sink. Summary: Preheader instruction's operands will always be invariant w.r.t. the loop which its the preheader for. Memory aliases are handled in canSinkOrHoistInst. Reviewers: danielcdh, davidxl Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D28270 llvm-svn: 291132	2017-01-05 16:52:37 +00:00
Justin Lebar	58891907fe	[Driver] Driver changes to support CUDA compilation on Windows. Summary: For the most part this is straightforward: Just add a CudaInstallation object to the MSVC and MinGW toolchains. CudaToolChain has to override computeMSVCVersion so that Clang::constructJob passes the right version flag to cc1. We have to modify IsWindowsMSVC and friends in Clang::constructJob to be true when compiling CUDA device code on Windows for the same reason. Depends on: D28319 Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28320 llvm-svn: 291131	2017-01-05 16:52:29 +00:00
Justin Lebar	dda1d844fb	[CUDA] Make CUDAInstallationDetector take the host triple in its constructor. Summary: Previously it was taking the true target triple, which is not really what it needs: The location of the CUDA installation depends on the host OS. Reviewers: tra Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D28319 llvm-svn: 291130	2017-01-05 16:52:11 +00:00
Justin Lebar	4086fe5cd1	[TableGen] Only normalize the spelling of GNU-style attributes. Summary: When Sema looks up an attribute name, it strips off leading and trailing "__" if the attribute is GNU-style. That is, __attribute__((foo)) and __attribute__((__foo__)) are equivalent. This is only true for GNU-style attributes. In particular, __declspec(__foo__) is not equivalent to __declspec(foo), and Sema respects this difference. This patch fixes TableGen to match Sema's behavior. The spelling 'GNU<"__foo__">' should be normalized to 'GNU<"foo">', but 'Declspec<"__foo__">' should not be changed. This is necessary to make CUDA compilation work on Windows, because e.g. the __device__ attribute is spelled __declspec(__device__). Attr.td does not contain any Declspec spellings that start or end with "__", so this change should not affect any other attributes. Reviewers: rnk Subscribers: cfe-commits, tra Differential Revision: https://reviews.llvm.org/D28318 llvm-svn: 291129	2017-01-05 16:51:54 +00:00
Justin Lebar	1863d611f8	[Windows] Remove functions in intrin.h that are defined in Builtin.def. Summary: These duplicate declarations cause a problem for CUDA compiles on Windows. All implicitly-defined functions are host+device, and this applies to the declarations in Builtin.def. But then when we see the declarations in intrin.h, they have no attributes, so are host-only functions. This is an error. (A better fix might be to make these builtins host-only, but that is a much bigger change.) Reviewers: rnk Subscribers: cfe-commits, echristo Differential Revision: https://reviews.llvm.org/D28317 llvm-svn: 291128	2017-01-05 16:51:37 +00:00
Zvi Rackover	b10f7de3b5	[X86] Add test cases that cover pr31551. NFC. llvm-svn: 291127	2017-01-05 16:48:28 +00:00
Sanjay Patel	dea5a7bd53	less braces; NFC llvm-svn: 291126	2017-01-05 16:47:32 +00:00
Saleem Abdulrasool	7bf88b3c1f	test: add an explicit triple Not all targets use the integrated assembler. Specify a triple to ensure we use the integrated as for this. llvm-svn: 291125	2017-01-05 16:36:15 +00:00
Samuel Antao	f83efdb77a	[OpenMP] Add fields for flags in the offload entry descriptor. Summary: This patch adds two fields to the offload entry descriptor. One field is meant to signal Ctors/Dtors and `link` global variables, and the other is reserved for runtime library use. Currently, these fields are only filled with zeros in the current code generation, but that will change when `declare target` is added. The reason, we are adding these fields now is to make the code generation consistent with the runtime library proposal under review in https://reviews.llvm.org/D14031. Reviewers: ABataev, hfinkel, carlo.bertolli, kkwli0, arpith-jacob, Hahnfeld Subscribers: cfe-commits, caomhin, jholewinski Differential Revision: https://reviews.llvm.org/D28298 llvm-svn: 291124	2017-01-05 16:02:49 +00:00
Saleem Abdulrasool	888e289ed7	CodeGen: plumb header search down to the IAS inline assembly may use the `.include` directive to include other content into the file. Without the integrated assembler, the `-I` group gets passed to the assembler. Emulate this by collecting the header search paths and passing them to the IAS. Resolves PR24811! llvm-svn: 291123	2017-01-05 16:02:32 +00:00
Simon Pilgrim	bca02f9e20	[CostModel][X86] Add support for broadcast shuffle costs Currently only for broadcasts with input and output of the same width. Differential Revision: https://reviews.llvm.org/D27811 llvm-svn: 291122	2017-01-05 15:56:08 +00:00
Arpith Chacko Jacob	406acdba61	[OpenMP] Update target codegen for NVPTX device. This patch includes updates for codegen of the target region for the NVPTX device. It moves initializers from the compiler to the runtime and updates the worker loop to assume parallel work is retrieved from the runtime. A subsequent patch will update the codegen to retrieve the parallel work using calls to the runtime. It includes the removal of the inline attribute for the worker loop and disabling debug info in it. This allows codegen for a target directive and serial execution on the NVPTX device. Reviewers: ABataev Differential Revision: https://reviews.llvm.org/D28125 llvm-svn: 291121	2017-01-05 15:24:05 +00:00
Zvi Rackover	4b7d724d62	[X86] Optimize vector shifts with variable but uniform shift amounts Summary: For instructions such as PSLLW/PSLLD/PSLLQ a variable shift amount may be passed in an XMM register. The lower 64-bits of the register are evaluated to determine the shift amount. This patch improves the construction of the vector containing the shift amount. Reviewers: craig.topper, delena, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28353 llvm-svn: 291120	2017-01-05 15:11:43 +00:00
Teresa Johnson	2b60384581	[ThinLTO] Add parenthesis as per build warning Fixes a warning about "\|\|" and "&&" due to r291108. llvm-svn: 291119	2017-01-05 15:10:10 +00:00
Hafiz Abid Qadeer	66c007d507	Skip a test on darwin. My earlier commit today seem to cause a failure on a darwin buildbot. I am skipping the test while I investigate the failure. llvm-svn: 291118	2017-01-05 15:09:07 +00:00
Chad Rosier	3ccd1dffff	[AArch64] Remove mcpu option as this test is not target specific. NFC. llvm-svn: 291117	2017-01-05 15:05:03 +00:00
Tony Jiang	3a2f00b024	[PowerPC] Implement missing ISA 2.06 instructions. Instructions: fctidu[.], fctiwu[.], ftdiv, ftsqrt are not implemented. Implement them and add corresponding test cases in this patch. llvm-svn: 291116	2017-01-05 15:00:45 +00:00
Teresa Johnson	e27b058de3	[ThinLTO] Use DenseSet instead of SmallPtrSet for holding GUIDs Should fix some more bot failures from r291108. This should have been a DenseSet, since GUID is not a pointer type. It caused some bots to fail, but for some reason I wasnt't getting a build failure. llvm-svn: 291115	2017-01-05 14:59:56 +00:00
Simon Pilgrim	fd93a54fc8	Wdocumentation fix llvm-svn: 291114	2017-01-05 14:58:54 +00:00
Rafael Espindola	bd3ab097f6	Move code to the .cpp file. NFC. llvm-svn: 291113	2017-01-05 14:52:46 +00:00
Chad Rosier	e1dc73d9a7	[AArch64] Remove unused arguments from tests. NFC. llvm-svn: 291112	2017-01-05 14:48:53 +00:00
Teresa Johnson	01e7236748	[ThinLTO] Update new ModuleSummaryIndexYAML.h for r291108 Should fix bot failures due to r291108 which happened due to a change required in ModuleSummaryIndexYAML.h which was just added in r291069. llvm-svn: 291111	2017-01-05 14:40:15 +00:00
Rafael Espindola	7244708fcd	Detemplate SectionKey. NFC. llvm-svn: 291110	2017-01-05 14:35:41 +00:00

... 7 8 9 10 11 ...

251566 Commits All Branches Search

251566 Commits

All Branches