llvm-project

Commit Graph

Author	SHA1	Message	Date
Samuel Antao	d06239d359	[CUDA][OpenMP] Create generic offload action Summary: This patch replaces the CUDA specific action by a generic offload action. The offload action may have multiple dependences classier in “host” and “device”. The way this generic offloading action is used is very similar to what is done today by the CUDA implementation: it is used to set a specific toolchain and architecture to its dependences during the generation of jobs. This patch also proposes propagating the offloading information through the action graph so that that information can be easily retrieved at any time during the generation of commands. This allows e.g. the "clang tool” to evaluate whether CUDA should be supported for the device or host and ptas to easily retrieve the target architecture. This is an example of how the action graphs would look like (compilation of a single CUDA file with two GPU architectures) ``` 0: input, "cudatests.cu", cuda, (host-cuda) 1: preprocessor, {0}, cuda-cpp-output, (host-cuda) 2: compiler, {1}, ir, (host-cuda) 3: input, "cudatests.cu", cuda, (device-cuda, sm_35) 4: preprocessor, {3}, cuda-cpp-output, (device-cuda, sm_35) 5: compiler, {4}, ir, (device-cuda, sm_35) 6: backend, {5}, assembler, (device-cuda, sm_35) 7: assembler, {6}, object, (device-cuda, sm_35) 8: offload, "device-cuda (nvptx64-nvidia-cuda:sm_35)" {7}, object 9: offload, "device-cuda (nvptx64-nvidia-cuda:sm_35)" {6}, assembler 10: input, "cudatests.cu", cuda, (device-cuda, sm_37) 11: preprocessor, {10}, cuda-cpp-output, (device-cuda, sm_37) 12: compiler, {11}, ir, (device-cuda, sm_37) 13: backend, {12}, assembler, (device-cuda, sm_37) 14: assembler, {13}, object, (device-cuda, sm_37) 15: offload, "device-cuda (nvptx64-nvidia-cuda:sm_37)" {14}, object 16: offload, "device-cuda (nvptx64-nvidia-cuda:sm_37)" {13}, assembler 17: linker, {8, 9, 15, 16}, cuda-fatbin, (device-cuda) 18: offload, "host-cuda (powerpc64le-unknown-linux-gnu)" {2}, "device-cuda (nvptx64-nvidia-cuda)" {17}, ir 19: backend, {18}, assembler 20: assembler, {19}, object 21: input, "cuda", object 22: input, "cudart", object 23: linker, {20, 21, 22}, image ``` The changes in this patch pass the existent regression tests (keeps the existent functionality) and resulting binaries execute correctly in a Power8+K40 machine. Reviewers: echristo, hfinkel, jlebar, ABataev, tra Subscribers: guansong, andreybokhanko, tcramer, mkuron, cfe-commits, arpith-jacob, carlo.bertolli, caomhin Differential Revision: https://reviews.llvm.org/D18171 llvm-svn: 275645	2016-07-15 23:13:27 +00:00
Argyrios Kyrtzidis	d9849a972b	[index] Create different USR if a property is a class property. Avoids USR conflicts between class & instance properties of the same name. llvm-svn: 275630	2016-07-15 22:18:19 +00:00
Richard Smith	13fb860c78	Revert r275481, r275490. This broke modules bootstrap. llvm-svn: 275624	2016-07-15 21:33:46 +00:00
Matt Arsenault	c7536a5d60	AMDGPU: Remove legacy ldexp builtin llvm-svn: 275623	2016-07-15 21:33:06 +00:00
Matt Arsenault	c86671da09	AMDGPU: Update for rsq intrinsic changes llvm-svn: 275622	2016-07-15 21:33:02 +00:00
Saleem Abdulrasool	511f2e5a89	Sema: support __declspec(dll) on ObjC interfaces Extend the __declspec(dll) attribute to cover ObjC interfaces. This was requested by Microsoft for their ObjC support. Cover both import and export. This only adds the semantic analysis portion of the support, code-generation still remains outstanding. Add some basic initial documentation on the attributes that were previously empty. Tweak the previous tests to use the relative expected-warnings to make the tests easier to read. llvm-svn: 275610	2016-07-15 20:41:10 +00:00
Argyrios Kyrtzidis	d798c05526	[AST] Keep track of the left brace source location of a tag decl. This is useful for source modification tools. There will be a follow-up commit using it. llvm-svn: 275590	2016-07-15 18:11:33 +00:00
Wei Ding	ea41f356bb	AMDGPU: Add Clang Builtin for v_lerp_u8 Differential Revision: http://reviews.llvm.org/D22380 llvm-svn: 275577	2016-07-15 16:43:03 +00:00
Peter Collingbourne	03f8907f65	Frontend: Simplify ownership model for clang's output streams. This changes the CompilerInstance::createOutputFile function to return a std::unique_ptr<llvm::raw_ostream>, rather than an llvm::raw_ostream implicitly owned by the CompilerInstance. This in most cases required that I move ownership of the output stream to the relevant ASTConsumer. The motivation for this change is to allow BackendConsumer to be a client of interfaces such as D20268 which take ownership of the output stream. Differential Revision: http://reviews.llvm.org/D21537 llvm-svn: 275507	2016-07-15 00:55:40 +00:00
Richard Smith	6c35b2dea4	[modules] Don't pass interesting decls to the consumer for a module file that's passed on the command line but never actually used. We consider a (top-level) module to be used if any part of it is imported, either by the current translation unit, or by any part of a top-level module that is itself used. (Put another way, a module is used if an implicit modules build would have loaded its .pcm file.) llvm-svn: 275481	2016-07-14 21:50:09 +00:00
Roger Ferrer Ibanez	58b8e483f0	Reverting 275417 This change has triggered unexpected failures. llvm-svn: 275462	2016-07-14 20:05:30 +00:00
Roger Ferrer Ibanez	585ea9ddce	Diagnose taking address and reference binding of packed members This patch implements PR#22821. Taking the address of a packed member is dangerous since the reduced alignment of the pointee is lost. This can lead to memory alignment faults in some architectures if the pointer value is dereferenced. This change adds a new warning to clang emitted when taking the address of a packed member. A packed member is either a field/data member declared as attribute((packed)) or belonging to a struct/class declared as such. The associated flag is -Waddress-of-packed-member. Conversions (either implicit or via a valid casting) to pointer types with lower or equal alignment requirements (e.g. void* or char*) silence the warning. This change also adds a new error diagnostic when the user attempts to bind a reference to a packed member, regardless of the alignment. Differential Revision: https://reviews.llvm.org/D20561 llvm-svn: 275417	2016-07-14 14:10:43 +00:00
Aaron Ballman	745e752725	Correct the attribute documentation for the new XRay attributes. Fixes the documentation build. llvm-svn: 275404	2016-07-14 12:35:00 +00:00
Kelvin Li	a579b9196c	[OpenMP] Sema and parsing for 'target parallel for simd' pragma This patch is to implement sema and parsing for 'target parallel for simd' pragma. Differential Revision: http://reviews.llvm.org/D22096 llvm-svn: 275365	2016-07-14 02:54:56 +00:00
Adrian Prantl	284652beec	Add a comment mirroring the one in LLVM's Dwarf.h llvm-svn: 275356	2016-07-14 00:42:53 +00:00
Richard Smith	a547eb27fa	P0305R0: Semantic analysis and code generation for C++17 init-statement for 'if' and 'switch': if (stmt; condition) { ... } Patch by Anton Bikineev! Some minor formatting and comment tweets by me. llvm-svn: 275350	2016-07-14 00:11:03 +00:00
Aaron Ballman	7d2aecbc76	Add XRay flags to Clang. We implement two flags to control the XRay behaviour: -fxray-instrument: enables XRay annotation of IR -fxray-instruction-threshold: configures the threshold for function size (looking at IR instructions), and allow LLVM to decide whether to add the nop sleds later on in the process. Also implements the related xray_always_instrument and xray_never_instrument function attributes. Patch by Dean Michael Berris. llvm-svn: 275330	2016-07-13 22:32:15 +00:00
Yaron Keren	18c3d0674e	Implement FunctionDecl::getDefinition() to be consistent with VarDecl, TagDecl, EnumDecl, RecordDecl, CXXRecordDecl. Use getDefinition in two locations to make the code more readable. llvm-svn: 275303	2016-07-13 19:04:51 +00:00
Artem Dergachev	50aece03cb	[analyzer] Implement a methond to discover origin region of a symbol. This encourages checkers to make logical decisions depending on value of which region was the symbol under consideration introduced to denote. A similar technique is already used in a couple of checkers; they were modified to call the new method. Differential Revision: http://reviews.llvm.org/D22242 llvm-svn: 275290	2016-07-13 18:07:26 +00:00
Carlo Bertolli	70594e9282	[OpenMP] Initial implementation of parse+sema for OpenMP clause 'is_device_ptr' of target http://reviews.llvm.org/D22070 llvm-svn: 275282	2016-07-13 17:16:49 +00:00
Carlo Bertolli	2404b17192	[OpenMP] Initial implementation of parse+sema for clause use_device_ptr of 'target data' http://reviews.llvm.org/D21904 This patch is similar to the implementation of 'private' clause: it adds a list of private pointers to be used within the target data region to store the device pointers returned by the runtime. Please refer to the following document for a full description of what the runtime witll return in this case (page 10 and 11): https://github.com/clang-omp/OffloadingDesign I am happy to answer any question related to the runtime interface to help reviewing this patch. llvm-svn: 275271	2016-07-13 15:37:16 +00:00
Pierre Gousseau	533a893fa1	[PCH] Add a fno-pch-timestamp option to cc1 to disable inclusion of timestamps in PCH files. This is to allow distributed build systems, that do not preserve time stamps, to use PCH files. Second and last part of the patch proposed at: Differential Revision: http://reviews.llvm.org/D20867 llvm-svn: 275267	2016-07-13 14:21:11 +00:00
Tim Northover	00dc68dff6	AArch64: fix return type of vqmovun_high_*. These should be returning an unsigned quantity. llvm-svn: 275195	2016-07-12 17:38:50 +00:00
Clement Courbet	425175934e	[ASTMatchers] isSignedInteger() and isUnsignedInteger() Complementary to isInteger(), these match signed and unsigned integers respectively. Review: http://reviews.llvm.org/D21989 llvm-svn: 275157	2016-07-12 06:36:00 +00:00
David Majnemer	526793d14c	[MS ABI] Support throwing/catching __unaligned types We need to mark the appropriate bits in ThrowInfo and HandlerType so that the personality routine can correctly handle qualification conversions. llvm-svn: 275154	2016-07-12 04:42:50 +00:00
Erik Pilkington	f1996e567a	[NFC] Reorder fields of VersionTuple to reduce size Differential revision: http://reviews.llvm.org/D19934 llvm-svn: 275095	2016-07-11 20:00:48 +00:00
Eric Liu	4f8d99433d	Make tooling::applyAllReplacements return llvm::Expected<string> instead of empty string to indicate potential error. Summary: return llvm::Expected<> to carry error status and error information. This is the first step towards introducing "Error" into tooling::Replacements. Reviewers: djasper, klimek Subscribers: ioeric, klimek, cfe-commits Differential Revision: http://reviews.llvm.org/D21601 llvm-svn: 275062	2016-07-11 13:53:12 +00:00
Anastasia Stulova	4d85003964	[OpenCL] Improved diagnostics of OpenCL types. - Changes diagnostics for Blocks to be implicitly const qualified OpenCL v2.0 s6.12.5. - Added and unified diagnostics of some OpenCL special types: blocks, images, samplers, pipes. These types are intended for use with the OpenCL builtin functions only and, therefore, most regular uses are not allowed including assignments, arithmetic operations, pointer dereferencing, etc. Review: http://reviews.llvm.org/D21989 llvm-svn: 275061	2016-07-11 13:46:02 +00:00
Craig Topper	4d61a3c2d8	[AVX512] Replace masked AND/OR/XOR intrinsics with native code and remove the builtins. llvm-svn: 275049	2016-07-11 06:14:18 +00:00
Jan Vesely	d7e03a5bd9	AMDGPU: Export workitem builtins Reviewers: tstellardAMD Differential Revision: http://reviews.llvm.org/D20299 llvm-svn: 275030	2016-07-10 22:38:04 +00:00
Craig Topper	8a62061e37	[AVX512] Remove masked shufps/shudpd builtins. These are all handled with __builtin_shufflevector. llvm-svn: 275018	2016-07-10 16:35:54 +00:00
Sean Silva	9ac6ae2a99	Delete dead code. We were just setting DisableUnitAtATime to its default value. llvm-svn: 275005	2016-07-10 00:57:52 +00:00
David Majnemer	58fab355e2	[clang-cl] Add support for /Zd MASM (ML.exe and ML64.exe) and older versions of MSVC (CL.exe) support a flag called /Zd which is more-or-less -gline-tables-only. It seems nicer to support this flag instead of exposing -gline-tables-only. llvm-svn: 274991	2016-07-09 21:49:16 +00:00
David Majnemer	97d0517078	[AST] Tighten up some bitfields Optimize the bitfield types to conserve space for the MSVC ABI. llvm-svn: 274983	2016-07-09 19:26:19 +00:00
Craig Topper	95b61b0544	[X86] Use __builtin_ia32_vec_ext_v4hi and __builtin_ia32_vec_set_v4hi to implement pextrw/pinsertw MMX intrinsics instead of trying to use native IR. Without this we end up generating code that doesn't use mmx registers and probably doesn't work well with other mmx intrinsics. llvm-svn: 274968	2016-07-09 05:30:41 +00:00
Yaxun Liu	79c99fb7eb	[OpenCL] Add missing -cl-no-signed-zeros option into driver Add OCL option -cl-no-signed-zeros to driver options. Also added to opencl.cl testcases. Patch by Aaron En Ye Shi. Differential Revision: http://reviews.llvm.org/D22067 llvm-svn: 274923	2016-07-08 20:28:29 +00:00
Alexey Bader	c813c8113d	[OpenCL] Fix access qualifiers handling for typedefs OpenCL s6.6: "Access qualifier must be used with image object arguments of kernels and of user-defined functions [...] If no qualifier is provided, read_only is assumed". This does not define the behavior for image types used in typedef declaration, but following the spec logic, we should allow access qualifiers specification in typedefs, e.g.: typedef write_only image1d_t img1d_wo; Unlike cv-qualifiers, user cannot add access qualifier to a typedef type, i.e. this is not allowed: typedef image1d_t img1d; // note: previously declared 'read_only' here void foo(write_only img1d im) {} // error: multiple access qualifier Patch by Andrew Savonichev. Reviewers: Anastasia Stulova. Differential revision: http://reviews.llvm.org/D20948 llvm-svn: 274858	2016-07-08 15:34:59 +00:00
Alexander Kornienko	67d5821695	[ASTMatchers] Add missing forEachArgumentWithParam() to code sample Reviewers: klimek Subscribers: cfe-commits, klimek Patch by Martin Boehme! Differential Revision: http://reviews.llvm.org/D21799 llvm-svn: 274835	2016-07-08 10:51:00 +00:00
Vassil Vassilev	7baef47065	Recommit r274348 and r274349. The Windows failures should be fixed. Original commit message: "Add postorder traversal support to the RecursiveASTVisitor. This feature needs to be explicitly enabled by overriding shouldTraversePostOrder() as it has performance drawbacks for the iterative Stmt-traversal. Patch by Raphael Isemann! Reviewed by Richard Smith and Benjamin Kramer." llvm-svn: 274830	2016-07-08 08:33:56 +00:00
Craig Topper	a1bee4398c	[X86] Remove dead builtins that don't exist in the backend intrinsic file and don't have custom handling in CGBuiltins.cpp either. llvm-svn: 274825	2016-07-08 05:11:47 +00:00
Devin Coughlin	cad622742e	[analyzer] Add rudimentary handling of AtomicExpr. This proposed patch adds crude handling of atomics to the static analyzer. Rather than ignore AtomicExprs, as we now do, this patch causes the analyzer to escape the arguments. This is imprecise -- and we should model the expressions fully in the future -- but it is less wrong than ignoring their effects altogether. This is rdar://problem/25353187 Differential Revision: http://reviews.llvm.org/D21667 llvm-svn: 274816	2016-07-08 00:53:18 +00:00
Justin Lebar	c43ad9ee5a	[CUDA] Check that our CUDA install supports the requested architectures. Summary: Raise an error if you're using a CUDA installation that's too old for the requested architectures. In practice, this means that you need a CUDA 8 install to compile for sm_6*. Reviewers: tra Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D21869 llvm-svn: 274781	2016-07-07 18:17:52 +00:00
Justin Lebar	495f1a22af	[CUDA] Rename the __nvvm_bar0 builtin back to __syncthreads. The builtin was renamed in r274770. But __syncthreads is part of our user-facing API, so we need to keep the name as-is. Patch by Justin Bogner. llvm-svn: 274780	2016-07-07 18:15:03 +00:00
Justin Bogner	2d5de7e568	NVPTX: Use the nvvm builtins to read SRegs rather than the legacy ptx ones The ptx spellings were removed from LLVM in r274769. llvm-svn: 274770	2016-07-07 16:41:08 +00:00
David Majnemer	8aaf372bdf	[AST] Tighten up the bitfield in TemplateSpecializationType Optimize the bitfield types to conserve space for the MSVC ABI. llvm-svn: 274733	2016-07-07 04:43:11 +00:00
David Majnemer	6fbeee307e	[AST] Use ArrayRef in more interfaces ArrayRef is a little better than passing around a pointer/length pair. No functional change is intended. llvm-svn: 274732	2016-07-07 04:43:07 +00:00
Aaron Ballman	c1c6823976	Ensuring the bit-fields have the same type; MSVC will place the fields in different allocation units otherwise. llvm-svn: 274695	2016-07-06 22:06:19 +00:00
Justin Lebar	629076178a	[CUDA] Add utility functions for dealing with CUDA versions / architectures. Summary: Currently our handling of CUDA architectures is scattered all around clang. This patch centralizes it. A key advantage of this centralization is that you can now write a C++ switch on e.g. CudaArch and get a compile error if you don't handle one of the enum values. Reviewers: tra Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D21867 llvm-svn: 274681	2016-07-06 21:21:39 +00:00
Justin Bogner	2f8de9fb4f	NVPTX: Rename __builtin_ptx_shfl -> __nvvm_shfl To match "NVPTX: Make the llvm.nvvm.shfl intrinsics and builtin names consistent" in LLVM. llvm-svn: 274663	2016-07-06 19:52:32 +00:00
Aaron Ballman	5c574341f5	Add AST matchers for handling bit-fields and narrowing based on their width. llvm-svn: 274652	2016-07-06 18:25:16 +00:00

1 2 3 4 5 ...

20348 Commits