llvm-project

Commit Graph

Author	SHA1	Message	Date
Artem Belevich	52cc487ba8	[cuda] Include GPU binary into host object file and generate init/deinit code. - added -fcuda-include-gpubinary option to incorporate results of device-side compilation into host-side one. - generate code to register GPU binaries and associated kernels with CUDA runtime and clean-up on exit. - added test case for init/deinit code generation. Differential Revision: http://reviews.llvm.org/D9507 llvm-svn: 236765	2015-05-07 19:34:16 +00:00
Justin Bogner	65512647cc	InstrProf: Cede ownership of createProfileWeights to CGF The fact that PGO has a say in how these branch weights are determined isn't interesting to most of CodeGen, so it makes more sense for this API to be accessible via CodeGenFunction rather than CodeGenPGO. llvm-svn: 236380	2015-05-02 05:00:55 +00:00
Reid Kleckner	cb7a0a0562	Revert most of r236271, leaving only the datalayout change in lib/Basic/Targets.cpp llvm-svn: 236274	2015-04-30 22:29:25 +00:00
Reid Kleckner	af67602e14	Use 4 byte preferred aggregate alignment in datalayout on x86 Win32 llvm-svn: 236271	2015-04-30 22:13:05 +00:00
Reid Kleckner	be9843ce54	Revert r236128, LLVM isn't falling back in the right way llvm-svn: 236167	2015-04-29 21:55:21 +00:00
Reid Kleckner	0bb12a8981	Re-land r236052, the linker errors were fixed by LLVM r236123 Basic __finally blocks don't cause linker errors anymore (although they are miscompiled). llvm-svn: 236128	2015-04-29 17:17:17 +00:00
Nico Weber	ea721b64df	Revert r236052, it caused linker errors when building 32-bit applications. llvm-svn: 236082	2015-04-29 03:08:32 +00:00
Reid Kleckner	ddd40964f0	[SEH] Add 32-bit lowering code for __try This is just the clang-side of 32-bit SEH. LLVM still needs work, and it will determinstically fail to compile until it's feature complete. On x86, all outlined handlers have no parameters, but they do implicitly take the EBP value passed in and use it to address locals of the parent frame. We model this with llvm.frameaddress(1). This works (mostly), but __finally block inlining can break it. For now, we apply the 'noinline' attribute. If we really want to inline __finally blocks on 32-bit x86, we should teach the inliner how to untangle frameescape and framerecover. Promote the error diagnostic from codegen to sema. It now rejects SEH on non-Windows platforms. LLVM doesn't implement SEH on non-x86 Windows platforms, but there's nothing preventing it. llvm-svn: 236052	2015-04-28 22:19:32 +00:00
Justin Bogner	66242d6c5e	InstrProf: Stop using RegionCounter outside of CodeGenPGO (NFC) The RegionCounter type does a lot of legwork, but most of it is only meaningful within the implementation of CodeGenPGO. The uses elsewhere in CodeGen generally just want to increment or read counters, so do that directly. llvm-svn: 235664	2015-04-23 23:06:47 +00:00
David Majnemer	dc012fa266	Revert "Revert r234581, it might have caused a few miscompiles in Chromium." This reverts commit r234700. It turns out that the lifetime markers were not the cause of Chromium failing but a bug which was uncovered by optimizations exposed by the markers. llvm-svn: 235553	2015-04-22 21:38:15 +00:00
Reid Kleckner	ebaf28d13d	Reland r234613 (and follow-ups 234614, 234616, 234618) The frameescape intrinsic cannot be inlined, so I fixed the inliner in r234937. This should address PR23216. llvm-svn: 234942	2015-04-14 20:59:00 +00:00
Nico Weber	ad108337cf	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234789	2015-04-13 20:04:22 +00:00
Nico Weber	f2a39a7b4e	Revert r234786, it contained a bunch of stuff I did not mean to commit. llvm-svn: 234787	2015-04-13 20:03:03 +00:00
Nico Weber	b31abb05fb	Revert r234613 (and follow-ups 234614, 234616, 234618), it caused PR23216. llvm-svn: 234786	2015-04-13 20:01:20 +00:00
Nico Weber	1c565c31b1	Revert r234581, it might have caused a few miscompiles in Chromium. If the revert helps, I'll get a repro this Monday. Else I'll put the change back in. llvm-svn: 234700	2015-04-11 23:51:38 +00:00
Reid Kleckner	11859afd5f	[SEH] Re-land r234532, but use internal linkage for all SEH helpers Even though these symbols are in a comdat group, the Microsoft linker really wants them to have internal linkage. I'm planning to tweak the mangling in a follow-up change. This is a straight revert with a 1-line fix. llvm-svn: 234613	2015-04-10 17:34:52 +00:00
Arnaud A. de Grandmaison	047a686d53	Remove threshold for inserting lifetime markers for named temporaries Now that TailRecursionElimination has been fixed with r222354, the threshold on size for lifetime marker insertion can be removed. This only affects named temporary though, as the patch for unnamed temporaries is still in progress. My previous commit (r222993) was not handling debuginfo correctly, but this could only be seen with some asan tests. Basically, lifetime markers are just instrumentation for the compiler's usage and should not affect debug information; however, the cleanup infrastructure was assuming it contained only destructors, i.e. actual code to be executed, and was setting the breakpoint for the end of the function to the closing '}', and not the return statement, in order to show some destructors have been called when leaving the function. This is wrong when the cleanups are only lifetime markers, and this is now fixed. llvm-svn: 234581	2015-04-10 10:13:52 +00:00
Nico Weber	bd51a6a99f	Revert r234532 for a bit, it very likely caused http://crbug.com/475768 llvm-svn: 234563	2015-04-10 04:33:03 +00:00
Reid Kleckner	0dbecf2b78	[SEH] Outline finally blocks using the new variable capture support WinEHPrepare was going to have to pattern match the control flow merge and split that the old lowering used, and that wasn't really feasible. Now we can teach WinEHPrepare to pattern match this, which is much simpler: %fp = call i8* @llvm.frameaddress(i32 0) call void @func(iN [01], i8* %fp) This prototype happens to match the prototype used by the Win64 SEH personality function, so this is really simple. llvm-svn: 234532	2015-04-09 20:37:24 +00:00
Sanjay Patel	359b105745	Process the -freciprocal-math optimization flag (PR20912) The driver currently accepts but ignores the -freciprocal-math flag. This patch passes the flag through and enables 'arcp' fast-math-flag generation in IR. Note that this change does not actually enable the optimization for any target. The reassociation optimization that this flag specifies was implemented by http://reviews.llvm.org/D6334 : http://llvm.org/viewvc/llvm-project?view=revision&revision=222510 Because the optimization is done in the backend rather than IR, the backend must be modified to understand instruction-level fast-math-flags or a new function-level attribute must be created. Also note that -freciprocal-math is independent of any target-specific usage of reciprocal estimate hardware instructions. That requires its own flag ('-mrecip'). https://llvm.org/bugs/show_bug.cgi?id=20912 llvm-svn: 234493	2015-04-09 15:03:23 +00:00
Reid Kleckner	31a1bb0c14	Reland "[SEH] Implement filter capturing in CodeGen" The test should be fixed. It was failing in NDEBUG builds due to a missing '*' character in a regex. In asserts builds, the pattern matched a single digit value, which became a double digit value in NDEBUG builds. Go figure. This reverts commit r234261. llvm-svn: 234447	2015-04-08 22:23:48 +00:00
Daniel Jasper	303c3ac925	Revert "[SEH] Implement filter capturing in CodeGen" Test fails: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/3182/ llvm-svn: 234306	2015-04-07 10:07:47 +00:00
Reid Kleckner	0ada50f17f	[SEH] Implement filter capturing in CodeGen While capturing filters aren't very common, we'd like to outline __finally blocks in the frontend to simplify -O0 EH preparation and reduce code size. Finally blocks are usually have captures, and this is the first step towards that. Currently we don't support capturing 'this' or VLAs. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D8825 llvm-svn: 234261	2015-04-06 23:51:44 +00:00
David Blaikie	1ed728c499	[opaque pointer type] More GEP API migrations Looks like the VTable code in particular will need some work to pass around the pointee type explicitly. llvm-svn: 234128	2015-04-05 22:45:47 +00:00
David Blaikie	fb901c7abf	[opaque pointer type] more GEP API migrations llvm-svn: 234097	2015-04-04 15:12:29 +00:00
Reid Kleckner	7ffc3fbb2f	C++14: Disable sized deallocation by default due to ABI breakage There are no widely deployed standard libraries providing sized deallocation functions, so we have to punt and ask the user if they want us to use sized deallocation. In the future, when such libraries are deployed, we can teach the driver to detect them and enable this feature. N3536 claimed that a weak thunk from sized to unsized deallocation could be emitted to avoid breaking backwards compatibility with standard libraries not providing sized deallocation. However, this approach and other variations don't work in practice. With the weak function approach, the thunk has to have default visibility in order to ensure that it is overridden by other DSOs providing sized deallocation. Weak, default visibility symbols are particularly expensive on MachO, so John McCall was considering disabling this feature by default on Darwin. It also changes behavior ELF linking behavior, causing certain otherwise unreferenced object files from an archive to be pulled into the link. Our second approach was to use an extern_weak function declaration and do an inline conditional branch at the deletion call site. This doesn't work because extern_weak only works on MachO if you have some archive providing the default value of the extern_weak symbol. Arranging to provide such an archive has the same challenges as providing the symbol in the standard library. Not to mention that extern_weak doesn't really work on COFF. Reviewers: rsmith, rjmccall Differential Revision: http://reviews.llvm.org/D8467 llvm-svn: 232788	2015-03-20 00:31:07 +00:00
Artem Belevich	f3d3db65de	Remove .CUDAIsDevice flags from CodeGenOpts as it's already available in LangOpts. Differential Revision: http://reviews.llvm.org/D8385 llvm-svn: 232749	2015-03-19 18:58:18 +00:00
Alexey Bataev	3eff5f46d7	[OPENMP] Rename methods of OpenMPRuntime class. NFC. llvm-svn: 230470	2015-02-25 08:32:46 +00:00
Reid Kleckner	4343922dde	Avoid using a COMDAT for sized delete on MachO llvm-svn: 229915	2015-02-19 21:13:45 +00:00
Reid Kleckner	66abf2f92f	Put the implicit weak sized deallocation funciton in C++14 in a comdat Fixes PR22635. llvm-svn: 229913	2015-02-19 21:01:34 +00:00
Larisse Voufo	e990a3f60c	Rename flags and options to match current naming: from -fdef-sized-delete to -fdefine-sized-deallocation, and from DefaultSizedDelete to DefineSizedDeallocation. llvm-svn: 229597	2015-02-18 01:04:10 +00:00
Larisse Voufo	5526f4f094	Revise the implementation logic of sized deallocation: Do not automatically generate weak definitions of the sized operator delete (in terms of unsized operator delete). Instead, provide the funcitonality via a new compiler flag, -fdef-sized-delete. The current implementation causes link-time ODR violations when the delete symbols are exported into the dynamic table. llvm-svn: 229241	2015-02-14 05:42:57 +00:00
Reid Kleckner	11c033e8aa	SEH: Use the SEHTryEpilogueStack instead of a separate bool We don't need a bool to track this now that we have a stack for it. llvm-svn: 228982	2015-02-12 23:40:45 +00:00
Reid Kleckner	a593000f01	Add the 'noinline' attribute to call sites within __try bodies LLVM doesn't support non-call exceptions, so inlining makes it harder to catch such asynchronous exceptions. llvm-svn: 228876	2015-02-11 21:40:48 +00:00
Reid Kleckner	aca01db706	Implement IRGen for SEH __finally and AbnormalTermination Previously we would simply double-emit the body of the __finally block, but that doesn't work when it contains any kind of Decl, which we can't double emit. This fixes that by emitting the block once and branching into a shared code region and then branching back out. llvm-svn: 228222	2015-02-04 22:37:07 +00:00
David Blaikie	4d52443c0e	DebugInfo: Attribute cleanup code to the end of the scope, not the end of the function. Now if you break on a dtor and go 'up' in your debugger (or you get an asan failure in a dtor) during an exception unwind, you'll have more context. Instead of all dtors appearing to be called from the '}' of the function, they'll be attributed to the end of the scope of the variable, the same as the non-exceptional dtor call. This doesn't /quite/ remove all uses of CurEHLocation (which might be nice to remove, for a few reasons) - it's still used to choose the location for some other work in the landing pad. It'd be nice to attribute that code to the same location as the exception calls within the block and to remove CurEHLocation. llvm-svn: 228181	2015-02-04 19:47:54 +00:00
David Blaikie	303facbeb8	DebugInfo: Fix line table for comparisons harder/better for the sake of C (& the GDB buildbot) llvm-svn: 227663	2015-01-31 01:10:11 +00:00
David Blaikie	298720d324	DebugInfo: Attribute implicit boolean tests to the expression being tested, not to the outer use of that expression. This is half a fix for a GDB test suite failure that expects to start at 'a' in the following code: void func(int a) if (a && b) ... But instead, without this change, the comparison was assigned to '&&' (well, worse actually - because there was a chained 'a && b && c' and it was assigned to the second '&&' because of a recursive application of this bug) and then the load folded into the comparison so breaking on the function started at '&&' instead of 'a'. The other part of this needs to be fixed in LLVM where it's ignoring the location of the icmp and instead using the location of the branch instruction. The fix to the conditional operator is actually a no-op currently, because the conditional operator's location coincides with 'a' (the start of the conditional expression) but should probably be '?' instead. See the FIXME in the test case that mentions the ARCMigration tool failures when I tried to make that change. llvm-svn: 227356	2015-01-28 19:50:09 +00:00
Reid Kleckner	1d59f99f5c	Initial support for Win64 SEH IR emission The lowering looks a lot like normal EH lowering, with the exception that the exceptions are caught by executing filter expression code instead of matching typeinfo globals. The filter expressions are outlined into functions which are used in landingpad clauses where typeinfo would normally go. Major aspects that still need work: - Non-call exceptions in __try bodies won't work yet. The plan is to outline the __try block in the frontend to keep things simple. - Filter expressions cannot use local variables until capturing is implemented. - __finally blocks will not run after exceptions. Fixing this requires work in the LLVM SEH preparation pass. The IR lowering looks like this: // C code: bool safe_div(int n, int d, int r) { __try { r = normal_div(n, d); } __except(_exception_code() == EXCEPTION_INT_DIVIDE_BY_ZERO) { return false; } return true; } ; LLVM IR: define i32 @filter(i8* %e, i8* %fp) { %ehptrs = bitcast i8* %e to i32 %ehrec = load i32 %ehptrs %code = load i32* %ehrec %matches = icmp eq i32 %code, i32 u0xC0000094 %matches.i32 = zext i1 %matches to i32 ret i32 %matches.i32 } define i1 zeroext @safe_div(i32 %n, i32 %d, i32* %r) { %rr = invoke i32 @normal_div(i32 %n, i32 %d) to label %normal unwind to label %lpad normal: store i32 %rr, i32* %r ret i1 1 lpad: %ehvals = landingpad {i8, i32} personality i32 (...) @__C_specific_handler catch i8* bitcast (i32 (i8, i8)* @filter to i8) %ehptr = extractvalue {i8, i32} %ehvals, i32 0 %sel = extractvalue {i8, i32} %ehvals, i32 1 %filter_sel = call i32 @llvm.eh.seh.typeid.for(i8 bitcast (i32 (i8, i8)* @filter to i8*)) %matches = icmp eq i32 %sel, %filter_sel br i1 %matches, label %eh.except, label %eh.resume eh.except: ret i1 false eh.resume: resume } Reviewers: rjmccall, rsmith, majnemer Differential Revision: http://reviews.llvm.org/D5607 llvm-svn: 226760	2015-01-22 01:36:17 +00:00
David Blaikie	66e4197f07	Reapply r225000 (reverted in r225555): DebugInfo: Generalize debug info location handling (and follow-up commits). Several pieces of code were relying on implicit debug location setting which usually lead to incorrect line information anyway. So I've fixed those (in r225955 and r225845) separately which should pave the way for this commit to be cleanly reapplied. The reason these implicit dependencies resulted in crashes with this patch is that the debug location would no longer implicitly leak from one place to another, but be set back to invalid. Once a call with no/invalid location was emitted, if that call was ever inlined it could produce invalid debugloc chains and assert during LLVM's codegen. There may be further cases of such bugs in this patch - they're hard to flush out with regression testing, so I'll keep an eye out for reports and investigate/fix them ASAP if they come up. Original commit message: Reapply "DebugInfo: Generalize debug info location handling" Originally committed in r224385 and reverted in r224441 due to concerns this change might've introduced a crash. Turns out this change fixes the crash introduced by one of my earlier more specific location handling changes (those specific fixes are reverted by this patch, in favor of the more general solution). Recommitted in r224941 and reverted in r224970 after it caused a crash when building compiler-rt. Looks to be due to this change zeroing out the debug location when emitting default arguments (which were meant to inherit their outer expression's location) thus creating call instructions without locations - these create problems for inlining and must not be created. That is fixed and tested in this version of the change. Original commit message: This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 225956	2015-01-14 07:38:27 +00:00
David Blaikie	f353d3ecd0	Revert "DebugInfo: Generalize debug info location handling" and related commits This reverts commit r225000, r225021, r225083, r225086, r225090. The root change (r225000) still has several issues where it's caused calls to be emitted without debug locations. This causes assertion failures if/when those calls are inlined. I'll work up some test cases and fixes before recommitting this. llvm-svn: 225555	2015-01-09 23:00:28 +00:00
David Blaikie	b9a23c9155	DebugInfo: Provide a less subtle way to set the debug location of simple ret instructions un-XFAILing the test XFAIL'd in r225086 after it regressed in r225083. llvm-svn: 225090	2015-01-02 22:07:26 +00:00
Pekka Jaaskelainen	3701450b06	OpenCL C: Add support for a set of floating point arithmetic relaxation flags: -cl-no-signed-zeros -cl-unsafe-math-optimizations -cl-finite-math-only -cl-fast-relaxed-math Propagate the info to FP instruction flags as well as function attributes where they are available. llvm-svn: 223928	2014-12-10 16:41:14 +00:00
Duncan P. N. Exon Smith	fb49491477	IR: Update clang for Metadata/Value split in r223802 Match LLVM API changes from r223802. llvm-svn: 223803	2014-12-09 18:39:32 +00:00
Justin Bogner	970ac60573	InstrProf: Use LLVM's -instrprof pass for profiling The logic for lowering profiling counters has been moved to an LLVM pass. Emit the intrinsics rather than duplicating the whole pass in clang. llvm-svn: 223683	2014-12-08 19:04:51 +00:00
Sameer Sahasrabuddhe	c6093fea03	Always emit kernel arg info for SPIR. http://llvm.org/bugs/show_bug.cgi?id=21555 Currently, kernel argument metadata is omitted unless the "-cl-kernel-arg-info" option is specified. But the SPIR 1.2 spec requires that all metadata except kernel_arg_name should always be emitted, and kernel_arg_name is only emitted when "-cl-kernel-arg-info" is specified. Patch ported by Ryan Burn from the Khronos SPIR generator. https://github.com/KhronosGroup/SPIR llvm-svn: 223340	2014-12-04 05:30:58 +00:00
Peter Collingbourne	1a0a9a3c75	UBSan now uses prologue data instead of prefix data As the semantics of prefix data has changed. See D6454. Patch by Ben Gamari! Test Plan: Testsuite Differential Revision: http://reviews.llvm.org/D6489 llvm-svn: 223190	2014-12-03 02:08:51 +00:00
Alexey Samsonov	e396bfc064	Bundle conditions checked by UBSan with sanitizer kinds they implement. Summary: This change makes CodeGenFunction::EmitCheck() take several conditions that needs to be checked (all of them need to be true), together with sanitizer kinds these checks are for. This would allow to split one call into UBSan runtime into several calls in case different sanitizer kinds would have different recoverability settings. Tests should be fixed accordingly, I'm working on it. Test Plan: regression test suite. Reviewers: rsmith Reviewed By: rsmith Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6219 llvm-svn: 221716	2014-11-11 22:03:54 +00:00
Alexey Samsonov	4c1a96f519	Propagate SanitizerKind into CodeGenFunction::EmitCheck() call. Make sure CodeGenFunction::EmitCheck() knows which sanitizer it emits check for. Make CheckRecoverableKind enum an implementation detail and move it away from header. Currently CheckRecoverableKind is determined by the type of sanitizer ("unreachable" and "return" are unrecoverable, "vptr" is always-recoverable, all the rest are recoverable). This will change in future if we allow to specify which sanitizers are recoverable, and which are not by -fsanitize-recover= flag. No functionality change. llvm-svn: 221635	2014-11-10 22:27:30 +00:00
Alexey Samsonov	edf99a92c0	Introduce a SanitizerKind enum to LangOptions. Use the bitmask to store the set of enabled sanitizers instead of a bitfield. On the negative side, it makes syntax for querying the set of enabled sanitizers a bit more clunky. On the positive side, we will be able to use SanitizerKind to eventually implement the new semantics for -fsanitize-recover= flag, that would allow us to make some sanitizers recoverable, and some non-recoverable. No functionality change. llvm-svn: 221558	2014-11-07 22:29:38 +00:00

1 2 3 4 5 ...

527 Commits