llvm-project

Commit Graph

Author	SHA1	Message	Date
Igor Laevsky	77f118f878	Add invoke related functionality into StatepointSite classes. Differential Revision: http://reviews.llvm.org/D7364 llvm-svn: 229838	2015-02-19 11:02:11 +00:00
Elena Demikhovsky	69e8b45b13	AVX-512: Full implementation for VRNDSCALESS/SD instructions and intrinsics. llvm-svn: 229837	2015-02-19 10:48:04 +00:00
Chandler Carruth	bcb6c5f62d	[x86] Add support for bit-wise blending and use it in the v8 and v16 lowering paths. I'm going to be leveraging this to simplify a lot of the overly complex lowering of v8 and v16 shuffles in pre-SSSE3 modes. Sadly, this isn't profitable on v4i32 and v2i64. There, the float and double blending instructions for pre-SSE4.1 are actually pretty good, and we can't beat them with bit math. And once SSE4.1 comes around we have direct blending support and this ceases to be relevant. Also, some of the test cases look odd because the domain fixer canonicalizes these to floating point domain. That's OK, it'll use the integer domain when it matters and some day I may be able to update enough of LLVM to canonicalize the other way. This restores almost all of the regressions from teaching x86's vselect lowering to always use vector shuffle lowering for blends. The remaining problems are because the v16 lowering path is still doing crazy things. I'll be re-arranging that strategy in more detail in subsequent commits to finish recovering the performance here. llvm-svn: 229836	2015-02-19 10:46:52 +00:00
Chandler Carruth	b89464a9b6	[x86,sdag] Two interrelated changes to the x86 and sdag code. First, don't combine bit masking into vector shuffles (even ones the target can handle) once operation legalization has taken place. Custom legalization of vector shuffles may exist for these patterns (making the predicate return true) but that custom legalization may in some cases produce the exact bit math this matches. We only really want to handle this prior to operation legalization. However, the x86 backend, in a fit of awesome, relied on this. What it would do is mark VSELECTs as expand, which would turn them into arithmetic, which this would then match back into vector shuffles, which we would then lower properly. Amazing. Instead, the second change is to teach the x86 backend to directly form vector shuffles from VSELECT nodes with constant conditions, and to mark all of the vector types we support lowering blends as shuffles as custom VSELECT lowering. We still mark the forms which actually support variable blends as legal so that the custom lowering is bypassed, and the legal lowering can even be used by the vector shuffle legalization (yes, i know, this is confusing. but that's how the patterns are written). This makes the VSELECT lowering much more sensible, and in fact should fix a bunch of bugs with it. However, as you'll see in the test cases, right now what it does is point out the hilarious deficiency of the new vector shuffle lowering when it comes to blends. Fortunately, my very next patch fixes that. I can't submit it yet, because that patch, somewhat obviously, forms the exact and/or pattern that the DAG combine is matching here! Without this patch, teaching the vector shuffle lowering to produce the right code infloops in the DAG combiner. With this patch alone, we produce terrible code but at least lower through the right paths. With both patches, all the regressions here should be fixed, and a bunch of the improvements (like using 2 shufps with no memory loads instead of 2 andps with memory loads and an orps) will stay. Win! There is one other change worth noting here. We had hilariously wrong vectorization cost estimates for vselect because we fell through to the code path that assumed all "expand" vector operations are scalarized. However, the "expand" lowering of VSELECT is vector bit math, most definitely not scalarized. So now we go back to the correct if horribly naive cost of "1" for "not scalarized". If anyone wants to add actual modeling of shuffle costs, that would be cool, but this seems an improvement on its own. Note the removal of 16 and 32 "costs" for doing a blend. Even in SSE2 we can blend in fewer than 16 instructions. ;] Of course, we don't right now because of OMG bad code, but I'm going to fix that. Next patch. I promise. llvm-svn: 229835	2015-02-19 10:36:19 +00:00
Daniel Jasper	22a3e79aaf	Make -fmodules-decluse and -fmodules-strict-decluse compatible options. They don't actually influence the result of the module compilation. llvm-svn: 229834	2015-02-19 09:56:13 +00:00
Mohit K. Bhakkad	88077324c6	[LSan][MIPS64] Enable LSan testing for mips64/mips64el Patch by Sagar Thakur Reviewers: petarj, earthdok, kcc. Subscribers: samsonov, dsanders, mohit.bhakkad, Anand.Takale, llvm-commits. Differential Revision: http://reviews.llvm.org/D7124 llvm-svn: 229833	2015-02-19 09:14:43 +00:00
Dmitry Vyukov	c0e912dd7b	tsan: fix PTRACE_ATTACH handling during stop-the-world If the thread receives a signal concurrently with PTRACE_ATTACH, we can get notification about the signal before notification about stop. In such case we need to forward the signal to the thread, otherwise the signal will be missed (as we do PTRACE_DETACH with arg=0) and any logic relying on signals will break. After forwarding we need to continue to wait for stopping, because the thread is not stopped yet. We do ignore delivery of SIGSTOP, because we want to make stop-the-world as invisible as possible. http://reviews.llvm.org/D7723 --This line, and those below, will be ignored-- M lib/sanitizer_common/sanitizer_stoptheworld_linux_libcdep.cc M test/tsan/signal_segv_handler.cc llvm-svn: 229832	2015-02-19 09:02:29 +00:00
Michael Kuperstein	ba5b04c798	Use std::bitset for SubtargetFeatures Previously, subtarget features were a bitfield with the underlying type being uint64_t. Since several targets (X86 and ARM, in particular) have hit or were very close to hitting this bound, switching the features to use a bitset. No functional change. Differential Revision: http://reviews.llvm.org/D7065 llvm-svn: 229831	2015-02-19 09:01:04 +00:00
Mohit K. Bhakkad	36f974d76b	[LSan] [MIPS] adding support of LSan for mips64/mips64el arch Patch by Sagar Thakur Reviewers: petarj, earthdok, kcc. Subscribers: samsonov, dsanders, mohit.bhakkad, Anand.Takale, llvm-commits. Differential Revision: http://reviews.llvm.org/D7013 llvm-svn: 229830	2015-02-19 07:30:39 +00:00
David Majnemer	12d3783add	Mark DR1940 as implemented llvm-svn: 229829	2015-02-19 07:29:01 +00:00
David Majnemer	6440548434	Mark DR1947 as implemented llvm-svn: 229828	2015-02-19 07:28:57 +00:00
David Majnemer	7da2302780	Mark DR1948 as implemented llvm-svn: 229827	2015-02-19 07:28:55 +00:00
David Majnemer	54dc82e1d2	Mark DR1994 as a duplicate of DR529 llvm-svn: 229826	2015-02-19 07:28:52 +00:00
David Majnemer	2dbad01349	Mark DR1968 as implemented llvm-svn: 229825	2015-02-19 07:28:49 +00:00
Davide Italiano	faafae33fa	[Support/Timer] Make GetMallocUsage() aware of jemalloc. Differential Revision: D7657 Reviewed by: shankarke, majnemer llvm-svn: 229824	2015-02-19 07:27:14 +00:00
NAKAMURA Takumi	8f2085ed16	CXXNameMangler::mangleUnresolvedPrefix(): Prune an obsolete \param, according to r229809. [-Wdocumentation] llvm-svn: 229823	2015-02-19 07:14:26 +00:00
David Majnemer	cb34c67c2a	Itanium ABI: Don't pass nullptr to a bool argument llvm-svn: 229822	2015-02-19 05:51:14 +00:00
Lang Hames	c6ba0bf33b	[Orc][Kaleidoscope] Fix typo in tutorial comment. llvm-svn: 229821	2015-02-19 05:33:30 +00:00
Tobias Grosser	d1e33e7061	ScopDetection: Only detect scops that have at least one read and one write Scops that only read seem generally uninteresting and scops that only write are most likely initializations where there is also little to optimize. To not waste compile time we bail early. Differential Revision: http://reviews.llvm.org/D7735 llvm-svn: 229820	2015-02-19 05:31:07 +00:00
Dmitri Gribenko	3e1551c96f	Provide the same ABI regardless of NDEBUG For projects depending on LLVM, I find it very useful to combine a release-no-asserts build of LLVM with a debug+asserts build of the dependent project. The motivation is that when developing a dependent project, you are debugging that project itself, not LLVM. In my usecase, a significant part of the runtime is spent in LLVM optimization passes, so I would like to build LLVM without assertions to get the best performance from this combination. Currently, `lib/Support/Debug.cpp` changes the set of symbols it provides depending on NDEBUG, while `include/llvm/Support/Debug.h` requires extra symbols when NDEBUG is not defined. Thus, it is not possible to enable assertions in an external project that uses facilities of `Debug.h`. This patch changes `Debug.cpp` and `Valgrind.cpp` to always define the symbols that other code may depend on when #including LLVM headers without NDEBUG. http://reviews.llvm.org/D7662 llvm-svn: 229819	2015-02-19 05:30:16 +00:00
Larisse Voufo	ee3c1b2a71	Update C++ implementation status page with recent changes w.r.t. to sized deallocation. llvm-svn: 229818	2015-02-19 04:34:13 +00:00
Alexey Bataev	054829b1bd	[MSVC] Improved lookup into dependent/non-dependent bases of dependent class Patch improves lookup into dependendt bases of dependent class and adds lookup into non-dependent bases. Differential Revision: http://reviews.llvm.org/D7173 llvm-svn: 229817	2015-02-19 04:28:23 +00:00
Rui Ueyama	3966c61536	PECOFF: Fix base relocation for an absolute symbol. Previously we wrongly emitted a base relocation entry for an absolute symbol. That made the loader to rewrite some instruction operands with wrong values only when a DLL is not loaded at the default address. That caused a misterious crash of some executable. Absolute symbols will of course never change value wherever the binary is loaded to memory. We shouldn't emit base relocations for absolute symbols. llvm-svn: 229816	2015-02-19 04:22:27 +00:00
Ben Langmuir	07236733be	Revert adding hostname to module hash I didn't realize how easily the hostname could change - for example just changing wireless networks seems to prompt it in some cases. Users can always set their own local module cache path to avoid this. This reverts commits r228592, 228594, 228601 and 228613. rdar://19287368 llvm-svn: 229815	2015-02-19 04:03:57 +00:00
Rui Ueyama	3e6490f1e8	PECOFF: use llvm-readobj to dump .reloc section When this test was written, no llvm tool could print out contents of base relocation section. Now llvm-readobj is able to dump it in a text format. Use that tool to make this test readable. llvm-svn: 229814	2015-02-19 04:02:17 +00:00
Justin Bogner	91f2e3c9c2	InstrProf: Always emit a coverage region for the condition of an if When tools like llvm-cov show regions, it's much easier to understand what's happening if the condition of an if shows a counter as well as the body. llvm-svn: 229813	2015-02-19 03:10:30 +00:00
Larisse Voufo	62d8aa5caf	Fix a test case. llvm-svn: 229812	2015-02-19 03:03:23 +00:00
Filipe Cabecinhas	54a2ba8b76	[Headers] Add tests for _mm256_insert_epi64 and fix its definition Summary: The definition for _mm256_insert_epi64 was taking an int, which would get truncated before being inserted in the vector. Original patch by Joshua Magee! Reviewers: bruno, craig.topper Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D7179 llvm-svn: 229811	2015-02-19 03:02:33 +00:00
Eric Fiselier	2d38959cd9	Mark more tuple tests as unsupported in C++98 && C++03 llvm-svn: 229810	2015-02-19 02:44:09 +00:00
David Majnemer	b8014dd7c0	Itanium ABI: Properly qualify the destructor-name We didn't have enough qualificaiton before the scope specifier and we had too much qualification in the destructor name itself. llvm-svn: 229809	2015-02-19 02:16:16 +00:00
Eric Fiselier	0a52cd7937	[libcxx] Mark most tuple tests UNSUPPORTED for c++03 and c++98. Summary: No declaration for the type `tuple` is given in c++03 or c++98 modes. Mark all tests that use the actual `tuple` type as UNSUPPORTED. Reviewers: jroelofs, mclow.lists, danalbert Reviewed By: danalbert Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D5956 llvm-svn: 229808	2015-02-19 02:10:42 +00:00
Lang Hames	56678fe634	[Orc][Kaleidoscope] Make the 'fully lazy' orc kaleidoscope tutorial lazier still. The new JIT doesn't IRGen stubs until they're referenced. llvm-svn: 229807	2015-02-19 01:32:43 +00:00
Lang Hames	af53ed1a7f	[Orc] Fix a bug in the compile callback manager: trampoline ids need to be fixed up before returning them to the available pool. llvm-svn: 229806	2015-02-19 01:31:25 +00:00
Eric Christopher	d84f5d30e2	Remove the local subtarget variable from the SystemZ asm printer and update the two calls accordingly. llvm-svn: 229805	2015-02-19 01:26:28 +00:00
Eric Christopher	0795a2ef0c	Remove a few more calls to TargetMachine::getSubtarget from the R600 port. llvm-svn: 229804	2015-02-19 01:10:55 +00:00
Eric Christopher	7edca437f5	Grab the subtarget off of the machine function for the R600 asm printer and clean up a bunch of uses. llvm-svn: 229803	2015-02-19 01:10:53 +00:00
Eric Christopher	96caeda730	Remove the DisasmEnabled AsmPrinter variable and just look it up on the subtarget where it's set anyhow than looking it up 2-3 times in the same place. llvm-svn: 229802	2015-02-19 01:10:49 +00:00
Filipe Cabecinhas	ec5d0e6423	Improve our handling of rtti/sanitize=vptr/sanitize=undefined This patch removes the huge blob of code that is dealing with rtti/exceptions/sanitizers and replaces it with: A ToolChain function which, for a given set of Args, figures out if rtti should be: - enabled - disabled implicitly - disabled explicitly A change in the way SanitizerArgs figures out what sanitizers to enable (or if it should error out, or warn); And a check for exceptions/rtti interaction inside addExceptionArgs. The RTTIMode algorithm is: - If -mkernel, -fapple-kext, or -fno-rtti are passed, rtti was disabled explicitly; - If -frtti was passed or we're not targetting the PS4, rtti is enabled; - If -fexceptions or -fcxx-exceptions was passed and we're targetting the PS4, rtti was enabled implicitly; - If we're targetting the PS4, rtti is disabled implicitly; - Otherwise, rtti is enabled; Since the only flag needed to pass to -cc1 is -fno-rtti if we want to disable it, there's no problem in saying rtti is enabled if we're compiling C code, so we don't look at the input file type. addExceptionArgs now looks at the RTTIMode and warns that rtti is being enabled implicitly if targetting the PS4 and exceptions are on. It also errors out if, targetting the PS4, -fno-rtti was passed, and exceptions were turned on. SanitizerArgs now errors out if rtti was disabled explicitly and the vptr sanitizer was enabled implicitly, but just turns off vptr if rtti is disabled but -fsanitize=undefined was passed. Also fixed tests, removed duplicate name from addExceptionArgs comment, and added one or two surrounding lines when running clang-format. This changes test/Driver/fsanitize.c to make it not expect a warning when passed -fsanitize=undefined -fno-rtti, but expect vptr to not be on. Removed all users and definition of SanitizerArgs::sanitizesVptr(). Reviewers: samsonov Subscribers: llvm-commits, samsonov, rsmith Differential Revision: http://reviews.llvm.org/D7525 llvm-svn: 229801	2015-02-19 01:04:49 +00:00
Peter Collingbourne	fb8002cbe0	MC: Remove NullStreamer hook, as it is redundant with NullTargetStreamer. llvm-svn: 229799	2015-02-19 00:45:07 +00:00
Peter Collingbourne	f4498a4fd3	llvm-mc: Use Target::createNullStreamer to fix crashes on target-specific asm directives. llvm-svn: 229798	2015-02-19 00:45:04 +00:00
Peter Collingbourne	20c7259ce9	Introduce Target::createNullTargetStreamer and use it from IRObjectFile. A null MCTargetStreamer allows IRObjectFile to ignore target-specific directives. Previously we were crashing. Differential Revision: http://reviews.llvm.org/D7711 llvm-svn: 229797	2015-02-19 00:45:02 +00:00
Michael Gottesman	e5ad66f8a9	[objc-arc] Introduce the concept of RCIdentity and rename all relevant functions to use that name. NFC. The RCIdentity root ("Reference Count Identity Root") of a value V is a dominating value U for which retaining or releasing U is equivalent to retaining or releasing V. In other words, ARC operations on V are equivalent to ARC operations on U. This is a useful property to ascertain since we can use this in the ARC optimizer to make it easier to match up ARC operations by always mapping ARC operations to RCIdentityRoots instead of pointers themselves. Then we perform pairing of retains, releases which are applied to the same RCIdentityRoot. In general, the two ways that we see RCIdentical values in ObjC are via: 1. PointerCasts 2. Forwarding Calls that return their argument verbatim. As such in ObjC, two RCIdentical pointers must always point to the same memory location. Previously this concept was implicit in the code and various methods that dealt with this concept were given functional names that did not conform to any name in the "ARC" model. This often times resulted in code that was hard for the non-ARC acquanted to understand resulting in unhappiness and confusion. llvm-svn: 229796	2015-02-19 00:42:38 +00:00
Michael Gottesman	dfa3e4b08a	[objc-arc-contract] Rename contractRelease => tryToContractReleaseIntoStoreStrong. NFC. Makes it clearer what this method is actually supposed to do. llvm-svn: 229795	2015-02-19 00:42:34 +00:00
Michael Gottesman	1827973f80	[objc-arc-contract] Refactor out tryToPeepholeInstruction into its own method. NFC. The main method of ObjCARCContract is really large and busy. By refactoring this out, it becomes easier to reason about. llvm-svn: 229794	2015-02-19 00:42:30 +00:00
Michael Gottesman	56bd6a077a	[objc-arc-contract] Reorganize the code a bit and make the debug output easier to read. llvm-svn: 229793	2015-02-19 00:42:27 +00:00
Richard Smith	64ecacf6cb	PR22566: a conversion from a floating-point type to bool is a narrowing conversion. llvm-svn: 229792	2015-02-19 00:39:05 +00:00
Duncan P. N. Exon Smith	3d62bbacb1	IR: Drop scope from MDTemplateParameter Follow-up to r229740, which removed `DITemplate*::getContext()` after my upgrade script revealed that scopes are always `nullptr` for template parameters. This is the other shoe: drop `scope:` from `MDTemplateParameter` and its two subclasses. (Note: a bitcode upgrade would be pointless, since the hierarchy hasn't been moved into place.) llvm-svn: 229791	2015-02-19 00:37:21 +00:00
Eric Christopher	ca929f2469	Avoid using a self-referential initializer and fix up uses. llvm-svn: 229790	2015-02-19 00:22:47 +00:00
Eric Christopher	111de895a0	80-column fixups. llvm-svn: 229789	2015-02-19 00:15:33 +00:00
Richard Smith	11152dd55f	Allow errors on use of a private module header to be disabled, to better support incremental transition to modules. llvm-svn: 229788	2015-02-19 00:10:28 +00:00

... 5 6 7 8 9 ...

194167 Commits All Branches Search

194167 Commits

All Branches