llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	8e9f2813cf	DI: Prune another example llvm-svn: 219053	2014-10-04 15:30:52 +00:00
Hal Finkel	64567a80d2	Emit @llvm.assume for non-parameter lvalue align_value-attribute loads We already add the align parameter attribute for function parameters that have the align_value attribute (or those with a typedef type having that attribute), which is an important special case, but does not handle pointers with value alignment assumptions that come into scope in any other way. To handle the general case, emit an @llvm.assume-based alignment assumption whenever we load the pointer-typed lvalue of an align_value-attributed variable (except for function parameters, which we already deal with at entry). I'll also note that this is more general than Intel's described support in: https://software.intel.com/en-us/articles/data-alignment-to-assist-vectorization which states that the compiler inserts __assume_aligned directives in response to align_value-attributed variables only for function parameters and for the initializers of local variables. I think that we can make the optimizer deal with this more-general scheme (which could lead to a lot of calls to @llvm.assume inside of loop bodies, for example), but if not, I'll rework this to be less aggressive. llvm-svn: 219052	2014-10-04 15:26:49 +00:00
Duncan P. N. Exon Smith	936675e281	DI: Update and prune metadata examples Update a couple of the examples of debug info metadata, and prune the rest. Point to the true reference implementation in the source. llvm-svn: 219051	2014-10-04 14:56:56 +00:00
Nikola Smiljanic	905bfda957	-ms-extensions: Allow __super in return stements. llvm-svn: 219050	2014-10-04 10:17:57 +00:00
David Majnemer	5da21da4f6	MS ABI: Disallow dllimported/exported variables from having TLS Windows TLS relies on indexing through a tls_index in order to get at the DLL's thread local variables. However, this index is not exported along with the variable: it is assumed that all accesses to thread local variables are inside the same module which created the variable in the first place. While there are several implementation techniques we could adopt to fix this (notably, the Itanium ABI gets this for free), it is not worth the heroics. Instead, let's just ban this combination. We could revisit this in the future if we need to. This fixes PR21111. llvm-svn: 219049	2014-10-04 06:51:54 +00:00
David Majnemer	7656f41809	Sema: Simplify checkAttributesAfterMerging Use getDLLAttr to factor out some common dllimport/dllexport code. llvm-svn: 219048	2014-10-04 06:16:45 +00:00
Chandler Carruth	808ec85ad0	[x86] Slap a triple on this test since it is poking around at the stack and calling conventions. Otherwise its too hard to craft a usefully generic set of assertions. llvm-svn: 219047	2014-10-04 04:22:55 +00:00
Chandler Carruth	99627bfbff	[x86] Enable the new vector shuffle lowering by default. Update the entire regression test suite for the new shuffles. Remove most of the old testing which was devoted to the old shuffle lowering path and is no longer relevant really. Also remove a few other random tests that only really exercised shuffles and only incidently or without any interesting aspects to them. Benchmarking that I have done shows a few small regressions with this on LNT, zero measurable regressions on real, large applications, and for several benchmarks where the loop vectorizer fires in the hot path it shows 5% to 40% improvements for SSE2 and SSE3 code running on Sandy Bridge machines. Running on AMD machines shows even more dramatic improvements. When using newer ISA vector extensions the gains are much more modest, but the code is still better on the whole. There are a few regressions being tracked (PR21137, PR21138, PR21139) but by and large this is expected to be a win for x86 generated code performance. It is also more correct than the code it replaces. I have fuzz tested this extensively with ISA extensions up through AVX2 and found no crashes or miscompiles (yet...). The old lowering had a few miscompiles and crashers after a somewhat smaller amount of fuzz testing. There is one significant area where the new code path lags behind and that is in AVX-512 support. However, there was extremely little support for that already and so this isn't a significant step backwards and the new framework will probably make it easier to implement lowering that uses the full power of AVX-512's table-based shuffle+blend (IMO). Many thanks to Quentin, Andrea, Robert, and others for benchmarking assistance. Thanks to Adam and others for help with AVX-512. Thanks to Hal, Eric, and many others for answering my incessant questions about how the backend actually works. =] I will leave the old code path in the tree until the 3 PRs above are at least resolved to folks' satisfaction. Then I will rip it (and 1000s of lines of code) out. =] I don't expect this flag to stay around for very long. It may not survive next week. llvm-svn: 219046	2014-10-04 03:52:55 +00:00
Jingyue Wu	4938e271c6	Add fake use to suppress defined-but-unused warnings llvm-svn: 219045	2014-10-04 03:50:10 +00:00
Chandler Carruth	200e87c0c5	[x86] Fix a bug in the VZEXT DAG combine that I just made more powerful. It turns out this combine was always somewhat flawed -- there are cases where nested VZEXT nodes can't be combined: if their types have a mismatch that can be observed in the result. While none of these show up in currently, once I switch to the new vector shuffle lowering a few test cases actually form such nested VZEXT nodes. I've not come up with any IR pattern that I can sensible write to exercise this, but it will be covered by tests once I flip the switch. llvm-svn: 219044	2014-10-04 02:51:03 +00:00
Richard Smith	a9d100178c	PR20991: ::decltype is not valid. llvm-svn: 219043	2014-10-04 01:57:39 +00:00
Chandler Carruth	7e26a67ffa	[x86] Sink a generic combine of VZEXT nodes from the lowering to VZEXT nodes to the DAG combining of them. This will allow the combine to fire on both old vector shuffle lowering and the new vector shuffle lowering and generally seems like a cleaner design. I've trimmed down the code a bit and tried to make it and the surrounding combine fairly clean while moving it around. llvm-svn: 219042	2014-10-04 01:05:48 +00:00
Nick Kledzik	15eba696f6	[mach-o] add -iphoneos_version_min as alias for -ios_version_min llvm-svn: 219041	2014-10-04 00:19:56 +00:00
Steven Wu	84610ba9b3	Fix the armv7 thumb builtins on darwin The arm builtins converted into thumb in r213481 are not working on darwin. On apple platforms, .thumb_func directive is required to generated correct symbols for thumb functions. <rdar://problem/18523605> llvm-svn: 219040	2014-10-04 00:18:59 +00:00
Nick Kledzik	09d00bb4d7	[mach-o] Add support for -dependency_info command line option This option is added by Xcode when it runs the linker. It produces a binary file which contains the file the linker used. Xcode uses the info to dynamically update it dependency tracking. To check the content of the binary file, the test case uses a python script to dump the binary file as text which FileCheck can check. llvm-svn: 219039	2014-10-04 00:16:13 +00:00
Matt Arsenault	c996175b57	R600/SI: Custom lower f64 -> i64 conversions llvm-svn: 219038	2014-10-03 23:54:56 +00:00
Matt Arsenault	f7c95e3eda	R600: Custom lower [s\|u]int_to_fp for i64 -> f64 llvm-svn: 219037	2014-10-03 23:54:41 +00:00
Matt Arsenault	6cda887776	R600/SI: Fix ftrunc f64 conformance failures. Re-add the tests since they were deleted at some point llvm-svn: 219036	2014-10-03 23:54:27 +00:00
Peter Collingbourne	11d08b8be1	Remove unused ALL_BINDINGS configuration variable. llvm-svn: 219035	2014-10-03 23:03:01 +00:00
Rafael Auler	6fd0afa195	[ELF] Fix bug in ELFFile::createAtoms() that caused lld to mislink musl When creating the graph edges of the atoms of an ELF file, special care must be taken with atoms that represent weak symbols. They cannot be the target of any Reference::kindLayoutAfter edge because they can be merged and point to other code, screwing up the final layout of the atoms. ELFFile::createAtoms() correctly handles this corner case. The problem is that createAtoms() assumed that there can be no zero-sized weak symbols, which is not true. Consider: my_weak_func1: my_weak_func2: my_weak_func3: code In this case, we have two zero-sized weak symbols, my_weak_func1 and my_weak_func2, and one non-zero weak symbol my_weak_func3. createAtoms() would correctly handle my_weak_func3, but not the first two symbols. This problem happens in the musl C library when a zero-sized weak symbol is merged and screws up the file layout. Since this musl code lives at the finalization hooks, any C program linked with LLD and musl was correctly executing, but segfaulting at the end. Reviewers: shankarke http://reviews.llvm.org/D5606 llvm-svn: 219034	2014-10-03 22:50:50 +00:00
Chandler Carruth	f3e880697a	[x86] Add a really preposterous number of patterns for matching all of the various ways in which blends can be used to do vector element insertion for lowering with the scalar math instruction forms that effectively re-blend with the high elements after performing the operation. This then allows me to bail on the element insertion lowering path when we have SSE4.1 and are going to be doing a normal blend, which in turn restores the last of the blends lost from the new vector shuffle lowering when I got it to prioritize insertion in other cases (for example when we don't have a blend instruction). Without the patterns, using blends here would have regressed sse-scalar-fp-arith.ll completely with the new vector shuffle lowering. For completeness, I've added RUN-lines with the new lowering here. This is somewhat superfluous as I'm about to flip the default, but hey, it shows that this actually significantly changed behavior. The patterns I've added are just ridiculously repetative. Suggestions on making them better very much welcome. In particular, handling the commuted form of the v2f64 patterns is somewhat obnoxious. llvm-svn: 219033	2014-10-03 22:43:17 +00:00
Enrico Granata	1a3576a450	These two tests were failing on the FreeBSD bot - one has to assume because FreeBSD comes with libc++. Skip them llvm-svn: 219032	2014-10-03 22:33:03 +00:00
Benjamin Kramer	719772c269	Remove stray enum keywords. MSVC sees this as a redeclaration at global scope. llvm-svn: 219031	2014-10-03 22:20:30 +00:00
Justin Bogner	fa9df7af07	test: Disable standard system includes in %clang_cc1 This adds -nostdsysteminc to the %clang_cc1 expansion, which should make it harder to accidentally write tests that depend on headers in /usr/include. It also updates a few tests that use -isysroot <x> and a darwin triple to omit the triple and use -isystem <x>/usr/include instead, making them a little bit more general. Incidentally, this fixes a test failure I'm seeing on darwin in Modules/stddef.c, that happens because my system finds a stddef.h in /usr/include. llvm-svn: 219030	2014-10-03 22:18:49 +00:00
Jingyue Wu	3c3b48805f	Suppress defined-but-unused warnings by adding a fake use llvm-svn: 219029	2014-10-03 22:16:40 +00:00
Chris Bieneman	489d1dce3f	Converting the ErrorHandlerMutex to a ManagedStatic to avoid the static constructor and destructor. llvm-svn: 219028	2014-10-03 22:03:12 +00:00
Jonathan Roelofs	b140a100a0	CFE Knob for: Add a thread-model knob for lowering atomics on baremetal & single threaded systems http://reviews.llvm.org/D4985 llvm-svn: 219027	2014-10-03 21:57:44 +00:00
Anna Zaks	0820e13e2a	[analyzer] Refactor and cleanup IsCompleteType There are three copies of IsCompleteType(...) functions in CSA and all of them are incomplete (I experienced crashes in some CSA's test cases). I have replaced these function calls with Type::isIncompleteType() calls. A patch by Aleksei Sidorin! llvm-svn: 219026	2014-10-03 21:49:03 +00:00
Anna Zaks	d79b840716	[analyzer] Make Malloc Checker track memory allocated by if_nameindex The MallocChecker does currently not track the memory allocated by if_nameindex. That memory is dynamically allocated and should be freed by calling if_freenameindex. The attached patch teaches the checker about these functions. Memory allocated by if_nameindex is treated as a separate allocation "family". That way the checker can verify it is freed by the correct function. A patch by Daniel Fahlgren! llvm-svn: 219025	2014-10-03 21:48:59 +00:00
Anna Zaks	2d2f137ed4	[analyzer] Make CStringChecker correctly calculate return value of mempcpy The return value of mempcpy is only correct when the destination type is one byte in size. This patch casts the argument to a char* so the calculation is also correct for structs, ints etc. A patch by Daniel Fahlgren! llvm-svn: 219024	2014-10-03 21:48:54 +00:00
Benjamin Kramer	de952d1180	Initialize MCObjectFileInfo when parsing ms-style asm. Otherwise we're left with an half-initialized bag of variables that may or may not explode later on. Should bring the MSVC buildbot back to life. llvm-svn: 219023	2014-10-03 21:48:23 +00:00
Chandler Carruth	0adda1e4d4	[x86] Adjust the patterns for lowering X86vzmovl nodes which don't perform a load to use blendps rather than movss when it is available. For non-loads, blendps is much faster. It can execute on two ports in Sandy Bridge and Ivy Bridge, and three ports on Haswell. This fixes one of the "regressions" from aggressively taking the "insertion" path in the new vector shuffle lowering. This does highlight one problem with blendps -- it isn't commuted as heavily as it should be. That's future work though. llvm-svn: 219022	2014-10-03 21:38:49 +00:00
Enrico Granata	0aca4b1aa0	These tests all seem to pass on my machine, marking them as non-Xfail on Darwin, or clang where applicable. Non-Apple folks, if these fail for you, maybe we can put some more helpful markers on them llvm-svn: 219020	2014-10-03 21:26:37 +00:00
Duncan P. N. Exon Smith	52fd68980c	DI: LLVM schema change: fold constants into string Update debug info testcases for the LLVM metadata schema change in r219010 to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219019	2014-10-03 21:08:48 +00:00
Jonathan Roelofs	b24884de70	Fix typo in TableGen documentation llvm-svn: 219018	2014-10-03 20:46:05 +00:00
Johannes Doerfert	a441783544	[Fix] Accidently changed the type of a libgomp argument in r219003. Only subsequent patches introduced tests for the signature in the generated IR, thus the tests were wrong too and are adjusted now. llvm-svn: 219017	2014-10-03 20:40:24 +00:00
Sid Manning	50600f39ab	Add unit tests to verify Hexagon emission. Add the test cases I overlooked, part of the original commit, http://reviews.llvm.org/D5523 llvm-svn: 219016	2014-10-03 20:33:03 +00:00
Adrian Prantl	adc41ca4ef	Add a reference to Phabricator.rst to docs/index.rst. llvm-svn: 219015	2014-10-03 20:17:32 +00:00
Richard Smith	1ed4229f6f	PR21145: Teach LLVM about C++14 sized deallocation functions. C++14 adds new builtin signatures for 'operator delete'. This change allows new/delete pairs to be removed in C++14 onwards, as they were in C++11 and before. llvm-svn: 219014	2014-10-03 20:17:06 +00:00
Fariborz Jahanian	aae7fefce8	Objective-C. Assortment of improvements pretty printing objective-C declarations, including printing of availability attributes on methods. llvm-svn: 219013	2014-10-03 20:05:33 +00:00
Reid Kleckner	cf6b0c64b9	Use __atomic_exchange_n instead of Clang's __sync_swap Also remove an extra extern "C" from a global variable redeclaration. This allows building libcxxabi with GCC on my system. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D5604 llvm-svn: 219012	2014-10-03 20:03:47 +00:00
Duncan P. N. Exon Smith	3c51fa6aae	Revert "Revert "DI: LLVM schema change: fold constants into string"" This reverts commit r218917, effectively reapplying r218913. Original commit message follows. -- Update debug info testcases for an LLVM metadata schema change to fold metadata constant operands into a single `MDString`. Part of PR17891. llvm-svn: 219011	2014-10-03 20:01:52 +00:00
Duncan P. N. Exon Smith	176b691d32	Revert "Revert "DI: Fold constant arguments into a single MDString"" This reverts commit r218918, effectively reapplying r218914 after fixing an Ocaml bindings test and an Asan crash. The root cause of the latter was a tightened-up check in `DILexicalBlock::Verify()`, so I'll file a PR to investigate who requires the loose check (and why). Original commit message follows. -- This patch addresses the first stage of PR17891 by folding constant arguments together into a single MDString. Integers are stringified and a `\0` character is used as a separator. Part of PR17891. Note: I've attached my testcases upgrade scripts to the PR. If I've just broken your out-of-tree testcases, they might help. llvm-svn: 219010	2014-10-03 20:01:09 +00:00
Adam Nemet	ff63a2dc51	[ISel] Keep matching state consistent when folding during X86 address match In the X86 backend, matching an address is initiated by the 'addr' complex pattern and its friends. During this process we may reassociate and-of-shift into shift-of-and (FoldMaskedShiftToScaledMask) to allow folding of the shift into the scale of the address. However as demonstrated by the testcase, this can trigger CSE of not only the shift and the AND which the code is prepared for but also the underlying load node. In the testcase this node is sitting in the RecordedNode and MatchScope data structures of the matcher and becomes a deleted node upon CSE. Returning from the complex pattern function, we try to access it again hitting an assert because the node is no longer a load even though this was checked before. Now obviously changing the DAG this late is bending the rules but I think it makes sense somewhat. Outside of addresses we prefer and-of-shift because it may lead to smaller immediates (FoldMaskAndShiftToScale is an even better example because it create a non-canonical node). We currently don't recognize addresses during DAGCombiner where arguably this canonicalization should be performed. On the other hand, having this in the matcher allows us to cover all the cases where an address can be used in an instruction. I've also talked a little bit to Dan Gohman on llvm-dev who added the RAUW for the new shift node in FoldMaskedShiftToScaledMask. This RAUW is responsible for initiating the recursive CSE on users (http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-September/076903.html) but it is not strictly necessary since the shift is hooked into the visited user. Of course it's safer to keep the DAG consistent at all times (e.g. for accurate number of uses, etc.). So rather than changing the fundamentals, I've decided to continue along the previous patches and detect the CSE. This patch installs a very targeted DAGUpdateListener for the duration of a complex-pattern match and updates the matching state accordingly. (Previous patches used HandleSDNode to detect the CSE but that's not practical here). The listener is only installed on X86. I tested that there is no measurable overhead due to this while running through the spec2k BC files with llc. The only thing we pay for is the creation of the listener. The callback never ever triggers in spec2k since this is a corner case. Fixes rdar://problem/18206171 llvm-svn: 219009	2014-10-03 20:00:34 +00:00
Tom Stellard	081e778d22	Implement async_work_group_copy builtin v3 This is a simple implementation which just copies data synchronously. v2: - Use size_t. v3: - Fix possible race condition by splitting the copy among multiple work items. llvm-svn: 219008	2014-10-03 19:49:39 +00:00
Tom Stellard	ed5bbfdb1b	Implement async_work_group_strided_copy builtin v2 This is a simple implementation which just copies data synchronously. v2: - Use size_t. llvm-svn: 219007	2014-10-03 19:49:37 +00:00
Tom Stellard	b5064f79ef	Implement wait_group_events builtin v2 This is a simple default implemetation which just calls barrier(). v2: - Only call barrier() once. llvm-svn: 219006	2014-10-03 19:49:34 +00:00
Johannes Doerfert	1356ac75d1	Put the parallel context alloca into the function entry block. We use lifetime markers to limit the actual life range (similar to clang). Differential Revision: http://reviews.llvm.org/D5582 llvm-svn: 219005	2014-10-03 19:12:05 +00:00
Johannes Doerfert	990cd4c2e2	Add option to limit the maximal number of parallel threads. Differential Revision: http://reviews.llvm.org/D5581 llvm-svn: 219004	2014-10-03 19:11:10 +00:00
Johannes Doerfert	12b355a2ce	[Refactor] Generalize parallel code generation + Generalized function names and comments + Removed OpenMP (omp) from the names and comments + Use common names (non OpenMP specific) for runtime library call creation methodes + Commented the parallel code generator and all its member functions + Refactored some values and methodes Differential Revision: http://reviews.llvm.org/D4990 llvm-svn: 219003	2014-10-03 19:10:13 +00:00

1 2 3 4 5 ...

183902 Commits All Branches Search

183902 Commits

All Branches