llvm-project

Commit Graph

Author	SHA1	Message	Date
Arnold Schwaighofer	d96e427eac	LoopVectorize: Add support for floating point min/max reductions Add support for min/max reductions when "no-nans-float-math" is enabled. This allows us to assume we have ordered floating point math and treat ordered and unordered predicates equally. radar://13723044 llvm-svn: 181144	2013-05-05 01:54:48 +00:00
Arnold Schwaighofer	e972d03f64	PatternMatch: Matcher for (un)ordered floating point min/max Add support for matching 'ordered' and 'unordered' floating point min/max constructs. In LLVM we can express min/max functions as a combination of compare and select. We have support for matching such constructs for integers but not for floating point. In floating point math there is no total order because of the presence of 'NaN'. Therefore, we have to be careful to preserve the original fcmp semantics when interpreting floating point compare select combinations as a minimum or maximum function. The resulting 'ordered/unordered' floating point maximum function has to select the same value as the select/fcmp combination it is based on. ordered_max(x,y) = max(x,y) iff x and y are not NaN, y otherwise unordered_max(x,y) = max(x,y) iff x and y are not NaN, x otherwise ordered_min(x,y) = min(x,y) iff x and y are not NaN, y otherwise unordered_min(x,y) = min(x,y) iff x and y are not NaN, x otherwise This matches the behavior of the underlying select(fcmp(olt/ult/.., L, R), L, R) construct. Any code using this predicate has to preserve this semantics. A follow-up patch will use this to implement floating point min/max reductions in the vectorizer. radar://13723044 llvm-svn: 181143	2013-05-05 01:54:46 +00:00
Arnold Schwaighofer	f5183729db	LoopVectorizer: Cleanup of miminimum/maximum pattern match code No need for setting the operands. The pointers are going to be bound by the matcher. radar://13723044 llvm-svn: 181142	2013-05-05 01:54:44 +00:00
Arnold Schwaighofer	a670a0a3aa	LoopVectorize: We don't need an identity element for min/max reductions We can just use the initial element that feeds the reduction. max(max(x, y), z) == max(max(x,y), max(x,z)) radar://13723044 llvm-svn: 181141	2013-05-05 01:54:42 +00:00
Dmitri Gribenko	010316ce34	ArrayRef<T>() -> None cleanup llvm-svn: 181140	2013-05-05 01:03:47 +00:00
Dmitri Gribenko	44ebbd5436	Replace ArrayRef<T>() with None, now that we have an implicit ArrayRef constructor from None Patch by Robert Wilhelm. llvm-svn: 181139	2013-05-05 00:41:58 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Nadav Rotem	d61dcfc4fd	whitespace llvm-svn: 181137	2013-05-04 23:27:32 +00:00
Nadav Rotem	42932bdcd0	Fix an odd comment. llvm-svn: 181136	2013-05-04 23:24:56 +00:00
Tim Northover	7b55b97dba	AArch64: enable MCJIT and tests now that everything passes. This removes dire warnings about AArch64 being unsupported and enables the tests when appropriate on this platform. llvm-svn: 181135	2013-05-04 20:14:22 +00:00
Tim Northover	b23d8dbbac	AArch64: implement 64-bit absolute relocation in MCJIT This is about the simplest relocation, but surprisingly rare in actual code. It occurs in (for example) the MCJIT test test-ptr-reloc.ll. llvm-svn: 181134	2013-05-04 20:14:14 +00:00
Tim Northover	37cde9755d	AArch64: add stubs to support long function calls on MCJIT As with global accesses, external functions could exist anywhere in memory. Therefore the stub must create a complete 64-bit address. This patch implements the fragment as (roughly): movz x16, #:abs_g3:somefunc movk x16, #:abs_g2_nc:somefunc movk x16, #:abs_g1_nc:somefunc movk x16, #:abs_g0_nc:somefunc br x16 In principle we could save 4 bytes by using a literal-load instead, but it is unclear that would be more efficient and can only be tested when real hardware is readily available. This allows (for example) the MCJIT test 2003-05-07-ArgumentTest to pass on AArch64. llvm-svn: 181133	2013-05-04 20:14:09 +00:00
Tim Northover	4d01c1e0e6	AArch64: implement relocations for global access The large memory model (default and main viable for JIT) emits addresses in need of relocation as movz x0, #:abs_g3:somewhere movk x0, #:abs_g2_nc:somewhere movk x0, #:abs_g1_nc:somewhere movk x0, #:abs_g0_nc:somewhere To support this we must implement those four relocations in the dynamic loader. This allows (for example) the test-global.ll MCJIT test to pass on AArch64. llvm-svn: 181132	2013-05-04 20:14:04 +00:00
Tim Northover	fa1b2f85da	AArch64: implement first relocation required for MCJIT R_AARCH64_PCREL32 is present in even trivial .eh_frame sections and so is required to compile any function without the "nounwind" attribute. This change implements very basic infrastructure in the RuntimeDyldELF file and allows (for example) the test-shift.ll MCJIT test to pass on AArch64. llvm-svn: 181131	2013-05-04 20:13:59 +00:00
Tim Northover	a958a57081	Build system changes to enable MCJIT on AArch64 These changes just allow AArch64 to take part in the MCJIT world when built correctly. llvm-svn: 181130	2013-05-04 20:13:52 +00:00
Tim Northover	6c26b327ef	AArch64: use __clear_cache under GCCish environments AArch64 is going to need some kind of cache-invalidation in order to successfully JIT since it has a weak memory-model. This is provided by a __clear_cache builtin in libgcc, which acts very much like the 32-bit ARM equivalent (on platforms where it exists). llvm-svn: 181129	2013-05-04 18:52:44 +00:00
Richard Osborne	2f75a0c0d8	Fix buildbot failure on 64 bit linux due to std::max() having different operand types. llvm-svn: 181128	2013-05-04 17:41:01 +00:00
Richard Osborne	0a7abb655b	[XCore] Remove unused operand type. llvm-svn: 181127	2013-05-04 17:30:05 +00:00
Richard Osborne	54ff84a8f8	[XCore] Make use of the target independent global address offset folding. This let us to remove some custom code that matched constant offsets from globals at instruction selection time as a special addressing mode. No intended functionality change. llvm-svn: 181126	2013-05-04 17:24:33 +00:00
Richard Osborne	a282fa5b60	[XCore] Simplify code that checks for an aligned base plus a constant. The code now makes use of ComputeMaskedBits, SelectionDAG::isBaseWithConstantOffset and TargetLowering::isGAPlusOffset where appropriate reducing the amount of logic needed in XCoreISelLowering. No intended functionality change. llvm-svn: 181125	2013-05-04 17:17:10 +00:00
Richard Osborne	8bbea9cde7	[XCore] Move lowering of thread local storage to a separate pass. Thread local storage is not supported by the XMOS linker so we handle thread local variables by lowering the variable to an array of n elements (where n is the number of hardware threads per core, currently 8 for all XMOS devices) indexed by the the current thread ID. Previously this lowering was spread across the XCoreISelLowering and the XCoreAsmPrinter classes. Moving this to a separate pass should be much cleaner. llvm-svn: 181124	2013-05-04 17:01:55 +00:00
Aaron Ballman	444eb6e23c	Properly parsing __declspec(safebuffers), though there is no semantic hookup. For more information about safebuffers, see MSDN: http://msdn.microsoft.com/en-us/library/dd778695(v=vs.110).aspx llvm-svn: 181123	2013-05-04 16:58:37 +00:00
Aaron Ballman	d428ff46fe	Reverting r181004 since it has broken test/Sema/wchar.c. llvm-svn: 181122	2013-05-04 16:56:22 +00:00
Tim Northover	85dcbde239	AArch64: assert code model is small for TLS accesses Supporting TLS in the large memory model is rather difficult at the moment, so make sure no-one gets into difficulties by mistake. llvm-svn: 181121	2013-05-04 16:54:11 +00:00
Tim Northover	885698a25c	AArch64: support literal pool access in large memory model. llvm-svn: 181120	2013-05-04 16:54:07 +00:00
Tim Northover	8ff187df5f	AArch64: support large code model for jump-tables llvm-svn: 181119	2013-05-04 16:54:00 +00:00
Tim Northover	9fc1cddb21	AArch64: implement support for blockaddress in large code model llvm-svn: 181118	2013-05-04 16:53:53 +00:00
Tim Northover	2dbef3452c	AArch64: implement large code model access to global variables. The MOVZ/MOVK instruction sequence may not be the most efficient (a literal-pool load could be better) but adding that would require reinstating the ConstantIslands pass. For now the sequence is correct, and that's enough. Beware, as of commit GNU ld does not appear to support the relocations needed for this. Its primary purpose (for now) will be to support JITed code, since in that case there is no guarantee of where your code will end up in memory relative to external symbols it references. llvm-svn: 181117	2013-05-04 16:53:46 +00:00
Richard Osborne	df9e574105	[XCore] Use static relocation model by default. This allows us to get get rid of a hack in XCoreTargetObjectFile where the the DataRel* sections were overridden. llvm-svn: 181116	2013-05-04 16:40:58 +00:00
Enea Zaffanella	b2d998f356	Moved pretty printer test for thread local storage in its own file and specified the triple. llvm-svn: 181115	2013-05-04 11:26:59 +00:00
Benjamin Kramer	c2f5f29b8d	Lex: Fix quadratic behavior when unescaping _Pragma strings. No functionality change. llvm-svn: 181114	2013-05-04 10:37:20 +00:00
Enea Zaffanella	acb8ecd664	In VarDecl nodes, store the thread storage class specifier as written. llvm-svn: 181113	2013-05-04 08:27:07 +00:00
Tim Northover	fee13d1e11	Allow host triple to be correctly overridden in CMake builds The intended semantics mirror autoconf, where the user is able to specify a host triple, but if it's left to the build system then "config.guess" is invoked for the default. This also renames the LLVM_HOSTTRIPLE define to LLVM_HOST_TRIPLE to fit in with the style of the surrounding defines. llvm-svn: 181112	2013-05-04 07:36:23 +00:00
Tim Northover	8ec8c4bf89	AArch64: teach Clang about __clear_cache intrinsic libgcc provides a __clear_cache intrinsic on AArch64, much like it does on 32-bit ARM. llvm-svn: 181111	2013-05-04 07:15:13 +00:00
Richard Smith	fa7a7b4f36	Tweaks to C++ status: add a link to Faisal's generic lambdas implementation, and mark "clarifying memory allocation" as done, since it turns out that our optimizations here (such as they are) already conform to the new rules. llvm-svn: 181110	2013-05-04 07:12:37 +00:00
Richard Smith	452d6b0331	More colours for C++ status page, and mark relaxed constexpr as partially complete. llvm-svn: 181109	2013-05-04 07:02:10 +00:00
Richard Smith	2a7d481faf	Implement most of N3638 (return type deduction for normal functions). Missing (somewhat ironically) is support for the new deduction rules in lambda functions, plus PCH support for return type patching. llvm-svn: 181108	2013-05-04 07:00:32 +00:00
Richard Smith	22262abd78	Don't build a call expression referring to a function which we're not allowed to use. This makes very little difference right now (other than suppressing follow-on errors in some cases), but will matter more once we support deduced return types (we don't want expressions with undeduced return types in the AST). llvm-svn: 181107	2013-05-04 06:44:46 +00:00
Jason Molenda	513db4d9f4	Change ProcessKDP::UpdateThreadList's call to FindThreadByProtocolID to not let it update the thread list or else we'll infinite recurse call back to UpdateThreadList. llvm-svn: 181106	2013-05-04 05:51:02 +00:00
Rafael Espindola	914d31ffd3	Optimize llvm-link too. This takes the linking of almost all modules in a clang build from 6:32 to 0:19. llvm-svn: 181105	2013-05-04 05:30:49 +00:00
Rafael Espindola	aa9918aac7	Fix a performance bug in the Linker. Now that we hava a convinient place to keep it, remeber the set of identified structs as we merge modules. This speeds up the linking of all the bitcode files in clang with the gold plugin and -plugin-opt=emit-llvm (i.e., link only, no codegen) from 5:25 minutes to 13.6 seconds! Patch by Xiaofei Wan! llvm-svn: 181104	2013-05-04 05:05:18 +00:00
Richard Smith	489e4e0369	Say 'decltype(auto)' not 'auto' as appropriate in mismatched-deduction diagnostic. llvm-svn: 181103	2013-05-04 04:19:27 +00:00
Rafael Espindola	287f18b4b8	Implement Linker::LinkModules with Linker::linkInModule. Flipping which one is the implementation will let us optimize linkInModule. llvm-svn: 181102	2013-05-04 04:08:02 +00:00
Wei Pan	17fbf6edc2	Implement template support for CapturedStmt - Sema tests added and CodeGen tests are pending Differential Revision: http://llvm-reviews.chandlerc.com/D728 llvm-svn: 181101	2013-05-04 03:59:06 +00:00
Rafael Espindola	3df61b7bef	Now that Linker.cpp is almost empty, merge it into LinkModules.cpp. Also remove unused includes. llvm-svn: 181100	2013-05-04 03:48:37 +00:00
Rafael Espindola	a8023c1c9f	Last batch of cleanups to Linker.h. Update comments, fix * placement, fix method names that are not used in clang, add a linkInModule that takes a Mode and put it in Linker.cpp. llvm-svn: 181099	2013-05-04 03:06:50 +00:00
Rafael Espindola	0229acaa0f	Don't construct or delete a module on the Linker. The linker is now responsible only for actually linking the modules, it is up to the clients to create and destroy them. llvm-svn: 181098	2013-05-04 02:43:00 +00:00
Rafael Espindola	02a071aca8	Don't store the context in the Linker. llvm-svn: 181097	2013-05-04 02:34:41 +00:00
Rafael Espindola	40bbfa1080	Remove unused members and constructor arguments. llvm-svn: 181096	2013-05-04 02:28:57 +00:00
Rafael Espindola	75f9655afb	Add missing header. llvm-svn: 181095	2013-05-04 02:21:46 +00:00

1 2 3 4 5 ...

148880 Commits All Branches Search

148880 Commits

All Branches