llvm-project

Commit Graph

Author	SHA1	Message	Date
Wan Xiaofei	b2c8cdc766	Change data structure to memorize computed result in ScalarEvolution Replace std::map with SmallVector to memorize the cached result since SCEV usually belongs to little Loop/BB Linear scan on SmallVector is faster than std::map. Code reviewer : Andrew Trick. Test result : Pass Unit Test & LLVM Test Suite 401.bzip2 0.425721 0.419981 101.37% 403.gcc 24.53855 24.2667 101.12% 429.mcf 0.060847 0.059944 101.51% 433.milc 0.646009 0.636119 101.55% 444.namd 1.383928 1.370614 100.97% 445.gobmk 5.836575 5.800225 100.63% 450.soplex 1.911257 1.895963 100.81% 456.hmmer 1.039565 1.032534 100.68% 458.sjeng 0.897401 0.885567 101.34% 464.h264ref 3.645908 3.577991 101.90% 470.lbm 0.049456 0.048398 102.19% 471.omnetpp 5.638575 5.60435 100.61% bitmnp01 0.045738 0.045291 100.99% cjpegv2data 0.304359 0.302833 100.50% idctrn01 0.046433 0.045763 101.46% quake2 4.534416 4.4952 100.87% quake 2.688566 2.659208 101.10% xcsoar 12.42545 12.30385 100.99% linpack 0.038739 0.03803 101.86% matrix01 0.053564 0.0528 101.45% nbench 0.402867 0.395803 101.78% tblook01 0.021265 0.021015 101.19% ttsprk01 0.066384 0.065566 101.25% llvm-svn: 194459	2013-11-12 09:40:41 +00:00
Arnaud A. de Grandmaison	f5f040fa1e	CalcSpillWeights: allow overidding the spill weight normalizing function This will enable the PBQP register allocator to provide its own normalizing function. No functionnal change. llvm-svn: 194417	2013-11-11 19:56:14 +00:00
Chad Rosier	d3684a0566	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Arnaud A. de Grandmaison	ea3ac1612c	CalcSpillWeights: give a better describing name to calculateSpillWeights Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint. No functionnal change. llvm-svn: 194404	2013-11-11 19:04:45 +00:00
Chad Rosier	35575e737c	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Peter Zotov	d2cf791ad8	[llvm-c] Remove dead typedef llvm-svn: 194379	2013-11-11 14:47:01 +00:00
Pete Cooper	a8b685cd7b	Don't universally enable initialiser lists on GCC. Thanks for catching this Chandler llvm-svn: 194365	2013-11-11 05:14:42 +00:00
Pete Cooper	020832fb6e	Add LLVM_HAS_INITIALIZER_LISTS for upcoming C++11 support. Use it in ArrayRef llvm-svn: 194362	2013-11-11 03:58:00 +00:00
Arnaud A. de Grandmaison	760c1e0b0a	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194356	2013-11-10 17:46:31 +00:00
Chandler Carruth	90a835d2a0	[PM] Start sketching out the new module and function pass manager. This is still just a skeleton. I'm trying to pull together the experimentation I've done into committable chunks, and this is the first coherent one. Others will follow in hopefully short order that move this more toward a useful initial implementation. I still expect the design to continue evolving in small ways as I work through the different requirements and features needed here though. Keep in mind, all of this is off by default. Currently, this mostly exercises the use of a polymorphic smart pointer and templates to hide the polymorphism for the pass manager from the pass implementation. The next step will be more significant, adding the first framework of analysis support. llvm-svn: 194325	2013-11-09 13:09:08 +00:00
Chandler Carruth	7caea41545	Move the old pass manager infrastructure into a legacy namespace and give the files a legacy prefix in the right directory. Use forwarding headers in the old locations to paper over the name change for most clients during the transitional period. No functionality changed here! This is just clearing some space to reduce renaming churn later on with a new system. Even when the new stuff starts to go in, it is going to be hidden behind a flag and off-by-default as it is still WIP and under development. This patch is specifically designed so that very little out-of-tree code has to change. I'm going to work as hard as I can to keep that the case. Only direct forward declarations of the PassManager class are impacted by this change. llvm-svn: 194324	2013-11-09 12:26:54 +00:00
Filip Pizlo	dfc9b586ae	This exposes the new calling conventions (WebKit_JS and AnyReg) via the C API by adding them to the enumeration in Core.h. llvm-svn: 194323	2013-11-09 06:00:03 +00:00
Chandler Carruth	42fabdead0	Switch to allow implicit construction. In many cases, we're wrapping a derived type and this makes it much easier to write this code. llvm-svn: 194321	2013-11-09 05:55:03 +00:00
Chandler Carruth	64b0556071	Add a polymorphic_ptr<T> smart pointer data type. It's a somewhat silly unique ownership smart pointer which is deep copyable by assuming it can call a T::clone() method to allocate a copy of the owned data. This is mostly useful with containers or other collections of uniquely owned data in C++98 where they might copy. With C++11 we can likely remove this in favor of move-only types and containers wrapped around those types. llvm-svn: 194315	2013-11-09 04:06:02 +00:00
NAKAMURA Takumi	5f847c007b	include/llvm/CodeGen/PBQP: Update @param(s) in comments. [-Wdocumentation] llvm-svn: 194314	2013-11-09 03:54:05 +00:00
NAKAMURA Takumi	866975c26c	Fix whitespace. llvm-svn: 194313	2013-11-09 03:53:55 +00:00
Lang Hames	fb82630a91	Re-apply r194300 with fixes for warnings. llvm-svn: 194311	2013-11-09 03:08:56 +00:00
Nick Lewycky	59886d00ec	Revert r194300 which broke the build. llvm-svn: 194308	2013-11-09 02:01:25 +00:00
Juergen Ributzka	87ed906b2e	[Stackmap] Materialize the jump address within the patchpoint noop slide. This patch moves the jump address materialization inside the noop slide. This enables patching of the materialization itself or its complete removal. This patch also adds the ability to define scratch registers that can be used safely by the code called from the patchpoint intrinsic. At least one scratch register is required, because that one is used for the materialization of the jump address. This patch depends on D2009. Differential Revision: http://llvm-reviews.chandlerc.com/D2074 Reviewed by Andy llvm-svn: 194306	2013-11-09 01:51:33 +00:00
Lang Hames	1662b832d9	Rewrite the PBQP graph data structure. The new graph structure replaces the node and edge linked lists with vectors. Free lists (well, free vectors) are used for fast insertion/deletion. The ultimate aim is to make PBQP graphs cheap to clone. The motivation is that the PBQP solver destructively consumes input graphs while computing a solution, forcing the graph to be fully reconstructed for each round of PBQP. This imposes a high cost on large functions, which often require several rounds of solving/spilling to find a final register allocation. If we can cheaply clone the PBQP graph and incrementally update it between rounds then hopefully we can reduce this cost. Further, once we begin pooling matrix/vector values (future work), we can cache some PBQP solver metadata and share it between cloned graphs, allowing the PBQP solver to re-use some of the computation done in earlier rounds. For now this is just a data structure update. The allocator and solver still use the graph the same way as before, fully reconstructing it between each round. I expect no material change from this update, although it may change the iteration order of the nodes, causing ties in the solver to break in different directions, and this could perturb the generated allocations (hopefully in a completely benign way). Thanks very much to Arnaud Allard de Grandmaison for encouraging me to get back to work on this, and for a lot of discussion and many useful PBQP test cases. llvm-svn: 194300	2013-11-09 00:14:07 +00:00
Juergen Ributzka	9969d3e6e8	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Lang Hames	3078977d28	Add a method to get the object-file appropriate stack map section. Thanks to Eric Christopher for the tips on the appropriate way to do this. llvm-svn: 194282	2013-11-08 22:14:49 +00:00
Arnaud A. de Grandmaison	f7a60a8e01	Revert "CalculateSpillWeights does not need to be a pass" Temporarily revert my previous commit until I understand why it breaks 3 target tests. llvm-svn: 194272	2013-11-08 18:19:19 +00:00
Arnaud A. de Grandmaison	ed812f6590	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194269	2013-11-08 17:56:29 +00:00
Jordan Rose	09e604333e	Add ImmutableSet profiling info for 'bool'. Useful for tri-state maps: true, false, and "no data yet". llvm-svn: 194266	2013-11-08 17:23:49 +00:00
Artyom Skrobov	08b2257f14	Export MCDisassembler's SubtargetInfo, to allow architecture-aware disassembly llvm-svn: 194260	2013-11-08 16:07:43 +00:00
NAKAMURA Takumi	29c3b55897	llvm-c/Support.h: Add a newline at eof. llvm-svn: 194203	2013-11-07 13:54:24 +00:00
Simon Atanasyan	0f756cd70b	Add DT_VERSYM dynamic table entry tag definition. llvm-svn: 194149	2013-11-06 12:23:52 +00:00
Peter Zotov	f7e64feb33	[llvm-c] Add parameter names in Target.h for C99 compliance llvm-svn: 194146	2013-11-06 11:52:40 +00:00
Peter Zotov	7b61b75c21	[llvm-c] Improve TargetMachine bindings Original patch by Chris Wailes llvm-svn: 194143	2013-11-06 10:25:18 +00:00
Peter Zotov	6b5e8b9409	[llvm-c] Correctly check for existence of native AsmParser, AsmPrinter, Disassembler Also, properly name the functions. llvm-svn: 194141	2013-11-06 09:45:53 +00:00
Peter Zotov	04f5981996	[llvm-c] Add functions for initializing native AsmPrinter, AsmParser & Disassembler Original patch by Chris Wailes llvm-svn: 194140	2013-11-06 09:21:35 +00:00
Peter Zotov	34ddbf1a7e	[llvm-c] Expose LLVMLoadLibraryPermanently Original patch by Chris Wailes llvm-svn: 194139	2013-11-06 09:21:31 +00:00
Peter Zotov	285eed6073	[llvm-c] Expose IRReader interface Original patch by Chris Wailes llvm-svn: 194137	2013-11-06 09:21:15 +00:00
Peter Zotov	cd93b370d5	[llvm-c] Implement LLVMPrintValueToString Original patch by Chris Wailes llvm-svn: 194135	2013-11-06 09:21:01 +00:00
Andrew Trick	34e2f0c4ea	Rewrite SCEV's backedge taken count computation. Patch by Michele Scandale! Rewrite of the functions used to compute the backedge taken count of a loop on LT and GT comparisons. I decided to split the handling of LT and GT cases becasue the trick "a > b == -a < -b" in some cases prevents the trip count computation due to the multiplication by -1 on the two operands of the comparison. This issue comes from the conservative computation of value range of SCEVs: taking the negative SCEV of an expression that have a small positive range (e.g. [0,31]), we would have a SCEV with a fullset as value range. Indeed, in the new rewritten function I tried to better handle the maximum backedge taken count computation when MAX/MIN expression are used to handle the cases where no entry guard is found. Some test have been modified in order to check the new value correctly (I manually check them and reasoning on possible overflow the new values seem correct). I finally added a new test case related to the multiplication by -1 issue on GT comparisons. llvm-svn: 194116	2013-11-06 02:08:26 +00:00
Rafael Espindola	03cb49e159	Remove another unused, and IMHO, not very desirable feature of ErrorOr. One of the uses of the IsValid flag is to support default constructing a ErrorOr that is not a Error or a Value. There is not much value in doing that IMHO. If ErrorOr was to have a default constructor, it should be implemented by default constructing the value, but even that looks unnecessary. The other use is to avoid calling destructors on moved objects. This looks wrong. If the data being moved has non trivial treatment of moves (an std::vector for example), it is its destructor that should handle it, not ~ErrorOr. With this change ErrorOr becomes a fairly simple wrapper and should always be better than using an error_code + value in an API. llvm-svn: 194109	2013-11-05 23:41:57 +00:00
Dmitri Gribenko	75e12236cc	Convert comments to documentation comments (// -> ///) Patch by MathOnNapkins llvm-svn: 194093	2013-11-05 21:28:42 +00:00
Rafael Espindola	2b11ad4fe9	Use error_code in GVMaterializer. They just propagate out the bitcode reader error, so we don't need a new enum. llvm-svn: 194091	2013-11-05 19:36:34 +00:00
Jiangning Liu	d7c52676f6	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194085	2013-11-05 17:42:05 +00:00
Peter Zotov	ae0344b07f	[llvm-c] (PR16190) Add LLVMIsA* functions for ConstantDataSequential and subclasses Original patch by David Monniaux llvm-svn: 194074	2013-11-05 12:55:37 +00:00
Alp Toker	e5e3bc0c04	Fix symbol defines in config.h.cmake These were incorrectly pointing to HAVE_LOG despite being checked for correctly in config-ix.cmake. Patch by James Lyon! llvm-svn: 194051	2013-11-05 07:27:18 +00:00
Yuchen Wu	30672d9086	Support for reading run counts in llvm-cov. This patch enables llvm-cov to correctly output the run count stored in the GCDA file. GCOVProfiling currently does not generate this information, so the GCDA run data had to be hacked on from a GCDA file generated by gcc. This is corrected by a subsequent patch. With the run and program data included, both llvm-cov and gcov produced the same output. llvm-svn: 194033	2013-11-05 01:11:58 +00:00
Rafael Espindola	2bad63c341	Fix MSVC build by not putting an error_code directly in a union. llvm-svn: 194032	2013-11-05 01:07:06 +00:00
Rafael Espindola	ca35ffe6a2	Simplify ErrorOr. ErrorOr had quiet a bit of complexity and indirection to be able to hold a user type with the error. That feature is not used anymore. This patch removes it, it will live in svn history if we ever need it again. If we do need it again, IMHO there is one thing that should be done differently: Holding extra info in the error is not a property a function also returning a value or not. The ability to hold extra info should be in the error type and ErrorOr templated over it so that we don't need the funny looking ErrorOr<void>. llvm-svn: 194030	2013-11-05 00:28:01 +00:00
Hal Finkel	081eaef6fa	Add a runtime unrolling parameter to the LoopUnroll pass constructor As with the other loop unrolling parameters (the unrolling threshold, partial unrolling, etc.) runtime unrolling can now also be controlled via the constructor. This will be necessary for moving non-trivial unrolling late in the pass manager (after loop vectorization). No functionality change intended. llvm-svn: 194027	2013-11-05 00:08:03 +00:00
Cameron McInally	d80f7d34de	Add support for AVX512 masked vector blend intrinsics. llvm-svn: 194006	2013-11-04 19:14:56 +00:00
Zoran Jovanovic	8a80aa76c8	Support for microMIPS branch instructions. llvm-svn: 193992	2013-11-04 14:53:22 +00:00
Elena Demikhovsky	46eeaba93b	AVX-512: fixed a typo in builtin name llvm-svn: 193988	2013-11-04 11:48:23 +00:00
Filip Pizlo	c10ca90324	Make the pretty stack trace be an opt-in, rather than opt-out, facility. Enable pretty stack traces by default if you use PrettyStackTraceProgram, so that existing LLVM-based tools will continue to get it without any changes. llvm-svn: 193971	2013-11-04 02:22:25 +00:00
Elena Demikhovsky	dacddb0bab	AVX-512: added VPCONFLICT instruction and intrinsics, added EVEX_KZ to tablegen llvm-svn: 193959	2013-11-03 13:46:31 +00:00
Bob Wilson	d8d92d90fa	Convert calls to __sinpi and __cospi into __sincospi_stret This adds an SimplifyLibCalls case which converts the special __sinpi and __cospi (float & double variants) into a __sincospi_stret where appropriate to remove duplicated work. Patch by Tim Northover llvm-svn: 193943	2013-11-03 06:48:38 +00:00
Filip Pizlo	9f89e59bb9	Add a comment to note that LLVMDisablePrettyStackTrace() is likely not a good long-term solution. llvm-svn: 193939	2013-11-03 04:38:31 +00:00
Filip Pizlo	9f50ccd1a3	When LLVM is embedded in a larger application, it's not OK for LLVM to intercept crashes. LLVM already has the ability to disable this functionality. This patch exposes it via the C API. llvm-svn: 193937	2013-11-03 00:29:47 +00:00
Rafael Espindola	586af97a30	move getSymbolNMTypeChar to the one program that needs it: nm. llvm-svn: 193933	2013-11-02 21:16:09 +00:00
Yuchen Wu	dbcf19758d	Added command-line option to output llvm-cov to file. Added -o option to llvm-cov. If no output file is specified, it defaults to STDOUT. llvm-svn: 193899	2013-11-02 00:09:17 +00:00
Rafael Espindola	716e7405d3	Remove linkonce_odr_auto_hide. linkonce_odr_auto_hide was in incomplete attempt to implement a way for the linker to hide symbols that are known to be available in every TU and whose addresses are not relevant for a particular DSO. It was redundant in that it all its uses are equivalent to linkonce_odr+unnamed_addr. Unlike those, it has never been connected to clang or llvm's optimizers, so it was effectively dead. Given that nothing produces it, this patch just nukes it (other than the llvm-c enum value). llvm-svn: 193865	2013-11-01 17:09:14 +00:00
Kevin Enderby	3c5ac81032	Add to the disassembler C API output reference types for Objective-C data structures. This is allows tools such as darwin's otool(1) that uses the LLVM disassembler take a pointer value being loaded by an instruction and add a comment to what it is being referenced to make following disassembly of Objective-C programs more readable. For example disassembling the Mac OS X TextEdit app one will see comments like the following: movq 0x20684(%rip), %rsi ## Objc selector ref: standardUserDefaults movq 0x21985(%rip), %rdi ## Objc class ref: _OBJC_CLASS_$_NSUserDefaults movq 0x1d156(%rip), %r14 ## Objc message: +[NSUserDefaults standardUserDefaults] leaq 0x23615(%rip), %rdx ## Objc cfstring ref: @"SelectLinePanel" callq 0x10001386c ## Objc message: -[[%rdi super] initWithWindowNibName:] These diffs also include putting quotes around C strings in literal pools and uses "symbol address" in the comment when adding a symbol name to the comment to tell these types of references apart: leaq 0x4f(%rip), %rax ## literal pool for: "Hello world" movq 0x1c3ea(%rip), %rax ## literal pool symbol address: ___stack_chk_guard Of course the easy changes are in the LLVM disassembler and the hard work is up to the implementer of the SymbolLookUp() call back. rdar://10602439 llvm-svn: 193833	2013-11-01 00:00:07 +00:00
Chad Rosier	74b65cd811	[AArch64] Add support for NEON scalar fixed-point convert to floating-point instructions. llvm-svn: 193816	2013-10-31 22:36:59 +00:00
Andrew Trick	a3a11dedca	Add new calling convention for WebKit Java Script. llvm-svn: 193812	2013-10-31 22:12:01 +00:00
Andrew Trick	153ebe6d2a	Add support for stack map generation in the X86 backend. Originally implemented by Lang Hames. llvm-svn: 193811	2013-10-31 22:11:56 +00:00
Rafael Espindola	282a47037b	Use LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN instead of the "dso list". There are two ways one could implement hiding of linkonce_odr symbols in LTO: * LLVM tells the linker which symbols can be hidden if not used from native files. * The linker tells LLVM which symbols are not used from other object files, but will be put in the dso symbol table if present. GOLD's API is the second option. It was implemented almost 1:1 in llvm by passing the list down to internalize. LLVM already had partial support for the first option. It is also very similar to how ld64 handles hiding these symbols when not doing LTO. This patch then * removes the APIs for the DSO list. * marks LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN all linkonce_odr unnamed_addr global values and other linkonce_odr whose address is not used. * makes the gold plugin responsible for handling the API mismatch. llvm-svn: 193800	2013-10-31 20:51:58 +00:00
Chad Rosier	20e1f20d69	[AArch64] Add support for NEON scalar shift immediate instructions. llvm-svn: 193790	2013-10-31 19:28:44 +00:00
Manman Ren	a4290bed81	Cleanup: update comments. llvm-svn: 193773	2013-10-31 17:25:22 +00:00
Andrew Trick	74f4c749cf	Lower stackmap intrinsics directly to their target opcode in the DAG builder. llvm-svn: 193769	2013-10-31 17:18:24 +00:00
Andrew Trick	50231ff8ab	Add experimental stackmap intrinsics to definition file and documenation. llvm-svn: 193767	2013-10-31 17:18:14 +00:00
Andrew Trick	a2efd99bdf	Enable variable arguments support for intrinsics. llvm-svn: 193766	2013-10-31 17:18:11 +00:00
Rafael Espindola	4b102d0ead	Remove another unused flag. llvm-svn: 193756	2013-10-31 15:58:33 +00:00
Rafael Espindola	74e1d0a0a0	Remove unused flag. llvm-svn: 193752	2013-10-31 15:49:39 +00:00
Cameron McInally	394d557f41	Add AVX512 unmasked integer broadcast intrinsics and support. llvm-svn: 193748	2013-10-31 13:56:31 +00:00
Rafael Espindola	6554e5a94d	Merge CallGraph and BasicCallGraph. llvm-svn: 193734	2013-10-31 03:03:55 +00:00
Rafael Espindola	6f1b2852fc	Produce .weak_def_can_be_hidden for some linkonce_odr values With this patch llvm produces a weak_def_can_be_hidden for linkonce_odr if they are also unnamed_addr or don't have their address taken. There is not a lot of documentation about .weak_def_can_be_hidden, but from the old discussion about linkonce_odr_auto_hide and the name of the directive this looks correct: these symbols can be hidden. Testing this with the ld64 in Xcode 5 linking clang reduces the number of exported symbols from 21053 to 19049. llvm-svn: 193718	2013-10-30 22:08:11 +00:00
Simon Atanasyan	6a2aaecd66	[Mips] Add more SHF_MIPS_xxx ELF section flags. llvm-svn: 193713	2013-10-30 20:41:45 +00:00
Rui Ueyama	00e24e48b6	Add {start,end}with_lower methods to StringRef. startswith_lower is ocassionally useful and I think worth adding. endwith_lower is added for completeness. Differential Revision: http://llvm-reviews.chandlerc.com/D2041 llvm-svn: 193706	2013-10-30 18:32:26 +00:00
Daniel Sanders	d5f554f0bb	[mips][msa] Correct definition of bins[lr] and CHECK-DAG-ize related tests llvm-svn: 193695	2013-10-30 15:45:42 +00:00
Daniel Sanders	ab94b537d7	[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693	2013-10-30 15:20:38 +00:00
Chad Rosier	be020d0309	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193691	2013-10-30 15:19:37 +00:00
Cameron McInally	d184466d1b	Refactor the AVX512 intrinsics. Cluster the intrinsics into the appropriate vector extension class within the .td file. llvm-svn: 193690	2013-10-30 15:19:10 +00:00
Howard Hinnant	811c96fa0e	Rehash but don't grow when full of tombstones. This problem was found and fixed by José Fonseca in March 2011 for SmallPtrSet, committed r128566. But as far as I can tell, all other llvm hash tables retain the same problem: the bucket count can grow without bound while size() remains near constant by repeated insert/erase cycles that tend to fill the container with tombstones. Here is a demo that has been reduced to a trivial case: int main() { llvm::DenseSet<unsigned> d; for (unsigned i = 0; i < 0xFFFFFFF; ++i) { d.insert(i); d.erase(i); } } While the container size() never grows above 1, the bucket count grows like this: nb = 64 nb = 128 nb = 256 nb = 512 nb = 1024 nb = 2048 nb = 4096 nb = 8192 nb = 16384 nb = 32768 nb = 65536 nb = 131072 nb = 262144 nb = 524288 nb = 1048576 nb = 2097152 nb = 4194304 nb = 8388608 nb = 16777216 nb = 33554432 nb = 67108864 nb = 134217728 nb = 268435456 The above program currently consumes a few GB ram. This patch brings the memory consumption down by several orders of magnitude, and keeps the bucket count at 64 for the above test. llvm-svn: 193689	2013-10-30 15:10:54 +00:00
Daniel Sanders	d74b130cc9	[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687	2013-10-30 14:45:14 +00:00
Josh Magee	7245f1d85d	Reformat code with clang-format. Differential Revision: http://llvm-reviews.chandlerc.com/D2057 llvm-svn: 193672	2013-10-30 02:25:14 +00:00
NAKAMURA Takumi	c6823c760c	StackProtector.h: Fix trailing comments for doxygen. [-Wdocumentation] s!//<!///<! llvm-svn: 193669	2013-10-30 00:49:39 +00:00
NAKAMURA Takumi	8970f5386c	Trailing whitespace in a comment line. llvm-svn: 193668	2013-10-30 00:49:33 +00:00
Josh Magee	3f1c0e35e6	[stackprotector] Update the StackProtector pass to perform datalayout analysis. This modifies the pass to classify every SSP-triggering AllocaInst according to an SSPLayoutKind (LargeArray, SmallArray, AddrOf). This analysis is collected by the pass and made available for use, but no other pass uses it yet. The next patch will make use of this analysis in PEI and StackSlot passes. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1789 llvm-svn: 193653	2013-10-29 21:16:16 +00:00
Matt Arsenault	87596662cd	Update comment llvm-svn: 193651	2013-10-29 21:04:19 +00:00
Matt Arsenault	a1ca46d003	Workaround MSVC 32-bit miscompile of getCondCodeAction. Use 32-bit types for the array instead of 64. This should generally be better anyway. In optimized + assert builds, I saw a failure when a cond code / type combination that is never set was loading a non-zero value and hitting the != Promote assert. It turns out when loading the 64-bit value to do the shift, the assembly loads the 2 32-bit halves from non-consecutive addresses. The address the second half of the loaded uint64_t doesn't include the offset of the array in the struct. Instead of being offset + 4, it's just + 4. I'm not entirely sure why this wasn't observed before. setCondCodeAction isn't heavily used by the in-tree targets, and not with the higher valued vector SimpleValueTypes. Only PPC is using one of the > 32 valued types, and that is probably never used by anyone on a 32-bit MSVC compiled host. I ran into this when upgrading LLVM versions, so I guess the value loaded from the nonsense address happened to work out before. No test since I'm not really sure if / how it can be reproduced with the current in tree targets, and it's not supposed to change anything. llvm-svn: 193650	2013-10-29 20:59:29 +00:00
Rafael Espindola	88034af278	Remove declared but not implemented function. llvm-svn: 193637	2013-10-29 18:31:14 +00:00
Rafael Espindola	e133ed88b5	Move getSymbol to TargetLoweringObjectFile. This allows constructing a Mangler with just a TargetMachine. llvm-svn: 193630	2013-10-29 17:28:26 +00:00
Rafael Espindola	79858aa3df	Add a helper getSymbol to AsmPrinter. llvm-svn: 193627	2013-10-29 17:07:16 +00:00
Zoran Jovanovic	507e084a18	Support for microMIPS jump instructions llvm-svn: 193623	2013-10-29 16:38:59 +00:00
Rafael Espindola	5d1b745689	Clarify that GlobalVariables definitions must have an initializer. llvm-svn: 193609	2013-10-29 13:44:11 +00:00
Anders Waldenborg	213a63fe53	llvm-c: Make LLVM{Get,Set}Alignment work on {Load,Store}Inst too Patch by Peter Zotov Differential Revision: http://llvm-reviews.chandlerc.com/D1910 llvm-svn: 193597	2013-10-29 09:02:02 +00:00
Alp Toker	6a03374526	Fix "existant" typos llvm-svn: 193579	2013-10-29 02:35:28 +00:00
Joerg Sonnenberger	fc18473400	Move the STT_FILE symbols out of the normal symbol table processing for ELF. They can overlap with the other symbols, e.g. if a source file "foo.c" contains a function "foo" with a static variable "c". llvm-svn: 193569	2013-10-29 01:06:17 +00:00
Alexey Samsonov	a56bbf0c8c	DWARF parser: Use ArrayRef to represent form sizes and simplify DWARFDIE::extractFast() interface. No functionality change. llvm-svn: 193560	2013-10-28 23:41:49 +00:00
Alexey Samsonov	48cbda5850	DebugInfo: Introduce the notion of "form classes" Summary: Use DWARF4 table of form classes to fetch attributes from DIE in a more consistent way. This shouldn't change the functionality and serves as a refactoring for upcoming change: DW_AT_high_pc has different semantics depending on its form class. Reviewers: dblaikie, echristo Reviewed By: echristo CC: echristo, llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1961 llvm-svn: 193553	2013-10-28 23:01:48 +00:00
Logan Chien	8cbb80d159	[arm] Implement eabi_attribute, cpu, and fpu directives. This commit allows the ARM integrated assembler to parse and assemble the code with .eabi_attribute, .cpu, and .fpu directives. To implement the feature, this commit moves the code from AttrEmitter to ARMTargetStreamers, and several new test cases related to cortex-m4, cortex-r5, and cortex-a15 are added. Besides, this commit also change the Subtarget->isFPOnlySP() to Subtarget->hasD16() to match the usage of .fpu directive. This commit changes the test cases: * Several .eabi_attribute directives in 2010-09-29-mc-asm-header-test.ll are removed because the .fpu directive already cover the functionality. * In the Cortex-A15 test case, the value for Tag_Advanced_SIMD_arch has be changed from 1 to 2, which is more precise. llvm-svn: 193524	2013-10-28 17:51:12 +00:00
Richard Sandiford	39c1ce4dc1	Keep TBAA info when rewriting SelectionDAG loads and stores Most SelectionDAG code drops the TBAA info when creating a new form of a load and store (e.g. during legalization, or when converting a plain load to an extending one). This patch tries to catch all cases where the TBAA information can legitimately be carried over. The patch adds alternative forms of getLoad() and getExtLoad() that take a MachineMemOperand instead of individual fields. (The corresponding getTruncStore() already exists.) The idea is to use the MachineMemOperand forms when all fields are carried over (size, pointer info, isVolatile, isNonTemporal, alignment and TBAA info). If some adjustment is being made, e.g. to narrow the load, then we still pass the individual fields but also pass the TBAA info. llvm-svn: 193517	2013-10-28 11:17:59 +00:00
Elena Demikhovsky	199c823555	AVX-512: PMIN/PMAX intrinsics and patterns Patch by Cameron McInally <cameron.mcinally@nyu.edu> llvm-svn: 193497	2013-10-27 08:18:37 +00:00
Shuxin Yang	2e1890e18b	Revert r193251 : Use address-taken to disambiguate global variable and indirect memops. llvm-svn: 193489	2013-10-27 03:08:44 +00:00
Wan Xiaofei	be640b28c0	Quick look-up for block in loop. This patch implements quick look-up for block in loop by maintaining a hash set for blocks. It improves the efficiency of loop analysis a lot, the biggest improvement could be 5-6%(458.sjeng). Below are the compilation time for our benchmark in llc before & after the patch. Benchmark llc - trunk llc - patched 401.bzip2 0.339081 100.00% 0.329657 102.86% 403.gcc 19.853966 100.00% 19.605466 101.27% 429.mcf 0.049823 100.00% 0.048451 102.83% 433.milc 0.514898 100.00% 0.510217 100.92% 444.namd 1.109328 100.00% 1.103481 100.53% 445.gobmk 4.988028 100.00% 4.929114 101.20% 456.hmmer 0.843871 100.00% 0.825865 102.18% 458.sjeng 0.754238 100.00% 0.714095 105.62% 464.h264ref 2.9668 100.00% 2.90612 102.09% 471.omnetpp 4.556533 100.00% 4.511886 100.99% bitmnp01 0.038168 100.00% 0.0357 106.91% idctrn01 0.037745 100.00% 0.037332 101.11% libquake2 3.78689 100.00% 3.76209 100.66% libquake_ 2.251525 100.00% 2.234104 100.78% linpack 0.033159 100.00% 0.032788 101.13% matrix01 0.045319 100.00% 0.043497 104.19% nbench 0.333161 100.00% 0.329799 101.02% tblook01 0.017863 100.00% 0.017666 101.12% ttsprk01 0.054337 100.00% 0.053057 102.41% Reviewer : Andrew Trick <atrick@apple.com>, Hal Finkel <hfinkel@anl.gov> Approver : Andrew Trick <atrick@apple.com> Test : Pass make check-all & llvm test-suite llvm-svn: 193460	2013-10-26 03:08:02 +00:00
Andrew Trick	57243da70f	Fix SCEVExpander: don't try to expand quadratic recurrences outside a loop. Partial fix for PR17459: wrong code at -O3 on x86_64-linux-gnu (affecting trunk and 3.3) When SCEV expands a recurrence outside of a loop it attempts to scale by the stride of the recurrence. Chained recurrences don't work that way. We could compute binomial coefficients, but would hve to guarantee that the chained AddRec's are in a perfectly reduced form. llvm-svn: 193438	2013-10-25 21:35:56 +00:00
Rafael Espindola	1d19c8f03a	Change MemoryBuffer::getFile to take a Twine. llvm-svn: 193429	2013-10-25 19:06:52 +00:00
David Blaikie	65cc969f50	DIEHash: Summary hashing of nested types llvm-svn: 193427	2013-10-25 18:38:43 +00:00
Rafael Espindola	64cc1b0043	Call destroy from ~BasicCallGraph. This fix a memory leak found by valgrind. Calling it from the base class destructor would not destroy the BasicCallGraph bits. FIXME: BasicCallGraph is the only thing that inherits from CallGraph. Can we merge the two? llvm-svn: 193412	2013-10-25 15:01:34 +00:00
Rafael Espindola	fe3be1153f	Use c comments. llvm-svn: 193404	2013-10-25 12:59:02 +00:00
Tim Northover	1744d0ad83	ARM: allow .thumb_func to be separated from symbol definition When assembling, a .thumb_func directive is supposed to be applicable to the next symbol definition, even if there are intervening directives. We were racing ahead to try and find it, and this commit should fix the issue. Patch by Gabor Ballabas llvm-svn: 193403	2013-10-25 12:49:50 +00:00
Tim Northover	a564d329c2	LegalizeDAG: allow libcalls for max/min atomic operations ARM processors without ldrex/strex need to be able to make libcalls for all atomic operations, including the newer min/max versions. The alternative would probably be expanding these operations in terms of cmpxchg (as x86 does always), but in the configurations where this matters code-size tends to be paramount so the libcall is more desirable. llvm-svn: 193398	2013-10-25 09:30:20 +00:00
Richard Smith	a2d566fa98	Fix ODR violation. llvm-svn: 193391	2013-10-25 03:29:42 +00:00
Yuchen Wu	14ae8e6195	Support for reading program counts in llvm-cov. llvm-cov will now be able to read program counts from the GCDA file and output it in the same format as gcov. The program summary tag was identified from gcov-io.h as "\0\0\0\a3". There is currently a bug in GCOVProfiling.cpp which does not generate the run- or program-counting IR, so this change was tested manually by modifying the GCDA file and comparing the gcov and llvm-cov outputs. llvm-svn: 193389	2013-10-25 02:22:21 +00:00
David Blaikie	d8c5b4e8ef	MCStreamer: Reimplement the virtual EmitRawText as a protected member, EmitRawTextImpl, to avoid string literal ambiguities Also improve the implementation of EmitRawText(Twine) so it doesn't bother using the SmallString buffer if the Twine is a simple StringRef anyway. llvm-svn: 193378	2013-10-24 22:43:10 +00:00
Reid Kleckner	ddac15108a	lto.h: Use lto_bool_t instead of int to restore the ABI This reverts commit r193255 and instead creates an lto_bool_t typedef that points to bool, _Bool, or unsigned char depending on what is available. Only recent versions of MSVC provide a stdbool.h header. Reviewers: rafael.espindola Differential Revision: http://llvm-reviews.chandlerc.com/D2019 llvm-svn: 193377	2013-10-24 22:26:04 +00:00
Eric Christopher	dd542ef786	Formatting and whitespace. llvm-svn: 193370	2013-10-24 21:04:51 +00:00
John Thompson	6cd5bd4a3d	Reverting my r193344 checkin due to build breakage. llvm-svn: 193350	2013-10-24 14:52:56 +00:00
John Thompson	e38e57206f	Added std::string as a built-in type for mapping. llvm-svn: 193344	2013-10-24 13:36:58 +00:00
Nuno Lopes	340b0463e6	fix PR17635: false positive with packed structures LLVM optimizers may widen accesses to packed structures that overflow the structure itself, but should be in bounds up to the alignment of the object llvm-svn: 193317	2013-10-24 09:17:24 +00:00
Zonr Chang	b2453337d2	Include missing Compiler.h for using LLVM_ENUM_INT_TYPE. llvm-svn: 193315	2013-10-24 08:17:39 +00:00
Elena Demikhovsky	dd0794e51b	AVX-512: added VCVTPH2PS, VCVTPS2PH with intrinsics llvm-svn: 193312	2013-10-24 07:16:35 +00:00
Yuchen Wu	887c20ffc2	Fixed llvm-cov to count edges instead of blocks. This was a fundamental flaw in llvm-cov where it treated the values in the GCDA files as block counts instead of edge counts. This created incorrect line counts when branching was present. Instead, the edge counts should be summed to obtain the correct block count. The fix was tested using custom test files as well as single source files from the test-suite directory. The behaviour can be verified by reading the GCOV documentation that describes the GCDA spec ("ARC_COUNTS gives the counter values for those arcs that are instrumented") and the header description provided by GCOVProfiling.cpp ("instruments the code that runs to records (sic) the edges between blocks that run and emit a complementary "gcda" file on exit"). llvm-svn: 193299	2013-10-24 01:51:04 +00:00
Andrew Kaylor	c89fc826b2	Optimizing MCJIT module state tracking Patch co-developed with Yaron Keren. llvm-svn: 193291	2013-10-24 00:19:14 +00:00
Yuchen Wu	a5ffd3f994	Fixed doxygen comment to match Module.cpp llvm-svn: 193273	2013-10-23 21:25:44 +00:00
Yuchen Wu	48342ee908	Use a map instead of vector to store line counts. There are a few motivations for this: - Using a map allows for checking if line is in map. This differentiates unexecutable lines (such as comments) from unexecuted logical lines of code. "#####" is now outputted in this case, in line with gcov. - Source files are no longer read in twice: once when storing the line counts, and once when outputting the data. - Greatly simplifies the function FileInfo::addLineCount(). llvm-svn: 193264	2013-10-23 19:45:03 +00:00
NAKAMURA Takumi	fb9c241597	llvm-c/Target.h: Tweak "inline" for msvc to use __inline instead. FIXME: I don't think it'd be smart. llvm-svn: 193256	2013-10-23 17:56:52 +00:00
NAKAMURA Takumi	b13d51c6eb	llvm-c/lto.h: Avoid use of bool. llvm-svn: 193255	2013-10-23 17:56:46 +00:00
NAKAMURA Takumi	a3a8135f45	include/llvm-c: Whitespace. llvm-svn: 193253	2013-10-23 17:56:29 +00:00
Shuxin Yang	e4fb375995	Use address-taken to disambiguate global variable and indirect memops. Major steps include: 1). introduces a not-addr-taken bit-field in GlobalVariable 2). GlobalOpt pass sets "not-address-taken" if it proves a global varirable dosen't have its address taken. 3). AA use this info for disambiguation. llvm-svn: 193251	2013-10-23 17:28:19 +00:00
Benjamin Kramer	325ec89508	Mark zero-argument functions explicitly in C headers. Pacifies GCC's -Wstrict-prototypes. llvm-svn: 193249	2013-10-23 16:57:34 +00:00
Zoran Jovanovic	e7ae8af896	Support for microMIPS relocations 1. llvm-svn: 193247	2013-10-23 16:14:44 +00:00
Tom Stellard	8d7d4deafe	SelectionDAG: Pass along the original argument/element type in ISD::InputArg For some targets, it is useful to be able to look at the original type of an argument without having to dig through the original IR. This also fixes a bug in SelectionDAGBuilder where InputArg.PartOffset was not taking into account the offset of structure elements. Patch by: Justin Holewinski Tom Stellard: - Changed the type of ArgVT to EVT, so it can store non-simple types like v3i32. llvm-svn: 193214	2013-10-23 00:44:24 +00:00
Bob Wilson	68bf30a8b4	Fix llvm-cov counts to be 64-bit integers to avoid overflows. Line counts in llvm-cov are read in as 64-bit integers but were being truncated to 32-bit in collectLineCounts(), which caused overflow for large counts. This patch fixes all counts to be uint64_t. Patch by Yuchen Wu! llvm-svn: 193172	2013-10-22 17:43:47 +00:00
Benjamin Kramer	57f30bce64	Speling fixes. llvm-svn: 193165	2013-10-22 15:18:03 +00:00
Wan Xiaofei	2f8dc08b8c	Using FoldingSet in SelectionDAG::getVTList. VTList has a long life cycle through the module and getVTList is frequently called. In current getVTList, sequential search over a std::vector is used, this is inefficient in big module. This patch use FoldingSet to implement hashing mechanism when searching. Reviewer: Nadav Rotem Test : Pass unit tests & LNT test suite llvm-svn: 193150	2013-10-22 08:02:02 +00:00
Anders Waldenborg	47b3bd3fbb	llvm-c: Add LLVMPrintTypeToString Differential Revision: http://llvm-reviews.chandlerc.com/D1963 llvm-svn: 193149	2013-10-22 06:58:34 +00:00
Bob Wilson	3461bedbfd	Change llvm-cov output formatting to be more similar to gcov. - Replaced tabs with proper padding - print() takes two arguments, which are the GCNO and GCDA filenames - Files are listed at the top of output, appended by line 0 - Stripped strings of trailing \0s - Removed last two lines of whitespace in output Patch by Yuchen Wu! llvm-svn: 193148	2013-10-22 05:09:41 +00:00
Adrian Prantl	efe4520b72	fix two typos. llvm-svn: 193133	2013-10-21 23:55:19 +00:00
Chad Rosier	e012cb3783	[AArch64] Add the constraint to NEON scalar mla/mls instructions. llvm-svn: 193117	2013-10-21 20:11:47 +00:00
Matt Arsenault	bc4242114e	Remove unused TargetLowering field. llvm-svn: 193113	2013-10-21 20:04:01 +00:00
Matt Arsenault	51f9f77494	Fix CodeGen for vectors of pointers with address spaces. llvm-svn: 193112	2013-10-21 20:03:58 +00:00
Matt Arsenault	4ed49b5301	Remove unused SCEV functions llvm-svn: 193097	2013-10-21 18:08:09 +00:00
Andrew Kaylor	4fba04942d	Improving MCJIT/RuntimeDyld thread safety llvm-svn: 193094	2013-10-21 17:42:06 +00:00
David Blaikie	f244319cac	DebugInfo: Put each kind of constant (form, attribute, tag, etc) into its own enum for ease of use. This allows various variables to be more self-documenting and easier to debug by being of specific types without overlapping enum values. Precommit review by Eric Christopher. llvm-svn: 193091	2013-10-21 17:28:37 +00:00
Rafael Espindola	3d7fc25c7c	Optimize more linkonce_odr values during LTO. When a linkonce_odr value that is on the dso list is not unnamed_addr we can still look to see if anything is actually using its address. If not, it is safe to hide it. This patch implements that by moving GlobalStatus to Transforms/Utils and using it in Internalize. llvm-svn: 193090	2013-10-21 17:14:55 +00:00
Matheus Almeida	70fbf77546	[mips][msa] Fix definition of SLD instruction. The second parameter of the SLD intrinsic is the number of columns (GPR) to slide left the source array. llvm-svn: 193076	2013-10-21 11:47:56 +00:00
Michael J. Spencer	c064a9abff	[Support][YAML] Add support for accessing tags and tag handle substitution. llvm-svn: 193004	2013-10-18 22:38:04 +00:00
Hans Wennborg	ce69d77cec	MC asm parser: allow ?'s in symbol names, and handle @'s in names in MS asm This is another (final?) stab at making us able to parse our own asm output on Windows. Symbols on Windows often contain @'s and ?'s in their names. Our asm parser didn't like this. ?'s were not allowed, and @'s were intepreted as trying to reference PLT/GOT/etc. We can't just add quotes around the bad names, since e.g. for MinGW, we use gas to assemble, and it doesn't like quotes in some places (notably in .def directives). This commit makes us allow ?'s in symbol names, and @'s in symbol names for MS assembly. Differential Revision: http://llvm-reviews.chandlerc.com/D1978 llvm-svn: 193000	2013-10-18 20:46:28 +00:00
David Majnemer	451b7dd1ef	CodeGen: Emit a libcall if the target doesn't support 16-byte wide atomics There are targets that support i128 sized scalars but cannot emit instructions that modify them directly. The proper thing to do is to emit a libcall. This fixes PR17481. llvm-svn: 192957	2013-10-18 08:03:43 +00:00
Alexey Samsonov	742e6b8efd	[DebugInfo] Remove unneeded struct member and hide struct definition. No functionality change. llvm-svn: 192954	2013-10-18 07:13:32 +00:00
Alexey Samsonov	5b5a7865e7	[DebugInfo] Remove dead code. llvm-svn: 192952	2013-10-18 07:03:16 +00:00
Anders Waldenborg	959f04077c	llvm-c: Add LLVMIntPtrType{,ForAS}InContext All of the Core API functions have versions which accept explicit context, in addition to ones which work on global context. This commit adds functions which accept explicit context to the Target API for consistency. Patch by Peter Zotov Differential Revision: http://llvm-reviews.chandlerc.com/D1912 llvm-svn: 192913	2013-10-17 18:51:01 +00:00
Chad Rosier	37d29173aa	[AArch64] Add support for NEON scalar three register different instruction class. The instruction class includes the signed saturating doubling multiply-add long, signed saturating doubling multiply-subtract long, and the signed saturating doubling multiply long instructions. llvm-svn: 192908	2013-10-17 18:12:29 +00:00

1 2 3 4 5 ...

19189 Commits