llvm-project

Commit Graph

Author	SHA1	Message	Date
Chad Rosier	8b3014ea04	[ms-inline asm] Add the inline assembly dialect, AsmDialect, to the InlineAsm class. llvm-svn: 163175	2012-09-04 22:46:24 +00:00
Chad Rosier	38d24e6751	[ms-inline asm] Remove the Inline Asm Non-Standard Dialect attribute. This implementation does not co-exist well with how the sideeffect and alignstack attributes are handled. The reverts r161641. llvm-svn: 163174	2012-09-04 22:29:45 +00:00
Bill Wendling	6bbe48967a	Move the GCOVFormat enums into their own namespace per the LLVM coding standard. llvm-svn: 163008	2012-08-31 17:31:28 +00:00
NAKAMURA Takumi	fa81438042	Apply "/Og-" also to MSC15(aka VS9) on VMCore/Function.cpp. llvm-svn: 162917	2012-08-30 16:22:26 +00:00
Eli Friedman	79a6b30d8a	Make atomic load and store of pointers work. Tighten verification of atomic operations so other unexpected operations don't slip through. Based on patch by Logan Chien. PR11786/PR13186. llvm-svn: 162146	2012-08-17 23:24:29 +00:00
Bill Wendling	34bc34ecae	Change the `linker_private_weak_def_auto' linkage to `linkonce_odr_auto_hide' to make it more consistent with its intended semantics. The `linker_private_weak_def_auto' linkage type was meant to automatically hide globals which never had their addresses taken. It has nothing to do with the `linker_private' linkage type, which outputs the symbols with a `l' (ell) prefix among other things. The intended semantic is more like the `linkonce_odr' linkage type. Change the name of the linkage type to `linkonce_odr_auto_hide'. And therefore changing the semantics so that it produces the correct output for the linker. Note: The old linkage name `linker_private_weak_def_auto' will still parse but is not a synonym for `linkonce_odr_auto_hide'. This should be removed in 4.0. <rdar://problem/11754934> llvm-svn: 162114	2012-08-17 18:33:14 +00:00
Rafael Espindola	9a16735e22	Assert that dominates is not given a multiple edge. Finding out if we have multiple edges between two blocks is linear. If the caller is iterating all edges leaving a BB that would be a square time algorithm. It is more efficient to have the callers handle that case. Currently the only callers are: * GVN: already avoids the multiple edge case. * Verifier: could only hit this assert when looking at an invalid invoke. Since it already rejects the invoke, just avoid computing the dominance for it. llvm-svn: 162113	2012-08-17 18:21:28 +00:00
Rafael Espindola	cc80cdebb9	Teach GVN to reason about edges dominating uses. This allows it to handle cases where some fact lake a=b dominates a use in a phi, but doesn't dominate the basic block itself. This feature could also be implemented by splitting critical edges, but at least with the current algorithm reasoning about the dominance directly is faster. The time for running "opt -O2" in the testcase in pr10584 is 1.003 times slower and on gcc as a single file it is 1.0007 times faster. llvm-svn: 162023	2012-08-16 15:09:43 +00:00
Nick Lewycky	58564d5aa6	Fix a typo that led to a failure to correctly verify bitcast instructions. Patch by Stephen Hines! llvm-svn: 161921	2012-08-15 02:37:07 +00:00
Eric Christopher	97f6ea9f34	Typo. llvm-svn: 161826	2012-08-14 01:09:10 +00:00
Eli Friedman	4c923b3b3f	The normal edge of an invoke is not allowed to branch to a block with a landingpad. Enforce it in the verifier, and fix the regression tests to match. llvm-svn: 161697	2012-08-10 20:55:20 +00:00
Rafael Espindola	1187077f81	Move BasicBlockEdge to the cpp file. No functionality change. llvm-svn: 161663	2012-08-10 14:05:55 +00:00
Chad Rosier	09f74b5517	[ms-inline asm] Add a new Inline Asm Non-Standard Dialect attribute. This new attribute is intended to be used by the backend to determine how the inline asm string should be parsed/printed. This patch adds the ia_nsdialect attribute and also adds a test case to ensure the IR is correctly parsed, but there is no functional change at this time. The standard dialect is assumed to be AT&T. Therefore, this attribute should only be added to MS-style inline assembly statements, which use the Intel dialect. If we ever support more dialects we'll need to add additional state to the attribute. llvm-svn: 161641	2012-08-10 00:00:22 +00:00
Rafael Espindola	59564079e9	The dominance computation already has logic for computing if an edge dominates a use or a BB, but it is inline in the handling of the invoke instruction. This patch refactors it so that it can be used in other cases. For example, in define i32 @f(i32 %x) { bb0: %cmp = icmp eq i32 %x, 0 br i1 %cmp, label %bb2, label %bb1 bb1: br label %bb2 bb2: %cond = phi i32 [ %x, %bb0 ], [ 0, %bb1 ] %foo = add i32 %cond, %x ret i32 %foo } GVN should be able to replace %x with 0 in any use that is dominated by the true edge out of bb0. In the above example the only such use is the one in the phi. llvm-svn: 161429	2012-08-07 17:30:46 +00:00
Benjamin Kramer	3849fcbe0e	Postpone the deletion of the old name in StructType::setName to allow using a slice of the old name. Fixes PR13522. Add a rudimentary unit test to exercise the behavior. llvm-svn: 161296	2012-08-04 09:47:02 +00:00
Bill Wendling	8555a37c04	Move the "findUsedStructTypes" functionality outside of the Module class. The "findUsedStructTypes" method is very expensive to run. It needs to be optimized so that LTO can run faster. Splitting this method out of the Module class will help this occur. For instance, it can keep a list of seen objects so that it doesn't process them over and over again. llvm-svn: 161228	2012-08-03 00:30:35 +00:00
Micah Villmow	7b473d9f72	Add support for v16i32/v16i64 into the code generator. This is required for backends that use i32/i64 vectors for the getSetCCResultType function. llvm-svn: 160814	2012-07-26 21:22:00 +00:00
Chandler Carruth	1f41bf0c3f	Fix a dangling StringRef bug in the auto upgrader. In one case, we reset CI's name, and then used the StringRef pointing at its old name. I'm fixing it by storing the name in a std::string, and hoisting the renaming logic to happen always. This is nicer anyways as it will allow the upgraded IR to have the same names as the input IR in more cases. Another bug found by AddressSanitizer. Woot. llvm-svn: 160572	2012-07-20 21:09:18 +00:00
Benjamin Kramer	347d559323	Pull the simple parts of DenseMapInfo<DebugLoc> inline and prune includes. llvm-svn: 160507	2012-07-19 15:00:34 +00:00
Bill Wendling	ea6397f67b	Remove tabs. llvm-svn: 160477	2012-07-19 00:11:40 +00:00
Victor Oliveira	aa9ccee921	Adding some debug information to PassManager llvm-svn: 160446	2012-07-18 19:59:29 +00:00
Joel Jones	b84f7bea09	More replacing of target-dependent intrinsics with target-indepdent intrinsics. The second instruction(s) to be handled are the vector versions of count set bits (ctpop). The changes here are to clang so that it generates a target independent vector ctpop when it sees an ARM dependent vector bits set count. The changes in llvm are to match the target independent vector ctpop and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector pop counts with target-independent ctpops. There are also changes to an existing test case in llvm for ARM vector count instructions and to a test for the bitcode upgrade. <rdar://problem/11892519> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160410	2012-07-18 00:02:16 +00:00
Aaron Ballman	ed9b0a9114	MSVC's implementation of isalnum will assert on characters > 255, so we need to use an unsigned char to ensure the integer promotion happens properly. This fixes an assert in debug builds with CodeGen\X86\utf8.ll llvm-svn: 160286	2012-07-16 16:18:18 +00:00
Joel Jones	43cb87839c	This is one of the first steps at moving to replace target-dependent intrinsics with target-indepdent intrinsics. The first instruction(s) to be handled are the vector versions of count leading zeros (ctlz). The changes here are to clang so that it generates a target independent vector ctlz when it sees an ARM dependent vector ctlz. The changes in llvm are to match the target independent vector ctlz and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector ctlzs with target-independent ctlzs. There are also changes to an existing test case in llvm for ARM vector count instructions and a new test for the bitcode upgrade. <rdar://problem/11831778> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160200	2012-07-13 23:25:25 +00:00
Galina Kistanova	fc25990582	Fixed few warnings; trimmed empty lines. llvm-svn: 160159	2012-07-13 01:25:27 +00:00
Bill Wendling	786de35fa0	Use the DebugInfo wrappers instead of mucking about with the MDNode directly. llvm-svn: 159881	2012-07-07 00:52:35 +00:00
Bill Wendling	56543735c9	Print the name last. llvm-svn: 159879	2012-07-06 23:43:12 +00:00
Bill Wendling	3270582ceb	Check if it's a scope last, because several things are scopes. llvm-svn: 159873	2012-07-06 23:06:16 +00:00
Bill Wendling	aa02e36fa8	Add a print method to the ObjC property object. llvm-svn: 159848	2012-07-06 19:12:31 +00:00
Bill Wendling	5ef3159820	Remove trailing comma in array initialization list. llvm-svn: 159843	2012-07-06 17:49:19 +00:00
Bill Wendling	7154c43eff	Remove unnecessary 'llvm::'. llvm-svn: 159842	2012-07-06 17:47:36 +00:00
Bill Wendling	16d944ce11	Remove unnecessary 'llvm::'. llvm-svn: 159841	2012-07-06 17:46:28 +00:00
Eric Christopher	174266960e	Untabify and move a function near similar functions dealing with struct types. llvm-svn: 159801	2012-07-06 02:35:57 +00:00
Nuno Lopes	0d44a50426	PHINode::hasConstantValue(): return undef if the PHI is fully recursive. Thanks Duncan for the idea llvm-svn: 159687	2012-07-03 21:15:40 +00:00
Bill Wendling	a0bc1083be	Use the DebugInfo's 'print()' method to emit the comments. These give quite a bit more information about the DebugInfo and makes it more readable. llvm-svn: 159680	2012-07-03 20:01:02 +00:00
Nuno Lopes	90c76dfb17	improve PHINode::hasConstantValue() to detect recursive cases like %phi = phi(%phi,42) as constant llvm-svn: 159666	2012-07-03 17:10:28 +00:00
Chandler Carruth	aafe0918bc	Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h This was always part of the VMCore library out of necessity -- it deals entirely in the IR. The .cpp file in fact was already part of the VMCore library. This is just a mechanical move. I've tried to go through and re-apply the coding standard's preferred header sort, but at 40-ish files, I may have gotten some wrong. Please let me know if so. I'll be committing the corresponding updates to Clang and Polly, and Duncan has DragonEgg. Thanks to Bill and Eric for giving the green light for this bit of cleanup. llvm-svn: 159421	2012-06-29 12:38:19 +00:00
Bill Wendling	098d906dbb	Update the CMake files. llvm-svn: 159417	2012-06-29 09:01:47 +00:00
Bill Wendling	f799efdedc	The DIBuilder class is just a wrapper around debug info creation (a.k.a. MDNodes). The module doesn't belong in Analysis. Move it to the VMCore instead. llvm-svn: 159414	2012-06-29 08:32:07 +00:00
Nuno Lopes	2f49284f12	make the verifier accept @llvm.donothing as the only intrinsic that can be invoked While at it, merge 2 tests and FileCheckize them llvm-svn: 159388	2012-06-28 22:57:00 +00:00
Benjamin Kramer	92658b8149	Devirtualize DIScope and subclasses. Nothing in here makes use of the virtuality. llvm-svn: 159349	2012-06-28 14:25:45 +00:00
Hal Finkel	74e5225c92	Refactor operation equivalence checking in BBVectorize by extending Instruction::isSameOperationAs. Maintaining this kind of checking in different places is dangerous, extending Instruction::isSameOperationAs consolidates this logic into one place. Here I've added an optional flags parameter and two flags that are important for vectorization: CompareIgnoringAlignment and CompareUsingScalarTypes. llvm-svn: 159329	2012-06-28 05:42:26 +00:00
Bill Wendling	a2ccbf0f85	Only print out the tag if it's there. llvm-svn: 159328	2012-06-28 02:17:58 +00:00
Bill Wendling	74ac023cf6	Don't output an empty string. llvm-svn: 159327	2012-06-28 02:12:20 +00:00
Bill Wendling	5cb50c5bd5	Use the interface through DIDescriptor to get the tag/version for a debug info MDNode. llvm-svn: 159317	2012-06-28 00:41:44 +00:00
Bill Wendling	3b2ab9eaaa	Fix cmake failure from moving files around. llvm-svn: 159314	2012-06-28 00:18:12 +00:00
Bill Wendling	e38859dc8e	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h. The reasoning is because the DebugInfo module is simply an interface to the debug info MDNodes and has nothing to do with analysis. llvm-svn: 159312	2012-06-28 00:05:13 +00:00
Nuno Lopes	07594cba7c	improve optimization of invoke instructions: - simplifycfg: invoke undef/null -> unreachable - instcombine: invoke new -> invoke expect(0, 0) (an arbitrary NOOP intrinsic; only done if the allocated memory is unused, of course) - verifier: allow invoke of intrinsics (to make the previous step work) llvm-svn: 159146	2012-06-25 17:11:47 +00:00
NAKAMURA Takumi	704de074b8	llvm/lib: [CMake] Add explicit dependency to intrinsics_gen. llvm-svn: 159112	2012-06-24 13:32:01 +00:00
NAKAMURA Takumi	cca44e219f	VMCore/CMakeLists.txt: [CMake][MSVC] Add "/Og-" to Function.cpp on msvc10. Otherwise, it took over 20 minutes to compile. FIXME: Suppressing optimizations to core libraries would not be good thing. llvm-svn: 159097	2012-06-24 03:48:29 +00:00
Hans Wennborg	ac9fb36c31	Clean-up after r159077. Remove temporary GlobalVariable constructors now that Clang has been updated (r159078). llvm-svn: 159079	2012-06-23 12:14:23 +00:00
Hans Wennborg	cbe34b4cc9	Extend the IL for selecting TLS models (PR9788) This allows the user/front-end to specify a model that is better than what LLVM would choose by default. For example, a variable might be declared as @x = thread_local(initialexec) global i32 42 if it will not be used in a shared library that is dlopen'ed. If the specified model isn't supported by the target, or if LLVM can make a better choice, a different model may be used. llvm-svn: 159077	2012-06-23 11:37:03 +00:00
Stepan Dyatkovskiy	a6c8cc307b	Fixed r158979. Original message: Performance optimizations: - SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges. - Optimized IntItem, added APInt value caching. - Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only. llvm-svn: 158997	2012-06-22 14:53:30 +00:00
Duncan Sands	83884a1042	Revert commit 158979 (dyatkovskiy) since it is causing several buildbots to fail. Original commit message: Performance optimizations: - SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges. - Optimized IntItem, added APInt value caching. - Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only. On my machine these optimizations gave about 4-6% of compile-time improvement. llvm-svn: 158986	2012-06-22 10:35:06 +00:00
Stepan Dyatkovskiy	fcfa633bf8	Performance optimizations: - SwitchInst: case values stored separately from Operands List. It allows to make faster access to individual case value numbers or ranges. - Optimized IntItem, added APInt value caching. - Optimized IntegersSubsetGeneric: added optimizations for cases when subset is single number or when subset consists from single numbers only. On my machine these optimizations gave about 4-6% of compile-time improvement. llvm-svn: 158979	2012-06-22 07:35:13 +00:00
Nuno Lopes	f9abcb7ba9	revert r158660, since Chris has some issues with this patch (namely using code to reprent information only used by the compiler) Original commit msg: add the 'alloc' metadata node to represent the size of offset of buffers pointed to by pointers. This metadata can be attached to any instruction returning a pointer llvm-svn: 158688	2012-06-18 23:34:26 +00:00
Nuno Lopes	b7c941bad9	add the 'alloc' metadata node to represent the size of offset of buffers pointed to by pointers. This metadata can be attached to any instruction returning a pointer llvm-svn: 158660	2012-06-18 16:04:04 +00:00
Hal Finkel	16ddd4b66b	Move the Metadata merging methods from GVN and make them public in MDNode. There are other passes, BBVectorize specifically, that also need some of this functionality. llvm-svn: 158605	2012-06-16 20:33:37 +00:00
Duncan Sands	318a89ddac	When linearizing a multiplication, return at once if we see a factor of zero, since then the entire expression must equal zero (similarly for other operations with an absorbing element). With this in place a bunch of reassociate code for handling constants is dead since it is all taken care of when linearizing. No intended functionality change. llvm-svn: 158398	2012-06-13 09:42:13 +00:00
Craig Topper	71dc02d659	Fix intrinsics for XOP frczss/sd instructions. These instructions only take one source register and zero the upper bits of the destination rather than preserving them. llvm-svn: 158396	2012-06-13 07:18:53 +00:00
Duncan Sands	d7aeefebd6	Now that Reassociate's LinearizeExprTree can look through arbitrary expression topologies, it is quite possible for a leaf node to have huge multiplicity, for example: x0 = xx, x1 = x0x0, x2 = x1*x1, ... rapidly gives a value which is x raised to a vast power (the multiplicity, or weight, of x). This patch fixes the computation of weights by correctly computing them no matter how big they are, rather than just overflowing and getting a wrong value. It turns out that the weight for a value never needs more bits to represent than the value itself, so it is enough to represent weights as APInts of the same bitwidth and do the right overflow-avoiding dance steps when computing weights. As a side-effect it reduces the number of multiplies needed in some cases of large powers. While there, in view of external uses (eg by the vectorizer) I made LinearizeExprTree static, pushing the rank computation out into users. This is progress towards fixing PR13021. llvm-svn: 158358	2012-06-12 14:33:56 +00:00
Nadav Rotem	17ee58a792	Add AutoUpgrade support for the SSE4 ptest intrinsics. Patch by Michael Kuperstein. llvm-svn: 158295	2012-06-10 18:42:51 +00:00
Craig Topper	3352ba55b9	Replace XOP vpcom intrinsics with fewer intrinsics that take the immediate as an argument. llvm-svn: 158278	2012-06-09 16:46:13 +00:00
Craig Topper	2c5ccd8af7	Simplify the fma4 renaming code. llvm-svn: 157902	2012-06-03 16:48:52 +00:00
Craig Topper	720c7bde5c	Autoupgrade support the rename of x86.fma4 intrinsics to x86.fma from r157898. llvm-svn: 157899	2012-06-03 08:07:25 +00:00
Benjamin Kramer	bde9176663	Fix typos found by http://github.com/lyda/misspell-check llvm-svn: 157885	2012-06-02 10:20:22 +00:00
Stepan Dyatkovskiy	0e46d8a08c	PR1255: case ranges. IntRange converted from struct to class. So main change everywhere is replacement of ".Low/High" with ".getLow/getHigh()" llvm-svn: 157884	2012-06-02 09:42:43 +00:00
Rafael Espindola	103c2cfbbd	Use dominates(Instruction, Use) in the verifier. This removes a bit of context from the verifier erros, but reduces code duplication in a fairly critical part of LLVM and makes dominates easier to test. llvm-svn: 157845	2012-06-01 21:56:26 +00:00
Stepan Dyatkovskiy	bd7303b7f7	PR1255: case ranges. IntItem cleanup. IntItemBase, IntItemConstantIntImp and IntItem merged into IntItem. All arithmetic operators was propogated from APInt. Also added comparison operators <,>,<=,>=. Currently you will find set of macros that propogates operators from APInt to IntItem in the beginning of IntegerSubset. Note that THESE MACROS WILL REMOVED after all passes will case-ranges compatible. Also note that these macros much smaller pain that something like this: if (V->getValue().ugt(AnotherV->getValue()) { ... } These changes made IntItem full featured integer object. It allows to make IntegerSubset class generic (move out all ConstantInt references inside and add unit-tests) in next commits. llvm-svn: 157810	2012-06-01 10:06:14 +00:00
Rafael Espindola	e3c5f3e5b1	Fix typos noticed by Benjamin Kramer. Also make the checks stronger and test that we reject ranges that overlap a previous wrapped range. llvm-svn: 157749	2012-05-31 16:04:26 +00:00
Rafael Espindola	97d7787788	Require intervals in the range metadata to be in a canonical form: They must be non contiguous, non overlapping and sorted by the lower end. While this is technically a backward incompatibility, every frontent currently produces range metadata with a single interval and we don't have any pass that merges intervals yet, so no existing bitcode files should be rejected by this. llvm-svn: 157741	2012-05-31 13:45:46 +00:00
Stepan Dyatkovskiy	58107dd547	ConstantRangesSet renamed to IntegersSubset. CRSBuilder renamed to IntegersSubsetMapping. llvm-svn: 157612	2012-05-29 12:26:47 +00:00
Stepan Dyatkovskiy	e3e19cbb13	PR1255: Case Ranges Implemented IntItem - the wrapper around APInt. Why not to use APInt item directly right now? 1. It will very difficult to implement case ranges as series of small patches. We got several large and heavy patches. Each patch will about 90-120 kb. If you replace ConstantInt with APInt in SwitchInst you will need to changes at the same time all Readers,Writers and absolutely all passes that uses SwitchInst. 2. We can implement APInt pool inside and save memory space. E.g. we use several switches that works with 256 bit items (switch on signatures, or strings). We can avoid value duplicates in this case. 3. IntItem can be easyly easily replaced with APInt. 4. Currenly we can interpret IntItem both as ConstantInt and as APInt. It allows to provide SwitchInst methods that works with ConstantInt for non-updated passes. Why I need it right now? Currently I need to update SimplifyCFG pass (EqualityComparisons). I need to work with APInts directly a lot, so peaces of code ConstantInt *V = ...; if (V->getValue().ugt(AnotherV->getValue()) { ... } will look awful. Much more better this way: IntItem V = ConstantIntVal->getValue(); if (AnotherV < V) { } Of course any reviews are welcome. P.S.: I'm also going to rename ConstantRangesSet to IntegersSubset, and CRSBuilder to IntegersSubsetMapping (allows to map individual subsets of integers to the BasicBlocks). Since in future these classes will founded on APInt, it will possible to use them in more generic ways. llvm-svn: 157576	2012-05-28 12:39:09 +00:00
Chris Lattner	3cb6f83ebb	switch AttrListPtr::get to take an ArrayRef, simplifying a lot of clients. llvm-svn: 157556	2012-05-28 01:47:44 +00:00
Chris Lattner	5be972d8a2	simplify code. llvm-svn: 157555	2012-05-28 01:37:08 +00:00
Chris Lattner	144b619684	Reimplement the intrinsic verifier to use the same table as Intrinsic::getDefinition, making it stronger and more sane. Delete the code from tblgen that produced the old code. Besides being a path forward in intrinsic sanity, this also eliminates a bunch of machine generated code that was compiled into Function.o llvm-svn: 157545	2012-05-27 19:37:05 +00:00
Chris Lattner	f39c278384	move some code around so that Verifier.cpp can get access to the intrinsic info table. llvm-svn: 157540	2012-05-27 18:28:35 +00:00
Chris Lattner	c464416107	enhance the intrinsic info table to encode what kind of Any argument it is (at the cost of 45 bytes of extra table space) so that the verifier can start using it. llvm-svn: 157536	2012-05-27 16:39:08 +00:00
Tobias Grosser	6b31d170a4	Add half support to LLVM (for OpenCL) Submitted by: Anton Lokhmotov <Anton.Lokhmotov@arm.com> Approved by: o Anton Korobeynikov o Micah Villmow o David Neto llvm-svn: 157393	2012-05-24 15:59:06 +00:00
Patrik Hägglund	ca210d8432	Fixed typo in r156905. llvm-svn: 157320	2012-05-23 12:34:56 +00:00
Chris Lattner	4f18aa8f04	small refinement to r157218 to save a tiny amount of table size in the common case. llvm-svn: 157312	2012-05-23 05:19:18 +00:00
Nuno Lopes	ad40c0a425	revert my previous patches that introduced an additional parameter to the objectsize intrinsic. After a lot of discussion, we realized it's not the best option for run-time bounds checking llvm-svn: 157255	2012-05-22 15:25:31 +00:00
Pete Cooper	243efd7ac3	Added address space qualifier to intrinsic PointerType arguments. llvm-svn: 157218	2012-05-21 23:21:28 +00:00
Stepan Dyatkovskiy	e89dafd876	PR1255 (case ranges: work with ConstantRangesSet instead of ConstantInt) related changes for Execution and Verifier. llvm-svn: 157183	2012-05-21 10:44:40 +00:00
Benjamin Kramer	1ed0fa452c	Move CallbackVHs dtor inline, it can be devirtualized in many cases. Move the other virtual methods out of line as they are only called from within Value.cpp anyway. llvm-svn: 157123	2012-05-19 19:15:25 +00:00
Chris Lattner	a3b0f52a72	enhance the intrinsic info stuff to emit encodings that don't fit in 32-bits into a separate side table, using the handy SequenceToOffsetTable class. This encodes all these weird things into another 256 bytes, allowing all intrinsics to be encoded this way. llvm-svn: 156995	2012-05-17 15:55:41 +00:00
Manuel Klimek	0fc33af2a7	Fix compile error. llvm-svn: 156986	2012-05-17 09:32:05 +00:00
Chris Lattner	a57c797c58	Genericize the intrinsics descriptor decoding a bit to make room for future expansion, no functionality change yet though. llvm-svn: 156979	2012-05-17 05:13:57 +00:00
Chris Lattner	3e34a7b93d	finish encoding all of the interesting details of intrinsics. Now intrinsics are only rejected because they can't be encoded into a 32-bit unit, not because they contain an unencodable feature. llvm-svn: 156978	2012-05-17 05:03:24 +00:00
Chris Lattner	827b253c63	strengthen the intrinsic descriptor stuff to be able to handle sin, cos and other intrinsics that use passed-in arguments. llvm-svn: 156977	2012-05-17 04:30:58 +00:00
Chris Lattner	7f0e7bae25	Significantly reduce the compiled size of Functions.cpp by turning a big blob of tblgen generated code (for Intrinsic::getType) into a table. This handles common cases right now, but I plan to extend it to handle all cases and merge in type verification logic as well in follow-on patches. llvm-svn: 156905	2012-05-16 06:34:44 +00:00
Bill Wendling	ea857e1b9f	Use ArrayRef instead of an explicit vector type. llvm-svn: 156755	2012-05-14 07:53:40 +00:00
Stepan Dyatkovskiy	0beab5e1cd	Recommited r156374 with critical fixes in BitcodeReader/Writer: Ordinary patch for PR1255. Added new case-ranges orientated methods for adding/removing cases in SwitchInst. After this patch cases will internally representated as ConstantArray-s instead of ConstantInt, externally cases wrapped within the ConstantRangesSet object. Old methods of SwitchInst are also works well, but marked as deprecated. So on this stage we have no side effects except that I added support for case ranges in BitcodeReader/Writer, of course test for Bitcode is also added. Old "switch" format is also supported. llvm-svn: 156704	2012-05-12 10:48:17 +00:00
Jay Foad	ca0c499609	Teach Function::hasAddressTaken that BlockAddress doesn't really take the address of a function. llvm-svn: 156703	2012-05-12 08:30:16 +00:00
Joel Jones	3d90a9ae65	Fix a problem with incomplete equality testing of PHINodes in Instruction::IsIdenticalToWhenDefined. This manifested itself when inlining two calls to the same function. The inlined function had a switch statement that returned one of a set of global variables. Without this modification, the two phi instructions that chose values from the branches of the switch instruction inlined from the callee were considered equivalent and jump-threading replaced a load for the first switch value with a phi selecting from the second switch, thereby producing incorrect code. This patch has been tested with "make check-all", "lnt runteste nt", and llvm self-hosted, and on the original program that had this problem, wireshark. <rdar://problem/11025519> llvm-svn: 156548	2012-05-10 15:59:41 +00:00
Hans Wennborg	b7ef2fe8ae	Introduce llvm-c function LLVMPrintModuleToFile. This lets you save the textual representation of the LLVM IR to a file. Before this patch it could only be printed to STDERR from llvm-c. Patch by Carlo Kok! llvm-svn: 156479	2012-05-09 16:54:17 +00:00
Nuno Lopes	01547b3ad2	change the objectsize intrinsic signature: add a 3rd parameter to denote the maximum runtime performance penalty that the user is willing to accept. This commit only adds the parameter. Code taking advantage of it will follow. llvm-svn: 156473	2012-05-09 15:52:43 +00:00
Stepan Dyatkovskiy	5eafce5c88	Rejected r156374: Ordinary PR1255 patch. Due to clang-x86_64-debian-fnt buildbot failure. llvm-svn: 156377	2012-05-08 08:33:21 +00:00
Craig Topper	7daf897678	Remove 256-bit AVX non-temporal store intrinsics. Similar was previously done for 128-bit. llvm-svn: 156375	2012-05-08 06:58:15 +00:00
Stepan Dyatkovskiy	b6a4640163	Ordinary patch for PR1255. Added new case-ranges orientated methods for adding/removing cases in SwitchInst. After this patch cases will internally representated as ConstantArray-s instead of ConstantInt, externally cases wrapped within the ConstantRangesSet object. Old methods of SwitchInst are also works well, but marked as deprecated. So on this stage we have no side effects except that I added support for case ranges in BitcodeReader/Writer, of course test for Bitcode is also added. Old "switch" format is also supported. llvm-svn: 156374	2012-05-08 06:36:08 +00:00
Dan Gohman	1ccecdb2fd	Reapply r155682, making constant folding more consistent, with a fix to work properly with how the code handles all-undef PHI nodes. llvm-svn: 155721	2012-04-27 17:50:22 +00:00
NAKAMURA Takumi	6008dfdb70	Revert r155682, "Use ConstantExpr::getExtractElement when constant-folding vectors" It broke stage2 build. stage1/clang sometimes crashed. llvm-svn: 155699	2012-04-27 07:59:20 +00:00
Dan Gohman	90f3798f26	Use ConstantExpr::getExtractElement when constant-folding vectors instead of getAggregateElement. This has the advantage of being more consistent and allowing higher-level constant folding to procede even if an inner extract element cannot be folded. Make ConstantFoldInstruction call ConstantFoldConstantExpression on the instruction's operands, making it more consistent with ConstantFoldConstantExpression itself. This makes sure that ConstantExprs get TargetData-aware folding before being handed off as operands for further folding. This causes more expressions to be folded, but due to a known shortcoming in constant folding, this currently has the side effect of stripping a few more nuw and inbounds flags in the non-targetdata side of constant-fold-gep.ll. This is mostly harmless. This fixes rdar://11324230. llvm-svn: 155682	2012-04-27 00:54:36 +00:00
Bill Wendling	0156f44a68	Don't forget to reset 'first operand' flag when we're setting the MDNodeOperand value. llvm-svn: 155599	2012-04-26 00:38:42 +00:00
Nadav Rotem	450d69a5ee	ConstantFoldSelectInstruction swapped the operands of the select. Fix 12592. Patch by Matt Pharr. llvm-svn: 155480	2012-04-24 20:18:49 +00:00
Bill Wendling	e32c23a5e0	Cleanup whitespace. llvm-svn: 155328	2012-04-23 00:23:33 +00:00
Bill Wendling	3d0ec2bedb	Limit the number of times we recurse through this algorithm. All of the intructions are processed. So there's no need to look at them if they're used as operands of other instructions. llvm-svn: 155327	2012-04-23 00:22:55 +00:00
Bill Wendling	32854e2727	Add a flag to the struct type finder to collect only those types which have names. This saves collecting types we normally don't care about. llvm-svn: 155300	2012-04-21 23:59:16 +00:00
Bill Wendling	1bf41faca2	Revert r155241, which is causing some breakage. llvm-svn: 155253	2012-04-20 23:11:38 +00:00
Bill Wendling	c5fae47a63	If we discover all of the named structs in a module, then don't bother to process any more Values. llvm-svn: 155241	2012-04-20 21:56:24 +00:00
Craig Topper	d3c9e404ba	Remove AVX vpermil intrinsics. I removed their uses from clang headers and builtins a while back. llvm-svn: 154985	2012-04-18 05:24:00 +00:00
Eric Christopher	7df0240e52	Typo. llvm-svn: 154879	2012-04-16 23:54:31 +00:00
Duncan Sands	9af6298293	Remove support for the special 'fast' value for fpmath accuracy for the moment. llvm-svn: 154850	2012-04-16 19:39:33 +00:00
Duncan Sands	05f4df8d72	Make it possible to indicate relaxed floating point requirements at the IR level through the use of 'fpmath' metadata. Currently this only provides a 'fpaccuracy' value, which may be a number in ULPs or the keyword 'fast', however the intent is that this will be extended with additional information about NaN's, infinities etc later. No optimizations have been hooked up to this so far. llvm-svn: 154822	2012-04-16 16:28:59 +00:00
Duncan Sands	34bd91a49f	Rename "fpaccuracy" metadata to the more generic "fpmath". That's because I'm thinking of generalizing it to be able to specify other freedoms beyond accuracy (such as that NaN's don't have to be respected). I'd like the 3.1 release (the first one with this metadata) to have the more generic name already rather than having to auto-upgrade it in 3.2. llvm-svn: 154744	2012-04-14 12:36:06 +00:00
Dan Gohman	4f8ced58a7	Def here is an Instruction, so !isa<Instruction>(Def) is always false, as Eli noticed. llvm-svn: 154641	2012-04-13 00:50:57 +00:00
Dan Gohman	73273275a4	Add forms of dominates and isReachableFromEntry that accept a Use directly instead of a user Instruction. This allows them to test whether a def dominates a particular operand if the user instruction is a PHI. llvm-svn: 154631	2012-04-12 23:31:46 +00:00
Benjamin Kramer	2335a5cb85	Cache the hash value of the operands in the MDNode. FoldingSet is implemented as a chained hash table. When there is a hash collision during insertion, which is common as we fill the table until a load factor of 2.0 is hit, we walk the chained elements, comparing every operand with the new element's operands. This can be very expensive if the MDNode has many operands. We sacrifice a word of space in MDNode to cache the full hash value, reducing compares on collision to a minimum. MDNode grows from 28 to 32 bytes + operands on x86. On x86_64 the new bits fit nicely into existing padding, not growing the struct at all. The actual speedup depends a lot on the test case and is typically between 1% and 2% for C++ code with clang -c -O0 -g. llvm-svn: 154497	2012-04-11 14:06:54 +00:00
Benjamin Kramer	7a426b5f2e	Compute hashes directly with hash_combine instead of taking a detour through FoldingSetNodeID. llvm-svn: 154495	2012-04-11 14:06:39 +00:00
Bill Wendling	c4c568b2d9	The MDString class stored a StringRef to the string which was already in a StringMap. This was redundant and unnecessarily bloated the MDString class. Because the MDString class is a "Value" and will never have a "name", and because the Name field in the Value class is a pointer to a StringMap entry, we repurpose the Name field for an MDString. It stores the StringMap entry in the Name field, and uses the normal methods to get the string (name) back. PR12474 llvm-svn: 154429	2012-04-10 20:12:16 +00:00
Duncan Sands	af06b26c8e	Express the number of ULPs in fpaccuracy metadata as a real rather than a rational number, eg as 2.5 rather than 5, 2. OK'd by Peter Collingbourne. llvm-svn: 154387	2012-04-10 08:22:43 +00:00
Bill Wendling	5c0068f807	Remove the 'Parent' pointer from the MDNodeOperand class. An MDNode has a list of MDNodeOperands allocated directly after it as part of its allocation. Therefore, the Parent of the MDNodeOperands can be found by walking back through the operands to the beginning of that list. Mark the first operand's value pointer as being the 'first' operand so that we know where the beginning of said list is. This saves a lot of space during LTO with -O0 -g flags. llvm-svn: 154280	2012-04-08 10:20:49 +00:00
Bill Wendling	9b2503a006	Allow subclasses of the ValueHandleBase to store information as part of the value pointer by making the value pointer into a pointer-int pair with 2 bits available for flags. llvm-svn: 154279	2012-04-08 10:16:43 +00:00
Bill Wendling	e2cf674310	The speedup doesn't appear to have been from this, but was an anomaly of my testing machine. llvm-svn: 153951	2012-04-03 11:19:21 +00:00
Bill Wendling	dd91e73409	Reserve space for the eventual filling of the vector. This gives a small speedup. llvm-svn: 153949	2012-04-03 10:50:09 +00:00
Duncan Sands	26a80f3ddb	I noticed in passing that the Metadata getIfExists method was creating a new node and returning it if one didn't exist. llvm-svn: 153798	2012-03-31 08:20:11 +00:00
Rafael Espindola	a53c46aaa3	Handle unreachable code in the dominates functions. This changes users when needed for correctness, but still doesn't clean up code that now unnecessary checks for reachability. llvm-svn: 153755	2012-03-30 16:46:21 +00:00
Douglas Gregor	c0f6380464	Add missing include of <new> llvm-svn: 153436	2012-03-26 14:04:17 +00:00
Rafael Espindola	8e5b40eb08	Remove always true variable. llvm-svn: 153392	2012-03-24 20:02:25 +00:00
Rafael Espindola	ef9f5504ea	First part of PR12251. Add documentation and verifier support for the range metadata. llvm-svn: 153359	2012-03-24 00:14:51 +00:00
Eric Christopher	bdb64495c4	Fix up cmake build. llvm-svn: 153306	2012-03-23 03:55:14 +00:00
Eric Christopher	3c0d51661f	Take out the debug info probe stuff. It's making some changes to the PassManager annoying and should be reimplemented as a decorator on top of existing passes (as should the timing data). llvm-svn: 153305	2012-03-23 03:54:05 +00:00
Chris Lattner	2cc6f9dd90	add load/store volatility control to the C API, patch by Yiannis Tsiouris! llvm-svn: 153238	2012-03-22 03:54:15 +00:00
Chandler Carruth	4d1d34fbfc	Extend the inline cost calculation to account for bonuses due to correlated pairs of pointer arguments at the callsite. This is designed to recognize the common C++ idiom of begin/end pointer pairs when the end pointer is a constant offset from the begin pointer. With the C-based idiom of a pointer and size, the inline cost saw the constant size calculation, and this provides the same level of information for begin/end pairs. In order to propagate this information we have to search for candidate operations on a pair of pointer function arguments (or derived from them) which would be simplified if the pointers had a known constant offset. Then the callsite analysis looks for such pointer pairs in the argument list, and applies the appropriate bonus. This helps LLVM detect that half of bounds-checked STL algorithms (such as hash_combine_range, and some hybrid sort implementations) disappear when inlined with a constant size input. However, it's not a complete fix due the inaccuracy of our cost metric for constants in general. I'm looking into that next. Benchmarks showed no significant code size change, and very minor performance changes. However, specific code such as hashing is showing significantly cleaner inlining decisions. llvm-svn: 152752	2012-03-14 23:19:53 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Chandler Carruth	97f6f03c42	Refactor some methods to look through bitcasts and GEPs on pointers into a common collection of methods on Value, and share their implementation. We had two variations in two different places already, and I need the third variation for inline cost estimation. Reviewed by Duncan Sands on IRC, but further comments here welcome. llvm-svn: 152490	2012-03-10 08:39:09 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Chandler Carruth	d4ba3eb480	Switch this code to use hash_combine_range rather than incremental calls to hash_combine. One of the interfaces could already do this, and the other can just use a small buffer. This is a much more efficient way to use the hash_combine interface, although I don't have any particular benchmark where this code was hot, so I can't measure much of an impact. It at least doesn't slow anything down. llvm-svn: 152200	2012-03-07 03:22:32 +00:00
Chandler Carruth	cee7a12d40	Cache the sized-ness of struct types, once we reach the steady state of "is sized". This prevents every query to isSized() from recursing over every sub-type of a struct type. This could get very slow for extremely deep nesting of structs, as in 177.mesa. This change is a 45% speedup for 'opt -O2' of 177.mesa.linked.bc, and likely a significant speedup for other cases as well. It even impacts -O0 cases because so many part of the code try to check whether a type is sized. Thanks for the review from Nick Lewycky and Benjamin Kramer on IRC. llvm-svn: 152197	2012-03-07 02:33:09 +00:00
Jay Foad	cc5fd3e25d	Change ConstantAggrUniqueMap to use Chandler's new hashing implementation. Patch by Meador Inge llvm-svn: 152116	2012-03-06 10:43:52 +00:00
Chandler Carruth	71bd7d1e54	Replace the hashing functions on APInt and APFloat with overloads of the new hash_value infrastructure, and replace their implementations using hash_combine. This removes a complete copy of Jenkin's lookup3 hash function (which is both significantly slower and lower quality than the one implemented in hash_combine) along with a somewhat scary xor-only hash function. Now that APInt and APFloat can be passed directly to hash_combine, simplify the rest of the LLVMContextImpl hashing to use the new infrastructure. llvm-svn: 152004	2012-03-04 12:02:57 +00:00
Chandler Carruth	1d03a3b6b1	Rewrite LLVM's generalized support library for hashing to follow the API of the proposed standard hashing interfaces (N3333), and to use a modified and tuned version of the CityHash algorithm. Some of the highlights of this change: -- Significantly higher quality hashing algorithm with very well distributed results, and extremely few collisions. Should be close to a checksum for up to 64-bit keys. Very little clustering or clumping of hash codes, to better distribute load on probed hash tables. -- Built-in support for reserved values. -- Simplified API that composes cleanly with other C++ idioms and APIs. -- Better scaling performance as keys grow. This is the fastest algorithm I've found and measured for moderately sized keys (such as show up in some of the uniquing and folding use cases) -- Support for enabling per-execution seeds to prevent table ordering or other artifacts of hashing algorithms to impact the output of LLVM. The seeding would make each run different and highlight these problems during bootstrap. This implementation was tested extensively using the SMHasher test suite, and pased with flying colors, doing better than the original CityHash algorithm even. I've included a unittest, although it is somewhat minimal at the moment. I've also added (or refactored into the proper location) type traits necessary to implement this, and converted users of GeneralHash over. My only immediate concerns with this implementation is the performance of hashing small keys. I've already started working to improve this, and will continue to do so. Currently, the only algorithms faster produce lower quality results, but it is likely there is a better compromise than the current one. Many thanks to Jeffrey Yasskin who did most of the work on the N3333 paper, pair-programmed some of this code, and reviewed much of it. Many thanks also go to Geoff Pike Pike and Jyrki Alakuijala, the original authors of CityHash on which this is heavily based, and Austin Appleby who created MurmurHash and the SMHasher test suite. Also thanks to Nadav, Tobias, Howard, Jay, Nick, Ahmed, and Duncan for all of the review comments! If there are further comments or concerns, please let me know and I'll jump on 'em. llvm-svn: 151822	2012-03-01 18:55:25 +00:00
Benjamin Kramer	acd78d5092	Emit the "is an intrinsic overloaded" table as a bitfield. llvm-svn: 151792	2012-03-01 02:16:57 +00:00
Rafael Espindola	654320a0bb	Use the DT dominates function in the verifier. llvm-svn: 151470	2012-02-26 02:23:37 +00:00
Rafael Espindola	94df267db3	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Rafael Espindola	bfa7579801	Don't call dominates on unreachable instructions. llvm-svn: 151468	2012-02-26 02:14:25 +00:00
Nick Lewycky	d489edff7f	Remove spurious emacs mode marker. llvm-svn: 151440	2012-02-25 07:20:06 +00:00
Jay Foad	529776c786	Reinstate r151049 now that GeneralHash is fixed. llvm-svn: 151248	2012-02-23 09:17:40 +00:00
Chad Rosier	5dfe6dab25	Remove extra semi-colons. llvm-svn: 151169	2012-02-22 17:25:00 +00:00
Jay Foad	af3cf11fec	Revert r151049 cos it broke the buildbots. llvm-svn: 151052	2012-02-21 11:44:46 +00:00

1 2 3 4 5 ...

4366 Commits