llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	c7363f1147	AsmWriter/Bitcode: MDSubrange llvm-svn: 229003	2015-02-13 01:10:38 +00:00
Duncan P. N. Exon Smith	193a4fdafd	IR: Add MDExpression::ExprOperand Port `DIExpression::Operand` over to `MDExpression::ExprOperand`. The logic is needed directly in `MDExpression` to support printing in assembly. llvm-svn: 229002	2015-02-13 01:07:46 +00:00
Duncan P. N. Exon Smith	3b631d291e	Support: Add dwarf::getOperationEncoding() llvm-svn: 229001	2015-02-13 01:05:00 +00:00
Duncan P. N. Exon Smith	8f46ee61c1	Support: Rewrite LocationAtom and OperationEncodingString(), NFC Use `Dwarf.def` more. llvm-svn: 229000	2015-02-13 01:04:08 +00:00
Akira Hatanaka	c43df5187c	[LinkModules] Change the way ModuleLinker merges triples. This commit makes the following changes: - Stop issuing a warning when the triples' string representations do not match exactly if the Triple objects generated from the strings compare equal. - On Apple platforms, choose the triple that has the larger minimum version number. rdar://problem/16743513 Differential Revision: http://reviews.llvm.org/D7591 llvm-svn: 228999	2015-02-13 00:40:41 +00:00
Eric Christopher	dc3a8a4a66	PPCFrameLowering's FramePointerOffset can be computed at initialization time. Do so. llvm-svn: 228998	2015-02-13 00:39:38 +00:00
Eric Christopher	736d39e189	The TOC save offset can be computed at compile time, do so and propagate changes. llvm-svn: 228997	2015-02-13 00:39:36 +00:00
Eric Christopher	f71609b5dd	The return save offset can be computed at initialization time - do so and save the value. llvm-svn: 228996	2015-02-13 00:39:27 +00:00
Michael Zolotukhin	8ec536e3dd	Testcase for r228988. llvm-svn: 228995	2015-02-13 00:35:45 +00:00
Chandler Carruth	10a9926ab5	[unroll] Don't use a map from pointer to bool. Use a set. This is much more efficient. In particular, the query with the user instruction has to insert a false for every missing instruction into the set. This is just a cleanup a long the way to fixing the underlying algorithm problems here. llvm-svn: 228994	2015-02-13 00:29:39 +00:00
NAKAMURA Takumi	34d46fa297	llvm/test/Transforms/LoopVectorize/PowerPC/small-loop-rdx.ll REQUIRES +Asserts due to -debug. llvm-svn: 228989	2015-02-13 00:21:34 +00:00
Michael Zolotukhin	1b48019751	Prevent division by 0. When we try to estimate number of potentially removed instructions in loop unroller, we analyze first N iterations and then scale the computed number by TripCount/N. We should bail out early if N is 0. llvm-svn: 228988	2015-02-13 00:17:03 +00:00
Chandler Carruth	186ad60815	[unroll] Update the new analysis logic from r228265 to use modern coding conventions for function names consistently. Some were already using this but not all. llvm-svn: 228987	2015-02-13 00:00:24 +00:00
Rafael Espindola	b6a812ebb1	Add support for having multiple sections with the same name and comdat. Using this in combination with -ffunction-sections allows LLVM to output a .o file with mulitple sections named .text. This saves space by avoiding long unique names of the form .text.<C++ mangled name>. llvm-svn: 228980	2015-02-12 23:29:51 +00:00
David Majnemer	a12fcb790f	X86: Don't crash if we can't decode the pshufb mask Constant pool entries are uniqued by their contents regardless of their type. This means that a pshufb can have a shuffle mask which isn't a simple array of bytes. The code path which attempts to decode the mask didn't check for failure, causing PR22559. llvm-svn: 228979	2015-02-12 23:26:26 +00:00
Rafael Espindola	e4bcad4754	Learn that __DATA,__objc_classrefs is not atomized via symbols. This should hopefully fix objc on AArch64. llvm-svn: 228976	2015-02-12 23:11:59 +00:00
David Blaikie	64a3f3084e	Add missing override. llvm-svn: 228974	2015-02-12 22:58:53 +00:00
Olivier Sallenave	05e69157b6	Change max interleave factor to 12 for POWER7 and POWER8. llvm-svn: 228973	2015-02-12 22:57:58 +00:00
Simon Pilgrim	b4a0df9a4a	Ensure integer domain on general shuffle stack folding tests llvm-svn: 228972	2015-02-12 22:47:45 +00:00
David Blaikie	7548aeeb9f	Remove typedef of a pointer type used in a gep to simplify migration of geps to a typeless-pointer future. I'd modify my migration tool to account for this, but this is the only instance of a typedef'd pointer type to a gep I found in the whole test suite, so it didn't seem worthwhile. llvm-svn: 228970	2015-02-12 22:45:25 +00:00
Hal Finkel	271e9f2870	[SDAG] Don't try to use FP_EXTEND/FP_ROUND for int<->fp promotions The PowerPC backend has long promoted some floating-point vector operations (such as select) to integer vector operations. Unfortunately, this behavior was broken by r216555. When using FP_EXTEND/FP_ROUND for promotions, we must check that both the old and new types are floating-point types. Otherwise, we must use BITCAST as we did prior to r216555 for everything. llvm-svn: 228969	2015-02-12 22:43:52 +00:00
Duncan P. N. Exon Smith	b93569d182	IR: Stop abusing DW_TAG_base_type for compile unit arrays The sub-arrays for compile units have for a long time been initialized to distinct temporary nodes with the `DW_TAG_base_type` tag, with no other operands. These invalid `DIBasicType`s are later replaced with appropriate arrays. This seems like a poor man's assertion that the arrays do eventually get replaced. These days, temporaries in the graph will cause assertions when writing bitcode or assembly, so this isn't necessary. Use temporary empty tuples instead. Note that the whole idea of using temporaries and then replacing them later is wasteful here. We never actually want to merge compile units by uniquing based on content. Compile units should use `getDistinct()` instead of `get()`, and then their operands can be freely replaced later on. llvm-svn: 228967	2015-02-12 21:52:11 +00:00
Zachary Turner	39e988c63c	Attempt to fix the build again. llvm-svn: 228964	2015-02-12 21:25:58 +00:00
Zachary Turner	0e4b101222	Attempt to fix Linux builds after r228960. llvm-svn: 228962	2015-02-12 21:17:07 +00:00
Rafael Espindola	3105fd8335	Remove mostly unused setters. Most of the code was setting the TargetOptions directly. llvm-svn: 228961	2015-02-12 21:16:34 +00:00
Zachary Turner	c074de041b	Add concrete type overloads to PDBSymbol::findChildren(). Frequently you only want to iterate over children of a specific type (e.g. functions). Previously you would get back a generic interface that allowed iteration over the base symbol type, which you would have to dyn_cast<> each one of. With this patch, we allow the user to specify the concrete type as a template parameter, and it will return an iterator which returns instances of the concrete type directly. llvm-svn: 228960	2015-02-12 21:09:24 +00:00
Reed Kotler	aa150ed780	Add bulk of returning of values to Mips fast-isel Summary: Implement the bulk of returning values in Mips fast-isel Test Plan: reatabi.ll Passes test-suite at -O0,-O2 and with mips32r2 and mips32r1. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits, aemerson, rfuhler Differential Revision: http://reviews.llvm.org/D5920 llvm-svn: 228958	2015-02-12 21:05:12 +00:00
Bjorn Steinbrink	6f972a13f6	Fix a crash in the assumption cache when inlining indirect function calls Summary: Instances of the AssumptionCache are per function, so we can't re-use the same AssumptionCache instance when recursing in the CallAnalyzer to analyze a different function. Instead we have to pass the AssumptionCacheTracker to the CallAnalyzer so it can get the right AssumptionCache on demand. Reviewers: hfinkel Subscribers: llvm-commits, hans Differential Revision: http://reviews.llvm.org/D7533 llvm-svn: 228957	2015-02-12 21:04:22 +00:00
Benjamin Kramer	e8cb17f282	Update test case. llvm-svn: 228956	2015-02-12 20:40:19 +00:00
Benjamin Kramer	443c7967ea	InstCombine: Allow folding of xor into icmp by changing the predicate for vectors The loop vectorizer can create this pattern. llvm-svn: 228954	2015-02-12 20:26:46 +00:00
Simon Pilgrim	295eaad2b3	Relaxed over-zealous alignment requirement for VEX-encoded AES instructions llvm-svn: 228953	2015-02-12 20:01:03 +00:00
Michael Zolotukhin	56c918bcce	Add a testcase for r228432. llvm-svn: 228951	2015-02-12 19:57:24 +00:00
Benjamin Kramer	193c94b459	Try to fix the MSVC build. 0xFFFFFFFFFFFFFFFFLL doesn't fit in a long long so it should have type 'unsigned long long'. MSVC thinks it's a (signed) __int64. llvm-svn: 228950	2015-02-12 19:53:49 +00:00
Michael Kuperstein	a07d9b9a4b	gold-plugin: delete the output file for OT_DISABLE bfd creates the output file early, so calling exit(0) is not enough, the file needs to be explicitly deleted. Patch by: H.J. Lu <hjl.tools@gmail.com> llvm-svn: 228946	2015-02-12 18:21:50 +00:00
Rafael Espindola	203c5b9f39	On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 228939	2015-02-12 17:46:49 +00:00
Rafael Espindola	29786d4c16	Put each jump table in an independent section if the function is too. This allows the linker to GC both, fixing pr22557. llvm-svn: 228937	2015-02-12 17:16:46 +00:00
Benjamin Kramer	40957cc2ce	Fix accidental bit flip. llvm-svn: 228936	2015-02-12 16:30:00 +00:00
Benjamin Kramer	71e1eb5ab4	CoverageMapping: Bitvectorize code. No functionality change. llvm-svn: 228934	2015-02-12 16:18:07 +00:00
James Molloy	e805ad95dc	[LoopRerolling] Be more forgiving with instruction order. We can't solve the full subgraph isomorphism problem. But we can allow obvious cases, where for example two instructions of different types are out of order. Due to them having different types/opcodes, there is no ambiguity. llvm-svn: 228931	2015-02-12 15:54:14 +00:00
Benjamin Kramer	5f6a907288	MathExtras: Bring Count(Trailing\|Leading)Ones and CountPopulation in line with countTrailingZeros Update all callers. llvm-svn: 228930	2015-02-12 15:35:40 +00:00
Tim Northover	be0fda3c33	Triple: refactor redundant code. Should be no functional change, since most of the logic removed was completely pointless (after some previous refactoring) and the rest duplicated elsewhere. Patch by Kamil Rytarowski. llvm-svn: 228926	2015-02-12 15:12:13 +00:00
Michael Kuperstein	f4d1aca568	[X86] Call frame optimization - allow stack-relative movs to be folded into a push Since we track esp precisely, there's no reason not to allow this. llvm-svn: 228924	2015-02-12 14:17:35 +00:00
Andrea Di Biagio	b08862c4f0	[TTI] Teach the cost heuristic how to query TLI to check if a zext/trunc is 'free' for the target. Now that SimplifyCFG uses TTI for the cost heuristic, we can teach BasicTTIImpl how to query TLI in order to get a more accurate cost for truncates and zero-extends. Before this patch, the basic cost heuristic in TargetTransformInfoImplCRTPBase would have conservatively returned a 'default' TCC_Basic for all zero-extends, and TCC_Free for truncates on native types. This patch improves the heuristic so that we query TLI (if available) to get more accurate answers. If TLI is available, then methods 'isZExtFree' and 'isTruncateFree' can be used to check if a zext/trunc is free for the target. Added more test cases to SimplifyCFG/X86/speculate-cttz-ctlz.ll. With this change, SimplifyCFG is now able to speculate a 'cheap' cttz/ctlz immediately followed by a free zext/trunc. Differential Revision: http://reviews.llvm.org/D7585 llvm-svn: 228923	2015-02-12 14:17:24 +00:00
Benjamin Kramer	fe412882c2	BitVector: Remove manual bit width dispatch, this is handled by templates NFC. llvm-svn: 228922	2015-02-12 14:02:58 +00:00
Benjamin Kramer	baa4f7474e	MathExtras: Parametrize count(Trailing\|Leading)Zeros on the type size. Otherwise we will always select the generic version for e.g. unsigned long if uint64_t is typedef'd to 'unsigned long long'. Also remove enable_if hacks in favor of static_assert. llvm-svn: 228921	2015-02-12 13:47:29 +00:00
Asiri Rathnayake	e045e378ad	ARM: Fix another regression introduced in r223113 The changes in r223113 (ARM modified-immediate syntax) have broken instructions like: mov r0, #~0xffffff00 The problem is that I've added a spurious range check on the immediate operand to ensure that it lies between INT32_MIN and UINT32_MAX. While this range check is correct in theory, it causes problems because the operand is stored in an int64_t (by MC). So valid 32-bit constants like \#~0xffffff00 become out of range. The solution is to simply remove this range check. It is not possible to validate the range of the immediate operand with the current setup because: 1) The operand is stored in an int64_t by MC, 2) The immediate can be of the forms #imm, #-imm, #~imm or even #((~imm)) etc. So we just chop the value to 32 bits and use it. Also noted that the original range check was note tested by any of the unit tests. I've added a new test to cover #~imm kind of operands. Change-Id: I411e90d84312a2eff01b732bb238af536c4a7599 llvm-svn: 228920	2015-02-12 13:37:28 +00:00
Dmitry Vyukov	2e8d82e607	tsan: do not instrument not captured values I've built some tests in WebRTC with and without this change. With this change number of __tsan_read/write calls is reduced by 20-40%, binary size decreases by 5-10% and execution time drops by ~5%. For example: $ ls -l old/modules_unittests new/modules_unittests -rwxr-x--- 1 dvyukov 41708976 Jan 20 18:35 old/modules_unittests -rwxr-x--- 1 dvyukov 38294008 Jan 20 18:29 new/modules_unittests $ objdump -d old/modules_unittests \| egrep "callq.__tsan_(read\|write\|unaligned)" \| wc -l 239871 $ objdump -d new/modules_unittests \| egrep "callq.__tsan_(read\|write\|unaligned)" \| wc -l 148365 http://reviews.llvm.org/D7069 llvm-svn: 228917	2015-02-12 09:55:28 +00:00
Elena Demikhovsky	d2cb3c8876	AVX-512: Fixed the "test" operation for i1 type Using KORTESTW for comparison i1 value with zero was wrong since the instruction tests 16 bits. KORTESTW may be used with KSHIFTL+KSHIFTR that clean the 15 upper bits. I removed (X86cmp i1, 0) pattern and zero-extend i1 to i8 and then use TESTB. There are some cases where i1 is in the mask register and the upper bits are already zeroed. Then KORTESTW is the better solution, but it is subject for optimization. Meanwhile, I'm fixing the correctness issue. llvm-svn: 228916	2015-02-12 08:40:34 +00:00
Michael Kuperstein	db95d04be4	[X86] A heuristic to estimate the size impact for converting stack-relative parameter movs to pushes This gives a rough estimate of whether using pushes instead of movs is profitable, in terms of size. We go over all calls in the MachineFunction and compute: a) For each callsite that can not use pushes, the penalty of not having a reserved call frame. b) For each callsite that can use pushes, the gain of actually replacing the movs with pushes (and the potential penalty of having to readjust the stack). Differential Revision: http://reviews.llvm.org/D7561 llvm-svn: 228915	2015-02-12 08:36:35 +00:00
Ahmed Bougacha	24433a7005	[CodeGen] Don't blindly combine (fp_round (fp_round x)) to (fp_round x). We used to do this DAG combine, but it's not always correct: If the first fp_round isn't a value preserving truncation, it might introduce a tie in the second fp_round, that wouldn't occur in the single-step fp_round we want to fold to. In other words, double rounding isn't the same as rounding. Differential Revision: http://reviews.llvm.org/D7571 llvm-svn: 228911	2015-02-12 06:15:29 +00:00
George Burgess IV	33305e7280	Fixed a bug where CFLAA would crash the compiler. We would crash if we couldn't locate a Function that either Location's Value belonged to. Now we just print out a debug message and return conservatively. llvm-svn: 228901	2015-02-12 03:07:07 +00:00
Chandler Carruth	63aaa98d94	[slp] Fix a nasty bug in the SLP vectorizer that Joerg pointed out. Apparently some code finally started to tickle this after my canonicalization changes to instcombine. The bug stems from trying to form a vector type out of scalars that aren't compatible at all. In this example, from x86_mmx values. The code in the vectorizer that checks for reasonable types whas checking for aggregates or vectors, but there are lots of other types that should just never reach the vectorizer. Debugging this was made more confusing by the lie in an assert in VectorType::get() -- it isn't that the types are primitive. The types must be integer, pointer, or floating point types. No other types are allowed. I've improved the assert and added a helper to the vectorizer to handle the element type validity checks. It now re-uses the VectorType static function and then further excludes weird target-specific types that we probably shouldn't be touching here (x86_fp80 and ppc_fp128). Neither of these are really reachable anyways (neither 80-bit nor 128-bit things will get vectorized) but it seems better to just eagerly exclude such nonesense. I've added a test case, but while it definitely covers two of the paths through this code there may be more paths that would benefit from test coverage. I'm not familiar enough with the SLP vectorizer to synthesize test cases for all of these, but was able to update the code itself by inspection. llvm-svn: 228899	2015-02-12 02:30:56 +00:00
Hal Finkel	7a0516ea66	[PowerPC] Mark jumps as expensive (using using CR bits) On PowerPC, which has a full set of logical operations on (its multiple sets of) condition-register bits, it is not profitable to break of complex conditions feeding a jump into multiple jumps. We can turn off this feature of CGP/SDAGBuilder by marking jumps as "expensive". P7 test-suite speedups (no regressions): MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2 -0.626647% +/- 0.323583% MultiSource/Benchmarks/Olden/power/power -18.2821% +/- 8.06481% llvm-svn: 228895	2015-02-12 01:02:52 +00:00
Zachary Turner	36f807c860	Revert "Change Path::filename_pos() to skip the drive letter." This reverts commit 228874. For some reason users reported seeing Clang taking up 25+GB of memory and bringing down machines with this change. Reverting until we figure it out. llvm-svn: 228890	2015-02-12 00:05:49 +00:00
Rafael Espindola	bbcdb9da19	Invert the section relocation map. It now points from rel section to section. Use it to set sh_info, avoiding a brittle name lookup. llvm-svn: 228889	2015-02-11 23:38:33 +00:00
Rafael Espindola	62118a1fe3	Use the existing SymbolTableIndex instead of doing a lookup. NFC. llvm-svn: 228888	2015-02-11 23:33:46 +00:00
Rafael Espindola	fbfbdc4377	Create the Seciton -> Rel Section map when it is first needed. NFC. Saves a walk over every section. llvm-svn: 228886	2015-02-11 23:17:48 +00:00
Tim Northover	02438033e8	DeadArgElim: aggregate Return assessment properly. I mistakenly thought the liveness of each "RetVal(F, i)" depended only on F. It actually depends on the index too, which means we need to be careful about how the results are combined before return. In particular if a single Use returns Live, that counts for the entire object, at the granularity we're considering. llvm-svn: 228885	2015-02-11 23:13:11 +00:00
Rafael Espindola	ef6baea74e	Remove unused argument. NFC. llvm-svn: 228884	2015-02-11 23:11:18 +00:00
David Majnemer	ab2b25bc97	Unbreak buildbots The next offset should be updated as well. llvm-svn: 228883	2015-02-11 22:51:55 +00:00
Rafael Espindola	fbd0ddf082	Don't recompute the entire section map just to add 3 entries. NFC. llvm-svn: 228881	2015-02-11 22:41:26 +00:00
David Majnemer	3df3c61e91	MC, COFF: Align section contents to a four byte boundary llvm-svn: 228879	2015-02-11 22:22:30 +00:00
Zachary Turner	3e76643a95	Change Path::filename_pos() to skip the drive letter. For Windows, filename_pos() tries to find the filename by searching for separators after the last :. Instead, it should really check for the only location that a : is valid, which is in the second character, and search for separators after that. llvm-svn: 228874	2015-02-11 21:16:35 +00:00
Rafael Espindola	d966522377	Remove unused argument. NFC. llvm-svn: 228873	2015-02-11 21:08:00 +00:00
Mehdi Amini	9730116bd6	Reassociate: cannot negate a INT_MIN value Summary: When trying to canonicalize negative constants out of multiplication expressions, we need to check that the constant is not INT_MIN which cannot be negated. Reviewers: mcrosier Reviewed By: mcrosier Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7286 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 228872	2015-02-11 19:54:44 +00:00
Tom Stellard	0648588e7d	R600/SI: Disable subreg liveness This is temporary while we try to fix a crash in the register coalescer. llvm-svn: 228861	2015-02-11 18:24:53 +00:00
Simon Pilgrim	2a9a745328	[X86][SSE] Added dual vector truncation tests. llvm-svn: 228857	2015-02-11 18:14:35 +00:00
Adrian Prantl	18a25b016e	Allow DIBuilder::replaceVTableHolder() to work with temporary nodes, tested via the clang test CodeGenCXX/vtable-holder-self-reference.cpp . llvm-svn: 228854	2015-02-11 17:45:10 +00:00
Adrian Prantl	9a8049238e	Add a trackIfUnresolved to DIBuilder::createInheritance(), tested via the clang test CodeGenCXX/vtable-holder-self-reference.cpp . llvm-svn: 228853	2015-02-11 17:45:08 +00:00
Adrian Prantl	534a81a9ec	Generalize DIBuilder's createReplaceableForwardDecl() to a more flexible createReplaceableCompositeType() that allows to create non-forward-declared temporary nodes. Paired commit with CFE. llvm-svn: 228852	2015-02-11 17:45:05 +00:00
Tom Stellard	de5b7b180a	R600: Split AMDGPUPassConfig into R600PassConfig and GCNPassConfig llvm-svn: 228850	2015-02-11 17:11:51 +00:00
Tom Stellard	c65b36061a	R600: Create an R600TargetMachine for pre-gcn GPUs No functinality change. R600TargetMachine inherits from AMDGPUTargetMachine. llvm-svn: 228849	2015-02-11 17:11:50 +00:00
Tom Stellard	502ef4e791	R600/SI: Fix -march in test llvm-svn: 228848	2015-02-11 17:11:48 +00:00
Jan Wen Voung	c11b45a2ea	Gold-plugin: Broaden scope of get/release_input_file to scope of Module. Summary: Move calls to get_input_file and release_input_file out of getModuleForFile(). Otherwise release_input_file may end up unmapping a view of the file while the view is still being used by the Module (on 32-bit hosts). Fix for PR22482. Test Plan: Add test using --no-map-whole-files. Reviewers: rafael, nlewycky Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7539 llvm-svn: 228842	2015-02-11 16:12:50 +00:00
Jonas Paulsson	bf8d0cc699	Fix SelectionDAG compile time issue with alias analysis. Add new token factor node and its users to worklist if alias analysis is turned on, in DAGCombiner::visitTokenFactor(). Alias analysis may cause a lot of new token factors to be inserted into the DAG, and they need to be optimized to avoid significant slow-downs. Reviewed by Hal Finkel. llvm-svn: 228841	2015-02-11 16:10:31 +00:00
Sanjay Patel	afe251649b	fixed to test features, not CPUs llvm-svn: 228836	2015-02-11 15:00:41 +00:00
Sanjay Patel	b53d82cbc5	fixed to test features, not CPUs llvm-svn: 228835	2015-02-11 15:00:19 +00:00
Sanjay Patel	8b88bc91bd	fixed to test features, not CPUs llvm-svn: 228834	2015-02-11 14:58:25 +00:00
Rafael Espindola	25d2c20c0c	Don't repeat name in comment and clang-format a function. llvm-svn: 228831	2015-02-11 14:44:17 +00:00
Marek Olsak	fa6607d0b6	R600/SI: Enable a lot of existing tests for VI (squashed commits) This is a union of these commits: * R600/SI: Enable more tests for VI which need no changes * R600/SI: Enable V_BCNT tests for VI Differences: - v_bcnt_..._e32 -> _e64 - s_load_dword* inline offset is in bytes instead of dwords * R600/SI: Enable all tests for VI which use S_LOAD_DWORD The inline offset is changed from dwords to bytes. * R600/SI: Enable LDS tests for VI Differences: - the s_load_dword inline offset changed from dwords to bytes - the tests checked very little on CI, so they have been fixed to check all instructions that "SI" checked * R600/SI: Enable lshr tests for VI * R600/SI: Fix divrem64 tests - "v_lshl_64" was missing "b" before "64" - added VI-NOT checks * R600/SI: Enable the SI.tid test for VI * R600/SI: Enable the frem test for VI Also, the frem_f64 checking is added for CI-VI. * R600/SI: Add VI tests for rsq.clamped llvm-svn: 228830	2015-02-11 14:26:46 +00:00
Andrea Di Biagio	2a0e435db1	[TTI] Improved cost heuristic for cttz/ctlz calls. This patch is a follow-up of r228826 (see code-review: D7506). Now that SimplifyCFG uses TargetTransformInfo for cost analysis, we have to fix the cost heuristic for intrinsic calls to cttz/ctlz. This patch defines method 'getIntrinsicCost' in BasicTTIImpl: now, BasicTTIImpl queries TLI to check if a call to cttz/ctlz is cheap for the target. Added test cases in Transforms/SimplifyCFG/X86 to verify that on x86, SimplifyCFG only speculates a call to cttz/ctlz if it is cheap. Differential Revision: http://reviews.llvm.org/D7554 llvm-svn: 228829	2015-02-11 14:22:18 +00:00
James Molloy	99f06df8ac	Make buildbots better. This testcase change was associated incorrectly to a followup commit in my git tree, not the base commit. Sorry! llvm-svn: 228827	2015-02-11 12:24:09 +00:00
James Molloy	7c336576a5	[SimplifyCFG] Swap to using TargetTransformInfo for cost analysis. We're already using TTI in SimplifyCFG, so remove the hard-baked "cheapness" heuristic and use TTI directly. Generally NFC intended, but we're using a slightly different heuristic now so there is a slight test churn. Test changes: * combine-comparisons-by-cse.ll: Removed unneeded branch check. * 2014-08-04-muls-it.ll: Test now doesn't branch but emits muleq. * coalesce-subregs.ll: Superfluous block check. * 2008-01-02-hoist-fp-add.ll: fadd is safe to speculate. Change to udiv. * PhiBlockMerge.ll: Superfluous CFG checking code. Main checks still present. * select-gep.ll: A variable GEP is not expensive, just TCC_Basic, according to the TTI. llvm-svn: 228826	2015-02-11 12:15:41 +00:00
Daniel Sanders	a19216c8f4	[mips] Merge disassemblers into a single implementation. Summary: Currently we have Mips32 and Mips64 disassemblers and this causes the target triple to affect the disassembly despite all the relevant information being in the ELF header. These implementations do not need to be separate. This patch merges them together such that the appropriate tables are checked for the subtarget (e.g. Mips64 is checked when GP64 is enabled). Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7498 llvm-svn: 228825	2015-02-11 11:28:56 +00:00
James Molloy	f147359376	[LoopReroll] Introduce the concept of DAGRootSets. A DAGRootSet models an induction variable being used in a rerollable loop. For example: x[i3+0] = y1 x[i3+1] = y2 x[i3+2] = y3 Base instruction -> i3 +---+----+ / \| \ ST[y1] +1 +2 <-- Roots \| \| ST[y2] ST[y3] There may be multiple DAGRootSets, for example: x[i2+0] = ... (1) x[i2+1] = ... (1) x[i2+4] = ... (2) x[i2+5] = ... (2) x[(i+1234)2+5678] = ... (3) x[(i+1234)2+5679] = ... (3) This concept is similar to the "Scale" member used previously, but allows multiple independent sets of roots based off the same induction variable. llvm-svn: 228821	2015-02-11 09:19:47 +00:00
David Majnemer	fad5a31160	AsmParser: Validate alloca's type An alloca's type should be weird things like metadata. llvm-svn: 228820	2015-02-11 09:13:11 +00:00
David Majnemer	04578fcfa5	DataLayout: Report when the preferred alignment is less than the ABI llvm-svn: 228819	2015-02-11 09:13:09 +00:00
David Majnemer	d7677e7a8d	Verifier: Check for null operands in !llvm.module.flags llvm-svn: 228818	2015-02-11 09:13:06 +00:00
Michael Kuperstein	1921d3d6f3	[X86] Split information collection from actual transformation in call frame optimization This splits collecting information from actually performing the transformation, so that we can add a heuristic in between the two. NFC. Differential Revision: http://reviews.llvm.org/D7497 llvm-svn: 228817	2015-02-11 08:53:55 +00:00
Arnaud A. de Grandmaison	de79026d5e	[PBQP] Cautiously update edge costs in the solver The NodeMetadata are maintained in an incremental way. When an edge between 2 nodes has its cost updated, in the course of graph reduction for example, the NodeMetadata need first to have the old edge cost removed, then the new edge cost added. Only once the NodeMetadata have been fully updated, it becomes safe to consider promoting the nodes to the ConservativelyAllocatable or OptimallyReducible sets. Previously, this promotion was occuring right after the removing the old cost, and this was breaking the assumption that a ConservativelyAllocatable should not be spilled. This patch also adds asserts to: - enforces the invariant that a node's reduction can not be downgraded, - only not provably allocatable or optimally reducible nodes can be spilled. llvm-svn: 228816	2015-02-11 08:25:36 +00:00
David Majnemer	9fd8cdc009	Verifier: Make sure !llvm.ident's operand isn't null llvm-svn: 228815	2015-02-11 08:23:20 +00:00
David Majnemer	300745351f	AsmParser: Don't crash when insertvalue has bad operands llvm-svn: 228813	2015-02-11 07:43:58 +00:00
David Majnemer	19b51054af	AsmParser: Switch some vectors to maps This speeds up parsing .ll files with metadata nodes with large IDs. llvm-svn: 228812	2015-02-11 07:43:56 +00:00
Peter Collingbourne	d20eff0ea6	Fix build for CMake < 2.8.12. llvm-svn: 228810	2015-02-11 05:58:57 +00:00
Zachary Turner	3bd47cee78	Use ADDITIONAL_HEADER_DIRS in all LLVM CMake projects. This allows IDEs to recognize the entire set of header files for each of the core LLVM projects. Differential Revision: http://reviews.llvm.org/D7526 Reviewed By: Chris Bieneman llvm-svn: 228798	2015-02-11 03:28:02 +00:00
Justin Bogner	d24e185784	InstrProf: Lower coverage mappings by setting their sections appropriately Add handling for __llvm_coverage_mapping to the InstrProfiling pass. We need to make sure the constant and any profile names it refers to are in the correct sections, which is easier and cleaner to do here where we have to know about profiling sections anyway. This is really tricky to test without a frontend, so I'm committing the test for the fix in clang. If anyone knows a good way to test this within LLVM, please let me know. Fixes PR22531. llvm-svn: 228793	2015-02-11 02:52:44 +00:00
Andrew Kaylor	7ad134a746	Temporary workaround to fix MSVC 2012 build problems llvm-svn: 228788	2015-02-11 02:16:34 +00:00
Reid Kleckner	b3775df32e	Fix invalid LLVM IR in PruneEH tests llvm-svn: 228786	2015-02-11 02:06:47 +00:00
Reid Kleckner	96d011315a	Don't promote asynch EH invokes of nounwind functions to calls If the landingpad of the invoke is using a personality function that catches asynch exceptions, then it can catch a trap. Also add some landingpads to invalid LLVM IR test cases that lack them. Over-the-shoulder reviewed by David Majnemer. llvm-svn: 228782	2015-02-11 01:23:16 +00:00
Tom Stellard	94b7231740	R600/SI: Store immediate offsets > 12-bits in soffset This will save us from having to extend these offsets to 64-bits and storing them in a pair of vgprs. llvm-svn: 228776	2015-02-11 00:34:35 +00:00

1 2 3 4 5 ...

113225 Commits