llvm-project

Commit Graph

Author	SHA1	Message	Date
David Majnemer	88542a0a69	[SCCP] Remove duplicate code SCCP has code identical to changeToUnreachable's behavior, switch it over to just call changeToUnreachable. No functionality change intended. llvm-svn: 258654	2016-01-24 06:26:47 +00:00
David Majnemer	35c46d3e0b	[InstCombine, SCCP] Consolidate code used to remove instructions InstCombine and SCCP both want to remove dead code in a very particular way but using identical means to do so. Share the code between the two. No functionality change is intended. llvm-svn: 258653	2016-01-24 05:26:18 +00:00
David Majnemer	b7d49268c2	[WinEH] Don't miscompile cleanups which conditionally unwind to caller A cleanup can have paths which unwind or end up in unreachable. If there is an unreachable path and a path which unwinds to caller, we would mistakenly inject an unwind path to a catchswitch on the unreachable path. This results in a verifier assertion firing because the cleanup unwinds to two different places: to the caller and to the catchswitch. This occured because we used getCleanupRetUnwindDest to determine if the cleanuppad had no cleanuprets. This is incorrect, getCleanupRetUnwindDest returns null for cleanuprets which unwind to caller. llvm-svn: 258651	2016-01-23 23:54:33 +00:00
Manuel Jacob	0af37b21c8	Remove duplicate documentation in ConstantFolding.cpp. NFC. The documentation for these functions is already present in the header file. llvm-svn: 258649	2016-01-23 22:49:54 +00:00
Manuel Jacob	5d073c4508	Remove duplicate documentation in Attributes.cpp. NFC. The documentation for these methods is already present in the header. llvm-svn: 258648	2016-01-23 22:42:24 +00:00
Manuel Jacob	25510fcf5c	Update outdated method documention in Attributes.h. NFC. Nowadays the alignment attribute is not the only integer attribute. llvm-svn: 258647	2016-01-23 22:38:39 +00:00
Simon Pilgrim	02c1b54a4a	[SelectionDAG] Generalised the CONCAT_VECTORS creation to support BUILD_VECTOR and UNDEF folding. llvm-svn: 258646	2016-01-23 22:27:54 +00:00
Simon Pilgrim	0423b382d3	[X86][SSE] Generalised TRUNC -> PACKSS/PACKUS code. NFC. Generalised mask generation / subvector extraction to use the input/output types directly instead of an if/else through all the currently accepted types. llvm-svn: 258645	2016-01-23 22:02:48 +00:00
Simon Pilgrim	b9b8fcd831	Tidied up TRUNC combine code. NFC. Make use of DAG.getBitcast and use clang-format to reduce number of lines (and make it more readable). llvm-svn: 258644	2016-01-23 21:50:40 +00:00
Justin Lebar	561d5a1758	[CUDA] Add Target::isNVPTX(). Summary: Helper so we don't have to enumerate nvptx && nvptx64 everywhere. Reviewers: echristo Subscribers: llvm-commits, jhen, tra Differential Revision: http://reviews.llvm.org/D16494 llvm-svn: 258639	2016-01-23 21:12:22 +00:00
Justin Lebar	3a5f5798a1	[CUDA] Die gracefully when trying to output an LLVM alias. Summary: Previously, we would just output "foo = bar" in the assembly, and then ptxas would choke. Now we die before emitting any invalid code. Reviewers: echristo Subscribers: jholewinski, llvm-commits, jhen, tra Differential Revision: http://reviews.llvm.org/D16490 llvm-svn: 258638	2016-01-23 21:12:20 +00:00
Justin Lebar	2a161f986f	[CUDA] Make empty parameter lists in nvptx function decls easier to read. Summary: Before: .func (.param .b32 func_retval0) _ZL21__nvvm_reflect_anchorv( ) { After: .func (.param .b32 func_retval0) _ZL21__nvvm_reflect_anchorv() { Reviewers: bkramer Subscribers: llvm-commits, tra, jhen, echristo, jholewinski Differential Revision: http://reviews.llvm.org/D16512 llvm-svn: 258637	2016-01-23 21:12:17 +00:00
Benjamin Kramer	58e1998520	Don't check if a list is empty with ilist::size. ilist::size() is O(n) while ilist::empty() is O(1) llvm-svn: 258636	2016-01-23 20:58:09 +00:00
NAKAMURA Takumi	e2b032a5b0	ObjectTransformLayerTest.cpp: Rework r258633. [-Winconsistent-missing-override] Sorry for the noise. llvm-svn: 258635	2016-01-23 20:48:50 +00:00
NAKAMURA Takumi	7f957926a4	ObjectTransformLayerTest.cpp: Fix a warning. [-Wredundant-move] llvm-svn: 258634	2016-01-23 20:45:55 +00:00
NAKAMURA Takumi	63fb066a7d	ObjectTransformLayerTest.cpp: Fix a warning. [-Winconsistent-missing-override] llvm-svn: 258633	2016-01-23 20:45:50 +00:00
Kostya Serebryany	9768e7f06b	[libFuzzer] add -abort_on_timeout option llvm-svn: 258631	2016-01-23 19:34:19 +00:00
Joseph Tremoulet	23d02f6149	[ORC] Update ObjectTransformLayer signature Summary: Update ObjectTransformLayer::addObjectSet to take the object set by value rather than reference and pass it to the base layer with move semantics rather than copy, to match r258185's changes to ObjectLinkingLayer. Update the unit test to verify that ObjectTransformLayer's signature stays in sync with ObjectLinkingLayer's. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16414 llvm-svn: 258630	2016-01-23 18:36:01 +00:00
Sanjay Patel	a41ecae46e	regenerate checks and note some near-term improvements For the moment, this file takes way too long to run (see inline comments), but that should be a temporary problem. The fact that the compile time is so slow for a target that doesn't support maskmov may be a bug worth investigating too. llvm-svn: 258629	2016-01-23 17:52:56 +00:00
Akira Hatanaka	1235d280d8	[Bitcode] Insert the darwin wrapper at the beginning of a file when the target is macho. It looks like the check for macho was accidentally dropped in r132959. I don't have a test case, but I'll add one if anyone knows how this can be tested. llvm-svn: 258627	2016-01-23 16:02:10 +00:00
Aaron Ballman	add830b5d1	Silence a -Wparentheses warning; NFC. llvm-svn: 258626	2016-01-23 15:42:21 +00:00
Simon Pilgrim	ead22d095e	Added missing comment. NFC. llvm-svn: 258624	2016-01-23 14:38:02 +00:00
NAKAMURA Takumi	0933b4280a	AlignOf.h: Satisfy both g++-4.7 and msc18. llvm-svn: 258623	2016-01-23 13:52:09 +00:00
Simon Pilgrim	fd66169341	[X86][SSE] Remove INSERTPS dependencies from unreferenced operands. If the INSERTPS zeroes out all the referenced elements from either of the 2 input vectors (and the input is not already UNDEF), then set that input to UNDEF to reduce dependencies. llvm-svn: 258622	2016-01-23 13:37:07 +00:00
Haicheng Wu	dd5e9d2159	[LIR] Add support for structs and hand unrolled loops Now LIR can turn following codes into memset: typedef struct foo { int a; int b; } foo_t; void bar(foo_t f, unsigned n) { for (unsigned i = 0; i < n; ++i) { f[i].a = 0; f[i].b = 0; } } void test(foo_t f, unsigned n) { for (unsigned i = 0; i < n; i += 2) { f[i] = 0; f[i+1] = 0; } } llvm-svn: 258620	2016-01-23 06:52:41 +00:00
Matthias Braun	327bca776c	Inline variable into assert Seems like some compilers still give unused variable warnings for bool var = ...; (void)var; so I have to inline the variable. llvm-svn: 258619	2016-01-23 06:49:29 +00:00
NAKAMURA Takumi	9974fa9c8c	AArch64ISelLowering.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 258618	2016-01-23 06:34:59 +00:00
Junmo Park	75e9d64aa2	Remove extra whitespace. NFC. llvm-svn: 258617	2016-01-23 06:34:36 +00:00
David Majnemer	6e51070dda	[PruneEH] Don't try to insert a terminator after another terminator LLVM's BasicBlock has a single terminator, it is not valid to have two. llvm-svn: 258616	2016-01-23 06:00:44 +00:00
Manuel Jacob	45cc9bb581	Put space after pointer type in test. NFC. llvm-svn: 258615	2016-01-23 05:47:34 +00:00
Matt Arsenault	8aa5678125	AMDGPU: Replace some deprecated intrinsic uses in tests llvm-svn: 258614	2016-01-23 05:42:49 +00:00
Matt Arsenault	325cca33ec	AMDGPU: Run instnamer on a few tests This will make future test updates easier llvm-svn: 258613	2016-01-23 05:42:43 +00:00
Matt Arsenault	7713162c32	AMDGPU: Remove more unused intrinsics Replace tests with lrp with basic IR expansion llvm-svn: 258612	2016-01-23 05:42:38 +00:00
David Majnemer	4bf0b6b483	[PruneEH] FuncletPads must not have undef operands Instead of RAUW with undef, replace the first non-token instruction with unreachable. This fixes PR26263. llvm-svn: 258611	2016-01-23 05:41:29 +00:00
David Majnemer	2d728ec55d	[PruneEH] Unify invoke and call handling in DeleteBasicBlock No functionality change is intended. llvm-svn: 258610	2016-01-23 05:41:27 +00:00
David Majnemer	146d781717	[PruneEH] Reuse code from removeUnwindEdge PruneEH had functionality idential to removeUnwindEdge. Consolidate around removeUnwindEdge. No functionality change is intended. llvm-svn: 258609	2016-01-23 05:41:22 +00:00
Matt Arsenault	f75257aaa6	AMDGPU: Move amdgcn intrinsic handling into SITargetLowering llvm-svn: 258608	2016-01-23 05:32:20 +00:00
Matt Arsenault	f1341406bf	AMDGPU: Remove IntrNoMem from llvm.SI.sendmsg This has side effects. llvm-svn: 258607	2016-01-23 05:32:18 +00:00
Matt Arsenault	2a93bb6365	AMDGPU: Remove Feature64BitPtr This is a leftover from AMDIL that doesn't do anything and doesn't belong here. llvm-svn: 258606	2016-01-23 05:32:14 +00:00
Matthias Braun	fdef49b183	AArch64ISel: Fix ccmp code selection matching deep expressions. Some of the conditions necessary to produce ccmp sequences were only checked in recursive calls to emitConjunctionDisjunctionTree() after some of the earlier expressions were already built. Move all checks over to isConjunctionDisjunctionTree() so they are all checked before we start emitting instructions. Also rename some variable to better reflect their usage. llvm-svn: 258605	2016-01-23 04:05:22 +00:00
Matthias Braun	985bdf9084	AArch64ISelLowering: Reduce maximum recursion depth of isConjunctionDisjunctionTree() This function will exhibit exponential runtime (2**n) so we should rather use a lower limit. llvm-svn: 258604	2016-01-23 04:05:18 +00:00
Matthias Braun	fd13c14669	Fix wrong indentation llvm-svn: 258603	2016-01-23 04:05:16 +00:00
NAKAMURA Takumi	7cfaab2bcc	AlignOf.h: Appease g++-4.7 for now. Will fix later. llvm-svn: 258600	2016-01-23 02:22:36 +00:00
Derek Schuff	65194682e9	[WebAssembly] Fix RegNumbering for the stack pointer Previously it failed to add NumArgRegs to the offset and so clobbered an already-used register. Now just start the numbering after the arg regs and don't duplicate the add. Test coverage for this coming shortly with the implementation of byval. llvm-svn: 258597	2016-01-23 01:20:43 +00:00
Kostya Serebryany	160dcba81f	[libFuzzer] add more fields to DictionaryEntry to count the number of uses and successes llvm-svn: 258589	2016-01-22 23:55:14 +00:00
Reid Kleckner	841e5a0398	[cmake] Disable manifest generation when LLD is the linker Running mt.exe to make the manifest is really slow. Disabling manifest generation doesn't seem to break anything. llvm-svn: 258581	2016-01-22 23:27:13 +00:00
David Majnemer	f1ff538456	[WinEH] Let cleanups post-dominated by unreachable get executed Cleanups in C++ are a little weird. They are only guaranteed to be reliably executed if, and only if, there is a viable catch handler which can handle the exception. This means that reachability of a cleanup is lexically determined by it being nested with a try-block which unwinds to a catch. It is cannot be reasoned about by examining the control flow edges leaving a cleanup. Usually this is not a problem. It becomes a problem when there are no edges out of a cleanup because we believed that code post-dominated by the cleanup is dead. In LLVM's case, this code is what informs the personality routine about the presence of a suitable catch handler. However, the lack of edges to that catch handler makes the handler become unreachable which causes us to remove it. By removing the handler, the cleanup becomes unreachable. Instead, inject a catch-all handler with every cleanup that has no unwind edges. This will allow us to properly unwind the stack. This fixes PR25997. llvm-svn: 258580	2016-01-22 23:20:43 +00:00
Kevin Enderby	1829c686bf	Fix the code that leads to the incorrect trigger of the report_fatal_error() in MachOObjectFile::getSymbolByIndex() when a Mach-O file has a symbol table load command but the number of symbols are zero. The code in MachOObjectFile::symbol_begin_impl() should not be assuming there is a symbol at index 0, in cases there is no symbol table load command or the count of symbol is zero. So I also fixed that. And needed to fix MachOObjectFile::symbol_end_impl() to also do the same thing for no symbol table or one with zero entries. The code in MachOObjectFile::getSymbolByIndex() should trigger the report_fatal_error() for programmatic errors for any index when there is no symbol table load command and not return the end iterator. So also fixed that. Note there is no test case as this is a programmatic error. The test case using the file macho-invalid-bad-symbol-index has a symbol table load command with its number of symbols (nsyms) is zero. Which was incorrectly testing the bad triggering of the report_fatal_error() in in MachOObjectFile::getSymbolByIndex(). This test case is an invalid Mach-O file but not for that reason. It appears this Mach-O file use to have an nsyms value of 11, and what makes this Mach-O file invalid is the counts and indexes into the symbol table of the dynamic load command are now invalid because the number of symbol table entries (nsyms) is now zero. Which can be seen with the existing llvm-obdump: % llvm-objdump -private-headers macho-invalid-bad-symbol-index … Load command 4 cmd LC_SYMTAB cmdsize 24 symoff 4216 nsyms 0 stroff 4392 strsize 144 Load command 5 cmd LC_DYSYMTAB cmdsize 80 ilocalsym 0 nlocalsym 8 (past the end of the symbol table) iextdefsym 8 (greater than the number of symbols) nextdefsym 2 (past the end of the symbol table) iundefsym 10 (greater than the number of symbols) nundefsym 1 (past the end of the symbol table) ... And the native darwin tools generates an error for this file: % nm macho-invalid-bad-symbol-index nm: object: macho-invalid-bad-symbol-index truncated or malformed object (ilocalsym plus nlocalsym in LC_DYSYMTAB load command extends past the end of the symbol table) I added new checks for the indexes and sizes for these in the constructor of MachOObjectFile. And added comments for what would be a proper diagnostic messages. And changed the test case using macho-invalid-bad-symbol-index to test for the new error now produced. Also added a test with a valid Mach-O file with a symbol table load command where the number of symbols is zero that shows the report_fatal_error() is not called. llvm-svn: 258576	2016-01-22 22:49:55 +00:00
Ivan Krasin	df91910bd4	Use std::piecewise_constant_distribution instead of ad-hoc binary search. Summary: Fix the issue with the most recently discovered unit receiving much less attention. Note: this is the second attempt (prev: r258473). Now, libc++ build is fixed. Reviewers: aizatsky, kcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16487 llvm-svn: 258571	2016-01-22 22:28:27 +00:00
Weiming Zhao	13e7cb294c	Fix LivePhysRegs::addLiveOuts Summary: The testing for returnBB was flipped which may cause ARM ld/st opt pass uses callee saved regs in returnBB when shrink-wrap is used. Reviewers: t.p.northover, apazos, MatzeB Subscribers: mcrosier, zzheng, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D16434 llvm-svn: 258569	2016-01-22 22:21:34 +00:00
Sanjay Patel	908ea7312a	fixed to test features, not CPU models llvm-svn: 258568	2016-01-22 22:20:56 +00:00
Sanjay Patel	c4efadb665	fix typos; NFC llvm-svn: 258567	2016-01-22 22:09:41 +00:00
Owen Anderson	bb33e7b4ba	Strip local symbols when using externalized debug info. When we build LLVM with externalized debug info, all debugging and symbolication related data is extracted into dSYM files prior to stripping. As such, there is no need to preserve local symbols in LLVM binaries after dSYM creation. This shrinks libLLVM.dylib from 58MB to 55MB on my system. llvm-svn: 258566	2016-01-22 22:07:24 +00:00
Davide Italiano	72d0d89b59	[gold] Remove inconsistent llvm_unreachable(). Differential Revision: http://reviews.llvm.org/D16429 llvm-svn: 258561	2016-01-22 21:36:49 +00:00
Matt Arsenault	7766b951d6	AMDGPU: Remove GCCBuiltin from intrinsics that need mangling If the intrinsic is overloaded and works on multiple types, it cannot resolve to a single corresponding builtin and requires handling in clang. This just causes crashes now. llvm-svn: 258559	2016-01-22 21:30:46 +00:00
Matt Arsenault	10ca39ca8b	AMDGPU: Add new name for barrier intrinsic llvm-svn: 258558	2016-01-22 21:30:43 +00:00
Matt Arsenault	bef34e21c7	AMDGPU: Rename intrinsics to use amdgcn prefix The intrinsic target prefix should match the target name as it appears in the triple. This is not yet complete, but gets most of the important ones. llvm.AMDGPU.* intrinsics used by mesa and libclc are still handled for compatability for now. llvm-svn: 258557	2016-01-22 21:30:34 +00:00
Sergei Larin	94be2dee7e	Make sure that any new and optimized objects created during GlobalOPT copy all the attributes from the base object. Summary: Make sure that any new and optimized objects created during GlobalOPT copy all the attributes from the base object. A good example of improper behavior in the current implementation is section information associated with the GlobalObject. If a section was set for it, and GlobalOpt is creating/modifying a new object based on this one (often copying the original name), without this change new object will be placed in a default section, resulting in inappropriate properties of the new variable. The argument here is that if customer specified a section for a variable, any changes to it that compiler does should not cause it to change that section allocation. Moreover, any other properties worth representation in copyAttributesFrom() should also be propagated. Reviewers: jmolloy, joker-eph, joker.eph Subscribers: slarin, joker.eph, rafael, tobiasvk, llvm-commits Differential Revision: http://reviews.llvm.org/D16074 llvm-svn: 258556	2016-01-22 21:18:20 +00:00
Nico Weber	7849ad0f72	Make InstProfWriter compile again after 258544 with MSVC. \src\llvm-rw\include\llvm/Support/AlignOf.h(254) : error C2872: 'detail' : ambiguous symbol could be 'llvm::detail' or 'llvm::support::detail' llvm-svn: 258553	2016-01-22 21:13:04 +00:00
Sanjay Patel	3388d1fc6d	function names start with a lowercase letter; NFC llvm-svn: 258552	2016-01-22 21:11:47 +00:00
Sanjoy Das	95639746e5	[PlaceSafepoints] Introduce a -spp-no-statepoints flag Summary: This change adds a `-spp-no-statepoints` flag to PlaceSafepoints that bypasses the code that wraps newly introduced polls and existing calls in gc.statepoint. With `-spp-no-statepoints` enabled, PlaceSafepoints effectively becomes a safpeoint poll insertion pass. The eventual goal is to "constant fold" this option, along with `-rs4gc-use-deopt-bundles` to `true`, once clients using gc.statepoint are okay doing so. Reviewers: pgavlin, reames, JosephTremoulet Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16439 llvm-svn: 258551	2016-01-22 21:02:55 +00:00
Xinliang David Li	e1574f0cdf	[PGO] Remove use of static variable. /NFC Make the variable a member of the writer trait object owned now by the writer. Also use a different generator interface to pass the infoObject from the writer. llvm-svn: 258544	2016-01-22 20:25:56 +00:00
Ahmed Bougacha	8e491e2d02	[AArch64] Cleanup ccmp test check labels. NFC. llvm-svn: 258541	2016-01-22 20:02:26 +00:00
Rafael Espindola	c0b103b133	Typo fix and simplification. Thanks to Justin Bogner for the suggestion. llvm-svn: 258540	2016-01-22 19:58:18 +00:00
Xinliang David Li	46ad363ba4	Revert 258486 -- for a better fix coming soon llvm-svn: 258538	2016-01-22 19:53:31 +00:00
Matt Arsenault	0b783ef076	AMDGPU: Fix crash with invariant markers The promote alloca pass didn't handle these intrinsics and crashed. These intrinsics should accept any address space, but for now just erase them to avoid breaking. llvm-svn: 258537	2016-01-22 19:47:54 +00:00
Jingyue Wu	585ec8671d	[NVPTX] expand mul_lohi to mul_lo and mul_hi Summary: Fixes PR26186. Reviewers: grosser, jholewinski Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D16479 llvm-svn: 258536	2016-01-22 19:47:26 +00:00
Rafael Espindola	d5607d35d4	Add ArrayRef support to EndianStream. Using an array instead of ArrayRef would allow type inference, but (short of using C99) one would still need to write typedef uint16_t VT[]; LE.write(VT{0x1234, 0x5678}); llvm-svn: 258535	2016-01-22 19:44:46 +00:00
Ahmed Bougacha	78d6efdb93	[AArch64] Simplify emitConditionalCompare calls. NFC. Now that both callsites are identical, we can simplify the prototype and make it easier to reason about the 2-CC case. llvm-svn: 258534	2016-01-22 19:43:57 +00:00
Ahmed Bougacha	99209b90a4	[AArch64] Lower 2-CC FCCMPs (one/ueq) using AND'ed CCs. The current behavior is incorrect, as the two CCs returned by changeFPCCToAArch64CC, intended to be OR'ed, are instead used in an AND ccmp chain. Consider: define i32 @t(float %a, float %b, float %c, float %d, i32 %e, i32 %f) { %cc1 = fcmp one float %a, %b %cc2 = fcmp olt float %c, %d %and = and i1 %cc1, %cc2 %r = select i1 %and, i32 %e, i32 %f ret i32 %r } Assuming (%a < %b) and (%c < %d); we used to do: fcmp s0, s1 # nzcv <- 1000 orr w8, wzr, #0x1 # w8 <- 1 csel w9, w8, wzr, mi # w9 <- 1 csel w8, w8, w9, gt # w8 <- 1 fcmp s2, s3 # nzcv <- 1000 cset w9, mi # w9 <- 1 tst w8, w9 # (w8 & w9) == 1, so: nzcv <- 0000 csel w0, w0, w1, ne # w0 <- w0 We now do: fcmp s2, s3 # nzcv <- 1000 fccmp s0, s1, #0, mi # mi, so: nzcv <- 1000 fccmp s0, s1, #8, le # !le, so: nzcv <- 1000 csel w0, w0, w1, pl # !pl, so: w0 <- w1 In other words, we transformed: (c < d) && ((a < b) \|\| (a > b)) into: (c < d) && (a u>= b) && (a u<= b) whereas, per De Morgan's, we wanted: (c < d) && !((a u>= b) && (a u<= b)) Note that this problem doesn't occur in the test-suite. changeFPCCToAArch64CC produces disjunct CCs; here, one -> mi/gt. We can't represent that in the fccmp chain; it can't express arbitrary OR sequences, as one comment explains: In general we can create code for arbitrary "... (and (and A B) C)" sequences. We can also implement some "or" expressions, because "(or A B)" is equivalent to "not (and (not A) (not B))" and we can implement some negation operations. [...] However there is no way to negate the result of a partial sequence. Instead, introduce changeFPCCToANDAArch64CC, which produces the conjunct cond codes: - (a one b) == ((a olt b) \|\| (a ogt b)) == ((a ord b) && (a une b)) - (a ueq b) == ((a uno b) \|\| (a oeq b)) == ((a ule b) && (a uge b)) Note that, at first, one might think that, when PushNegate is true, we should use the disjunct CCs, in effect doing: (a \|\| b) = !(!a && !(b)) = !(!a && !(b1 \|\| b2)) <- changeFPCCToAArch64CC(b, b1, b2) = !(!a && !b1 && !b2) However, we can take advantage of the fact that the CC is already negated, which lets us avoid special-casing PushNegate and doing the simpler to reason about: (a \|\| b) = !(!a && (!b)) = !(!a && (b1 && b2)) <- changeFPCCToANDAArch64CC(!b, b1, b2) = !(!a && b1 && b2) This makes both emitConditionalCompare cases behave identically, and produces correct ccmp sequences for the 2-CC fcmps. llvm-svn: 258533	2016-01-22 19:43:54 +00:00
Ahmed Bougacha	6345b9ecfa	[AArch64] Assert that CCMP isel didn't fail inconsistently. We verify that the op tree is eligible for CCMP emission in isConjunctionDisjunctionTree, but it's also possible that emitConjunctionDisjunctionTree fails later. The initial check is useful, as it avoids building nodes that will get discarded. Still, make sure that inconsistencies don't happen with an assert. llvm-svn: 258532	2016-01-22 19:43:43 +00:00
Sanjoy Das	acc43d197d	[RS4GC] Use OB_deopt instead of "deopt" llvm-svn: 258529	2016-01-22 19:20:40 +00:00
Krzysztof Parzyszek	7b413c6c63	[Hexagon] Use general purpose registers to spill pred/mod registers into Patch by Tobias Edler Von Koch. llvm-svn: 258527	2016-01-22 19:15:58 +00:00
Matt Arsenault	429f28066c	AMDGPU: Fix getArchTypePrefix llvm-svn: 258525	2016-01-22 19:09:12 +00:00
Matt Arsenault	59bd3014f2	AMDGPU: Rename some r600 intrinsics to use correct TargetPrefix These ones aren't directly emitted by mesa and inserted by a pass. llvm-svn: 258523	2016-01-22 19:00:09 +00:00
Matt Arsenault	bb4ff5f5b6	AMDGPU: Remove unused R600 intrinsics llvm-svn: 258522	2016-01-22 18:52:14 +00:00
David Majnemer	734d7c3272	[WinEH] Make collectFuncletMembers non-recursive Use a worklist for the pre-order DFS instead of using recursion. No functionality change is intended. llvm-svn: 258521	2016-01-22 18:49:50 +00:00
Kevin Enderby	f681ec5db1	Fix MachOObjectFile::getSymbolName() to not call report_fatal_error() but to return object_error::parse_failed. Then made the code in llvm-nm do for Mach-O files what is done in the darwin native tools which is to print "bad string index" for bad string indexes. Updated the error message in the llvm-objdump test, and added tests to show llvm-nm prints "bad string index" and a test to print the actual bad string index value which in this case is 0xfe000002 when printing the fields as raw hex. llvm-svn: 258520	2016-01-22 18:47:14 +00:00
Matt Arsenault	7898b90ee1	AMDGPU: Change control flow intrinsics to use amdgcn prefix These aren't supposed to be used outside of the backend, so there aren't any users to worry about. llvm-svn: 258516	2016-01-22 18:42:55 +00:00
Matt Arsenault	8d903029e8	AMDGPU: Don't use separate mulhu/mulhs Pats llvm-svn: 258515	2016-01-22 18:42:49 +00:00
Matt Arsenault	ee0930821a	AMDGPU: Remove random TGSI intrinsic I don't think this was ever used. llvm-svn: 258514	2016-01-22 18:42:44 +00:00
Matt Arsenault	0cbaa1762b	AMDGPU: Remove AMDGPU.fract intrinsic Mesa doesn't use this, and this is pattern matched already from fsub x, (ffloor x) llvm-svn: 258513	2016-01-22 18:42:38 +00:00
Xinliang David Li	3865fdc4cc	[PGO] add an interface needed by icall promotion llvm-svn: 258509	2016-01-22 18:13:34 +00:00
Craig Topper	674d238bcc	[TableGen] Make a class member local to the function that populates it and consumes it later. NFC llvm-svn: 258490	2016-01-22 05:59:43 +00:00
Craig Topper	6664c18518	[TableGen] Reorder fields in AsmWriterOperand to remove padding and reduce size. NFC llvm-svn: 258489	2016-01-22 05:59:40 +00:00
Craig Topper	db75cc184a	[TableGen] Remove the CGIOpNo from AsmWriterOperand as its not used for anything. NFC llvm-svn: 258488	2016-01-22 05:59:37 +00:00
Xinliang David Li	876c2024e2	[PGO] eliminate use of static variable llvm-svn: 258486	2016-01-22 05:48:40 +00:00
JF Bastien	4383a34268	NFC WebAssembly: update links I got a vanity URL, and moved the github waterfall repo. llvm-svn: 258484	2016-01-22 04:21:49 +00:00
Dan Gohman	0bf3ae84ca	[SelectionDAG] Fold more offsets into GlobalAddresses This reapplies r258296 and r258366, and also fixes an existing bug in SelectionDAG.cpp's isMemSrcFromString, neglecting to account for the offset in a GlobalAddressSDNode, which is uncovered by those patches. llvm-svn: 258482	2016-01-22 03:57:34 +00:00
Manuel Jacob	cc13c2cf47	Replace Type::getInt32Ty() and comparison by isIntegerTy(32). NFC. llvm-svn: 258480	2016-01-22 03:30:27 +00:00
Ivan Krasin	d84f74cab7	Revert r258473 as it's breaking the build with libc++ Reviewers: kcc Differential Revision: http://reviews.llvm.org/D16441 llvm-svn: 258479	2016-01-22 03:21:52 +00:00
Eduard Burtescu	68e7f49f8e	[opaque pointer types] [NFC] DataLayout::getIndexedOffset: take source element type instead of pointer type and rename to getIndexedOffsetInType. Summary: Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16282 llvm-svn: 258478	2016-01-22 03:08:27 +00:00
Eduard Burtescu	e2a6917849	[opaque pointer types] [NFC] FindAvailableLoadedValue: take LoadInst instead of just the pointer. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16422 llvm-svn: 258477	2016-01-22 01:51:51 +00:00
Eduard Burtescu	093ae49077	[opaque pointer types] [NFC] gep_type_{begin,end} now take source element type and address space. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16436 llvm-svn: 258474	2016-01-22 01:33:43 +00:00
Ivan Krasin	b008fd4d89	Use std::piecewise_constant_distribution instead of ad-hoc binary search. Summary: Fix the issue with the most recently discovered unit receiving much less attention. Note: I had to change the seed for one test to make it pass. Alternatively, the number of runs could be increased. I believe that the average time of 'foo' discovery is not increased, just seed=1 was particularly convenient for the previous PRNG scheme used. Reviewers: aizatsky, kcc Subscribers: llvm-commits, kcc Differential Revision: http://reviews.llvm.org/D16419 llvm-svn: 258473	2016-01-22 01:32:34 +00:00
Eduard Burtescu	1423921a24	[opaque pointer types] [NFC] Add an explicit type argument to ConstantFoldLoadFromConstPtr. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16418 llvm-svn: 258472	2016-01-22 01:17:26 +00:00
Pirama Arumuga Nainar	71e9a2a4c4	Do not lower VSETCC if operand is an f16 vector Summary: SETCC with f16 vectors has OperationAction set to Expand but still gets lowered to FCM* intrinsics based on its result type. This patch skips lowering of VSETCC if the operand is an f16 vector. v4 and v8 tests included. Reviewers: ab, jmolloy Subscribers: srhines, llvm-commits Differential Revision: http://reviews.llvm.org/D15361 llvm-svn: 258471	2016-01-22 01:16:57 +00:00
Reid Kleckner	b7ecfa5b09	Revert "[SelectionDAG] Fold more offsets into GlobalAddresses" This reverts r258296 and the follow up r258366. With this change, we miscompiled the following program on Windows: #include <string> #include <iostream> static const char kData[] = "asdf jkl;"; int main() { std::string s(kData + 3, sizeof(kData) - 3); std::cout << s << '\n'; } llvm-svn: 258465	2016-01-22 01:09:29 +00:00
Kostya Serebryany	b5e984992a	[libFuzzer] don't do expensive memmem if the result will not be used llvm-svn: 258462	2016-01-22 01:04:58 +00:00
Teresa Johnson	6cba37ce75	[ThinLTO] Do metadata linking during batch function importing Summary: Since we are currently not doing incremental importing there is no need to link metadata as a postpass. The module linker will only link in the imported subroutines due to the functionality added by r256003. (Note that the metadata postpass linking functionalitiy is still used by llvm-link, and may be needed here in the future if a more incremental strategy is adopted.) Reviewers: joker.eph Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16424 llvm-svn: 258458	2016-01-22 00:15:53 +00:00
Eduard Burtescu	2f4758b1cc	[opaque pointer types] [NFC] Take advantage of get{Source,Result}ElementType when folding GEPs. Summary: Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16302 llvm-svn: 258456	2016-01-21 23:42:06 +00:00
Sanjay Patel	577472141c	move function definitions so we don't need separate declarations ; NFCI llvm-svn: 258455	2016-01-21 23:38:43 +00:00
Sanjay Patel	9beec21fcf	[LibCallSimplifier] refactor FP function signature checks ; NFCI Use the helper function added in r258428. The check should really be hoisted to the caller of all of these optimize* functions, but that's another step. llvm-svn: 258446	2016-01-21 22:58:01 +00:00
Sanjay Patel	042aed90ab	avoid variable shadowing; NFC llvm-svn: 258445	2016-01-21 22:41:16 +00:00
Sanjay Patel	0e603fc3a7	remove unnecessary variable; NFC llvm-svn: 258444	2016-01-21 22:31:18 +00:00
Reid Kleckner	18ec96f0fc	Avoid unnecessary stack realignment in musttail thunks with SSE2 enabled The X86 musttail implementation finds register parameters to forward by running the calling convention algorithm until a non-register location is returned. However, assigning a vector memory location has the side effect of increasing the function's stack alignment. We shouldn't increase the stack alignment when we are only looking for register parameters, so this change conditionalizes it. llvm-svn: 258442	2016-01-21 22:23:22 +00:00
Simon Pilgrim	5ba1c127fc	[X86][SSE] Improve i16 splatting shuffles Better handling of the annoying pshuflw/pshufhw ops which only shuffle lower/upper halves of a vector. Added vXi16 unary shuffle support for cases where i16 elements (from the same half of the source) are being splatted to the whole of one of the halves. This avoids the general lowering case which must shuffle the 32-bit elements first - meaning that we used to end up with unnecessary duplicate pshuflw/pshufhw shuffles. Note this has the side effect of a lot of SSSE3 test cases no longer needing to use PSHUFB, as it falls below the 3 op combine threshold for when PSHUFB is typically worth it. I've raised PR26183 to discuss if the threshold should be changed and whether we need to make it more specific to the target CPU. Differential Revision: http://reviews.llvm.org/D14901 llvm-svn: 258440	2016-01-21 22:07:41 +00:00
Dimitry Andric	ca0516014c	In test-release.sh, only run `uname -s` once. NFC. llvm-svn: 258439	2016-01-21 22:07:17 +00:00
Lang Hames	3db630b5e7	[RuntimeDyld][AArch64] Add support for the MachO ARM64_RELOC_SUBTRACTOR reloc. llvm-svn: 258438	2016-01-21 21:59:50 +00:00
Dimitry Andric	4a5f8a19c7	Let test-release.sh checkout subprojects directly into the target tree, instead of using symlinks Summary: In the past I have run into several problems with the way `test-release.sh` creates all the subproject directories as siblings, and then uses symlinks to stitch them all together. In some scenarios this leads to clang not being able to find header files, etc. This patch changes the script so it directly exports into the correct target locations for each subproject. Reviewers: hans Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D16420 llvm-svn: 258436	2016-01-21 21:57:49 +00:00
David L Kreitzer	4d7257dfa1	Fix for two constant propagation problems in GVN with the assume intrinsic instruction. Patch by Yuanrui Zhang. Differential Revision: http://reviews.llvm.org/D16100 llvm-svn: 258435	2016-01-21 21:32:35 +00:00
Kevin Enderby	1f472eace5	Fix MachOObjectFile::getSymbolSection() to not call report_fatal_error() but to return object_error::parse_failed. Then made the code in llvm-nm do for Mach-O files what is done in the darwin native tools which is to print "(?,?)" or just "s" for bad section indexes. Also added a test to show it prints the bad section index of "42" when printing the fields as raw hex. llvm-svn: 258434	2016-01-21 21:13:27 +00:00
Sanjay Patel	fcc7c1a0ba	[LibCallSimplifier] don't get fooled by a fake fmin() This is similar to the bug/fix: https://llvm.org/bugs/show_bug.cgi?id=26211 http://reviews.llvm.org/rL258325 The fmin() test case reveals another bug caused by sloppy code duplication. It will crash without this patch because fp128 is a valid floating-point type, but we would think that we had matched a function that used doubles. The new helper function can be used to replace similar checks that are used in several other places in this file. llvm-svn: 258428	2016-01-21 20:19:54 +00:00
Rong Xu	950af1558f	Fix buildbot failure due to r258420 Include the needed headfile to fix the buildbot failure due to r258420 [PGO] Passmanagerbuilder change that enable IR level PGO instrumentation. llvm-svn: 258423	2016-01-21 19:06:24 +00:00
David Majnemer	3af5bf30e3	[InstCombine] Simplify (x >> y) <= x This commit extends the patterns recognised by InstSimplify to also handle (x >> y) <= x in the same way as (x /u y) <= x. The missing optimisation was found investigating why LLVM did not optimise away bound checks in a binary search: https://github.com/rust-lang/rust/pull/30917 Patch by Andrea Canciani! Differential Revision: http://reviews.llvm.org/D16402 llvm-svn: 258422	2016-01-21 18:55:54 +00:00
Chad Rosier	406808e344	Partially revert "Add command line options to force function/loop alignments." This partially reverts r256571 in favor of the solution in r258409. llvm-svn: 258421	2016-01-21 18:49:15 +00:00
Rong Xu	34abbfb78e	[PGO] Passmanagerbuilder change that enable IR level PGO instrumentation This patch includes the passmanagerbuilder change that enables IR level PGO instrumentation. It adds two passmanagerbuilder options: -profile-generate=<profile_filename> and -profile-use=<profile_filename>. The new options are primarily for debug purpose. Reviewers: davidxl, silvas Differential Revision: http://reviews.llvm.org/D15828 llvm-svn: 258420	2016-01-21 18:28:59 +00:00
Adam Nemet	af761104ba	[TTI] Add getCacheLineSize Summary: And use it in PPCLoopDataPrefetch.cpp. @hfinkel, please let me know if your preference would be to preserve the ppc-loop-prefetch-cache-line option in order to be able to override the value of TTI::getCacheLineSize for PPC. Reviewers: hfinkel Subscribers: hulx2000, mcrosier, mssimpso, hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D16306 llvm-svn: 258419	2016-01-21 18:28:36 +00:00
Rong Xu	ed9fec7365	[PGO] IR level instrumentation of indirect call value profiling This patch adds the instrumentation for indirect call value profiling. It finds all the indirect call-sites and generates instrprof_value_profile intrinsic calls. A new opt level option -disable-vp is introduced to disable this instrumentation. Reviewers: davidxl, betulb, vsk Differential Revision: http://reviews.llvm.org/D16016 llvm-svn: 258417	2016-01-21 18:11:44 +00:00
Sanjay Patel	4e971da272	make helper functions static; NFCI llvm-svn: 258416	2016-01-21 18:01:57 +00:00
Manuel Jacob	f3ee254bc2	Undo r258163 "Move part of an if condition into an assertion. NFC." This undoes the change made in r258163. The assertion fails if `Ptr` is of a vector type. The previous code doesn't look completely correct either, so I'll investigate this more. llvm-svn: 258411	2016-01-21 17:36:14 +00:00
Philip Reames	82e0f15f86	Fix a type in a comment Thanks to Sean Silva for pointing it out. llvm-svn: 258410	2016-01-21 17:32:12 +00:00
Geoff Berry	10494aca05	[BlockPlacement] Add option to align all non-fall-through blocks. Summary: This option is being added for testing purposes. Reviewers: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16410 llvm-svn: 258409	2016-01-21 17:25:52 +00:00
Matthew Simpson	486bace5cc	Revert "[SLP] Truncate expressions to minimum required bit width" This reverts commit r258404. llvm-svn: 258408	2016-01-21 17:17:20 +00:00
Teresa Johnson	f5aa64f25f	Use early return to simplify code (NFC) Follow on to r258405. llvm-svn: 258407	2016-01-21 17:16:53 +00:00
Vedant Kumar	61035fa3cb	[GCOV] Avoid emitting profile arcs for module and skeleton CUs Do not emit profile arc files and note files for module and skeleton CU's. Our users report seeing unexpected .gcda and .gcno files in their projects when using gcov-style profiling with modules or frameworks. The unwanted files come from these modules. This is not very helpful for end-users. Further, we've seen reports of instrumented programs crashing while writing these files out (due to I/O failures). rdar://problem/22838296 Reviewed-by: aprantl Differential Revision: http://reviews.llvm.org/D15997 llvm-svn: 258406	2016-01-21 17:04:42 +00:00
Teresa Johnson	6f508afce1	[ThinLTO] Avoid unnecesary hash lookups during metadata linking (NFC) Replace sequences of count() followed by operator[] with either find() or insert(), depending on the context. llvm-svn: 258405	2016-01-21 16:46:40 +00:00
Matthew Simpson	cb17d72170	[SLP] Truncate expressions to minimum required bit width This change attempts to produce vectorized integer expressions in bit widths that are narrower than their scalar counterparts. The need for demotion arises especially on architectures in which the small integer types (e.g., i8 and i16) are not legal for scalar operations but can still be used in vectors. Like similar work done within the loop vectorizer, we rely on InstCombine to perform the actual type-shrinking. We use the DemandedBits analysis and ComputeNumSignBits from ValueTracking to determine the minimum required bit width of an expression. Differential revision: http://reviews.llvm.org/D15815 llvm-svn: 258404	2016-01-21 16:31:55 +00:00
Scott Egerton	2455701117	[mips] Allowed dla instructions on 32-bit architectures. Summary: This is now the same as the behaviour of the GNU assembler. This was done as it is required in order to build the Linux kernel with the integrated assembler enabled. Reviewers: dsanders, vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D13594 llvm-svn: 258400	2016-01-21 15:11:01 +00:00
Teresa Johnson	e0373a6796	Revert obsolete llvm-link -preserve-modules option/test This testing mode is now obsolete with the change to linkInModule to take a std::unique_ptr to Module. llvm-svn: 258399	2016-01-21 14:28:52 +00:00
Igor Breger	7a000f5bb2	AVX512: Masked move intrinsic implementation. Implemented intrinsic for the follow instructions (reg move) : VMOVDQU8/16, VMOVDQA32/64, VMOVAPS/PD. Differential Revision: http://reviews.llvm.org/D16316 llvm-svn: 258398	2016-01-21 14:18:11 +00:00
Michael Zuckerman	21a30a42a9	[AVX512] Adding VPERMT2B and VPERMI2B Intrinsics Differential Revision: http://reviews.llvm.org/D16398 llvm-svn: 258397	2016-01-21 13:36:01 +00:00
Krzysztof Parzyszek	14f9535eec	PR26172: unnecessary indirection in HexagonCopyToCombine.cpp llvm-svn: 258395	2016-01-21 12:45:17 +00:00
Marina Yatsina	ff262fa807	[X86] - Removing warning on legal cases caused by commit r258132 There's an overloading of the "movsd" and "cmpsd" instructions, e.g. movsd can be either "Move Data from String to String" or "Move or Merge Scalar Double-Precision Floating-Point Value". The former should produce warnings when parsing a memory operand that is not ESI/EDI, but the latter should not. Fixed the code to produce warnings only after making sure we're dealing with the first case. Expanded the tests of the produced warnings + fixed RUN line of the test so that it would check both stdout and stderr Differential Revision: http://reviews.llvm.org/D16359 llvm-svn: 258393	2016-01-21 11:37:06 +00:00
Manuel Jacob	e902459c4b	Change ConstantFoldInstOperands to take Instruction instead of opcode and type. NFC. Summary: The previous form, taking opcode and type, is moved to an internal helper and the new form, taking an instruction, is a wrapper around this helper. Although this is a slight cleanup on its own, the main motivation is to refactor the constant folding API to ease migration to opaque pointers. This will be follow-up work. Reviewers: eddyb Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D16383 llvm-svn: 258391	2016-01-21 06:33:22 +00:00
Manuel Jacob	925d029461	Introduce ConstantFoldCastOperand function and migrate some callers of ConstantFoldInstOperands to use it. NFC. Summary: Although this is a slight cleanup on its own, the main motivation is to refactor the constant folding API to ease migration to opaque pointers. This will be follow-up work. Reviewers: eddyb Subscribers: zzheng, dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D16380 llvm-svn: 258390	2016-01-21 06:31:08 +00:00
Manuel Jacob	a61ca37b6d	Introduce ConstantFoldBinaryOpOperands function and migrate some callers of ConstantFoldInstOperands to use it. NFC. Summary: Although this is a slight cleanup on its own, the main motivation is to refactor the constant folding API to ease migration to opaque pointers. This will be follow-up work. Reviewers: eddyb Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D16378 llvm-svn: 258389	2016-01-21 06:26:35 +00:00
Tom Stellard	de008d338c	AMDGPU/SI: Pass whether to use the SI scheduler via Target Attribute Summary: Currently the SI scheduler can be selected via command line option, but it turned out it would be better if it was selectable via a Target Attribute. This patch adds "si-scheduler" attribute to the backend. Reviewers: tstellarAMD, echristo Subscribers: echristo, arsenm Differential Revision: http://reviews.llvm.org/D16192 llvm-svn: 258386	2016-01-21 04:28:34 +00:00
Xinliang David Li	b4fc4cbee6	re-submit test case (withright format-version) llvm-svn: 258384	2016-01-21 02:35:59 +00:00
Andrew Wilkins	7ab4dc76c4	llvm-go: call llvm-config with components Summary: Add components back into calls to llvm-config, which was accidentally removed in r258283. Reviewers: pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16392 llvm-svn: 258383	2016-01-21 02:33:39 +00:00
David Majnemer	8cc30787b0	Rename MCLineEntry to MCDwarfLineEntry MCLineEntry gives the impression that it is generic MC machinery. However, it is specific to DWARF. llvm-svn: 258381	2016-01-21 01:59:03 +00:00
Kostya Serebryany	2f13f223c7	[libFuzzer] don't use std::vector in one more hot path llvm-svn: 258380	2016-01-21 01:52:14 +00:00
Andrew Wilkins	a7a8ab71aa	[GlobalISel] make library an optional component Summary: Mark the LLVMGlobalISel library as optional in LLVMBuild.txt, since the library is only built if LLVM_BUILD_GLOBAL_ISEL is set. Without doing this, llvm-config includes the library in the list of components regardless of whether it's built, and then will error out when asked for the library names/paths. Reviewers: qcolombet Subscribers: joker.eph, llvm-commits, vkalintiris Differential Revision: http://reviews.llvm.org/D16386 llvm-svn: 258379	2016-01-21 01:41:03 +00:00
Quentin Colombet	25d8e21949	[GlobalISel] Move generic opcodes description to their own file. Differential Revision: http://reviews.llvm.org/D16384 llvm-svn: 258378	2016-01-21 01:37:18 +00:00
Xinliang David Li	d75eaf2b17	Revert 258376 -- wrong version llvm-svn: 258377	2016-01-21 01:21:00 +00:00
Xinliang David Li	bc05e4e4c0	[Coverage] Add a test case for comdat The binary contains two (merged) covmap sections which have duplicate CovMapRecords from comdat (template instantation). This test makes sure the reader reads it properly. It also tests that the coverage data from different instantiations of the same template function are properly merged in show output. llvm-svn: 258376	2016-01-21 00:57:42 +00:00
Mike Aizatsky	e313f8f8ff	[libfuzzer] use %p for printing addresses llvm-svn: 258370	2016-01-21 00:02:09 +00:00
Rafael Espindola	394524d940	Remove redundant argument. It is already a member variable. llvm-svn: 258369	2016-01-21 00:00:53 +00:00
Reid Kleckner	400f39308c	[readobj] Print CodeOffset first, it's easier to read llvm-svn: 258368	2016-01-20 23:21:14 +00:00
Dan Gohman	760bef5e50	[SelectionDAG] Fix constant offset folding to avoid commuting non-commutative operators. This fixes a miscompile in MultiSource/Benchmarks/MiBench/consumer-lame introduced in r258296. llvm-svn: 258366	2016-01-20 23:16:59 +00:00
Chad Rosier	816a1ab9d9	MachineScheduler: Add a command line option to disable post scheduler. llvm-svn: 258364	2016-01-20 23:08:32 +00:00
Chad Rosier	6338d7c390	MachineScheduler: Honor optnone functions in the pre-ra scheduler. llvm-svn: 258363	2016-01-20 22:38:25 +00:00
Rafael Espindola	55a7ae5cc7	Simplify the logic. NFC. Found while reviewing the change for PR26152. llvm-svn: 258362	2016-01-20 22:38:23 +00:00
Manuel Jacob	4e3b446ae8	Run clang-format over ConstantFolding.h, fixing inconsistent indentation. NFC. llvm-svn: 258361	2016-01-20 22:27:06 +00:00
Sanjay Patel	cd4377c74d	don't repeat function names in comments; NFC llvm-svn: 258360	2016-01-20 22:24:38 +00:00
David Blaikie	8ecf9938b2	Orc: Simplify lambda by using std::set's initializer_list ctor llvm-svn: 258359	2016-01-20 22:24:26 +00:00
Lang Hames	f129d6fb50	[Orc] Try to turn Orc execution unit tests back on for Linux. The fix in r258324 (plus r258354) should allow Orc execution tests to run on Linux. llvm-svn: 258358	2016-01-20 22:16:14 +00:00
George Burgess IV	1030d68e48	Fix typo in an error string. NFC. llvm-svn: 258357	2016-01-20 22:15:23 +00:00
Evgeniy Stepanov	9fb70f53ce	Fix PR26152. Fix the condition for when the new global takes over the name of the existing one to be the negation of the condition for the new global to get internal linkage. llvm-svn: 258355	2016-01-20 22:05:50 +00:00
Evgeniy Stepanov	b640415f9b	Fix build warning. error: field 'CCMgr' will be initialized after field 'IndirectStubsMgr' [-Werror,-Wreorder] : DL(TM.createDataLayout()), CCMgr(std::move(CCMgr)), llvm-svn: 258354	2016-01-20 22:02:07 +00:00
Tom Stellard	d1efda8e9e	AMDGPU/SI: Promote i1 SETCC operations Summary: While working on uniform branching, I've hit a few cases where we emit i1 SETCC operations. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16233 llvm-svn: 258352	2016-01-20 21:48:24 +00:00
Matt Arsenault	7836f895fe	AMDGPU: Fix old comments that mention AMDIL llvm-svn: 258350	2016-01-20 21:22:21 +00:00
Matt Arsenault	7ba334a7d9	AMDGPU: Remove AMDGPU.trunc intrinsic llvm-svn: 258348	2016-01-20 21:05:53 +00:00
Matt Arsenault	15fbe49daf	AMDGPU: Remove AMDIL.fraction intrinsic llvm-svn: 258347	2016-01-20 21:05:49 +00:00
Matt Arsenault	7cccd2672e	AMDGPU: Remove AMDIL.round.nearest intrinsic llvm-svn: 258346	2016-01-20 21:05:40 +00:00
Quentin Colombet	105cf2b179	[GlobalISel] Add the proper cmake plumbing. This patch adds the necessary plumbing to cmake to build the sources related to GlobalISel. To build the sources related to GlobalISel, we need to add -DBUILD_GLOBAL_ISEL=ON. By default, this is OFF, thus GlobalISel sources will not impact people that do not explicitly opt-in. Differential Revision: http://reviews.llvm.org/D15983 llvm-svn: 258344	2016-01-20 20:58:56 +00:00
Matt Arsenault	1c9e4ef0df	AMDGPU: Remove abs intrinsic llvm-svn: 258343	2016-01-20 20:58:29 +00:00
Matt Arsenault	f7e6e89718	AMDGPU: Remove min/max intrinsics This removes support for mesa 11.0.x llvm-svn: 258342	2016-01-20 20:50:19 +00:00
Sanjoy Das	a34ce95b60	Add a "gc-transition" operand bundle Summary: This adds a new kind of operand bundle to LLVM denoted by the `"gc-transition"` tag. Inputs to `"gc-transition"` operand bundle are lowered into the "transition args" section of `gc.statepoint` by `RewriteStatepointsForGC`. This removes the last bit of functionality that was unsupported in the deopt bundle based code path in `RewriteStatepointsForGC`. Reviewers: pgavlin, JosephTremoulet, reames Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16342 llvm-svn: 258338	2016-01-20 19:50:25 +00:00
Simon Atanasyan	2d0d8530e3	[llvm-readobj][ELF] Teach llvm-readobj to show arch specific ELF section's flags Some architecture specific ELF section flags might have the same value (for example SHF_X86_64_LARGE and SHF_HEX_GPREL) and we have to check machine architectures to select an appropriate set of possible flags. The patch selects architecture specific flags into separate arrays `ElfxxxSectionFlags` and combines `ElfSectionFlags` and `ElfxxxSectionFlags` before pass to the `StreamWriter::printFlags()` method. Differential Revision: http://reviews.llvm.org/D16269 llvm-svn: 258334	2016-01-20 19:15:18 +00:00
Quentin Colombet	2d7fa7065f	[GlobalISel] Add a generic machine opcode for ADD. The selection process being split into separate passes, we need generic opcodes to translate the LLVM IR to target independent code. This patch adds an opcode for addition: G_ADD. Differential Revision: http://reviews.llvm.org/D15472 llvm-svn: 258333	2016-01-20 19:14:55 +00:00
Sanjay Patel	f44bd38092	fix typo; NFC llvm-svn: 258332	2016-01-20 18:59:48 +00:00
Sanjay Patel	545a456235	fix formatting; NFC llvm-svn: 258330	2016-01-20 18:59:16 +00:00
Rafael Espindola	b718237dfc	Accept subtractions involving a weak symbol. When a symbol S shows up in an expression in assembly there are two possible interpretations * The expression is referring to the value of S in this file. * The expression is referring to the value after symbol resolution. In the first case the assembler can reason about the value and try to produce a relocation. In the second case, that is only possible if the symbol cannot be preempted. Assemblers are not very consistent about which interpretation gets used. This changes MC to agree with GAS in the case of an expression of the form "Sym - WeakSym". llvm-svn: 258329	2016-01-20 18:57:48 +00:00
Sanjay Patel	bd2dc67142	[LibCallSimplifier] don't get fooled by a fake sqrt() The test case will crash without this patch because the subsequent call to hasUnsafeAlgebra() assumes that the call instruction is an FPMathOperator (ie, returns an FP type). This part of the function signature check was omitted for the sqrt() case, but seems to be in place for all other transforms. Before: http://reviews.llvm.org/rL257400 ...we would have needlessly continued execution in optimizeSqrt(), but the bug was harmless because we'd eventually fail some other check and return without damage. This should fix: https://llvm.org/bugs/show_bug.cgi?id=26211 Differential Revision: http://reviews.llvm.org/D16198 llvm-svn: 258325	2016-01-20 17:41:14 +00:00
Lang Hames	6c3e790e78	[Orc] Fix a use-after-move bug in the Orc C-bindings stack. llvm-svn: 258324	2016-01-20 17:39:52 +00:00
Sanjay Patel	1c600c6e83	80-cols; NFC llvm-svn: 258323	2016-01-20 16:41:43 +00:00
Keith Walker	8c44bf1b89	Write AArch64 big endian data fixup entries as BE. There was support for writing the AArch64 big endian data fixup entries in the .eh_frame section in BE. This is changed to write all such fixup entries in BE with no restriction on the section. This is similar to the existing support for fixup entries for ARM. A test is added to check the length field in the .debug_line section as this is an example of where such a fixup occurs. Differential Revision: http://reviews.llvm.org/D16064 llvm-svn: 258320	2016-01-20 15:59:14 +00:00
Tom Stellard	77a177722f	Correctly initialize SIAnnotateControlFlow Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16304 llvm-svn: 258319	2016-01-20 15:48:27 +00:00
Michael Zuckerman	65c40afb03	[AVX512] Adding VPERMB Intrinsics Differential Revision: http://reviews.llvm.org/D16296 llvm-svn: 258316	2016-01-20 15:24:56 +00:00
Marina Yatsina	701938d64e	Fixing bug in rL258132: [X86] Adding support for missing variations of X86 string related instructions There was a bug in my rL258132 because there's an overloading of the "movsd" and "cmpsd" instructions, e.g. movsd can be either "Move Data from String to String" (the case I wanted to handle) or "Move or Merge Scalar Double-Precision Floating-Point Value" (the case that causes the asserts). Added code for escaping the unfamiliar scenarios and falling back to old behviour. Also changed the asserts to llvm_unreachable. llvm-svn: 258312	2016-01-20 14:03:47 +00:00
Krzysztof Parzyszek	2451c4835a	Proper handling of diamond-like cases in if-conversion If converter was somewhat careless about "diamond" cases, where there was no join block, or in other words, where the true/false blocks did not have analyzable branches. In such cases, it was possible for it to remove (needed) branches, resulting in a loss of entire basic blocks. Differential Revision: http://reviews.llvm.org/D16156 llvm-svn: 258310	2016-01-20 13:14:52 +00:00
Igor Breger	d3341f5021	AVX512: Store (MOVNTPD, MOVNTPS, MOVNTDQ) using non-temporal hint intrinsic implementation. Differential Revision: http://reviews.llvm.org/D16350 llvm-svn: 258309	2016-01-20 13:11:47 +00:00
Oliver Stannard	f7696f8267	[AArch64] Fix two bugs in the .inst directive The AArch64 .inst directive was implemented using EmitIntValue, which resulted in both $x and $d (code and data) mapping symbols being emitted at the same address. This fixes it to only emit the $x mapping symbol. EmitIntValue also emits the value in big-endian order when targeting big-endian systems, but instructions are always emitted in little-endian order for AArch64. Differential Revision: http://reviews.llvm.org/D16349 llvm-svn: 258308	2016-01-20 12:54:31 +00:00
Dylan McKay	cc018c1713	[AVR] Defnined calling conventions. NFC. llvm-svn: 258300	2016-01-20 09:30:01 +00:00
Petr Pavlu	eba3039238	[LTO] Fix error reporting when a file passed to libLTO is invalid or non-existent This addresses PR26060 where function lto_module_create() could return nullptr but lto_get_error_message() returned an empty string. The error() call after LTOModule::createFromFile() in llvm-lto is then removed because any error from this function should go through the diagnostic handler in llvm-lto which will exit the program. The error() call was added because this previously did not happen when the file was non-existent. This is fixed by the patch. (The situation that llvm-lto reports an error when the input file does not exist is tested by llvm/tools/llvm-lto/error.ll). Differential Revision: http://reviews.llvm.org/D16106 llvm-svn: 258298	2016-01-20 09:03:42 +00:00
Ivan Krasin	3b1c260d22	[Verifier] Fix performance regression for LTO builds Summary: Fix a significant performance regression by introducing GlobalValueVisited field and reusing the map. This is a follow up to r257823 that slowed down linking Chrome with LTO by 2.5x. If you revert this commit, please, also revert r257823. BUG=https://llvm.org/bugs/show_bug.cgi?id=26214 Reviewers: pcc, loladiro, joker.eph Subscribers: krasin1, joker.eph, loladiro, pcc Differential Revision: http://reviews.llvm.org/D16338 llvm-svn: 258297	2016-01-20 08:41:22 +00:00
Dan Gohman	edf98c5682	[SelectionDAG] Fold more offsets into GlobalAddresses SelectionDAG previously missed opportunities to fold constants into GlobalAddresses in several areas. For example, given `(add (add GA, c1), y)`, it would often reassociate to `(add (add GA, y), c1)`, missing the opportunity to create `(add GA+c, y)`. This isn't often visible on targets such as X86 which effectively reassociate adds in their complex address-mode folding logic, however it is currently visible on WebAssembly since it currently has very simple address mode folding code that doesn't reassociate anything. This patch fixes this by making SelectionDAG fold offsets into GlobalAddresses at the same times that it folds constants together, so that it doesn't miss any opportunities to perform such folding. Differential Revision: http://reviews.llvm.org/D16090 llvm-svn: 258296	2016-01-20 07:03:08 +00:00
Dan Gohman	e5d3c15d7d	[WebAssembly] Tighten up some regexes in some tests. llvm-svn: 258295	2016-01-20 05:55:09 +00:00
Dan Gohman	8394756937	[WebAssembly] Minor code cleanups. NFC. llvm-svn: 258294	2016-01-20 05:54:22 +00:00
Dan Gohman	26cf4f3689	[WebAssembly] Remove the Relooper code, as it is not currently being used. llvm-svn: 258293	2016-01-20 05:50:29 +00:00
Lang Hames	3c43dc27ab	[Orc] 'this' qualify more lambda-captured members. More workaround attempts for GCC ICEs. llvm-svn: 258288	2016-01-20 05:10:59 +00:00
Lang Hames	5959df89e9	[Orc] More qualifications of lambda-captured member variables to fix GCC ICEs. llvm-svn: 258286	2016-01-20 04:32:05 +00:00
Dan Gohman	7e64917fd1	[WebAssembly] Don't stackify stores across instructions with side effects. llvm-svn: 258285	2016-01-20 04:21:16 +00:00
Andrew Wilkins	dfd6088c3f	tools/llvm-config: improve shared library support Summary: This is a re-commit of r257003, which was reverted, along with the fixes from http://reviews.llvm.org/D15986. r252532 added support for reporting the monolithic library when LLVM_BUILD_LLVM_DYLIB is used. This would only be done if the individual components were not found, and the dynamic library is found. This diff extends this as follows: - If LLVM_LINK_LLVM_DYLIB is set, then prefer the shared library, even if all component libraries exist. - Two flags, --link-shared and --link-static are introduced to provide explicit guidance. If --link-shared is passed and the shared library does not exist, an error results. Additionally, changed the expected shared library names from (e.g.) LLVM-3.8.0 to LLVM-3.8. The former exists only in an installation (and then only in CMake builds I think?), and not in the build tree; this breaks usage of llvm-config during builds, e.g. by llvm-go. Reviewers: DiamondLovesYou, beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15986 llvm-svn: 258283	2016-01-20 04:03:09 +00:00
Lang Hames	efa5f6c170	[Orc] Qualify captured variable to work around GCC ICE. llvm-svn: 258278	2016-01-20 03:12:40 +00:00
Xinliang David Li	da656fe50e	Fix a bug in test llvm-svn: 258276	2016-01-20 02:49:53 +00:00
Joseph Tremoulet	b41632bf0f	[Inliner/WinEH] Honor implicit nounwinds Summary: Funclet EH tables require that a given funclet have only one unwind destination for exceptional exits. The verifier will therefore reject e.g. two cleanuprets with different unwind dests for the same cleanup, or two invokes exiting the same funclet but to different unwind dests. Because catchswitch has no 'nounwind' variant, and because IR producers are not required to annotate calls which will not unwind as 'nounwind', it is legal to nest a call or an "unwind to caller" catchswitch within a funclet pad that has an unwind destination other than caller; it is undefined behavior for such a call or catchswitch to unwind. Normally when inlining an invoke, calls in the inlined sequence are rewritten to invokes that unwind to the callsite invoke's unwind destination, and "unwind to caller" catchswitches in the inlined sequence are rewritten to unwind to the callsite invoke's unwind destination. However, if such a call or "unwind to caller" catchswitch is located in a callee funclet that has another exceptional exit with an unwind destination within the callee, applying the normal transformation would give that callee funclet multiple unwind destinations for its exceptional exits. There would be no way for EH table generation to determine which is the "true" exit, and the verifier would reject the function accordingly. Add logic to the inliner to detect these cases and leave such calls and "unwind to caller" catchswitches as calls and "unwind to caller" catchswitches in the inlined sequence. This fixes PR26147. Reviewers: rnk, andrew.w.kaylor, majnemer Subscribers: alexcrichton, llvm-commits Differential Revision: http://reviews.llvm.org/D16319 llvm-svn: 258273	2016-01-20 02:15:15 +00:00
Xinliang David Li	59411db520	[PGO] Add a new interface to be used by Indirect Call Promotion llvm-svn: 258271	2016-01-20 01:26:34 +00:00
Eduard Burtescu	23c4d83aa3	[NFC] Replace several manual GEP loops with gep_type_iterator. Reviewers: dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16335 llvm-svn: 258262	2016-01-20 00:26:52 +00:00

... 2 3 4 5 6 ...

126699 Commits