llvm-project

Commit Graph

Author	SHA1	Message	Date
Manman Ren	e221a870d3	Swift Calling Convention: swifterror target-independent change. At IR level, the swifterror argument is an input argument with type ErrorObject*. For targets that support swifterror, we want to optimize it to behave as an inout value with type ErrorObject; it will be passed in a fixed physical register. The main idea is to track the virtual registers for each swifterror value. We define swifterror values as AllocaInsts with swifterror attribute or a function argument with swifterror attribute. In SelectionDAGISel.cpp, we set up swifterror values (SwiftErrorVals) before handling the basic blocks. When iterating over all basic blocks in RPO, before actually visiting the basic block, we call mergeIncomingSwiftErrors to merge incoming swifterror values when there are multiple predecessors or to simply propagate them. There, we create a virtual register for each swifterror value in the entry block. For predecessors that are not yet visited, we create virtual registers to hold the swifterror values at the end of the predecessor. The assignments are saved in SwiftErrorWorklist and will be materialized at the end of visiting the basic block. When visiting a load from a swifterror value, we copy from the current virtual register assignment. When visiting a store to a swifterror value, we create a virtual register to hold the swifterror value and update SwiftErrorMap to track the current virtual register assignment. Differential Revision: http://reviews.llvm.org/D18108 llvm-svn: 265433	2016-04-05 18:13:16 +00:00
Sanjay Patel	fd16e62d56	add tests to show missing optimization from D18230 llvm-svn: 265431	2016-04-05 18:09:36 +00:00
Sanjay Patel	4064158ccc	add example usage and workflow to --help output llvm-svn: 265430	2016-04-05 18:00:47 +00:00
David Blaikie	9b49256fa8	llvm-dwp: Simplify hashing code a bit llvm-svn: 265426	2016-04-05 17:51:40 +00:00
Sanjay Patel	6ecf1b6760	[InstCombine] regenerate checks utils/update_test_checks.py was improved with: http://reviews.llvm.org/rL265414 to CHECK-NEXT the first line of the IR function. This ensures that nothing bad has happened before that. llvm-svn: 265417	2016-04-05 17:24:54 +00:00
Sanjay Patel	0484879fe7	[x86] regenerate checks utils/update_test_checks.py was improved with: http://reviews.llvm.org/rL265414 to include the first line of the function (expected to be a comment line). This ensures that nothing bad has happened before the first actual line of checked asm. It also matches the existing behavior of the old script. llvm-svn: 265416	2016-04-05 17:12:19 +00:00
JF Bastien	c6ba5ead5e	WebAssembly: fix cfg-stackify test It was broken by reshuffling induced by r265397 'Don't delete empty preheaders in CodeGenPrepare if it would create a critical edge'. llvm-svn: 265415	2016-04-05 17:01:52 +00:00
Sanjay Patel	96241e78ed	check or check-next the first line of the function too We could make this an option if people don't like it. But since part of the reason for using a script to generate checks is to prevent lazy checking that lets bugs crawl through, let's have the script check the first line too. For asm tests, it ensures that nothing unexpected has happened before the first line of asm. This matches the existing behavior of update_llc_test_checks.py. More discussion in PR22897: https://llvm.org/bugs/show_bug.cgi?id=22897 llvm-svn: 265414	2016-04-05 16:49:07 +00:00
Valery Pykhtin	020c29e2b7	[TableGen] AsmMatcherEmitter.cpp: replace a sequence of "if" to "switch" in emitValidateOperandClass. Differential Revision: http://reviews.llvm.org/D18394 llvm-svn: 265412	2016-04-05 16:18:16 +00:00
Jacques Pienaar	42991b3e5a	[lanai] LanaiSetflagAluCombiner more conservative Summary: LanaiSetflagAluCombiner could previously combine instructions across basic building blocks even when not legal. Make the LanaiSetflagAluCombiner more conservative to avoid this. Reviewers: eliben Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18746 llvm-svn: 265411	2016-04-05 16:18:13 +00:00
Sam Parker	0d3a3a537c	[ARM] Cleanup of smul and smla instruction descriptions Removed the SDNode argument passed to the AI_smul and AI_smla multiclass definitions as they are always mul. Differential Revision: http://reviews.llvm.org/D18791 llvm-svn: 265409	2016-04-05 16:01:25 +00:00
Konstantin Zhuravlyov	e63e02cb0c	[AMDGPU] Emit linkonce and linkonce_odr symbols Differential Revision: http://reviews.llvm.org/D18726 llvm-svn: 265408	2016-04-05 16:00:58 +00:00
Haicheng Wu	3618fa786f	[BlockPlacement] Remove an unnecessary continue NFC. llvm-svn: 265407	2016-04-05 15:37:08 +00:00
Rafael Espindola	aafcf758c9	Use ArrayRef for contiguous areas in ELF. NFC. This just simplifies the code a bit. More so in lld. llvm-svn: 265403	2016-04-05 14:47:22 +00:00
Chuang-Yu Cheng	f0eba83571	Add missing test for the "Don't delete empty preheaders" added in r265397 Author: Tom Jablin (tjablin) llvm-svn: 265402	2016-04-05 14:21:32 +00:00
Rafael Espindola	1d3c43b293	Centralize the definition of a few types. NFC. llvm-svn: 265399	2016-04-05 14:10:18 +00:00
Chuang-Yu Cheng	d3fb38cae5	Don't delete empty preheaders in CodeGenPrepare if it would create a critical edge Presently, CodeGenPrepare deletes all nearly empty (only phi and branch) basic blocks. This pass can delete loop preheaders which frequently creates critical edges. A preheader can be a convenient place to spill registers to the stack. If the entrance to a loop body is a critical edge, then spills may occur in the loop body rather than immediately before it. This patch protects loop preheaders from deletion in CodeGenPrepare even if they are nearly empty. Since the patch alters the CFG, it affects a large number of test cases. In most cases, the changes are merely cosmetic (basic blocks have different names or instruction orders change slightly). I am somewhat concerned about the test/CodeGen/Mips/brdelayslot.ll test case. If the loop preheader is not deleted, then the MIPS backend does not take advantage of a branch delay slot. Consequently, I would like some close review by a MIPS expert. The patch also partially subsumes D16893 from George Burgess IV. George correctly notes that CodeGenPrepare does not actually preserve the dominator tree. I think the dominator tree was usually not valid when CodeGenPrepare ran, but I am using LoopInfo to mark preheaders, so the dominator tree is now always valid before CodeGenPrepare. Author: Tom Jablin (tjablin) Reviewers: hfinkel george.burgess.iv vkalintiris dsanders kbarton cycheng http://reviews.llvm.org/D16984 llvm-svn: 265397	2016-04-05 14:06:20 +00:00
Peter Zotov	0a2fa0a13b	[llvm-c] Expose LLVM{Get,Set}ModuleIdentifier Patch by Nicole Mazzuca <npmazzuca@gmail.com>. Differential Revision: http://reviews.llvm.org/D18736 llvm-svn: 265394	2016-04-05 13:56:59 +00:00
Simon Dardis	d9d41f531e	[mips] MIPSR6 Compact jump support This patch adds support for compact jumps similiar to the previous compact branch support for MIPSR6. Unlike compact branches, compact jumps do not have a forbidden slot. As MipsInstrInfo::getEquivalentCompactForm can determine the correct expansion for jumps and branches for both microMIPS and MIPSR6, remove the unnecessary distinction in the delay slot filler. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders llvm-svn: 265390	2016-04-05 12:50:29 +00:00
Justin Holewinski	c79979299a	[NVPTX] Handle ldg created from sign-/zero-extended load Reviewers: jingyue Subscribers: jholewinski Differential Revision: http://reviews.llvm.org/D18053 llvm-svn: 265389	2016-04-05 12:38:01 +00:00
David L Kreitzer	188de5ae69	Adds the ability to use an epilog remainder loop during loop unrolling and makes this the default behavior. Patch by Evgeny Stupachenko (evstupac@gmail.com). Differential Revision: http://reviews.llvm.org/D18158 llvm-svn: 265388	2016-04-05 12:19:35 +00:00
Tamas Berghammer	849045f2aa	Set the thumb flag for thumb symbols coming from an ELF file Without setting the flag there is no way to determine if a symbol points to an arm or to a thumb function as the LSB of the address masked out in all getter function. Note: Currently the thumb flag is only used for MachO files so adding a test to this change is not possible. It will be used by the upcoming fix for llvm-objdump for disassembling thumb functions what is easily testable. Differential revision: http://reviews.llvm.org/D17956 llvm-svn: 265387	2016-04-05 12:11:40 +00:00
Haojian Wu	591ae46820	Add parentheses around `&&` within `\|\|` to avoid compiler warning message. Summary: The assert code is introduced by r265370. Reviewers: bkramer Subscribers: tejohnson Differential Revision: http://reviews.llvm.org/D18786 llvm-svn: 265383	2016-04-05 09:07:47 +00:00
Dmitry Polukhin	a3d5b0b218	[IFUNC] Use GlobalIndirectSymbol when aliases and ifuncs have something similar Second part extracted from http://reviews.llvm.org/D15525 Use GlobalIndirectSymbol in all cases when aliases and ifuncs have something in common. Differential Revision: http://reviews.llvm.org/D18754 llvm-svn: 265382	2016-04-05 08:47:51 +00:00
Etienne Bergeron	1562f69feb	[Support] Fix an invalid character escaping in string literal (unittest). Summary: A character within a string literal is not escaped correctly. In this case, there is no semantic change because the invalid character turn out to be NUL anyway. note: "\0x12" is equivalent to {0, 'x', '1', '2'} and not { 12 }. This issue was found by clang-tidy. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18747 llvm-svn: 265376	2016-04-05 01:46:26 +00:00
Teresa Johnson	fb7c764496	[ThinLTO] Refactor some common code into getGlobalValueInfo method (NFC) Refactor common code that queries the ModuleSummaryIndex for a value's GlobalValueInfo struct into getGlobalValueInfo helper methods, which will also be used by D18763. llvm-svn: 265370	2016-04-05 00:40:16 +00:00
JF Bastien	86d8d87640	Docs: dampen story time for atomics Story time was nice a few years ago, but by now it's nice to state how things are, rather than explain the diff from ye olden atomic history. These were dark times. llvm-svn: 265369	2016-04-05 00:31:25 +00:00
JF Bastien	1c3c223b65	Lanai: fix -Wsign-compare warning llvm-svn: 265368	2016-04-05 00:20:27 +00:00
Teresa Johnson	f4cf1c3eb4	Don't fold double constant to an integer if dest type not integral Summary: I encountered this issue when constant folding during inlining tried to fold away a bitcast of a double to an x86_mmx, which is not an integral type. The test case exposes the same issue with a smaller code snippet during early CSE. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18528 llvm-svn: 265367	2016-04-04 23:50:46 +00:00
JF Bastien	393b79ee00	Lanai: fix -Wpedantic warnings Extra semicolon. llvm-svn: 265365	2016-04-04 23:47:30 +00:00
Reid Kleckner	7de6761561	Fix non-determinism in order of LLVM attributes We were using array_pod_sort on an array of type 'Attribute', which wraps a pointer to AttributeImpl. For the most part this didn't matter because the printing code prints enum attributes in a defined order, but integer attributes such as 'align' and 'dereferenceable' were not ordered. Furthermore, AttributeImpl::operator< was broken for integer attributes. An integer attribute is a kind and an integer value, and both pieces need to be compared. By fixing the comparison operator, we can go back to std::sort, and things look good now. This should fix clang arm-swiftcall.c test failures on Windows. llvm-svn: 265361	2016-04-04 23:06:05 +00:00
Sanjay Patel	e77c7de459	use range loop; NFCI llvm-svn: 265360	2016-04-04 23:05:06 +00:00
Sanjay Patel	769b5fd546	fix typos; NFC llvm-svn: 265356	2016-04-04 22:45:56 +00:00
Amaury Sechet	56f056c01f	Style update in Core.h/Core.cpp . NFC llvm-svn: 265353	2016-04-04 22:00:25 +00:00
Justin Bogner	35c6903f22	Revert "CodeGen: Remove dead code in TailDuplicate" It seems this is reachable after all. It hit on 7zip-benchmark in lnt on ppc64: http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/2317 This reverts r265347. llvm-svn: 265352	2016-04-04 21:41:54 +00:00
Matthias Braun	7511abd5c1	MachineScheduler: Ignore COPYs with undef/dead op in CopyConstrain mutation. There is no problem with the code today, but the fix will avoid a crash in test/CodeGen/AMDGPU/subreg-coalescer-undef-use.ll once the DetectDeadLanes pass is added. llvm-svn: 265351	2016-04-04 21:23:46 +00:00
Matthias Braun	571e3481e7	test: Always treat .mir files as tests even outside of CodeGen/MIR We missed a handful of .mir tests that existed outside the test/CodeGen/MIR directory. Also fix the three powerpc .mir tests that nobody noticed were broken. llvm-svn: 265350	2016-04-04 21:23:44 +00:00
Teresa Johnson	3c35e0999b	Clean up calls to WriteBitcodeToFile (NFC) Remove a default parameter value being passed unnecessarily, which also reduces the changes required when this parameter is changed in D18763. Document the remaining non-default bool value passed for another parameter. llvm-svn: 265348	2016-04-04 21:19:31 +00:00
Justin Bogner	9ab8131a57	CodeGen: Remove dead code in TailDuplicate I noticed that this isn't covered by our existing tests and spent some time trying to come up with an example it actually hits. I tried hand rolling something based on the explanation in the comment, but couldn't get anything that didn't abort tail duplication earlier for one reason or another. Then, I tried cranking tail-dup-size cranked up so this would fire more and ran a bootstrap of clang and the nightly test suite - those don't hit this either. This reverts r132816 and replaces it with an assert. llvm-svn: 265347	2016-04-04 21:11:40 +00:00
Teresa Johnson	7ddec63d8f	clang-format llvm-as.cpp (NFC) This reduces unrelated changes in other patches (such as D18763) when changes to this file are clang formatted. llvm-svn: 265346	2016-04-04 21:06:17 +00:00
Hans Wennborg	a47a692341	Re-commit r265039 "[X86] Merge adjacent stack adjustments in eliminateCallFramePseudoInstr (PR27140)" The original commit miscompiled things on 32-bit Windows, e.g. a Clang boostrap. It turns out that mergeSPUpdates() was a bit too generous in what it interpreted as a stack adjustment, causing the following code: addl $12, %esp leal -4(%ebp), %esp To be "optimized" into simply: addl $8, %esp This commit tightens up mergeSPUpdates() and includes a new test (test14 in movtopush.ll) for this situation. llvm-svn: 265345	2016-04-04 21:02:46 +00:00
Zia Ansari	a82a58a4e5	Enable unroll for constant bound loops when TripCount is not modulo of unroll factor, reducing it to maximum power-of-2 that satisfies threshold limit. Commit for Evgeny Stupachenko (evstupac@gmail.com) Differential Revision: http://reviews.llvm.org/D18290 llvm-svn: 265337	2016-04-04 19:24:46 +00:00
Teresa Johnson	03e93bab7f	Fix bot errors from r265327, exact GUID which depends on path E.g. http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/21919 The source file path name will affect exact GUID, don't try to match exact value. llvm-svn: 265334	2016-04-04 19:11:00 +00:00
Sean Silva	f4517a15b8	Beef up some dllexport tests. Adds some dllexport tests to verify that: - Variables in bss are exported appropriately - Non-dllexport symbols aliased to dllexport symbols are not exported - Symbols declared as dllexport but are not defined are not exported We plan to enable dllimport/dllexport support for the PS4, and these additional tests are for points we noticed in our internal testing. Patch by Warren Ristow! Differential Revision: http://reviews.llvm.org/D18682 llvm-svn: 265333	2016-04-04 19:10:55 +00:00
Chandler Carruth	613eec8210	Revert r263460: [SpillPlacement] Fix a quadratic behavior in spill placement. That commit looks wonderful and awesome. Sadly, it greatly exacerbates PR17409 and effectively regresses build time for a lot of (very large) code when compiled with ASan or MSan. We thought this could be fixed forward by landing D15302 which at last fixes that PR, but some issues were discovered and it looks like that got reverted, so reverting this as well temporarily. As soon as the fix for PR17409 lands and sticks, we should re-land this patch as it won't trigger more significant test cases hitting that bug. Many thanks to Quentin and Wei here as they're doing all the awesome hard work!!! llvm-svn: 265331	2016-04-04 18:57:50 +00:00
Betul Buyukkurt	18131c4216	[PGO] Avoid instrumenting direct callee's at value sites. Direct callees' that are cast to other function prototypes, show up in the Call/Invoke instructions as ConstantExpr's. Currently llvm::CallSite's getCalledFunction() fails to return the callees in such expressions as direct calls. Value profiling should avoid instrumenting such cases. Mostly NFC. llvm-svn: 265330	2016-04-04 18:56:36 +00:00
Matthias Braun	870c34f0cf	ARM, AArch64, X86: Check preserved registers for tail calls. We can only perform a tail call to a callee that preserves all the registers that the caller needs to preserve. This situation happens with calling conventions like preserver_mostcc or cxx_fast_tls. It was explicitely handled for fast_tls and failing for preserve_most. This patch generalizes the check to any calling convention. Related to rdar://24207743 Differential Revision: http://reviews.llvm.org/D18680 llvm-svn: 265329	2016-04-04 18:56:13 +00:00
Teresa Johnson	916495d894	[ThinLTO] Add option to dump value name to GUID mapping Summary: Useful for debugging since we lose this correlation after the permodule summary/VST is read and until we later materialize source modules in the function importer. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18555 llvm-svn: 265327	2016-04-04 18:52:58 +00:00
Teresa Johnson	0beb858e97	[ThinLTO] Augment FunctionImport dump with value name to GUID map Summary: To aid in debugging, dump out the correlation between value names and GUID for each source module when it is materialized. This will make it easier to comprehend the earlier summary-based function importing debug trace which only has access to and prints the GUIDs. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18556 llvm-svn: 265326	2016-04-04 18:52:23 +00:00
Sanjay Patel	87a50c4f26	fix documentation comments; NFC llvm-svn: 265321	2016-04-04 18:25:06 +00:00
Brendon Cahoon	86f783e315	[DependenceAnalysis] Check if result of getConstantPart is null A seg-fault occurs due to a reference of a null pointer, which is the value returned by getConstantPart. This function returns null if the constant part is not found. The code that calls this function needs to check for the null return value. Differential Revision: http://reviews.llvm.org/D18718 llvm-svn: 265319	2016-04-04 18:13:18 +00:00
Derek Schuff	73900c6876	Replace MachineRegisterInfo::isSSA() with a MachineFunctionProperty Use the MachineFunctionProperty mechanism to indicate whether a MachineFunction is in SSA form instead of a custom method on MachineRegisterInfo. NFC Differential Revision: http://reviews.llvm.org/D18574 llvm-svn: 265318	2016-04-04 18:03:29 +00:00
Wei Mi	fb5252cac1	Revert r265309 and r265312 because they caused some errors I need to investigate. llvm-svn: 265317	2016-04-04 17:45:03 +00:00
Paul Robinson	f88cc148b6	Document standard substitutions defined by lit. Patch by Guilherme Bufolo! Differential Revision: http://reviews.llvm.org/D18752 llvm-svn: 265314	2016-04-04 17:14:45 +00:00
Derek Schuff	1dbf7a571f	Add MachineFunctionProperty checks for AllVRegsAllocated for target passes Summary: This adds the same checks that were added in r264593 to all target-specific passes that run after register allocation. Reviewers: qcolombet Subscribers: jyknight, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18525 llvm-svn: 265313	2016-04-04 17:09:25 +00:00
Wei Mi	cdaf1df657	Fix unused var warning caused by r265309. llvm-svn: 265312	2016-04-04 17:03:58 +00:00
Wei Mi	ffbc9c7f3b	Replace analyzeSiblingValues with new algorithm to fix its compile time issue. The patch is to solve PR17409 and its duplicates. analyzeSiblingValues is a N x N complexity algorithm where N is the number of siblings generated by reg splitting. Although it causes siginificant compile time issue when N is large, it is also important for performance since it removes redundent spills and enables rematerialization. To solve the compile time issue, the patch removes analyzeSiblingValues and replaces it with lower cost alternatives containing two parts. The first part creates a new spill hoisting method in postOptimization of register allocation. It does spill hoisting at once after all the spills are generated instead of inside every instance of selectOrSplit. The second part queries the define expr of the original register for rematerializaiton and keep it always available during register allocation even if it is already dead. It deletes those dead instructions only in postOptimization. With the two parts in the patch, it can remove analyzeSiblingValues without sacrificing performance. Differential Revision: http://reviews.llvm.org/D15302 llvm-svn: 265309	2016-04-04 16:42:40 +00:00
Daniel Sanders	b3c2764f89	[mips] Range check simm32 and fold MIPS16's imm32 into simm32. Summary: At this point we should be able to enable IAS by default for O32 without breaking check-all, or recursion. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18439 llvm-svn: 265302	2016-04-04 15:32:49 +00:00
Ulrich Weigand	99ac5045ab	[SystemZ] Add compare-and-branch instructions to MC This adds MC support for fused compare + indirect branch instructions, ie. CRB, CGRB, CLRB, CLGRB, CIB, CGIB, CLIB, CLGIB. They aren't actually generated yet -- this is preparation for their use for conditional returns in the next iteration of D17339. Author: koriakin Differential Revision: http://reviews.llvm.org/D18742 llvm-svn: 265296	2016-04-04 14:26:43 +00:00
Ulrich Weigand	a9ac6d6cc2	[SystemZ] Support ATOMIC_FENCE A cross-thread sequentially consistent fence should be lowered into z/Architecture's BCR serialization instruction, instead of causing a fatal error in the back-end. Author: bryanpkc Differential Revision: http://reviews.llvm.org/D18644 llvm-svn: 265292	2016-04-04 12:45:44 +00:00
Ulrich Weigand	f557d08325	[SystemZ] Support llvm.frameaddress/llvm.returnaddress intrinsics Enable the SystemZ back-end to lower FRAMEADDR and RETURNADDR, which previously would cause the back-end to crash. Currently, only a frame count of zero is supported. Author: bryanpkc Differential Revision: http://reviews.llvm.org/D18514 llvm-svn: 265291	2016-04-04 12:44:55 +00:00
NAKAMURA Takumi	e4a77057a3	Fixup r265277 [-Wdocumentation] llvm-svn: 265290	2016-04-04 11:54:48 +00:00
Elena Demikhovsky	e99c561391	AVX-512: Truncating store for i1 vectors Implemented truncstore for KNL and skylake-avx512. Covered vectors from v2i1 to v64i1. We save the value in bits (not in bytes) - v32i1 is saved in 4 bytes. Differential Revision: http://reviews.llvm.org/D18740 llvm-svn: 265283	2016-04-04 07:17:47 +00:00
Duncan P. N. Exon Smith	8e65f8ddfd	ValueMapper: Remove old FIXMEs; almost NFC Remove a few old FIXMEs from the original commit of the Metadata/Value split in r223802. These are commented out assertions to the effect that calls between mapValue and mapMetadata never return nullptr. (The only behaviour change is that Mapper::mapSimpleMetadata memoizes the nullptr return.) When I originally rewrote the mapping code, I thought we could be stricter in the new metadata hierarchy and never return nullptr when RF_NullMapMissingGlobalValues was off. It's still not entirely clear to me why these assertions failed (a few months ago, I had a theory that I forgot to write down, but that's helping no one). Understood or not, I no longer see how these commented-out assertions would be useful. I'm relegating them to the annals of source control before making significant changes to ValueMapper.cpp. llvm-svn: 265282	2016-04-04 04:59:56 +00:00
Davide Italiano	a017306063	[DebugInfo] Fix tests in Assembler/ Each DISubprogram with isDefinition : true must belong to a compile unit. llvm-svn: 265281	2016-04-04 02:11:34 +00:00
Duncan P. N. Exon Smith	fef609f15e	IR: Lazily create ReplaceableMetadataImpl on MDNode RAUW support on MDNode usually requires an extra allocation for ReplaceableMetadataImpl. This is only strictly necessary if there are tracking references to the MDNode. Make the construction of ReplaceableMetadataImpl lazy, so that we don't get allocations if we don't need them. Since MDNode::isResolved now checks MDNode::isTemporary and MDNode::NumUnresolved instead of whether a ReplaceableMetadataImpl is allocated, the internal changes are intrusive (at various internal checkpoints, isResolved now has a different answer). However, there should be no real functionality change here; just slightly lazier allocation behaviour. The external semantics should be identical. llvm-svn: 265279	2016-04-03 21:23:52 +00:00
Duncan P. N. Exon Smith	bd088744be	IR: Make MDNode::Context private, NFC llvm-svn: 265278	2016-04-03 21:10:00 +00:00
Amaury Sechet	7c2883cf85	Various style fix in Core.h/Core.cpp . NFC llvm-svn: 265277	2016-04-03 21:06:04 +00:00
Duncan P. N. Exon Smith	756e1c3db4	ValueMapper: Disallow metadata mapping recursion through mapValue This adds an assertion to maintain the property from r265273. When Mapper::mapSimpleMetadata calls Mapper::mapValue, it should not find its way back to mapMetadataImpl. This guarantees that mapSimpleMetadata is not involved in any recursion. Since Mapper::mapValue calls out to arbitrary materializers, we need to save a bit on the ValueMap to make this assertion effective. There should be no functionality change here. This co-recursion should already have been impossible. llvm-svn: 265276	2016-04-03 20:54:51 +00:00
Duncan P. N. Exon Smith	a997856b3d	Work around MSVC failure from r265273 http://lab.llvm.org:8011/builders/sanitizer-windows/builds/19726 llvm-svn: 265275	2016-04-03 20:42:21 +00:00
Simon Pilgrim	0edd3d771a	[X86] Removed duplicate code. llvm-svn: 265274	2016-04-03 20:40:35 +00:00
Duncan P. N. Exon Smith	c6065e3a25	ValueMapper: Avoid recursion in mapSimplifiedMetadata, NFC The main change is to delay materializing GlobalValue initializers from Mapper::mapValue until Mapper::~Mapper. This effectively removes all recursion from mapSimplifiedMetadata, as promised in r265270. mapSimplifiedMetadata calls mapValue for ConstantAsMetadata nodes to find the mapped constant, and now it shouldn't be possible for mapValue to indirectly re-invoke mapMetadata. I'll add an assertion to that effect in a follow-up (separated so that the assertion can easily be reverted independently, if it comes to that). This a step toward a broader goal: converting Mapper::mapMetadataImpl from a recursive to an iterative algorithm. When a BlockAddress points at a BasicBlock inside an unmaterialized function body, we need to delay it until the function body is materialized in Mapper::~Mapper. This commit creates a temporary BasicBlock and returns a new BlockAddress, then RAUWs the BasicBlock once it is known. This situation should be extremely rare since a BlockAddress is usually used from within the function it's referencing (and BlockAddress itself is rare). There should be no observable functionality change. llvm-svn: 265273	2016-04-03 20:17:45 +00:00
Peter Zotov	8efe38a1e2	[CodeGenPrepare] Fix r265264 (again). Don't require TLI for SinkCmpExpression, like it wasn't before r265264. llvm-svn: 265271	2016-04-03 19:32:13 +00:00
Duncan P. N. Exon Smith	ae8bd4bd11	ValueMapper: Split out mapSimpleMetadata, NFC Split out a helper for mapping metadata without operands. This is any metadata that is not an MDNode, and any MDNode where the answer is known without looking at operands. Through some weird twists, this function is co-recursive: mapSimpleMetadata => MapValue => materializeInitFor => linkFunctionBody => RemapInstructions => MapMetadata => mapSimpleMetadata I plan to break the recursion in a follow-up. llvm-svn: 265270	2016-04-03 19:31:01 +00:00
Duncan P. N. Exon Smith	829dc87a68	ValueMapper: Introduce Mapper helper class, NFC Remove a bunch of boilerplate from ValueMapper.cpp by using a new file-local class called Mapper. llvm-svn: 265268	2016-04-03 19:06:24 +00:00
Simon Pilgrim	d74f6e22f2	[X86][SSE] Refreshed MOVMSK sign bit tests llvm-svn: 265267	2016-04-03 18:59:42 +00:00
Simon Pilgrim	cd0dfc93eb	[X86][SSE] Support for MOVMSK signbit extraction instructions Add support for lowering with the MOVMSK instruction to extract vector element signbits to a GPR. This is an early step towards more optimal handling of vector comparison results. Differential Revision: http://reviews.llvm.org/D18741 llvm-svn: 265266	2016-04-03 18:22:03 +00:00
Peter Zotov	f87e550e89	[CodeGenPrepare] Fix r265264. The case where there was no TargetLowering was not handled, leading to null pointer dereferences. llvm-svn: 265265	2016-04-03 17:11:53 +00:00
Peter Zotov	0b6d7bc682	[CodeGenPrepare] Avoid sinking soft-FP comparisons Sinking comparisons in CGP can undo the job of hoisting them done earlier by LICM, and soft-FP makes this an expensive mistake. A common pattern that produces floating point comparisons uniform over a loop is an explicit check for division by zero. If the divisor is hoisted out of the loop, the comparison can also be, but hoisting the function that unwinds is never legal, since it may cause side effects in the loop body prior to the unwinding to not be executed. Differential Revision: http://reviews.llvm.org/D18744 llvm-svn: 265264	2016-04-03 16:36:17 +00:00
Simon Pilgrim	20d1d4f045	[X86] Tidied up X86ISD instruction nodes. NFCI. Tidied up comments, stripped trailing whitespace, split apart nodes that aren't related. No change in ordering although there is definitely some scope for it. llvm-svn: 265263	2016-04-03 14:14:32 +00:00
Peter Zotov	0218d0f383	Mark some FP intrinsics as safe to speculatively execute Floating point intrinsics in LLVM are generally not speculatively executed, since most of them are defined to behave the same as libm functions, which set errno. However, the only error that can happen when executing ceil, floor, nearbyint, rint and round libm functions per POSIX.1-2001 is -ERANGE, and that requires the maximum value of the exponent to be smaller than the number of mantissa bits, which is not the case with any of the floating point types supported by LLVM. The trunc and copysign functions never set errno per per POSIX.1-2001. Differential Revision: http://reviews.llvm.org/D18643 llvm-svn: 265262	2016-04-03 12:30:46 +00:00
Elena Demikhovsky	5e426f7356	AVX-512: Load and Extended Load for i1 vectors Implemented load+{sign\|zero}_extend for i1 vectors Fixed failures in i1 vector load. Covered loading of v2i1, v4i1, v8i1, v16i1, v32i1, v64i1 vectors for KNL and SKX. Differential Revision: http://reviews.llvm.org/D18737 llvm-svn: 265259	2016-04-03 08:41:12 +00:00
Davide Italiano	d4f5a059e0	[SimplifyLibCalls] Garbage collect dead code. We already skip optimizations if the return value of printf() is used, so CI->use_empty() is always true. Differential Revision: http://reviews.llvm.org/D18656 llvm-svn: 265253	2016-04-03 01:46:52 +00:00
Jacques Pienaar	796975d311	[lanai] Fix for LanaiDelaySlotFiller and LanaiMCInstLower.cpp Summary: * Fix to stop delay slot filler from inserting SP modifying instructions in the newly expanded call/return instructions. * In LowerSymbol the outermost type was not LanaiMCExpr if there was a binary expression * Remove printExpr in LanaiInstPrinter Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18734 llvm-svn: 265251	2016-04-03 00:49:27 +00:00
Zoran Jovanovic	2b7cc5a4ae	[mips][microMIPS] Revert commits r264245 and r264248. Commit r264245 was the reason for failing tests in LLVM test suite. Commit r264248 depends on the first one. llvm-svn: 265249	2016-04-02 23:06:13 +00:00
Simon Pilgrim	b2e837d875	[X86][SSE] Added 1024-bit vector comparison tests More examples of PR22603, poor vector splitting for AVX512F targets as well as missing uses of PACKSS/MOVMSK llvm-svn: 265248	2016-04-02 21:33:09 +00:00
Simon Pilgrim	8fd7d852d5	[X86][AVX512] Added AVX512 comparison tests llvm-svn: 265247	2016-04-02 21:24:42 +00:00
Saleem Abdulrasool	85b43639b1	AArch64: support .cpu directive Add support for the AArch64 .cpu directive. This is a slightly involved directive since the parameter is actually a variable encoded string. The general structure is: <cpu>[[+-]<feature>]* We now map some of the supported string names for features for internal representation of feature flags. If we encounter one which we do not support, bail out as we cannot validate the assembly any longer. Resolves PR27010. llvm-svn: 265240	2016-04-02 19:29:52 +00:00
Duncan P. N. Exon Smith	6d72d166dc	Linker: Split mapUnneededSubprograms into two; almost NFC Split the loop through compile units in mapUnneededSubprograms in two. First, visit imported entities to ensure that we've visited all need subprograms. Second, visit subprograms, and drop the ones we don't need. Hypothetically this protects against a subprogram from one compile unit being referenced from an imported entity in a different compile unit. I don't think that's valid IR (a debug info expert could confirm), but I think the refactor makes the code more clear. llvm-svn: 265233	2016-04-02 17:54:01 +00:00
Duncan P. N. Exon Smith	751114b39d	Remove redundant assertion after cast, NFC llvm-svn: 265232	2016-04-02 17:41:52 +00:00
Duncan P. N. Exon Smith	0d60a9887f	Linker: Avoid unnecessary work when moving named metadata IRLinker::mapUnneededSubprograms has to be sure that any "needed" subprograms get linked in. Rather than traversing through imported entities using llvm::getSubprogram, call MapMetadata. The latter memoizes the result in the ValueMap (sharing work with IRLinker::linkNamedMDNodes proper), and makes the local SmallPtrSet redundant. llvm-svn: 265231	2016-04-02 17:39:31 +00:00
Mehdi Amini	8958c40430	Rename FunctionIndex into GlobalValueIndex to reflect the recent changes (NFC) The index used to contain only Function, but now contains GlobalValue in general. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265230	2016-04-02 17:29:47 +00:00
Duncan P. N. Exon Smith	4b520e5ef6	Linker: Remove IRMover::isMetadataUnneeded indirection; almost NFC Instead of checking live during MapMetadata whether a subprogram is needed, seed the ValueMap with `nullptr` up-front. There is a small hypothetical functionality change. Previously, calling MapMetadataOp on a node whose "scope:" chain led to an unneeded subprogram would return nullptr. However, if that were ever called, then the subprogram would be needed; a situation that the IRMover is supposed to avoid a priori! Besides cleaning up the code a little, this restores a nice property: MapMetadataOp returns the same as MapMetadata. llvm-svn: 265229	2016-04-02 17:12:00 +00:00
Duncan P. N. Exon Smith	da4a56d1ab	ValueMapper: Add support for seeding metadata with nullptr Support seeding a ValueMap with nullptr for Metadata entries, a situation I didn't consider in the Metadata/Value split. I added a ValueMapper::getMappedMD accessor that returns an Optional<Metadata*> with the mapped (possibly null) metadata. IRMover needs to use this to avoid modifying the map when it's checking for unneeded subprograms. I updated a call from bugpoint since I find the new code clearer. llvm-svn: 265228	2016-04-02 17:04:38 +00:00
Duncan P. N. Exon Smith	ddbb1cd45a	Document end of anonymous namespaces, NFC Prevent clang-format from deleting the preceding newline. llvm-svn: 265227	2016-04-02 16:45:51 +00:00
Duncan P. N. Exon Smith	520f8542ff	Bitcode: Try to emit metadata in function blocks Whenever metadata is only referenced by a single function, emit the metadata just in that function block. This should improve lazy-loading by reducing the amount of metadata in the global block. For now, this should catch all DILocations, and anything else that happens to be referenced only by a single function. It's also a first step toward a couple of possible future directions (which this commit does not implement): 1. Some debug info metadata is only referenced from compile units and individual functions. If we can drop the link from the compile unit, this optimization will get more powerful. 2. Any uniqued metadata that isn't referenced globally can in theory be emitted in every function block that references it (trading off bitcode size and full-parse time vs. lazy-load time). Note: this assumes the new BitcodeReader error checking from r265223. The metadata stored in function blocks gets purged after parsing each function, which means unresolved forward references will get lost. Since all the global metadata should have already been resolved by the time we get to the function metadata blocks we just need to check for that case. (If for some reason we need to handle bitcode that fails the checks in r265223, the fix is to store about-to-be-dropped unresolved nodes in MetadataList::shrinkTo until they can be handled succesfully by a future call to MetadataList::tryToResolveCycles.) llvm-svn: 265226	2016-04-02 15:22:57 +00:00
Duncan P. N. Exon Smith	0b76b723f4	Fix doxygen comments from r265224, NFC llvm-svn: 265225	2016-04-02 15:16:56 +00:00
Duncan P. N. Exon Smith	9342911f31	BitcodeWriter: Further unify function metadata, NFC Further unify the handling of function-local metadata with global metadata, by exposing the same interface in ValueEnumerator. Both contexts use the same accessors: - getMDStrings(): get the strings for this block. - getNonMDStrings(): get the non-strings for this block. A future commit will start adding strings to the function-block. llvm-svn: 265224	2016-04-02 15:09:42 +00:00
Duncan P. N. Exon Smith	8742de9b20	BitcodeReader: Check for unresolved function metadata A follow-up commit will start using function metadata blocks more heavily. This commit adds some error checking to confirm that metadata is fully resolved before (and after) materializing each function. This is valid even when reading very old bitcode from before the metadata/value split. The global metadata block always came before the function blocks. However, in case somehow this causes a regression (i.e., an old LLVM did produce such bitcode after all) I'm committing separately. llvm-svn: 265223	2016-04-02 14:55:01 +00:00
Simon Pilgrim	a2c8da9e06	[X86][AVX] Added vector float truncation (double2float) tests llvm-svn: 265222	2016-04-02 14:09:17 +00:00
Mehdi Amini	1e5fddda3d	Reverts r265219. Unintentionally commited... time to call the day off! From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265221	2016-04-02 05:35:03 +00:00
Mehdi Amini	89038a1071	Fix "warning: variabl 'XX’ set but not used" in release build (variable used in assertion, NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265220	2016-04-02 05:34:19 +00:00
Mehdi Amini	5921a3ae66	wip From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265219	2016-04-02 05:34:14 +00:00
Mehdi Amini	b049431bec	constify GlobalValue::getGUID() and GlobalValue::getGlobalIdentifier() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265217	2016-04-02 05:25:27 +00:00
Mehdi Amini	024a79f780	Revert "ThinLTO: add module caching handling." This reverts commit r265214, unintentionally commited. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265216	2016-04-02 05:08:18 +00:00
Mehdi Amini	ad5741b075	Create a typedef GlobalValue::GUID for uint64_t and RAUW (NFC) Summary: This should make the code more readable, especially all the map declarations. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18721 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265215	2016-04-02 05:07:53 +00:00
Mehdi Amini	2cd609482d	ThinLTO: add module caching handling. Reviewers: tejohnson Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18494 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265214	2016-04-02 05:07:08 +00:00
Mehdi Amini	e70901552c	80 lines column after renaming "shouldDiscardValueNames" (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265212	2016-04-02 03:59:58 +00:00
Mehdi Amini	50af49fcdc	Rename Context::discardValueNames() to shouldDiscardValueNames() (NFC) Suggested by Sean Silva. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265211	2016-04-02 03:46:17 +00:00
Mehdi Amini	27814980a3	Add Cache Pruning support Incremental LTO will usea cache to store object files. This patch handles the pruning part of the cache, exposing a few knobs: - Pruning interval: the implementation keeps a "timestamp" file in the directory and will scan it only after a given interval since the last modification of the timestamp file. This is for performance purpose, we don't want to scan continuously the folder. - Entry expiration: this is the time after which a file that hasn't been used is remove from the cache. - Maximum size: expressed in percentage of the available disk space, it helps to avoid that we blow up the disk space. http://reviews.llvm.org/D18422 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265209	2016-04-02 03:28:26 +00:00
Hans Wennborg	fa6e414eef	Fix -Wpedantic warning about extra semi-colon llvm-svn: 265204	2016-04-02 01:03:41 +00:00
Rong Xu	0eb3603626	[PGO] Use a helper function to find all indirect call-sites Use a helper function to find all the direct-calls-sites in a function. Also split the code into a separated file as this will be use by indirect-call-promotion transformation. Differential Revision: http://reviews.llvm.org/D18704 llvm-svn: 265199	2016-04-01 23:16:44 +00:00
Tim Northover	5dad9df9f7	AArch64: avoid clobbering SP for dead MOVimm pseudos. We were producing ORR, which actually defines a GPR32sp rather than a GPR32. Should fix PR23209. llvm-svn: 265198	2016-04-01 23:14:52 +00:00
Nico Weber	73853ab4f8	Make DIASession work if msdia*.dll isn't registered. This fixes various symbolization test failures for me when I build with a hermetic VS2015 without having run the 2015 installer. http://reviews.llvm.org/D18707 llvm-svn: 265193	2016-04-01 22:21:51 +00:00
Adrian Prantl	cf0961f5ea	Add missing emissionKind flags to the DICompileUnits of several old testcases. llvm-svn: 265192	2016-04-01 22:18:43 +00:00
Mehdi Amini	5a2e5d324e	ThinLTO: special handling for LinkOnce functions These function can be dropped by the compiler if they are no longer referenced in the current module. However there is a change that another module is still referencing them because of the import. Multiple solutions can be used: - Always import LinkOnce when a caller is imported. This ensure that every module with a call to a LinkOnce has the definition and will be able to emit it if it emits the call. - Turn the LinkOnce into Weak, so that it is always emitted. - Turn all LinkOnce into available_externally and come back after all modules are codegen'ed to emit only one copy of the linkonce, when there is still a reference to it. This patch implement the second option, with am optimization that only one module will turn the LinkOnce into Weak, while the others will turn it into available_externally, so that there is exactly one copy emitted for the whole compilation. http://reviews.llvm.org/D18346 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265190	2016-04-01 21:53:50 +00:00
Manman Ren	9bfd0d03e9	Swift Calling Convention: add swifterror attribute. A ``swifterror`` attribute can be applied to a function parameter or an AllocaInst. This commit does not include any target-specific change. The target-specific optimization will come as a follow-up patch. Differential Revision: http://reviews.llvm.org/D18092 llvm-svn: 265189	2016-04-01 21:41:15 +00:00
Simon Pilgrim	3243b21dae	[X86][SSE] Regenerated vector float tests - fabs / floor(etc.) / fneg / float2double llvm-svn: 265186	2016-04-01 21:30:48 +00:00
Simon Pilgrim	1e5bf0a256	[X86][SSE] Vector i64 load tests llvm-svn: 265185	2016-04-01 21:06:17 +00:00
Simon Pilgrim	275b2bcb76	[X86][SSE] Regenerated comparison mask and float immediate tests llvm-svn: 265184	2016-04-01 21:00:00 +00:00
Simon Pilgrim	a372a0f295	[X86][SSE] Regenerated the vec_extract tests. llvm-svn: 265183	2016-04-01 20:55:19 +00:00
David Blaikie	66b1bb45b5	Update owners to reflect recent changes llvm-svn: 265182	2016-04-01 20:40:49 +00:00
Rong Xu	92c2eae4e1	Fix buildbot lldb-amd64-ninja-netbsd7 failure llvm-svn: 265180	2016-04-01 20:15:04 +00:00
Simon Pilgrim	f739d8a2ed	[X86][SSE] Regenerated the vec_insert tests. llvm-svn: 265179	2016-04-01 19:42:23 +00:00
James Y Knight	e6a4646372	Remove useless check for ThreadModel==Single in ARMISelLowering. NFC. ThreadModel::Single is already handled already by ARMPassConfig adding LowerAtomicPass to the pass list, which lowers all atomics to non-atomic ops and deletes fences. So by the time we get to ISel, there's no atomic fences left, so they don't need special handling. llvm-svn: 265178	2016-04-01 19:33:19 +00:00
Peter Collingbourne	dd711b93e0	LowerBitSets: Move declarations to separate namespace. Should fix modules build. llvm-svn: 265176	2016-04-01 18:46:50 +00:00
Mike Aizatsky	f13cbee12e	[libfuzzer] adding license headers to cpp files Differential Revision: http://reviews.llvm.org/D18705 llvm-svn: 265174	2016-04-01 18:38:58 +00:00
Simon Pilgrim	6121118663	[X86][SSE] Regenerated vec_partial tests. llvm-svn: 265173	2016-04-01 18:30:29 +00:00
Sanjay Patel	9b5b5c82ca	[x86] add an SSE2 + fast-unaligned accesses run for memset nonzero tests Was there really no other way to splat a byte in SSE2? punpcklbw {{.#+}} xmm0 = xmm0[0,0,1,1,2,2,3,3,4,4,5,5,6,6,7,7] pshuflw {{.#+}} xmm0 = xmm0[0,0,0,0,4,5,6,7] pshufd {{.*#+}} xmm0 = xmm0[0,0,1,1] llvm-svn: 265172	2016-04-01 18:29:25 +00:00
Simon Pilgrim	858194640e	[X86][SSE] Regenerated vec_logical tests. llvm-svn: 265171	2016-04-01 18:28:23 +00:00
Tom Stellard	354a43c7bc	AMDGPU: Implement {BUFFER,FLAT}_ATOMIC_CMPSWAP{,_X2} Summary: Implement BUFFER_ATOMIC_CMPSWAP{,_X2} instructions on all GCN targets, and FLAT_ATOMIC_CMPSWAP{,_X2} on CI+. 32-bit instruction variants tested manually on Kabini and Bonaire. Tests and parts of code provided by Jan Veselý. Patch by: Vedran Miletić Reviewers: arsenm, tstellarAMD, nhaehnle Subscribers: jvesely, scchan, kanarayan, arsenm Differential Revision: http://reviews.llvm.org/D17280 llvm-svn: 265170	2016-04-01 18:27:37 +00:00
Simon Pilgrim	1b14082488	[X86][SSE] Regenerated vector sdiv to shifts tests Added SSE + AVX1 tests as well as AVX2 llvm-svn: 265169	2016-04-01 18:18:40 +00:00
Mike Aizatsky	01c0f8d8a3	[sancov] save entry block from pruning (it is always full dominator) llvm-svn: 265168	2016-04-01 18:13:19 +00:00
Sanjay Patel	d3e3d48cb9	[x86] add an SSE1 run for these tests Note however that this is identical to the existing SSE2 run. What we really want is yet another run for an SSE2 machine that also has fast unaligned 16-byte accesses. llvm-svn: 265167	2016-04-01 18:11:30 +00:00
Simon Pilgrim	b8283631a5	[X86][SSE] Regenerated vec_setcc tests. llvm-svn: 265164	2016-04-01 17:55:02 +00:00
Simon Pilgrim	41e31ff6bd	[X86][SSE] Regenerated the vec_set tests. Replaced lots of dodgy greps with actual codegen llvm-svn: 265163	2016-04-01 17:40:25 +00:00
Sanjay Patel	9f413364d5	[x86] avoid intermediate splat for non-zero memsets (PR27100) Follow-up to http://reviews.llvm.org/D18566 and http://reviews.llvm.org/D18676 - where we noticed that an intermediate splat was being generated for memsets of non-zero chars. That was because we told getMemsetStores() to use a 32-bit vector element type, and it happily obliged by producing that constant using an integer multiply. The 16-byte test that was added in D18566 is now equivalent for AVX1 and AVX2 (no splats, just a vector load), but we have PR27141 to track that splat difference. Note that the SSE1 path is not changed in this patch. That can be a follow-up. This patch should resolve PR27100. llvm-svn: 265161	2016-04-01 17:36:45 +00:00
Chad Rosier	8787a81023	[AArch64] Fix a typo. NFC. llvm-svn: 265160	2016-04-01 17:34:38 +00:00
David Majnemer	fe3f9d1721	[InstCombine] Don't sink an instr after a catchswitch A catchswitch is a terminator, instructions cannot be inserted after it. llvm-svn: 265158	2016-04-01 17:28:17 +00:00
David Majnemer	6f1f85f0e1	[SLPVectorizer] Don't insert an extractelement before a catchswitch A catchswitch cannot be preceded by another instruction in the same basic block (other than a PHI node). Instead, insert the extract element right after the materialization of the vectorized value. This isn't optimal but is a reasonable compromise given the constraints of WinEH. This fixes PR27163. llvm-svn: 265157	2016-04-01 17:28:15 +00:00
Rong Xu	8e8fe859e0	[PGO] Refactor PGOFuncName meta data code to be used in clang Refactor the code that gets and creates PGOFuncName meta data so that it can be used in clang's value profile annotation. Differential Revision: http://reviews.llvm.org/D18623 llvm-svn: 265149	2016-04-01 16:43:30 +00:00
Sanjay Patel	a05e0ff223	[x86] avoid intermediate splat for non-zero memsets (PR27100) Follow-up to D18566 - where we noticed that an intermediate splat was being generated for memsets of non-zero chars. That was because we told getMemsetStores() to use a 32-bit vector element type, and it happily obliged by producing that constant using an integer multiply. The tests that were added in the last patch are now equivalent for AVX1 and AVX2 (no splats, just a vector load), but we have PR27141 to track that splat difference. In the new tests, the splat via shuffling looks ok to me, but there might be some room for improvement depending on uarch there. Note that the SSE1/2 paths are not changed in this patch. That can be a follow-up. This patch should resolve PR27100. Differential Revision: http://reviews.llvm.org/D18676 llvm-svn: 265148	2016-04-01 16:27:14 +00:00
Benjamin Kramer	99c67b31cb	[ADT] Make StringMap's tombstone aligned. This avoids undefined behavior when casting pointers to it. Also make sure that we don't cast to a derived StringMapEntry before checking for tombstone, as that may have different alignment requirements. llvm-svn: 265145	2016-04-01 15:51:51 +00:00
Vedant Kumar	7784171721	[PGOProfile] Rename a test to make it more reusable, NFC llvm-svn: 265144	2016-04-01 15:45:33 +00:00
Valery Pykhtin	5b3559c1ec	[AMDGPU] fix MADAK/MADMK instructions operand namings to match encoding fields. $vsrc1 -> $src1, $k -> $imm Differential Revision: http://reviews.llvm.org/D18659 llvm-svn: 265141	2016-04-01 13:13:12 +00:00
Andrea Di Biagio	8c48841907	[x86] Remove redundant call to setTargetDAGCombine for BUILD_VECTOR node type. Since revision 235394, we no longer perform target specific combines on build_vector nodes. No functional change intended. llvm-svn: 265138	2016-04-01 12:25:44 +00:00
Simon Pilgrim	7ec092d0f8	[X86][AVX512] Regenerated intrinsics tests llvm-svn: 265135	2016-04-01 11:57:51 +00:00
Sagar Thakur	48973d21e1	[MIPS][LLVM-MC] Fix JR encoding for MIPSR6 ISA Summary: The assembler was picking the wrong JR variant because the pre-R6 one was still enabled at R6. Author: nitesh.jain Reviewers: vkalintiris, dsanders Subscribers: dsanders, llvm-commits, mohit.bhakkad, sagar, bhushan, jaydeep Differential: D18387 llvm-svn: 265134	2016-04-01 11:55:33 +00:00
Benjamin Kramer	398e95c181	[ThinLTO] Fix uninitialized flags. Found by msan. Patch by Adrian Kuegel! llvm-svn: 265133	2016-04-01 11:49:59 +00:00
Andrey Turetskiy	958eb46443	[X86] Introduce Lakemont CPU. Add a new Intel MCU CPU Lakemont, which doesn't support X87. Differential Revision: http://reviews.llvm.org/D18650 llvm-svn: 265128	2016-04-01 10:16:15 +00:00
James Molloy	b876c72bcc	Fix for pr24346: arm asm label calculation error in sub Some ARM instructions encode 32-bit immediates as a 8-bit integer (0-255) and a 4-bit rotation (0-30, even) in its least significant 12 bits. The original fixup, FK_Data_4, patches the instruction by the value bit-to-bit, regardless of the encoding. For example, assuming the label L1 and L2 are 0x0 and 0x104 respectively, the following instruction: add r0, r0, #(L2 - L1) ; expects 0x104, i.e., 260 would be assembled to the following, which adds 1 to r0, instead of 260: e2800104 add r0, r0, #4, 2 ; equivalently 1 The new fixup kind fixup_arm_mod_imm takes care of the encoding: e2800f41 add r0, r0, #260 Patch by Ting-Yuan Huang! llvm-svn: 265122	2016-04-01 09:40:47 +00:00
Oliver Stannard	a5520b02a5	[AArch64] Better errors for out-of-range fixups When a fixup that can be resolved by the assembler is out of range, we should report an error in the source, rather than crashing. Differential Revision: http://reviews.llvm.org/D18402 llvm-svn: 265120	2016-04-01 09:14:50 +00:00
Mehdi Amini	215d59e7b0	ThinLTO: move ObjCARCContractPass in the CodeGen pipeline This is to be coherent with Full LTO. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265118	2016-04-01 08:22:59 +00:00
Jeroen Ketema	b7f633b8d3	[OCaml] Use LLVMCreateMessage with constant strings when calling llvm_raise The llvm_string_of_message function, called by llvm_raise, calls LLVMDisposeMessage, which expects the message to be dynamically allocated; it fails freeing the message otherwise. So always dynamically allocate with LLVMCreateMessage. Differential Revision: http://reviews.llvm.org/D18675 llvm-svn: 265116	2016-04-01 07:56:17 +00:00
Jeroen Ketema	c110fbc213	[OCaml] Reinstate data_layout Expose LLVMCreateTargetMachineData as data_layout. As r263530 did for go. From that commit: "LLVMGetTargetDataLayout was removed from the C API, and then TargetMachine.TargetData was removed. Later, LLVMCreateTargetMachineData was added to the C API" Differential Revision: http://reviews.llvm.org/D18677 llvm-svn: 265115	2016-04-01 07:54:24 +00:00
Mehdi Amini	43b657b5c7	Add a libLTO API to stop/restart ThinLTO between optimizations and CodeGen This allows the linker to instruct ThinLTO to perform only the optimization part or only the codegen part of the process. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265113	2016-04-01 06:47:02 +00:00
Chuang-Yu Cheng	f8b592f213	[PPC64] Bug fix: when enabling sibling-call-opt and shrink-wrapping, the tail call branch instruction might disappear Bug Pattern: # BB#0: # %entry cmpldi 3, 0 beq- 0, .LBB0_2 # BB#1: # %exit lwz 4, 0(3) #TC_RETURNd8 LVComputationKind 0 .LBB0_2: # %cond.false mflr 0 std 0, 16(1) stdu 1, -96(1) .Ltmp0: .cfi_def_cfa_offset 96 .Ltmp1: .cfi_offset lr, 16 bl __assert_fail nop The branch instruction for tail call return is not generated, because the shrink-wrapping pass choosing a new Restore Point: %cond.false, so %exit block is not sent to emitEpilogue, that's why the branch is not generated. Thanks Kit's opinions! Reviewers: nemanjai hfinkel tjablin kbarton http://reviews.llvm.org/D17606 llvm-svn: 265112	2016-04-01 06:44:32 +00:00
Mehdi Amini	d7ad221c16	Add a module Hash in the bitcode and the combined index, implementing a kind of "build-id" This is intended to be used for ThinLTO incremental build. Differential Revision: http://reviews.llvm.org/D18213 This is a recommit of r265095 after fixing the Windows issues. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265111	2016-04-01 05:33:11 +00:00
Mehdi Amini	eed269329c	Fix MSVC warning "comparison of integers of different signs" (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265110	2016-04-01 05:19:14 +00:00
Mehdi Amini	180441f09a	Fix S390 big endian detection From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265109	2016-04-01 05:12:24 +00:00
Mehdi Amini	7ef783d1fa	Const correctness in raw_sha1_ostream (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265108	2016-04-01 05:12:18 +00:00
Mehdi Amini	4cd5702578	Add support for computing SHA1 in LLVM Provide a class to generate a SHA1 from a sequence of bytes, and a convenience raw_ostream adaptor. This will be used to provide a "build-id" by hashing the Module block when writing bitcode. ThinLTO will use this information for incremental build. Reapply r265094 which was reverted in r265102 because it broke MSVC bots (constexpr is not supported). http://reviews.llvm.org/D16325 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265107	2016-04-01 04:30:16 +00:00
Sean Silva	3de5ef96c9	Improve CHECK-NOT robustness of dllexport tests This changes some dllexport tests, to verify that some symbols that should not be exported are not, in a way that improves the robustness of CHECK-SAME interaction with CHECK-NOT. We plan to enable dllimport/dllexport support for the PS4, and these changes are for points we noticed in our internal testing. Patch by Warren Ristow! llvm-svn: 265106	2016-04-01 03:54:03 +00:00
Michael Kuperstein	7bab713188	Use range-based for loops. NFC. llvm-svn: 265105	2016-04-01 03:45:08 +00:00
Mehdi Amini	85fb9e058e	Revert "Add support for computing SHA1 in LLVM" This reverts commit r265096, r265095, and r265094. Windows build is broken, and the validation does not pass. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265102	2016-04-01 03:03:21 +00:00
Sanjoy Das	f83ab6de56	Don't insert stackrestore on deoptimizing returns They're not necessary (since the stack pointer is trivially restored on return), and the way LLVM inserts the stackrestore calls breaks the IR (we get a stackrestore between the deoptimize call and the return). llvm-svn: 265101	2016-04-01 02:51:30 +00:00
Sanjoy Das	18b92968ea	Don't insert lifetime end markers on deoptimizing returns They're not necessary (since the lifetime of the alloca is trivially over due to the return), and the way LLVM inserts the lifetime.end markers breaks the IR (we get a lifetime end marker between the deoptimize call and the return). llvm-svn: 265100	2016-04-01 02:51:26 +00:00
Sanjoy Das	9d41a8f269	Don't use an i64 return type with webkit_jscc Re-enable an assertion enabled by Justin Lebar in rL265092. rL265092 was breaking test/CodeGen/X86/deopt-intrinsic.ll because webkit_jscc does not like non-i64 return types. Change the test case to not do that. llvm-svn: 265099	2016-04-01 02:51:21 +00:00
Matthias Braun	cc7fba40fe	AArch64ISelLowering: Remove unused variables/arguments; NFC llvm-svn: 265098	2016-04-01 02:49:17 +00:00
Chuang-Yu Cheng	35c6181982	Fix Sub-register Rewriting in Aggressive Anti-Dependence Breaker Previously, HandleLastUse would delete RegRef information for sub-registers if they were dead even if their corresponding super-register were still live. If the super-register were later renamed, then the definitions of the sub-register would not be updated appropriately. This patch alters the behavior so that RegInfo information for sub-registers is only deleted when the sub-register and super-register are both dead. This resolves PR26775. This is the mirror image of Hal's r227311 commit. Author: Tom Jablin (tjablin) Reviewers: kbarton uweigand nemanjai hfinkel http://reviews.llvm.org/D18448 llvm-svn: 265097	2016-04-01 02:05:29 +00:00
Mehdi Amini	4ea9e9c9bb	Add missing test for the "Module hash in bitcode" added in r265095 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265096	2016-04-01 01:37:52 +00:00
Mehdi Amini	4c2ed3337d	Add a module Hash in the bitcode and the combined index, implementing a kind of "build-id" This is intended to be used for ThinLTO incremental build. Differential Revision: http://reviews.llvm.org/D18213 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265095	2016-04-01 01:30:06 +00:00
Mehdi Amini	3689ae14eb	Add support for computing SHA1 in LLVM Provide a class to generate a SHA1 from a sequence of bytes, and a convenience raw_ostream adaptor. This will be used to provide a "build-id" by hashing the Module block when writing bitcode. ThinLTO will use this information for incremental build. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265094	2016-04-01 01:29:54 +00:00
Justin Lebar	98981e5573	Revert "Protect some assertions with NDEBUG rather than DEBUG()." This reverts r265092, because it breaks CodeGen/X86/deopt-intrinsic.ll. llvm-svn: 265093	2016-04-01 01:23:23 +00:00
Justin Lebar	c814e8e4ab	Protect some assertions with NDEBUG rather than DEBUG(). DEBUG() only runs if you pass -debug, but these assertions are generally useful. llvm-svn: 265092	2016-04-01 01:09:12 +00:00
Justin Lebar	96418481bc	[NVPTX] Add a truncate DAG node to some calls. Summary: Previously, we were running afoul of the assertion EVT(CLI.Ins[i].VT) == InVals[i].getValueType() && "LowerCall emitted a value with the wrong type!" in SelectionDAGBuilder.cpp when running the NVPTX/i8-param.ll test. This is because our backend (for some reason) treats small return values as i32, but it wasn't ever truncating the i32 back down to the expected width in the DAG. Unclear to me whether this fixes any actual bugs -- in this test, at least, the generated code is unchanged. Reviewers: jingyue Subscribers: llvm-commits, tra, jholewinski Differential Revision: http://reviews.llvm.org/D17872 llvm-svn: 265091	2016-04-01 01:09:10 +00:00
Justin Lebar	efcc81cbb4	[NVPTX] Read __CUDA_FTZ from module flags in NVVMReflect. Summary: Previously the NVVMReflect pass would read its configuration from command-line flags or a static configuration given to the pass at instantiation time. This doesn't quite work for clang's use-case. It needs to pass a value for __CUDA_FTZ down on a per-module basis. We use a module flag for this, so the NVVMReflect pass needs to be updated to read said flag. Reviewers: tra, rnk Subscribers: cfe-commits, jholewinski Differential Revision: http://reviews.llvm.org/D18672 llvm-svn: 265090	2016-04-01 01:09:07 +00:00
Justin Lebar	645c3014a1	[NVPTX] Annotate some instructions as hasSideEffects = 0. Summary: Tablegen tries to infer this from the selection DAG patterns defined for the instructions, but it can't always. An instructive example is CLZr64. CLZr32 is correctly inferred to have no side-effects, but the selection DAG pattern for CLZr64 is slightly more complicated, and in particular the ctlz DAG node is not at the root of the pattern. Thus tablegen can't infer that CLZr64 has no side-effects. Reviewers: jholewinski Subscribers: jholewinski, tra, llvm-commits Differential Revision: http://reviews.llvm.org/D17472 llvm-svn: 265089	2016-04-01 01:09:05 +00:00
Justin Lebar	acc47105f8	[ifcnv] Add brief comment explaining what ifcnv is. llvm-svn: 265088	2016-04-01 01:09:03 +00:00
Mehdi Amini	64719159d0	Fix Windows build (typo in disk_space() implementation) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265087	2016-04-01 00:52:05 +00:00
Akira Hatanaka	e9148dd62f	[LoopVectorize] Don't unconditionally print vectorization diagnostics when compiling with LTO. r244523 a new class DiagnosticInfoOptimizationRemarkAnalysisAliasing for optimization analysis remarks related to pointer aliasing without guarding it in isDiagnosticEnabled in LLVMContext.cpp. This caused the diagnostic message to be printed unconditionally when compiling with LTO. This commit cleans up isDiagnosticEnabled and makes sure all the vectorization optimization remarks are guarded. rdar://problem/25382153 llvm-svn: 265084	2016-04-01 00:34:39 +00:00
Mehdi Amini	e2d8f1b8fc	Add disk_space() to llvm::fs Summary: Adapted from Boost::filesystem. (This is a reapply by reverting commit r265080 and fixing the WinAPI part) Differential Revision: http://reviews.llvm.org/D18467 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265082	2016-04-01 00:18:08 +00:00
Adrian Prantl	b8089516a5	testcase gardening: update the emissionKind enum to the new syntax. (NFC) llvm-svn: 265081	2016-04-01 00:16:49 +00:00
Mehdi Amini	640de72a1e	Revert "Add disk_space() to llvm::fs" This reverts commit r265074 and r265068. Breaks windows build From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265080	2016-04-01 00:13:31 +00:00
Adrian Prantl	0235e95a4b	Fix a captialization error in r265077. llvm-svn: 265079	2016-04-01 00:00:27 +00:00
Adrian Prantl	b939a25707	Move the DebugEmissionKind enum from DIBuilder into DICompileUnit. This mostly cosmetic patch moves the DebugEmissionKind enum from DIBuilder into DICompileUnit. DIBuilder is not the right place for this enum to live in — a metadata consumer should not have to include DIBuilder.h. I also added a Verifier check that checks that the emission kind of a DICompileUnit is actually legal. http://reviews.llvm.org/D18612 <rdar://problem/25427165> llvm-svn: 265077	2016-03-31 23:56:58 +00:00
Hans Wennborg	649159df3c	Follow-up to r265036: I got these iterators mixed up llvm-svn: 265076	2016-03-31 23:55:16 +00:00
Mehdi Amini	073442e183	clang-format the large enum definitions in LLVMBitCodes.h Suggestion from Duncan in a review thread. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265075	2016-03-31 23:27:37 +00:00
Mehdi Amini	e503a71df1	Use const ref instead of value for Twine in the disk_space() API Thanks Rui for noticing! From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265074	2016-03-31 23:14:45 +00:00
Matthias Braun	19832b616e	lit: python3 compatibility fix llvm-svn: 265070	2016-03-31 23:08:55 +00:00
Peter Collingbourne	acff7d4a25	Create thin archive in GNU format to fix test on OS X. llvm-svn: 265069	2016-03-31 23:07:50 +00:00
Mehdi Amini	4c82356ad3	Add disk_space() to llvm::fs Summary: Adapted from Boost::filesystem. (This is a reapply by reverting commit r265062 and fixing the WinAPI part) Differential Revision: http://reviews.llvm.org/D18467 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265068	2016-03-31 23:05:26 +00:00
Tim Shen	2e24d0c0c1	Move asm-printer-topological-order.ll to PowerPC backend llvm-svn: 265067	2016-03-31 22:32:10 +00:00
Peter Collingbourne	a69d9e5aab	Object: Add function for moving thin archive object buffer vector out of Archive. Differential Revision: http://reviews.llvm.org/D18664 llvm-svn: 265066	2016-03-31 22:08:57 +00:00
Peter Collingbourne	f84646cd95	Object: Correctly read thin archives containing absolute paths. Differential Revision: http://reviews.llvm.org/D18666 llvm-svn: 265065	2016-03-31 22:08:31 +00:00
Tim Shen	800ed436e5	[AsmPrinter] Print aliases in topological order Print aliases in topological order, that is, for any alias a = b, b must be printed before a. This is because on some targets (e.g. PowerPC) linker expects aliases in such an order to generate correct TOC information. GCC also prints aliases in topological order. llvm-svn: 265064	2016-03-31 22:08:19 +00:00
Chandler Carruth	b472856a73	Fix PR26940 where compiles times regressed massively. Patch by Jonas Paulsson. Original description: Bugfix in buildSchedGraph() to make -dag-maps-huge-region work properly I found that the reduction of the maps did in fact never happen in this test case. This was because all the stores / loads were made with addresses from arguments and they thus became "unknown" stores / loads. Fixed by removing continue statements and making sure that the test for reduction always takes place. Differential Revision: http://reviews.llvm.org/D18673 llvm-svn: 265063	2016-03-31 21:55:58 +00:00
Mehdi Amini	b880144703	Revert "Add disk_space() to llvm::fs" Breaks windows bot. This reverts commit r265050. This reverts commit r265055. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265062	2016-03-31 21:55:35 +00:00
Evgeniy Stepanov	f74f091ea6	Preserve blockaddress use edges in the module splitter. "blockaddress" can not apply to an external function. All blockaddress constant uses must belong to the same module as the definition of the target function. llvm-svn: 265061	2016-03-31 21:55:11 +00:00
David Majnemer	ae272d718e	[NVPTX] Infer __nvvm_reflect as nounwind, readnone This patch simply mirrors the attributes we give to @llvm.nvvm.reflect to the __nvvm_reflect libdevice call. This shaves about 30% of the code in libdevice away because of CSE opportunities. It's also helps us figure out that libdevice implementations of transcendental functions don't have side-effects. llvm-svn: 265060	2016-03-31 21:29:57 +00:00
Simon Pilgrim	9a7689db91	Wdocumentation parameter fix llvm-svn: 265055	2016-03-31 21:13:49 +00:00
Sanjay Patel	4d71160d5d	fix typo; NFC llvm-svn: 265054	2016-03-31 21:00:48 +00:00
Simon Pilgrim	326014ad99	Fixed signed/unsigned warning llvm-svn: 265052	2016-03-31 20:57:36 +00:00
Jun Bum Lim	760afcb338	[AArch64] Allow loads with imp-def to be handled in getMemOpBaseRegImmOfsWidth() Summary: This change will allow loads with imp-def to be clustered in machine-scheduler pass. areMemAccessesTriviallyDisjoint() can also handle loads with imp-def. Reviewers: mcrosier, jmolloy, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18665 llvm-svn: 265051	2016-03-31 20:53:47 +00:00
Mehdi Amini	9defda528e	Add disk_space() to llvm::fs Summary: Adapted from Boost::filesystem. Reviewers: bruno, silvas Subscribers: tberghammer, danalbert, llvm-commits, srhines Differential Revision: http://reviews.llvm.org/D18467 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265050	2016-03-31 20:48:27 +00:00
Hal Finkel	0f02ccd955	[PowerPC] Cleanup test/CodeGen/PowerPC/qpx-load-splat.ll Removing unnecessary attributes and metadata... llvm-svn: 265049	2016-03-31 20:45:00 +00:00
Sanjay Patel	61e13249d8	[x86] add memset tests to show another potential improvement llvm-svn: 265048	2016-03-31 20:40:32 +00:00
Hal Finkel	fc35391f2b	[PowerPC] Add a late MI-level pass for QPX load/splat simplification Chapter 3 of the QPX manual states that, "Scalar floating-point load instructions, defined in the Power ISA, cause a replication of the source data across all elements of the target register." Thus, if we have a load followed by a QPX splat (from the first lane), the splat is redundant. This adds a late MI-level pass to remove the redundant splats in some of these cases (specifically when both occur in the same basic block). This optimization is scheduled just prior to post-RA scheduling. It can't happen before anything that might replace the load with some already-computed quantity (i.e. store-to-load forwarding). llvm-svn: 265047	2016-03-31 20:39:41 +00:00
Hans Wennborg	132cd62121	Revert r265039 "[X86] Merge adjacent stack adjustments in eliminateCallFramePseudoInstr (PR27140)" I think it might have caused these build breakages: http://lab.llvm.org:8011/builders/clang-x86-win2008-selfhost/builds/7234/steps/build%20stage%202/logs/stdio http://lab.llvm.org:8011/builders/sanitizer-windows/builds/19566/steps/run%20tests/logs/stdio llvm-svn: 265046	2016-03-31 20:27:30 +00:00
Simon Pilgrim	aab59b7a28	[X86][SSE] Some basic tests for variable shuffles We don't really support non-constant shuffle masks, but these tests are for cases where BUILD_VECTOR is made up from vector extracts (as well as undef/zero scalars). llvm-svn: 265045	2016-03-31 20:26:30 +00:00
Evgeniy Stepanov	a614ab7b71	Preserve extern_weak linkage in CloneModule. Only force "extern" linkage if the function used to be a definition in the source module. Declarations keep their original linkage. llvm-svn: 265043	2016-03-31 20:21:31 +00:00
Chris Bieneman	6099a4e7d4	[CMake] Provide the ability to skip stripping when generating dSYMs For debugging it is useful to be able to generate dSYM files but not strip the executables. This change adds the ability to skip stripping by setting LLVM_EXTERNALIZE_DEBUGINFO_SKIP_STRIP=On. llvm-svn: 265041	2016-03-31 20:03:19 +00:00
Benjamin Kramer	569efd2cfd	[ARM] Expand v1i64 and v2i64 ctpop. The default is legal, which results in 'Cannot select' errors. This is triggered during selfhost due to a recent cost model change. llvm-svn: 265040	2016-03-31 19:42:04 +00:00
Hans Wennborg	e97fb414e8	[X86] Merge adjacent stack adjustments in eliminateCallFramePseudoInstr (PR27140) For code such as: void f(int, int); void g() { f(1, 2); } compiled for 32-bit X86 Linux, Clang would previously generate: subl $12, %esp subl $8, %esp pushl $2 pushl $1 calll f addl $16, %esp addl $12, %esp retl This patch fixes that by merging adjacent stack adjustments in eliminateCallFramePseudoInstr(). Differential Revision: http://reviews.llvm.org/D18627 llvm-svn: 265039	2016-03-31 19:26:24 +00:00
Hans Wennborg	e1a2e90ffa	Change eliminateCallFramePseudoInstr() to return an iterator This will become necessary in a subsequent change to make this method merge adjacent stack adjustments, i.e. it might erase the previous and/or next instruction. It also greatly simplifies the calls to this function from Prolog- EpilogInserter. Previously, that had a bunch of logic to resume iteration after the call; now it just continues with the returned iterator. Note that this changes the behaviour of PEI a little. Previously, it attempted to re-visit the new instruction created by eliminateCallFramePseudoInstr(). That code was added in r36625, but I can't see any reason for it: the new instructions will obviously not be pseudo instructions, they will not have FrameIndex operands, and we have already accounted for the stack adjustment. Differential Revision: http://reviews.llvm.org/D18627 llvm-svn: 265036	2016-03-31 18:33:38 +00:00
Daniel Dunbar	42881eac30	[lit][googletest] Handle upstream gtest output Summary: Upstream googletest prints "Running main() from gtest_main.cc" to stdout prior to running tests. LLVM removed that print statement in r61540. If a user were to use lit to run tests that use upstream googletest, however, lit reports "Running main()" as an invalid test name. To avoid such a failure, add an extra conditional to `formats/googletest.py`. Also add tests to demonstrate the modified behavior. Reviewers: abdulras, ddunbar Subscribers: ddunbar, llvm-commits, kastiglione Differential Revision: http://reviews.llvm.org/D18606 llvm-svn: 265034	2016-03-31 18:22:55 +00:00
Jacques Pienaar	4badd6aaf3	[lanai] isBrImm should accept any non-constant immediate. isBrImm should accept any non-constant immediate. Previously it was only accepting LanaiMCExpr ones which was wrong. Differential Revision: http://reviews.llvm.org/D18571 llvm-svn: 265032	2016-03-31 17:58:55 +00:00
Ehsan Amiri	99b017ae35	[PPC] basic support for Power 9 direct move instructions http://reviews.llvm.org/D18097 Initial support does not include any patterns to generate this instructions llvm-svn: 265031	2016-03-31 17:47:17 +00:00
Rong Xu	d5a57b5947	[PGO] use emplace_back. NFC. Use emplace_back instead of push_back for simplicity. llvm-svn: 265030	2016-03-31 17:39:33 +00:00
Sanjay Patel	92d5ea5e07	[x86] use SSE/AVX ops for non-zero memsets (PR27100) Move the memset check down to the CPU-with-slow-SSE-unaligned-memops case: this allows fast targets to take advantage of SSE/AVX instructions and prevents slow targets from stepping into a codegen sinkhole while trying to splat a byte into an XMM reg. Follow-on bugs exposed by the current codegen are: https://llvm.org/bugs/show_bug.cgi?id=27141 https://llvm.org/bugs/show_bug.cgi?id=27143 Differential Revision: http://reviews.llvm.org/D18566 llvm-svn: 265029	2016-03-31 17:30:06 +00:00
Valery Pykhtin	ab962acd59	[AMDGPU] enable few disassembler tests that were mistakenly marked as FIXME. llvm-svn: 265028	2016-03-31 17:28:46 +00:00
Hans Wennborg	a7543ba10c	More checks in win32-seh-nested-finally.ll after comment on r264966 llvm-svn: 265027	2016-03-31 16:42:10 +00:00
Ulrich Weigand	6ad762d36f	[PowerPC] Attempt to fix fast-isel-i64offset.ll failure The test case added in r265023 is failing on ninja-x64-msvc-RA-centos6. Update the test to make less specific assumptions on code generation. llvm-svn: 265026	2016-03-31 16:38:57 +00:00
Xinliang David Li	d0b4cbb9dd	Minor code cleanup /NFC llvm-svn: 265025	2016-03-31 16:22:17 +00:00
Stephan Bergmann	480de227f6	Don't use potentially invalidated iterator If the lhs is evaluated before the rhs, FuncletI's operator-> can trigger the assert(isHandleInSync() && "invalid iterator access!"); at include/llvm/ADT/DenseMap.h:1061. (Happens e.g. when compiled with GCC 6.) Differential Revision: http://reviews.llvm.org/D18440 llvm-svn: 265024	2016-03-31 15:42:01 +00:00
Ulrich Weigand	3707ba8030	[PowerPC] Correctly compute 64-bit offsets in fast isel PPCSimplifyAddress contains this code: IntegerType OffsetTy = ((VT == MVT::i32) ? Type::getInt32Ty(Context) : Type::getInt64Ty(Context)); to determine the type to be used for an index register, if one needs to be created. However, the "VT" here is the type of the data being loaded or stored, not* the type of an address. This means that if a data element of type i32 is accessed using an index that does not not fit into 32 bits, a wrong address is computed here. Note that PPCFastISel is only ever used on 64-bit currently, so the type of an address is actually always MVT::i64. Other parts of the code, even in this same PPCSimplifyAddress routine, already rely on that fact. Thus, this patch changes the code to simply unconditionally use Type::getInt64Ty(*Context) as OffsetTy. llvm-svn: 265023	2016-03-31 15:37:06 +00:00
Nemanja Ivanovic	a621a7f9c3	[PowerPC] Basic support for P9 atomic loads and stores This patch corresponds to review: http://reviews.llvm.org/D18032 This patch provides asm implementation for the following instructions: lwat, ldat, stwat, stdat, ldmx, mcrxrx llvm-svn: 265022	2016-03-31 15:26:37 +00:00
Jun Bum Lim	cf9744367b	[AArch64] Handle missing store pair opportunity Summary: This change will handle missing store pair opportunity where the first store instruction stores zero followed by the non-zero store. For example, this change will convert : str wzr, [x8] str w1, [x8, #4] into: stp wzr, w1, [x8] Reviewers: jmolloy, t.p.northover, mcrosier Subscribers: flyingforyou, aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18570 llvm-svn: 265021	2016-03-31 14:47:24 +00:00
Ulrich Weigand	1931b01a64	[PowerPC] Remove incorrect use of COPY_TO_REGCLASS in fast isel The fast isel pass currently emits a COPY_TO_REGCLASS node to convert from a F4RC to a F8RC register class during conversion of a floating-point number to integer. There is actually no support in the common code instruction printers to emit COPY_TO_REGCLASS nodes, so the PowerPC back-end has special code there to simply ignore COPY_TO_REGCLASS. This is correct if and only if the source and destination registers of COPY_TO_REGCLASS are the same (except for the different register class). But nothing guarantees this to be the case, and if the register allocator does end up allocating source and destination to different registers after all, the back-end simply generates incorrect code. I've included a test case that shows such incorrect code generation. However, it seems that COPY_TO_REGCLASS is actually not intended to be used at the MI layer at all. It is used during SelectionDAG, but always lowered to a plain COPY before emitting MI. Other back-end's fast isel passes never emit COPY_TO_REGCLASS at all. I suspect it is simply wrong for the PowerPC back-end to emit it here. This patch changes the PowerPC back-end to directly emit COPY instead of COPY_TO_REGCLASS and removes the special handling in the instruction printers. Differential Revision: http://reviews.llvm.org/D18605 llvm-svn: 265020	2016-03-31 14:44:50 +00:00
Daniel Sanders	85fd10bd93	[mips] Range check simm16 Summary: There are too many instructions to exhaustively test so addiu and lwc2 are used as representative examples. It should be noted that many memory instructions that should have simm16 range checking do not because it is also necessary to support the macro of the same name which accepts simm32. The range checks for these occur in the macro expansion. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18437 llvm-svn: 265019	2016-03-31 14:34:00 +00:00
Daniel Sanders	eab3146156	[mips] Range check simm11 and mem_simm11. Summary: ldc2/sdc2 now emit slightly worse diagnostics for MIPS-I. The problem is that they don't trigger the custom parser because all the candidates are disabled by feature bits. On all other subtargets, the diagnostics are accurate but are subject to the usual issues of needing to report multiple ways to correct the code (e.g. smaller offset, enable a CPU feature) but only being able to report one error. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18436 llvm-svn: 265018	2016-03-31 14:23:20 +00:00
Dmitry Polukhin	cd835ad876	[IFUNC] Introduce GlobalIndirectSymbol as a base class for alias and ifunc This patch is a part of http://reviews.llvm.org/D15525 GlobalIndirectSymbol class contains common implementation for both aliases and ifuncs. This patch should be NFC change that just prepare common code for ifunc support. Differential Revision: http://reviews.llvm.org/D18433 llvm-svn: 265016	2016-03-31 14:16:21 +00:00
Sam Kolton	1048fb1818	[AMDGPU] Disassembler: support for DPP Review: http://reviews.llvm.org/D18642 llvm-svn: 265015	2016-03-31 14:15:04 +00:00
Daniel Sanders	dc0602a2c2	[mips] Split mem_msa into range checked mem_simm10 and mem_simm10_lsl[123] Summary: Also, made test_mi10.s formatting consistent with the majority of the MC tests. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18435 llvm-svn: 265014	2016-03-31 14:12:01 +00:00
Nirav Dave	83ce54aac2	Prevent X86ISelLowering from merging volatile loads Change isConsecutiveLoads to check that loads are non-volatile as this is a requirement for any load merges. Propagate change to two callers. Reviewers: RKSimon Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18546 llvm-svn: 265013	2016-03-31 13:40:55 +00:00
Daniel Sanders	2e9f69d933	[mips] Range check simm9 and fix a bug this revealed. Summary: The bug was that microMIPS's [ls]w[lr]e instructions claimed to support a 12-bit offset when it is only 9-bit. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D18434 llvm-svn: 265010	2016-03-31 13:15:23 +00:00
Benjamin Kramer	cad9a8a6bb	[TTI] Let the cost model estimate ctpop costs based on legality PPC has a vector popcount, this lets the vectorizer use the correct cost for it. Tweak X86 test to use an intrinsic that's actually scalarized (we have a somewhat efficient lowering for vector popcount using SSE, the cost model finds that now). llvm-svn: 265005	2016-03-31 10:42:40 +00:00
Zlatko Buljan	6221be8e46	[mips][microMIPS] Implement MFC, MFHC and DMFC* instructions Differential Revision: http://reviews.llvm.org/D17334 llvm-svn: 265002	2016-03-31 08:51:24 +00:00
Jeroen Ketema	52aadc8eb8	Silence warnings in OCaml bindings * LLVMDisposeMessage lives in llvm-c/Core.h, include this file where necessary * LLVMAddTargetData has been removed, follow suit in the bindings Differential Revision: http://reviews.llvm.org/D18633 llvm-svn: 265001	2016-03-31 08:39:42 +00:00
Jonas Paulsson	2ba315218b	Indentation fix in SystemZInstrInfo.cpp llvm-svn: 265000	2016-03-31 08:00:14 +00:00
Sanjoy Das	56df0ec610	[InstCombine] Fix incorrect rule from rL236202 The rule for SMIN introduced in rL236202 doesn't work as advertised: the check for Pred == ICmpInst::ICMP_SGT was missing. llvm-svn: 264996	2016-03-31 05:14:34 +00:00
Sanjoy Das	c9d6d8b106	Delete trailing whitespace llvm-svn: 264995	2016-03-31 05:14:29 +00:00
Sanjoy Das	e12c0e5159	[SCEV] Track NoWrap properties using MatchBinaryOp, NFC This way once we teach MatchBinaryOp to map more things into arithmetic, the non-wrapping add recurrence construction would understand it too. Right now MatchBinaryOp still only understands arithmetic, so this is solely a code-reorganization change. llvm-svn: 264994	2016-03-31 05:14:26 +00:00
Sanjoy Das	118d919a6a	[SCEV] NFC code motion to simplify later change llvm-svn: 264993	2016-03-31 05:14:22 +00:00
Craig Topper	d2aa03a60a	[X86] Use MVT instead of EVT in code called after legalization. llvm-svn: 264992	2016-03-31 04:37:41 +00:00
Davide Italiano	936a2b09f3	[DebugInfo] Subprograms should belong to a CU. Start fixing tests accordingly. There are still about 35 failures before we can enable this check in the IR verifier. llvm-svn: 264990	2016-03-31 03:40:07 +00:00
Hal Finkel	851b33a0b1	[PowerPC] Load two floats directly instead of using one 64-bit integer load When dealing with complex<float>, and similar structures with two single-precision floating-point numbers, especially when such things are being passed around by value, we'll sometimes end up loading both float values by extracting them from one 64-bit integer load. It looks like this: t13: i64,ch = load<LD8[%ref.tmp]> t0, t6, undef:i64 t16: i64 = srl t13, Constant:i32<32> t17: i32 = truncate t16 t18: f32 = bitcast t17 t19: i32 = truncate t13 t20: f32 = bitcast t19 The problem, especially before the P8 where those bitcasts aren't legal (and get expanded via the stack), is that it would have been better to use two floating-point loads directly. Here we add a target-specific DAGCombine to do just that. In short, we turn: ld 3, 0(5) stw 3, -8(1) rldicl 3, 3, 32, 32 stw 3, -4(1) lfs 3, -4(1) lfs 0, -8(1) into: lfs 3, 4(5) lfs 0, 0(5) llvm-svn: 264988	2016-03-31 02:56:05 +00:00
Sean Silva	24d7e2e869	Fix case confusion. The test case was defining and using a function 'notExported()', but the FileCheck checks were checking for the name 'not_exported'. This changes the test to use 'notExported' across the board. Also, the test defined a function 'not_defined()', but doesn't have any checks related to it. For consistency, this name is changed to 'notDefined'. A later commit will add checks for 'notDefined'. Patch by Warren Ristow! llvm-svn: 264984	2016-03-31 01:47:33 +00:00
Sanjoy Das	021de058df	Introduce a @llvm.experimental.guard intrinsic Summary: As discussed on llvm-dev[1]. This change adds the basic boilerplate code around having this intrinsic in LLVM: - Changes in Intrinsics.td, and the IR Verifier - A lowering pass to lower @llvm.experimental.guard to normal control flow - Inliner support [1]: http://lists.llvm.org/pipermail/llvm-dev/2016-February/095523.html Reviewers: reames, atrick, chandlerc, rnk, JosephTremoulet, echristo Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18527 llvm-svn: 264976	2016-03-31 00:18:46 +00:00
Hans Wennborg	be0df2b102	Add some more triples after r264966 llvm-svn: 264972	2016-03-30 23:55:22 +00:00

... 3 4 5 6 7 ...

129751 Commits