llvm-project

Commit Graph

Author	SHA1	Message	Date
Chen Li	e8f9387e0c	[X86ISelLowering] Add additional support for multiplication-to-shift conversion. Summary: This patch adds support of conversion (mul x, 2^N + 1) => (add (shl x, N), x) and (mul x, 2^N - 1) => (sub (shl x, N), x) if the multiplication can not be converted to LEA + SHL or LEA + LEA. LLVM has already supported this on ARM, and it should also be useful on X86. Note the patch currently only applies to cases where the constant operand is positive, and I am planing to add another patch to support negative cases after this. Reviewers: craig.topper, RKSimon Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D14603 llvm-svn: 255391	2015-12-11 23:39:32 +00:00
Diego Novillo	10cf124bb9	SamplePGO - Reduce memory utilization by 10x. DenseMap is the wrong data structure to use for sample records and call sites. The keys are too large, causing massive core memory growth when reading profiles. Before this patch, a 21Mb input profile was causing the compiler to grow to 3Gb in memory. By switching to std::map, the compiler now grows to 300Mb in memory. There still are some opportunities for memory footprint reduction. I'll be looking at those next. llvm-svn: 255389	2015-12-11 23:21:38 +00:00
Matt Arsenault	fabab4b7dd	SelectionDAG: Match min/max if the scalar operation is legal llvm-svn: 255388	2015-12-11 23:16:47 +00:00
Hal Finkel	cd8664c3c2	Revert r248483, r242546, r242545, and r242409 - absdiff intrinsics After much discussion, ending here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151123/315620.html it has been decided that, instead of having the vectorizer directly generate special absdiff and horizontal-add intrinsics, we'll recognize the relevant reduction patterns during CodeGen. Accordingly, these intrinsics are not needed (the operations they represent can be pattern matched, as is already done in some backends). Thus, we're backing these out in favor of the current development work. r248483 - Codegen: Fix llvm.*absdiff semantic. r242546 - [ARM] Use [SU]ABSDIFF nodes instead of intrinsics for VABD/VABA r242545 - [AArch64] Use [SU]ABSDIFF nodes instead of intrinsics for ABD/ABA r242409 - [Codegen] Add intrinsics 'absdiff' and corresponding SDNodes for absolute difference operation llvm-svn: 255387	2015-12-11 23:11:52 +00:00
Rafael Espindola	515f8df3f1	Avoid buffered reads of /dev/urandom I am seeing disappointing clang performance on a large PowerPC64 Linux box. GetRandomNumberSeed() does a buffered read from /dev/urandom to seed its PRNG. As a result we read an entire page even though we only need 4 bytes. With every clang task reading a page worth of /dev/urandom we end up spending a large amount of time stuck on kernel spinlock. Patch by Anton Blanchard! llvm-svn: 255386	2015-12-11 22:52:32 +00:00
Davide Italiano	62507043c5	[llvm-objdump/MachODump] Reduce code duplication. llvm-svn: 255380	2015-12-11 22:27:59 +00:00
Sanjay Patel	d497ad43da	Add tests for bitcast-bitcast sequences for all scalar/vector permutations As noted in http://reviews.llvm.org/D15392 , we should be able to improve this. llvm-svn: 255370	2015-12-11 20:26:30 +00:00
Xinliang David Li	a86545b0b5	[PGO] Revert r255365: solution incomplete, not handling lambda yet llvm-svn: 255369	2015-12-11 20:23:22 +00:00
Xinliang David Li	c79283ef29	[PGO] Stop using invalid char in instr variable names. Before the patch, -fprofile-instr-generate compile will fail if no integrated-as is specified when the file contains any static functions (the -S output is also invalid). This patch fixed the issue. With the change, the index format version will be bumped up by 1. Backward compatibility is preserved with this change. Differential Revision: http://reviews.llvm.org/D15243 llvm-svn: 255365	2015-12-11 19:53:19 +00:00
Matthias Braun	60d69e2865	CodeGen: Redo analyzePhysRegs() and computeRegisterLiveness() computeRegisterLiveness() was broken in that it reported dead for a register even if a subregister was alive. I assume this was because the results of analayzePhysRegs() are hard to understand with respect to subregisters. This commit: Changes the results of analyzePhysRegs (=struct PhysRegInfo) to be clearly understandable, also renames the fields to avoid silent breakage of third-party code (and improve the grammar). Fix all (two) users of computeRegisterLiveness() in llvm: By reenabling it and removing workarounds for the bug. This fixes http://llvm.org/PR24535 and http://llvm.org/PR25033 Differential Revision: http://reviews.llvm.org/D15320 llvm-svn: 255362	2015-12-11 19:42:09 +00:00
Matt Arsenault	fbd9bbfda3	Start replacing vector_extract/vector_insert with extractelt/insertelt These are redundant pairs of nodes defined for INSERT_VECTOR_ELEMENT/EXTRACT_VECTOR_ELEMENT. insertelement/extractelement are slightly closer to the corresponding C++ node name, and has stricter type checking so prefer it. Update targets to only use these nodes where it is trivial to do so. AArch64, ARM, and Mips all have various type errors on simple replacement, so they will need work to fix. Example from AArch64: def : Pat<(sext_inreg (vector_extract (v16i8 V128:$Rn), VectorIndexB:$idx), i8), (i32 (SMOVvi8to32 V128:$Rn, VectorIndexB:$idx))>; Which is trying to do sext_inreg i8, i8. llvm-svn: 255359	2015-12-11 19:20:16 +00:00
Derek Schuff	5a14306323	[WebAssembly] Fix ADJCALLSTACKDOWN/UP use/defs Summary: ADJCALLSTACK{DOWN,UP} (aka CALLSEQ_{START,END}) MIs are supposed to use and def the stack pointer. Since they do not, all the nodes are being eliminated by DeadMachineInstructionElim, so they aren't in the IR when PrologEpilogInserter/eliminateCallFramePseudo needs them. This change fixes that, but since RegStackify will not stackify across them (and it runs early, before PEI), change LowerCall to only emit them when the call frame size is > 0. That makes the current code work the same way and makes code handled by D15344 also work the same way. We can expand the condition beyond NumBytes > 0 in the future if needed. Reviewers: sunfish, jfb Subscribers: jfb, dschuff, llvm-commits Differential Revision: http://reviews.llvm.org/D15459 llvm-svn: 255356	2015-12-11 18:55:34 +00:00
Chad Rosier	d7634fc91d	Revert r255247, r255265, and r255286 due to serious compile-time regressions. Revert "[DSE] Disable non-local DSE to see if the bots go green." Revert "[DeadStoreElimination] Use range-based loops. NFC." Revert "[DeadStoreElimination] Add support for non-local DSE." llvm-svn: 255354	2015-12-11 18:39:41 +00:00
Manman Ren	abc7c1d1d2	CXX_FAST_TLS calling convention: target independent portion. The access function has a short entry and a short exit, the initialization block is only run the first time. To improve the performance, we want to have a short frame at the entry and exit. We explicitly handle most of the CSRs via copies. Only the CSRs that are not handled via copies will be in CSR_SaveList. Frame lowering and prologue/epilogue insertion will generate a short frame in the entry and exit according to CSR_SaveList. The majority of the CSRs will be handled by register allcoator. Register allocator will try to spill and reload them in the initialization block. We add CSRsViaCopy, it will be explicitly handled during lowering. 1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target supports it for the given calling convention and the function has only return exits). We also call TLI->initializeSplitCSR to perform initialization. 2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to virtual registers at beginning of the entry block and copies from virtual registers to CSRsViaCopy at beginning of the exit blocks. 3> we also need to make sure the explicit copies will not be eliminated. rdar://problem/23557469 Differential Revision: http://reviews.llvm.org/D15340 llvm-svn: 255353	2015-12-11 18:24:30 +00:00
Sanjay Patel	4dad27e016	fix typos; NFC llvm-svn: 255352	2015-12-11 18:12:01 +00:00
Frederic Riss	841b1732df	[dsymutil] Ignore absolute symbols in the debug map Quoting from the comment added to the code: // Objective-C on i386 uses artificial absolute symbols to // perform some link time checks. Those symbols have a fixed 0 // address that might conflict with real symbols in the object // file. As I cannot see a way for absolute symbols to find // their way into the debug information, let's just ignore those. llvm-svn: 255350	2015-12-11 17:50:37 +00:00
Hal Finkel	494393b740	AlignmentFromAssumptions and SLPVectorizer preserves AA and GlobalsAA GlobalsAA's assumptions that passes do not escape globals not previously escaped is not violated by AlignmentFromAssumptions and SLPVectorizer. Marking them as such allows GlobalsAA to be preserved until GVN in the LTO pipeline. http://lists.llvm.org/pipermail/llvm-dev/2015-December/092972.html Patch by Vaivaswatha Nagaraj! llvm-svn: 255348	2015-12-11 17:46:01 +00:00
Hal Finkel	cd5f984670	[TableGen] Correct Namespace lookup with AltNames in AsmWriterEmitter AsmWriterEmitter will generate a getRegisterName function with an alternate register name index as its second argument if the target makes use of them. The enum of these values is generated in RegisterInfoEmitter. The getRegisterName generator would assume the namespace could always be found by reading index 1 of the list of AltNameIndices, but this will fail if this list is sorted such that the NoRegAltName is at index 1. Because this list is sorted by record name (in CodeGenTarget::ReadRegAltNameIndices), you only run in to problems if your MyTargetRegisterInfo.td defines a single RegAltNameIndex that sorts lexically before NoRegAltName. For example, if a target has something like def AnAltNameIndex : RegAltNameIndex and defines RegAltNameIndices for some registers then, prior to this change, AsmWriterEmitter would generate references to ::AnAltNameIndex and ::NoRegAltName Patch by Alex Bradbury! llvm-svn: 255344	2015-12-11 17:31:27 +00:00
Artur Pilipenko	7ae49ac619	PruneEH pass incorrectly reports that a change was made Reviewed By: reames Differential Revision: http://reviews.llvm.org/D14097 llvm-svn: 255343	2015-12-11 16:30:26 +00:00
James Molloy	1bb6ea5e2d	[Mem2Reg] Respect optnone Mem2Reg shouldn't be optimizing a function that is marked optnone. There is a test checking this that fails when mem2reg is explicitly added to the standard pass pipeline. llvm-svn: 255336	2015-12-11 13:36:59 +00:00
James Molloy	37b82e79b2	[InstCombine] Make MatchBSwap also match bit reversals MatchBSwap has most of the functionality to match bit reversals already. If we switch it from looking at bytes to individual bits and remove a few early exits, we can extend the main recursive function to match any sequence of ORs, ANDs and shifts that assemble a value from different parts of another, base value. Once we have this bit->bit mapping, we can very simply detect if it is appropriate for a bswap or bitreverse. llvm-svn: 255334	2015-12-11 10:04:51 +00:00
Maxim Ostapenko	1dbfca60f8	Revert previous test commit. llvm-svn: 255331	2015-12-11 07:40:25 +00:00
Maxim Ostapenko	e518db35a8	This is a test commit to check my commit access works. llvm-svn: 255330	2015-12-11 07:31:29 +00:00
Xinliang David Li	d922c26c02	[PGO] Read VP raw data without depending on the Value field Before this patch, each function's on-disk VP data is 'pointed' to by the Value field of per-function ProfileData structue, and read relies on this field (relocated with ValueDataDelta field) to read the value data. However this means the Value field needs to be updated during runtime before dumping, which creates undesirable data races. With this patch, the reading of VP data no longer depends on Value field. There is no format change. ValueDataDelta header field becomes obsolute but will be kept for compatibility reason (will be removed next time the raw format change is needed). llvm-svn: 255329	2015-12-11 06:53:53 +00:00
Hans Wennborg	a8e6b3ecb7	Fix build after r255319. llvm-svn: 255322	2015-12-11 00:58:32 +00:00
Eric Christopher	5e834a5dc4	Fix a spurious if. llvm-svn: 255321	2015-12-11 00:51:59 +00:00
Akira Hatanaka	2992beec00	[LazyValueInfo] Stop inserting overdefined values into ValueCache to reduce memory usage. Previously, LazyValueInfoCache inserted overdefined lattice values into both ValueCache and OverDefinedCache. This wasn't necessary and was causing LazyValueInfo to use an excessive amount of memory in some cases. This patch changes LazyValueInfoCache to insert overdefined values only into OverDefinedCache. The memory usage decreases by 70 to 75% when one of the files in llvm is compiled. rdar://problem/11388615 Differential revision: http://reviews.llvm.org/D15391 llvm-svn: 255320	2015-12-11 00:49:47 +00:00
Kyle Butt	1452b76f1f	[PPC]: Peephole optimize small accesss to aligned globals. Access to aligned globals gives us a chance to peephole optimize nonzero offsets. If a struct is 4 byte aligned, then accesses to bytes 0-3 won't overflow the available displacement. For example: addis 3, 2, b4v@toc@ha addi 4, 3, b4v@toc@l lbz 5, b4v@toc@l(3) ; This is the result of the current peephole lbz 6, 1(4) ; optimizer lbz 7, 2(4) lbz 8, 3(4) If b4v is 4-byte aligned, we can skip using register 4 because we know that b4v@toc@l+{1,2,3} won't overflow 32K, and instead generate: addis 3, 2, b4v@toc@ha lbz 4, b4v@toc@l(3) lbz 5, b4v@toc@l+1(3) lbz 6, b4v@toc@l+2(3) lbz 7, b4v@toc@l+3(3) Saving a register and an addition. Larger alignments allow larger structures/arrays to be optimized. llvm-svn: 255319	2015-12-11 00:47:36 +00:00
Hans Wennborg	e59910cba9	Check in the script for building Win snapshots llvm-svn: 255318	2015-12-11 00:43:42 +00:00
Vedant Kumar	2491dd118f	[ProfileData] clang-format TextInstrProfReader::hasFormat. NFC. llvm-svn: 255317	2015-12-11 00:40:05 +00:00
Cong Hou	59898d8c68	[X86][SSE] Update the cost table for integer-integer conversions on SSE2/SSE4.1. Previously in the conversion cost table there are no entries for integer-integer conversions on SSE2. This will result in imprecise costs for certain vectorized operations. This patch adds those entries for SSE2 and SSE4.1. The cost numbers are counted from the result of running llc on the new test case in this patch. Differential revision: http://reviews.llvm.org/D15132 llvm-svn: 255315	2015-12-11 00:31:39 +00:00
Xinliang David Li	2d4803e81b	Format fix (NFC) llvm-svn: 255313	2015-12-10 23:48:05 +00:00
Eric Christopher	86e031a889	s/need/needs llvm-svn: 255306	2015-12-10 22:29:26 +00:00
Eric Christopher	325e8d06dc	Fix (bitcast (fabs x)), (bitcast (fneg x)) and (bitcast (fcopysign cst, x)) combines for ppc_fp128, since signbit computation is more complicated. Discussion thread: http://lists.llvm.org/pipermail/llvm-dev/2015-November/092863.html Patch by Tim Shen! llvm-svn: 255305	2015-12-10 22:09:06 +00:00
Eric Christopher	2ec6a49fbf	Attempt to fix the ReST compilation to html of the C API docs. llvm-svn: 255304	2015-12-10 22:04:11 +00:00
Eric Christopher	df2e4d2914	More non-ascii quote characters. llvm-svn: 255303	2015-12-10 21:47:38 +00:00
Eric Christopher	dedacf9c73	Clarify some of the wording on adding a new subcomponent to the C API. llvm-svn: 255302	2015-12-10 21:46:24 +00:00
Eric Christopher	b5c2b8dc92	Fix non-ascii quotes. llvm-svn: 255301	2015-12-10 21:38:56 +00:00
Eric Christopher	d9f8ce9977	Add C API guidelines to the developer policy to match discussions on the llvm mailing lists. llvm-svn: 255300	2015-12-10 21:33:53 +00:00
Kyle Butt	28b01a51b3	PPC: Teach FMA mutate to respect register classes. This was causing bad code gen and assembly that won't assemble, as mixed altivec and vsx code would end up with a vsx high register assigned to an altivec instruction, which won't work. Constraining the classes allows the optimization to proceed. llvm-svn: 255299	2015-12-10 21:28:40 +00:00
Chris Bieneman	dbdec57b56	[CMake] Add LLVM_BUILD_INSTRUMENTED option to enable building with -fprofile-instr-generate This is the first step in supporting PGO data generation via CMake. I've marked the option as advanced and experimental until it is fleshed out further. llvm-svn: 255298	2015-12-10 21:19:07 +00:00
Mike Aizatsky	a1a5c69b57	[LibFuzzer] Introducing FUZZER_FLAG_UNSIGNED and using it for seeding. Differential Revision: http://reviews.llvm.org/D15339 done llvm-svn: 255296	2015-12-10 20:41:53 +00:00
JF Bastien	82bf85ffed	EarlyCSE: add tests Summary: As a follow-up to rL255054 I wasn't able to convince myself that the code did what I thought, so I wrote more tests. Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15371 llvm-svn: 255295	2015-12-10 20:24:34 +00:00
Xinliang David Li	c61289aa4c	Add a forward declaration (NFC) llvm-svn: 255292	2015-12-10 20:13:41 +00:00
Cong Hou	5146b2d1da	Delete a duplicate branch in IfConversion.cpp. NFC. llvm-svn: 255291	2015-12-10 19:57:22 +00:00
Simon Pilgrim	06ea4be281	[DAGCombiner] Fix PR25763 - vector comparison constant folding + sign-extension PR25763 demonstrated an issue with D14683 - vector comparison constant folding only works for i1 results, so we need to split off the sign-extension of the result to the required type. Luckily this can be done with the existing type legalization code. llvm-svn: 255289	2015-12-10 19:47:06 +00:00
Chad Rosier	843c7b4309	[DSE] Disable non-local DSE to see if the bots go green. I see a few bots timing out, so I'm speculatively disabling r255247. llvm-svn: 255286	2015-12-10 19:23:02 +00:00
Rafael Espindola	a8547d35e9	Fix another case where the linkage was not set. llvm-svn: 255272	2015-12-10 18:44:26 +00:00
Rong Xu	2611ff8a27	[PGO] Use %t as the temporary profdata filename in the test cases. Using %t rather %T/<specific_name> as the temporary profdata filename. llvm-svn: 255271	2015-12-10 18:24:44 +00:00
Duncan P. N. Exon Smith	836f0ddb60	Verifier: Avoid quadratic checking of aggregates for bad bitcasts Avoid O(N^2) behaviour when checking for bad bitcasts in `ConstantExpr`s buried inside of aggregate initializers to `GlobalVariable`s. I've: - centralized the "visited" set for recursing through `ConstantExpr`s so that expressions are only visited once per Verifier run, - removed the duplicate logic for the stack visit, and - avoided recursing into other `GlobalValue`s. This recovers roughly a 100x time difference in clang compiles of a particular input file (filled with large cross-referencing tables) that depends on whether `-disable-llvm-verifier` is on. This slowdown was caused by r187506, which introduced these checks. Now, avoiding `-disable-llvm-verifier` only causes a 2x slowdown for this case. (Interestingly, dumping the textual IR for this file starts at least 50GB of global variable initializers (I don't know the total, since I killed the dump)...) llvm-svn: 255269	2015-12-10 17:56:06 +00:00
Chad Rosier	02fe4248a2	[DeadStoreElimination] Use range-based loops. NFC. llvm-svn: 255265	2015-12-10 17:27:18 +00:00
Nathan Slingerland	51abea7442	[ProfileData] Add unit test infrastructure for sample profile reader/writer Summary: Adds support for in-memory round-trip of sample profile data along with basic round trip unit tests. This will also make it easier to include unit tests for future changes to sample profiling. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15211 llvm-svn: 255264	2015-12-10 17:21:42 +00:00
Pirama Arumuga Nainar	1317d5f311	Fix fptosi, fptoui from f16 vectors to i8, i16 vectors Summary: Convert f16 vectors to corresponding f32 vectors before doing the conversion to int. Add tests for v4f16, v8f16. Reviewers: ab, jmolloy Subscribers: llvm-commits, srhines Differential Revision: http://reviews.llvm.org/D14936 llvm-svn: 255263	2015-12-10 17:16:49 +00:00
Sanjay Patel	c83fd9554a	[InstCombine] fold bitcasts around an extractelement (3rd try) This is a redo of r255137 (reverted at r255227) which was a redo of r255124 (reverted at r255126) with a fixed check for a scalar source type and an added test for the failure that caused the revert. Original commit message: Example: bitcast (extractelement (bitcast <2 x float> %X to <2 x i32>), 1) to float ---> extractelement <2 x float> %X, i32 1 This is part of fixing PR25543: https://llvm.org/bugs/show_bug.cgi?id=25543 The next step will be to generalize this fold: trunc ( lshr ( bitcast X) ) -> extractelement (X) Ie, I'm hoping to replace the existing transform of: bitcast ( trunc ( lshr ( bitcast X))) added by: http://reviews.llvm.org/rL112232 with 2 less specific transforms to catch the case in the bug report. Differential Revision: http://reviews.llvm.org/D14879 llvm-svn: 255261	2015-12-10 17:09:28 +00:00
Teresa Johnson	9f2ff9c669	[ThinLTO] Debug message cleanup (NFC) Added some missing spaces between the module identifier and the start of the debug message. Also added a ":" after the module identifier to make this look a little nicer. llvm-svn: 255259	2015-12-10 16:39:07 +00:00
Rafael Espindola	f81c7b03a0	Avoid undefined behavior when vector is empty. Found by ubsan. llvm-svn: 255258	2015-12-10 16:35:06 +00:00
Sanjay Patel	87c6c0797e	remove duplicated comments and don't repeat function names in comments; NFC llvm-svn: 255257	2015-12-10 16:34:21 +00:00
Teresa Johnson	9d5b71b3d2	[ThinLTO] Release files in gold plugin during combined index (take 2) Ensure we release the files even when they don't hold a function index summary section, by restructuring the control flow a little bit. llvm-svn: 255256	2015-12-10 16:11:23 +00:00
Dan Gohman	28818d7840	[WebAssembly] Tighten up several CHECK tests. llvm-svn: 255255	2015-12-10 14:52:34 +00:00
Rafael Espindola	caabe22832	Slit lib/Linker in two. A linker normally has two stages: symbol resolution and "moving stuff". In lib/Linker there is the complication of lazy linking some globals, but it was still far more mixed than it needed to. This splits the linker into a lower level IRMover and the linker proper. The IRMover just takes a list of globals to move and a callback that lets the user control what is lazy linked. The main motivation is that now tools/gold (and soon lld) can use their own symbol resolution to instruct IRMover what to do. llvm-svn: 255254	2015-12-10 14:19:35 +00:00
Dan Gohman	b949b9c01b	[WebAssembly] Make WebAssemblyStoreResults only return true when it has a change. llvm-svn: 255253	2015-12-10 14:17:36 +00:00
Dan Gohman	a87629d6d7	[WebAssembly] Fix WebAssemblyPeephole to set Changed to true when making changes. llvm-svn: 255252	2015-12-10 14:16:34 +00:00
Dan Gohman	acc0941bd1	[WebAssembly] Declare that WebAssemblyPeephole does not modify the CFG. llvm-svn: 255251	2015-12-10 14:12:04 +00:00
Dan Gohman	6d63f96749	[WebAssembly] Remove an unneeded getAnalysisUsage override. llvm-svn: 255250	2015-12-10 14:10:04 +00:00
Chad Rosier	533bc3fcac	[DeadStoreElimination] Add support for non-local DSE. We extend the search for redundant stores to predecessor blocks that unconditionally lead to the block BB with the current store instruction. That also includes single-block loops that unconditionally lead to BB, and if-then-else blocks where then- and else-blocks unconditionally lead to BB. http://reviews.llvm.org/D13363 Patch by Ivan Baev <ibaev@codeaurora.org>! llvm-svn: 255247	2015-12-10 13:51:43 +00:00
Nemanja Ivanovic	ac8d01add0	Bitcasts between FP and INT values using direct moves This patch corresponds to review: http://reviews.llvm.org/D15286 LLVM IR frequently contains bitcast operations between floating point and integer values of the same width. Doing this through memory operations is quite expensive on PPC. This patch allows the use of direct register moves between FPRs and GPRs for lowering bitcasts. llvm-svn: 255246	2015-12-10 13:35:28 +00:00
Amjad Aboud	a9bcf16ebc	Macro debug info support in LLVM IR Introduced DIMacro and DIMacroFile debug info metadata in the LLVM IR to support macros. Differential Revision: http://reviews.llvm.org/D14687 llvm-svn: 255245	2015-12-10 12:56:35 +00:00
Silviu Baranga	86de80db37	[LLE] Use the PredicatedScalarEvolution interface to query SCEVs for dependences Summary: LAA uses the PredicatedScalarEvolution interface, so it can produce forward/backward dependences having SCEVs that are AddRecExprs only after being transformed by PredicatedScalarEvolution. Use PredicatedScalarEvolution to get the expected expressions. Reviewers: anemet Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D15382 llvm-svn: 255241	2015-12-10 11:07:18 +00:00
Jonas Paulsson	e451eeff5c	[PostRA scheduling] Allow a target to do scheduling when it wants post RA. SystemZ needs to do its scheduling after branch relaxation, which can only happen after block placement, and therefore the standard PostRAScheduler point in the pass sequence is too early. TargetMachine::targetSchedulesPostRAScheduling() is a new method that signals on returning true that target will insert the final scheduling pass on its own. Reviewed by Hal Finkel llvm-svn: 255234	2015-12-10 09:10:07 +00:00
Akira Hatanaka	a3c0e8e1ba	Revert r255137. This commit broke apple's internal bot. llvm-svn: 255227	2015-12-10 08:00:52 +00:00
Sanjoy Das	ccd14566e2	Add arg_begin() and arg_end() to CallInst and InvokeInst; NFCI - This simplifies the CallSite class, arg_begin / arg_end are now simple wrapper getters. - In several places, we were creating CallSite instances solely to call arg_begin and arg_end. With this change, that's no longer required. llvm-svn: 255226	2015-12-10 06:39:02 +00:00
Craig Topper	8e44b9a4d1	[X86] Fix a couple cases were bitwise and logical operations were being mixed. NFC llvm-svn: 255224	2015-12-10 06:09:41 +00:00
Alexey Bataev	860435c8e2	[OPENMP] Make -fopenmp to turn on OpenMP support by default. Patch turns on OpenMP support in clang by default after fixing OpenMP buildbots. Differential Revision: http://reviews.llvm.org/D13802 llvm-svn: 255222	2015-12-10 05:45:58 +00:00
Dan Gohman	f170ba08af	[WebAssembly] Implement mixed-type ISD::FCOPYSIGN. ISD::FCOPYSIGN permits its operands to have differing types, and DAGCombiner uses this. Add some def : Pat rules to expand this out into an explicit conversion and a normal copysign operation. llvm-svn: 255220	2015-12-10 04:55:31 +00:00
Dan Gohman	9341c1d4b3	[WebAssembly] Implement fma. It is lowered to a libcall for now, but this is expected to change in the future. llvm-svn: 255219	2015-12-10 04:52:33 +00:00
Tom Stellard	c2d654322b	AMDGPU/SI: Fix warning introduced by r255204 llvm-svn: 255205	2015-12-10 03:10:46 +00:00
Tom Stellard	c93fc11f36	AMDGPU/SI: Emit constant arrays in the .text section Summary: This allows us to remove the END_OF_TEXT_LABEL hack we had been using and simplifies the fixups used to compute the address of constant arrays. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15257 llvm-svn: 255204	2015-12-10 02:13:01 +00:00
Tom Stellard	b3c3bda512	AMDGPU/SI: Add support for sgpr and vgpr inline assembly constraints Summary: The 's' constraint represents sgprs and the 'v' constraint represents vgprs. Reviewers: arsenm, echristo Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15342 llvm-svn: 255203	2015-12-10 02:12:53 +00:00
Dan Gohman	60bddf17c5	[WebAssembly] Fix legalization of f32->f64 EXTLOAD. llvm-svn: 255202	2015-12-10 02:07:53 +00:00
Derek Schuff	6fd28dfe5d	[WebAssembly] Update known test failures We can now select sign_extend_inreg llvm-svn: 255197	2015-12-10 01:09:40 +00:00
Matthias Braun	7d8e41e82c	RegisterPressure: Factor out liveness dead-def detection logic; NFCI Detecting additional dead-defs without a dead flag that are only visible through liveness information should be part of the register operand collection not intertwined with the register pressure update logic. llvm-svn: 255192	2015-12-10 01:04:15 +00:00
Dan Gohman	a5603b835b	[WebAssembly] Also legalize sign_extend_inreg of i32->i64. llvm-svn: 255191	2015-12-10 01:00:19 +00:00
Derek Schuff	71d0eae609	[WebAssembly] Update test failure expectations llvm-svn: 255190	2015-12-10 00:56:18 +00:00
Dan Gohman	dab313e0ed	PeepholeOptimizer: Ignore dead implicit defs Target-specific instructions may have uninteresting physreg clobbers, for target-specific reasons. The peephole pass doesn't need to concern itself with such defs, as long as they're implicit and marked as dead. llvm-svn: 255182	2015-12-10 00:37:51 +00:00
Dan Gohman	a8483755d3	[WebAssembly] Fix legalization of shift operators with illegal types. llvm-svn: 255181	2015-12-10 00:26:26 +00:00
Dan Gohman	7935fa3d1b	[WebAssembly] Fix copy+pastos. llvm-svn: 255180	2015-12-10 00:22:40 +00:00
Dan Gohman	df00a9ebc2	[WebAssembly] Implement anyext. llvm-svn: 255179	2015-12-10 00:17:35 +00:00
Quentin Colombet	5d2f7cfd44	[X86] Enable shrink-wrapping by default, but keep it disabled for stack frames without a frame pointer when unwind may happen. This is a workaround for a bug in the way we emit the CFI directives for frameless unwind information. See PR25614. llvm-svn: 255175	2015-12-09 23:08:18 +00:00
Sanjay Patel	87d2ae23ac	use range-based for loops; NFCI llvm-svn: 255171	2015-12-09 22:45:45 +00:00
Rafael Espindola	ed11bd286f	Synchronize the logic for deciding to link a gv. We were deciding to not link an available_externally gv over a declaration, but then copying over the body anyway. llvm-svn: 255169	2015-12-09 22:44:00 +00:00
Rong Xu	7dd9b1ea75	[PGO] Rename the profdata filename to avoid the conflict b/w tests. Two tests diag_mismatch.ll and diag_no_funcprofdata.ll generates the same profdata filename which can conflict in current test runs. This patch renames them to have different names. llvm-svn: 255158	2015-12-09 21:27:59 +00:00
Justin Bogner	b7389d6714	IR: Make ConstantDataArray::getFP actually return a ConstantDataArray The ConstantDataArray::getFP(LLVMContext &, ArrayRef<uint16_t>) overload has had a typo in it since it was written, where it will create a Vector instead of an Array. This obviously doesn't work at all, but it turns out that until r254991 there weren't actually any callers of this overload. Fix the typo and add some test coverage. llvm-svn: 255157	2015-12-09 21:21:07 +00:00
Teresa Johnson	db51357c11	[ThinLTO] Release files read when creating combined index in gold plugin This wasn't causing an issue since at HEAD we exit the linker completely after creating the combined index. llvm-svn: 255156	2015-12-09 21:11:42 +00:00
Reid Kleckner	54ade23504	[Float2Int] Don't operate on vector instructions This fixes a crash bug. It's also not clear if we'd want to do this transform for vectors. llvm-svn: 255155	2015-12-09 21:08:18 +00:00
David Blaikie	c3826da895	[llvm-dwp] Sink debug_types.dwo emission into the code parsing the type signatures (NFC) This is a preliminary change towards deduplicating type units based on their signatures. Next change will skip emission of types when their signature has already been seen. llvm-svn: 255154	2015-12-09 21:02:33 +00:00
Rafael Espindola	9edc3b8403	Don't assign a temporary string to a StringRef. Should fix the windows debug and asan bots. llvm-svn: 255149	2015-12-09 20:41:10 +00:00
Sanjoy Das	9abfb0b429	Use WeakVH to keep track of calls with operand bundles in CloneCodeInfo `CloneAndPruneIntoFromInst` can DCE instructions after cloning them into the new function, and so an AssertingVH is too strong. This change switches CloneCodeInfo to use a std::vector<WeakVH>. llvm-svn: 255148	2015-12-09 20:33:52 +00:00
Sanjoy Das	1f8fd88873	Delete trailing whitespace; NFC llvm-svn: 255147	2015-12-09 20:33:45 +00:00
Teresa Johnson	af9e93183d	Delay context construction to when/if it is needed in gold plugin (NFC) llvm-svn: 255146	2015-12-09 19:49:40 +00:00
Teresa Johnson	b13dbd633a	clang-format order of gold-plugin includes (NFC) llvm-svn: 255144	2015-12-09 19:45:55 +00:00

1 2 3 4 5 ...

124950 Commits