llvm-project

Commit Graph

Author	SHA1	Message	Date
Teresa Johnson	430110cc0b	[ThinLTO] Wrap dbgs() output in DEBUG macro Missed in a couple places. llvm-svn: 254422	2015-12-01 17:12:10 +00:00
Teresa Johnson	d582f5b3f8	[ThinLTO] Remove stale comment (NFC) Stale as of r254036 which added basic profitability check. llvm-svn: 254421	2015-12-01 16:45:23 +00:00
Rafael Espindola	baa3bf8f76	Bring r254336 back: The difference is that now we don't error on out-of-comdat access to internal global values. We copy them instead. This seems to match the expectation of COFF linkers (see pr25686). Original message: Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254418	2015-12-01 15:19:48 +00:00
Chad Rosier	869962f962	[LIR] Push check into helper function. NFC. llvm-svn: 254416	2015-12-01 14:26:35 +00:00
Evgeniy Stepanov	42f3b12274	[safestack] Protect byval function arguments. Detect unsafe byval function arguments and move them to the unsafe stack. llvm-svn: 254353	2015-12-01 00:40:05 +00:00
Evgeniy Stepanov	a4ac3f4bdf	[safestack] Fix handling of array allocas. The current code does not take alloca array size into account and, as a result, considers any access past the first array element to be unsafe. llvm-svn: 254350	2015-12-01 00:06:13 +00:00
Rafael Espindola	e9841a6bb5	This reverts commit r254336 and r254344. They broke a bot and I am debugging why. llvm-svn: 254347	2015-11-30 23:54:19 +00:00
Rafael Espindola	c109200c53	Start deciding earlier what to link. A traditional linker is roughly split in symbol resolution and "copying stuff". The two tasks are badly mixed in lib/Linker. This starts splitting them apart. With this patch there are no direct call to linkGlobalValueBody or linkGlobalValueProto. Everything is linked via WapValue. This also includes a few fixes: * A GV goes undefined if the comdat is dropped (comdat11.ll). * We error if an internal GV goes undefined (comdat13.ll). * We don't link an unused comdat. The first two match the behavior of an ELF linker. The second one is equivalent to running globaldce on the input. llvm-svn: 254336	2015-11-30 22:01:43 +00:00
Davide Italiano	1aeed6a955	[SimplifyLibCalls] Transform log(exp2(y)) to y*log(2) under fast-math. llvm-svn: 254317	2015-11-30 19:36:35 +00:00
Sanjay Patel	b67076c0f8	fix typos in comments; NFC llvm-svn: 254266	2015-11-29 22:09:34 +00:00
Davide Italiano	0b14f29285	[SimplifyLibCalls] Don't crash if the function doesn't have a name. llvm-svn: 254265	2015-11-29 21:58:56 +00:00
Davide Italiano	e2db58cfb8	[SimplifyLibCalls] Cross out implemented transformations. llvm-svn: 254264	2015-11-29 21:00:43 +00:00
Davide Italiano	b8b7133c94	[SimplifyLibCalls] Tranform log(pow(x, y)) -> ylog(x). This one is enabled only under -ffast-math. There are cases where the difference between the value computed and the correct value is huge even for ffast-math, e.g. as Steven pointed out: x = -1, y = -4 log(pow(-1), 4) = 0 4log(-1) = NaN I checked what GCC does and apparently they do the same optimization (which result in the dramatic difference). Future work might try to make this (slightly) less worse. Differential Revision: http://reviews.llvm.org/D14400 llvm-svn: 254263	2015-11-29 20:58:04 +00:00
Diego Novillo	7ff0a174d1	SamplePGO - Do not use std::to_string in diagnostics. This fixes buildbots in systems that std::to_string is not present. It also tidies the output of the diagnostic to render doubles a bit better (thanks Ben Kramer for help with string streams and format). llvm-svn: 254261	2015-11-29 18:23:26 +00:00
Craig Topper	d896b03e4c	Remove an intermediate lambda. NFC llvm-svn: 254246	2015-11-29 05:38:08 +00:00
Davide Italiano	da3beebad1	[SimplifyLibCalls] Use any_of(). Suggested by David Blaikie! llvm-svn: 254239	2015-11-28 22:27:48 +00:00
Benjamin Kramer	89766e5b1d	[SimplifyLibCalls] Fix inverted condition that lead to an uninitialized memory read below. Found by msan! llvm-svn: 254238	2015-11-28 21:43:12 +00:00
Craig Topper	e471cf32a0	Use range-based for loops. NFC llvm-svn: 254222	2015-11-28 08:23:04 +00:00
Diego Novillo	84f06cc835	SamplePGO - Add initial support for inliner annotations. This adds two thresholds to the sample profiler to affect inlining decisions: the concept of global hotness and coldness. Functions that have accumulated more than a certain fraction of samples at runtime, are annotated with the InlineHint attribute. Conversely, functions that accumulate less than a certain fraction of samples, are annotated with the Cold attribute. This is very similar to the hints emitted by Clang when using instrumentation profiles. Notice that this is a very blunt instrument. A function may have globally collected a significant fraction of samples, but that does not necessarily mean that every callsite for that function is hot. Ideally, we would annotate each callsite with the samples collected at that callsite. This way, the inliner can incorporate all these weights into its cost model. Once the inliner offers this functionality, we can change the hints emitted here to a more precise per-callsite annotation. For now, this is providing some measure of speedups with our internal benchmarks. I've observed speedups of up to 23% (though the geo mean is about 3%). I expect these numbers to improve as the inliner gets better annotations. llvm-svn: 254212	2015-11-27 23:14:51 +00:00
Diego Novillo	b579240875	SamplePGO - Fix default threshold for hot callsites. Based on testing of internal benchmarks, I'm lowering this threshold to a value of 0.1%. This means that SamplePGO will respect 99.9% of the original inline decisions when following a profile. The performance difference is noticeable in some tests. With the previous threshold, the speedups over baseline -O2 was about 0.63%. With the new default, the speedups are around 3% on average. The point of this threshold is not to do more aggressive inlining. When an inlined callsite crosses this threshold, SamplePGO will redo the inline decision so that it can better apply the input profile. By respecting most original inline decisions, we can apply more of the input profile because the shape of the code follows the profile more closely. In the next series, I'll be looking at adding some inline hints for the cold callsites and for toplevel functions that are hot/cold as well. llvm-svn: 254211	2015-11-27 23:14:49 +00:00
Rafael Espindola	19b52383c5	Simplify the linking of recursive data. Now the ValueMapper has two callbacks. The first one maps the declaration. The ValueMapper records the mapping and then materializes the body/initializer. llvm-svn: 254209	2015-11-27 20:28:19 +00:00
Adhemerval Zanella	d93c0c4dc4	[sanitizer] [dfsan] Unify aarch64 mapping This patch changes the DFSan instrumentation for aarch64 to instead of using fixes application mask defined by SANITIZER_AARCH64_VMA to read the application shadow mask value from compiler-rt. The value is initialized based on runtime VAM detection. Along with this patch a compiler-rt one will also be added to export the shadow mask variable. llvm-svn: 254196	2015-11-27 12:42:39 +00:00
Davide Italiano	ac0953a2e6	[SimplifyLibCalls] Use range-based loop. NFC. llvm-svn: 254193	2015-11-27 08:05:40 +00:00
Charlie Turner	54336a5a4e	[LoopVectorize] Use MapVector rather than DenseMap for MinBWs. The order in which instructions are truncated in truncateToMinimalBitwidths effects code generation. Switch to a map with a determinisic order, since the iteration order over a DenseMap is not defined. This code is not hot, so the difference in container performance isn't interesting. Many thanks to David Blaikie for making me aware of MapVector! Fixes PR25490. Differential Revision: http://reviews.llvm.org/D14981 llvm-svn: 254179	2015-11-26 20:39:51 +00:00
Rafael Espindola	8934577171	Disallow aliases to available_externally. They are as much trouble as aliases to declarations. They are requiring the code generator to define a symbol with the same value as another symbol, but the second symbol is undefined. If representing this is important for some optimization, we could add support for available_externally aliases. They would be required to point to a declaration (or available_externally definition). llvm-svn: 254170	2015-11-26 19:22:59 +00:00
Benjamin Kramer	fb419e71f4	[SimplifyLibCalls] Don't depend on a called function having a name, it might be an indirect call. Fixes the crasher in PR25651 and related crashers using the same pattern. llvm-svn: 254145	2015-11-26 09:51:17 +00:00
Evgeniy Stepanov	9842d61ca4	[safestack] Fix alignment of dynamic allocas. Fixes PR25588. llvm-svn: 254109	2015-11-25 22:52:30 +00:00
Davide Italiano	dd04fee8a6	[SCCP] More informative message if we don't know how to handle a terminator. llvm-svn: 254093	2015-11-25 21:03:36 +00:00
Sanjoy Das	c521c7bea5	[OperandBundles] Extract duplicated code into a helper function, NFC llvm-svn: 254047	2015-11-25 00:42:24 +00:00
Sanjoy Das	7629346193	[InstCombine] Don't drop operand bundles Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14857 llvm-svn: 254046	2015-11-25 00:42:19 +00:00
Rong Xu	25c106b347	[PGO] Revert revision r254021,r254028,r254035 Revert the above revision due to multiple issues. llvm-svn: 254040	2015-11-24 23:49:08 +00:00
Teresa Johnson	3930361969	[ThinLTO] Add option to limit importing based on instruction count Add a simple initial heuristic to control importing based on the number of instructions recorded in the function's summary. Add option to control the limit, and test using option. llvm-svn: 254036	2015-11-24 22:55:46 +00:00
Diego Novillo	0b6985a3c6	SamplePGO - Add test for hot/cold inlined functions. When the original binary is executed and sampled, the resulting profile contains information on the original inline stack. We currently follow the original inline plan if we notice that the inlined callsite has more than 0 samples to it. A better way is to determine whether the callsite is actually worth inlining. If the callsite accumulates a small fraction of the samples spent in the parent function, then we don't want to bother inlining it (as it means that the callsite is actually cold). This patch introduces a threshold expressed in percentage of samples in relation to the parent function. If the callsite uses less than N% of the total samples used by its parent, the original inline decision is not re-applied. I've set the threshold to the very arbitrary value of 5%. I'm yet to do any actual experiments to see what's a good value. I wanted to separate the basic mechanism from the tuning. llvm-svn: 254034	2015-11-24 22:38:37 +00:00
Rong Xu	4dd22b8d2b	[PGO] Fix build errors in x86_64-darwin Fix buildbot failure for x86_64-darwin due to r254021 llvm-svn: 254028	2015-11-24 21:55:50 +00:00
Rong Xu	1b665ca707	[PGO] MST based PGO instrumentation infrastructure This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 254021	2015-11-24 21:31:25 +00:00
Teresa Johnson	d450da3281	[ThinLTO] Refactor function body scan during importing into helper (NFC) llvm-svn: 254020	2015-11-24 21:15:19 +00:00
Teresa Johnson	130de7af7f	[ThinLTO] Enable iterative importing in FunctionImport pass Analyze imported function bodies and add any new external calls to the worklist for importing. Currently no controls on the importing so this will end up importing everything possible in the call tree below the importing module. Basic profitability checks coming next. Update test to check for iteratively inlined functions. llvm-svn: 254011	2015-11-24 19:55:04 +00:00
Weiming Zhao	45d4cb9a14	[Utils] Put includes in correct order. NFC. Summary: Followed the guidelines in: http://llvm.org/docs/CodingStandards.html#include-style However, I noticed that uppercase named headers come before lowercase ones throughout the codebase. So kept them as is. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide, jmolloy, atrick Subscribers: sanjoy Differential Revision: http://reviews.llvm.org/D14939 llvm-svn: 254005	2015-11-24 18:57:06 +00:00
Sanjay Patel	968e91aea0	[InstCombine] fix propagation of fast-math-flags Noticed while working on D4583: http://reviews.llvm.org/D4583 llvm-svn: 253997	2015-11-24 17:51:20 +00:00
Sanjay Patel	739f2ce93a	use convenience function for copying IR flags; NFCI llvm-svn: 253996	2015-11-24 17:16:33 +00:00
Teresa Johnson	17626654fd	[ThinLTO] Fix FunctionImport alias checking and test Skip imports for weak_any aliases as well. Fix the test to check non-import of weak aliases and functions, and import of normal alias. llvm-svn: 253991	2015-11-24 16:10:43 +00:00
Ismail Donmez	65487e2d7e	Fix build after r253954 llvm-svn: 253969	2015-11-24 09:48:09 +00:00
Mehdi Amini	42418aba58	Add a FunctionImporter helper to perform summary-based cross-module function importing Summary: This is a helper to perform cross-module import for ThinLTO. Right now it is importing naively every possible called functions. Reviewers: tejohnson Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14914 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253954	2015-11-24 06:07:49 +00:00
Chad Rosier	a15b4b6af2	[LIR] Put includes in correct order. NFC. llvm-svn: 253915	2015-11-23 21:09:13 +00:00
Diego Novillo	243ea6a7d6	SamplePGO - Add coverage tracking for samples. The existing coverage tracker counts the number of records that were used from the input profile. An alternative view of coverage is to check how many available samples were applied. This way, if the profile contains several records with few samples, it doesn't really matter much that they were not applied. The more interesting records to apply are the ones that contribute many samples. llvm-svn: 253912	2015-11-23 20:12:21 +00:00
Andrew Kaylor	0615a0e65d	[WinEH] Fix a case where GVN could incorrectly PRE a load into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253908	2015-11-23 19:51:41 +00:00
Xinliang David Li	c7c1f8581a	[PGO] Introduce alignment macro for instr-prof control data(NFC) llvm-svn: 253893	2015-11-23 18:02:59 +00:00
Diego Novillo	1ca881c4bb	SamplePGO - Clear coverage tracking when clearing per-function data. llvm-svn: 253877	2015-11-23 16:30:17 +00:00
Diego Novillo	39ab68f39b	SamplePGO - Use newly introduced local variable. NFC. llvm-svn: 253868	2015-11-23 15:24:13 +00:00
Davide Italiano	945d05f6a0	[LoopStrengthReduce] Mark dump() definitions as LLVM_DUMP_METHOD. llvm-svn: 253841	2015-11-23 02:47:30 +00:00

1 2 3 4 5 ...

13964 Commits