llvm-project

Commit Graph

Author	SHA1	Message	Date
JF Bastien	5e4303dc14	Accelerate MergeFunctions with hashing This patch makes the Merge Functions pass faster by calculating and comparing a hash value which captures the essential structure of a function before performing a full function comparison. The hash is calculated by hashing the function signature, then walking the basic blocks of the function in the same order as the main comparison function. The opcode of each instruction is hashed in sequence, which means that different functions according to the existing total order cannot have the same hash, as the comparison requires the opcodes of the two functions to be the same order. The hash function is a static member of the FunctionComparator class because it is tightly coupled to the exact comparison function used. For example, functions which are equivalent modulo a single variant callsite might be merged by a more aggressive MergeFunctions, and the hash function would need to be insensitive to these differences in order to exploit this. The hashing function uses a utility class which accumulates the values into an internal state using a standard bit-mixing function. Note that this is a different interface than a regular hashing routine, because the values to be hashed are scattered amongst the properties of a llvm::Function, not linear in memory. This scheme is fast because only one word of state needs to be kept, and the mixing function is a few instructions. The main runOnModule function first computes the hash of each function, and only further processes functions which do not have a unique function hash. The hash is also used to order the sorted function set. If the hashes differ, their values are used to order the functions, otherwise the full comparison is done. Both of these are helpful in speeding up MergeFunctions. Together they result in speedups of 9% for mysqld (a mostly C application with little redundancy), 46% for libxul in Firefox, and 117% for Chromium. (These are all LTO builds.) In all three cases, the new speed of MergeFunctions is about half that of the module verifier, making it relatively inexpensive even for large LTO builds with hundreds of thousands of functions. The same functions are merged, so this change is free performance. Author: jrkoenig Reviewers: nlewycky, dschuff, jfb Subscribers: llvm-commits, aemerson Differential revision: http://reviews.llvm.org/D11923 llvm-svn: 245140	2015-08-15 01:18:18 +00:00
Hans Wennborg	99000c24c9	Delay emitting members of dllexport classes until the class is fully parsed (PR23542) This enables Clang to correctly handle code such as: struct __declspec(dllexport) S { int x = 42; }; where it would otherwise error due to trying to generate the default constructor before the in-class initializer for x has been parsed. Differential Revision: http://reviews.llvm.org/D11850 llvm-svn: 245139	2015-08-15 01:18:16 +00:00
Alex Lorenz	3a4a60cba5	MIRLangRef: Describe the syntax that is used to represent machine basic blocks. llvm-svn: 245138	2015-08-15 01:06:06 +00:00
Matt Arsenault	427a0fd22e	LoopStrengthReduce: Try to pass address space to isLegalAddressingMode This seems to only work some of the time. In some situations, this seems to use a nonsensical type and isn't actually aware of the memory being accessed. e.g. if branch condition is an icmp of a pointer, it checks the addressing mode of i1. llvm-svn: 245137	2015-08-15 00:53:06 +00:00
Richard Smith	3938f0c728	[modules] Stop dropping 'module.timestamp' files into the current directory when building with implicit modules disabled. llvm-svn: 245136	2015-08-15 00:34:15 +00:00
Matt Arsenault	297ae311ce	AMDGPU/SI: Fix printing useless info with amdhsa The comments at the bottom would all report 0 if amdhsa was used. llvm-svn: 245135	2015-08-15 00:12:39 +00:00
Matt Arsenault	0259a7aa41	AMDGPU/SI: Update LiveVariables This is simple but won't work if/when this pass is moved to be post-SSA. llvm-svn: 245134	2015-08-15 00:12:37 +00:00
Matt Arsenault	670ba46efe	AMDGPU/SI: Update LiveIntervals during SIFixSGPRLiveRanges Does not mark SlotIndexes as reserved, although I think that might be OK. LiveVariables still need to be handled. llvm-svn: 245133	2015-08-15 00:12:35 +00:00
Matt Arsenault	b75233235c	AMDGPU: Remove unnecessary assert These shouldn't ever be null. The number of successors was already asserted to be 2. llvm-svn: 245132	2015-08-15 00:12:32 +00:00
Matt Arsenault	4275c29a02	AMDGPU/SI: Make comments more precise. True branch instructions do behave as expected with liveness. Avoid the phrasing "branch decision is based on a value in an SGPR" because this could be misleading. A VALU compare instruction's result is still based on an SGPR, even though that condition may be divergent. llvm-svn: 245131	2015-08-15 00:12:30 +00:00
Jason Molenda	650cc3dfd6	There is no such thing as gdb_arm_f8, this register set is f0-f7. Remove this entry and adjust the numbering for the rest of the arm register definitions. llvm-svn: 245130	2015-08-15 00:09:23 +00:00
Oleksiy Vyalov	c24da69ebf	Fix Android build. llvm-svn: 245129	2015-08-14 23:57:15 +00:00
Zachary Turner	398f9ed95c	Enable settings test for i686 as well as i386. llvm-svn: 245128	2015-08-14 23:29:32 +00:00
Zachary Turner	793d997585	Make skipUnlessArch decorator actually skip instead of XFAIL. llvm-svn: 245127	2015-08-14 23:29:24 +00:00
Zachary Turner	6e19fe9954	XFAIL some data formatter tests on Windows. Fixing these bugs is tracked by http://llvm.org/pr24462. llvm-svn: 245126	2015-08-14 23:29:17 +00:00
Zachary Turner	c714b07433	Disable libstdc++ and libcxx data formatter tests on Windows. Neither of these libraries has been ported to Windows. Eventually if they are ever ported we can re-enable these tests. But more immediately what we need to do is add new data formatters for MSVC's STL implementation. This is tracked in http://llvm.org/pr24460. llvm-svn: 245125	2015-08-14 23:28:49 +00:00
Naomi Musgrave	b9b46f5a58	clarified test comment llvm-svn: 245124	2015-08-14 23:22:03 +00:00
Nathan Wilson	b20ab9245a	[CONCEPTS] Add diagnostic; invalid tag when concept specified Summary: Adding check to emit diagnostic for invalid tag when concept is specified and associated tests. Reviewers: rsmith, hubert.reinterpretcast, fraggamuffin, faisalv, aaron.ballman Subscribers: aaron.ballman, cfe-commits Differential Revision: http://reviews.llvm.org/D11916 llvm-svn: 245123	2015-08-14 23:19:32 +00:00
Greg Clayton	56de8a4b56	Unbreak the windows and linux buildbots. llvm-svn: 245122	2015-08-14 23:16:12 +00:00
Greg Clayton	360dac7d58	Don't crash if we don't have a type system for a language. llvm-svn: 245121	2015-08-14 23:15:48 +00:00
Sanjay Patel	7332e0455f	make current codegen visible in the checks, so we can decide if it's right llvm-svn: 245120	2015-08-14 23:03:01 +00:00
Nick Lewycky	8075fd22b9	Fix a crash where a utility function wasn't aware of fcmp vectors and created a value with the wrong type. Fixes PR24458! llvm-svn: 245119	2015-08-14 22:46:49 +00:00
Bjarke Hammersholt Roune	9791ed4705	[SCEV] Apply NSW and NUW flags via poison value analysis for sub, mul and shl Summary: http://reviews.llvm.org/D11212 made Scalar Evolution able to propagate NSW and NUW flags from instructions to SCEVs for add instructions. This patch expands that to sub, mul and shl instructions. This change makes LSR able to generate pointer induction variables for loops like these, where the index is 32 bit and the pointer is 64 bit: for (int i = 0; i < numIterations; ++i) sum += ptr[i - offset]; for (int i = 0; i < numIterations; ++i) sum += ptr[i * stride]; for (int i = 0; i < numIterations; ++i) sum += ptr[3 * (i << 7)]; Reviewers: atrick, sanjoy Subscribers: sanjoy, majnemer, hfinkel, llvm-commits, meheff, jingyue, eliben Differential Revision: http://reviews.llvm.org/D11860 llvm-svn: 245118	2015-08-14 22:45:26 +00:00
Pat Gavlin	b399095c3f	Add a target environment for CoreCLR. Although targeting CoreCLR is similar to targeting MSVC, there are certain important differences that the backend must be aware of (e.g. differences in stack probes, EH, and library calls). Differential Revision: http://reviews.llvm.org/D11012 llvm-svn: 245115	2015-08-14 22:41:43 +00:00
Sanjay Patel	dd175bc6c4	make current codegen visible in the checks, so we can decide if it's right llvm-svn: 245108	2015-08-14 22:10:59 +00:00
Ahmed Bougacha	cd35787217	[AArch64] Fix FMLS scalar-indexed-from-2s-after-neg patterns. We canonicalize V64 vectors to V128 through insert_subvector: the other FMLA/FMLS/FMUL/FMULX patterns match that already, but this one doesn't, so we'd fail to match fmls and generate fneg+fmla instead. The vector equivalents are already tested and functional. llvm-svn: 245107	2015-08-14 22:06:05 +00:00
Evgeniy Stepanov	24ac55d884	[msan] Fix handling of musttail calls. MSan instrumentation for return values of musttail calls is not allowed by the IR constraints, and not needed at the same time. llvm-svn: 245106	2015-08-14 22:03:50 +00:00
Alexei Starovoitov	cb6b408da4	[bpf] add documentation and instruction set description llvm-svn: 245105	2015-08-14 22:00:45 +00:00
Alex Lorenz	577d271a75	MIR Serialization: Serialize the '.cfi_same_value' CFI directive. llvm-svn: 245103	2015-08-14 21:55:58 +00:00
Oleksiy Vyalov	52ae023f9d	Fix Linux build after r245090. llvm-svn: 245100	2015-08-14 21:16:00 +00:00
Alex Lorenz	c3ba7508f6	MIR Serialization: Serialize the external symbol call entry pseudo source values. llvm-svn: 245098	2015-08-14 21:14:50 +00:00
Alex Lorenz	50b826fb75	MIR Serialization: Serialize the global value call entry pseudo source values. llvm-svn: 245097	2015-08-14 21:08:30 +00:00
Anna Zaks	e5d74caf2a	[analyzer] Add checkers for OS X / iOS localizability issues Add checkers that detect code-level localizability issues for OS X / iOS: - A path sensitive checker that warns about uses of non-localized NSStrings passed to UI methods expecting localized strings. - A syntax checker that warns against not including a comment in NSLocalizedString macros. A patch by Kulpreet Chilana! (This is the second attempt with the compilation issue on Windows and the random test failures resolved.) llvm-svn: 245093	2015-08-14 20:22:22 +00:00
Michael Kruse	78a2e4720d	[RegionInfo] Remove unused and broken function splitBlock Summary: It always makes NewBB the entry of the region instead of OldBB. This breaks if there are edges from inside the region to OldBB. OldBB is moved out of the region and hence there are exiting edges to OldBB and the region's exit block, contradicting the single-exit condition for regions. The only use from Polly is going to be removed, hence I propose to remove the function completely. Reviewers: grosser Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11873 llvm-svn: 245092	2015-08-14 20:20:00 +00:00
Michael Kruse	82a1c7de09	Make TempScopInfo a RegionPass This modifies the order in which Polly passes are executed. Assuming a function has two scops (A and B), the order before was: FunctionPassManager ScopDetection IndependentBlocks TempScopInfo for A and B RegionPassManager ScopInfo for A DependenceInfo for A IslScheduleOptimizer for A IslAstInfo for A CodeGeneration for A ScopInfo for B DependenceInfo for B IslScheduleOptimizer for B IslAstInfo for B CodeGeneration for B After this patch: FunctionPassManager ScopDetection IndependentBlocks RegionPassManager TempScopInfo for A ScopInfo for A DependenceInfo for A IslScheduleOptimizer for A IslAstInfo for A CodeGeneration for A TempScopInfo for B ScopInfo for B DependenceInfo for B IslScheduleOptimizer for B IslAstInfo for B CodeGeneration for B TempScopInfo for B might store information and references to the IR that CodeGeneration for A might modify. Changing the order ensures that the IR is not modified from the analysis of a region until code generation. Reviewers: grosser Differential Revision: http://reviews.llvm.org/D12014 llvm-svn: 245091	2015-08-14 20:10:27 +00:00
Greg Clayton	8b4edba9da	Move all clang type system DWARF type parsing into ClangASTContext.cpp. Another step towards isolating all language/AST specific code into the files to further abstract specific implementations of parsing types for a given language. llvm-svn: 245090	2015-08-14 20:02:05 +00:00
Reid Kleckner	4ca630be18	Fixup stale comment about supported OSs llvm-svn: 245089	2015-08-14 20:01:27 +00:00
Tom Stellard	bef1094ee7	AMDGPU/SI: Add missing spill class The compiler was failing to spill for some shaders. Patch By: Axel Davy llvm-svn: 245087	2015-08-14 19:46:05 +00:00
Renato Golin	980b6cc42b	Revert "[ARM] Fix MachO CPU Subtype selection" This reverts commit r245081, as it breaks many builds. llvm-svn: 245086	2015-08-14 19:35:47 +00:00
Alex Lorenz	1039fd1ae5	MIR Serialization: Serialize the 'internal' register operand flag. llvm-svn: 245085	2015-08-14 19:07:07 +00:00
Martell Malone	b8a3fe7a53	WindowsX86: long double is x87DoubleExtended on mingw Summary: long double on x86 mingw is 80bits and is aligned to 16bytes Fixes: https://llvm.org/bugs/show_bug.cgi?id=24398 Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D12037 llvm-svn: 245084	2015-08-14 19:05:56 +00:00
Reid Kleckner	397a17bfeb	Don't run explicit-modules-missing-files.cpp on Windows It is flaky due to inability to remove files with open handles. We could paper over it with rm -f, but then the file would still be present. This is more evidence to me that we should roll our own 'rm' implementation in LLVM. llvm-svn: 245083	2015-08-14 19:03:02 +00:00
Alex Lorenz	f9a2b12361	MIR Serialization: Serialize the bundled machine instructions. llvm-svn: 245082	2015-08-14 18:57:24 +00:00
Vedant Kumar	2f079be789	[ARM] Fix MachO CPU Subtype selection This patch makes the Darwin ARM backend take advantage of TargetParser. It also teaches TargetParser about ARMV7K for the first time. This makes target triple parsing more consistent across llvm. Differential Revision: http://reviews.llvm.org/D11996 llvm-svn: 245081	2015-08-14 18:36:47 +00:00
Reid Kleckner	14b7dde85f	Move ASAN_OPTIONS setting from compile command to run command, fixes test on posix llvm-svn: 245079	2015-08-14 18:29:00 +00:00
Greg Clayton	ecc7443384	Don't test the output of "target modules dump symfile a.out" as this isn't something we should be testing for. This makes this test pass again. llvm-svn: 245078	2015-08-14 18:18:07 +00:00
Martell Malone	7ccda3c38b	Windows ARM: ignore calling conventions as described on MSDN Summary: MSDN says that fastcall, stdcall, thiscall, and vectorcall are all accepted but ignored on ARM and X64. https://msdn.microsoft.com/en-us/library/984x0h58.aspx MSDN also says cdecl is also accepted and typically ignored This patch brings ARM in line with how we ignore them for X64 Reviewers: rnk Subscribers: compnerd, cfe-commits Differential Revision: http://reviews.llvm.org/D12034 llvm-svn: 245076	2015-08-14 18:00:09 +00:00
Sanjay Patel	ed502905f7	[x86] fix allowsMisalignedMemoryAccess() implementation This patch fixes the x86 implementation of allowsMisalignedMemoryAccess() to correctly return the 'Fast' output parameter for 32-byte accesses. To test that, an existing load merging optimization is changed to use the TLI hook. This exposes a shortcoming in the current logic and results in the regression test update. Changing other direct users of the isUnalignedMem32Slow() x86 CPU attribute would be a follow-on patch. Without the fix in allowsMisalignedMemoryAccesses(), we will infinite loop when targeting SandyBridge because LowerINSERT_SUBVECTOR() creates 32-byte loads from two 16-byte loads while PerformLOADCombine() splits them back into 16-byte loads. Differential Revision: http://reviews.llvm.org/D10662 llvm-svn: 245075	2015-08-14 17:53:40 +00:00
Vedant Kumar	06f0678010	[test] Testing write access to llvm llvm-svn: 245074	2015-08-14 17:42:50 +00:00
Reid Kleckner	89d994367a	[windows] Fix or XFAIL remaining portable test failures and enable them Summary: This involved various fixes: - Move a test that uses ulimit to Posix. - Add a few "REQUIRES: shell" lines to tests using backtick subshell evaluation. - The MSVC CRT buffers stdio if the output is a pipe by default. Some tests need that disabled to avoid interleaving test stdio with asan output. - MSVC headers provide _alloca instead of alloca (go figure), so add a portability macro to the two alloca tests. - XFAIL tests that rely on accurate symbols, we need to pass more flags to make that work. - MSVC's printf implementation of %p uses upper case letters and doesn't add 0x, so do that manually. - Accept "SEGV" or "access-violation" reports in crash tests. Reviewers: samsonov Subscribers: tberghammer, danalbert, llvm-commits, srhines Differential Revision: http://reviews.llvm.org/D12019 llvm-svn: 245073	2015-08-14 17:39:48 +00:00

1 2 3 4 5 ...

207995 Commits All Branches Search

207995 Commits

All Branches