llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Gottesman	4d078a3d6f	[block-freq] Add BlockFrequency::scale that returns a remainder from the division and make the private scale in BlockFrequency more performant. This change is the first in a series of changes improving LLVM's Block Frequency propogation implementation to not lose probability mass in branchy code when propogating block frequency information from a basic block to its successors. This patch is a simple infrastructure improvement that does not actually modify the block frequency algorithm. The specific changes are: 1. Changes the division algorithm used when scaling block frequencies by branch probabilities to a short division algorithm. This gives us the remainder for free as well as provides a nice speed boost. When I benched the old routine and the new routine on a Sandy Bridge iMac with disabled turbo mode performing 8192 iterations on an array of length 32768, I saw ~600% increase in speed in mean/median performance. 2. Exposes a scale method that returns a remainder. This is important so we can ensure that when we scale a block frequency by some branch probability BP = N/D, the remainder from the division by D can be retrieved and propagated to other children to ensure no probability mass is lost (more to come on this). llvm-svn: 194950	2013-11-17 03:25:24 +00:00
Chandler Carruth	a8df47603a	[PM] Completely remove support for explicit 'require' methods on the AnalysisManager. All this method did was assert something and we have a perfectly good way to trigger that assert from the query path. llvm-svn: 194947	2013-11-17 03:18:05 +00:00
Andrew Trick	10d5be4e6e	Added a size field to the stack map record to handle subregister spills. Implementing this on bigendian platforms could get strange. I added a target hook, getStackSlotRange, per Jakob's recommendation to make this as explicit as possible. llvm-svn: 194942	2013-11-17 01:36:23 +00:00
Hal Finkel	bf45efde2d	Add a loop rerolling pass This adds a loop rerolling pass: the opposite of (partial) loop unrolling. The transformation aims to take loops like this: for (int i = 0; i < 3200; i += 5) { a[i] += alpha * b[i]; a[i + 1] += alpha * b[i + 1]; a[i + 2] += alpha * b[i + 2]; a[i + 3] += alpha * b[i + 3]; a[i + 4] += alpha * b[i + 4]; } and turn them into this: for (int i = 0; i < 3200; ++i) { a[i] += alpha * b[i]; } and loops like this: for (int i = 0; i < 500; ++i) { x[3i] = foo(0); x[3i+1] = foo(0); x[3*i+2] = foo(0); } and turn them into this: for (int i = 0; i < 1500; ++i) { x[i] = foo(0); } There are two motivations for this transformation: 1. Code-size reduction (especially relevant, obviously, when compiling for code size). 2. Providing greater choice to the loop vectorizer (and generic unroller) to choose the unrolling factor (and a better ability to vectorize). The loop vectorizer can take vector lengths and register pressure into account when choosing an unrolling factor, for example, and a pre-unrolled loop limits that choice. This is especially problematic if the manual unrolling was optimized for a machine different from the current target. The current implementation is limited to single basic-block loops only. The rerolling recognition should work regardless of how the loop iterations are intermixed within the loop body (subject to dependency and side-effect constraints), but the significant restriction is that the order of the instructions in each iteration must be identical. This seems sufficient to capture all current use cases. This pass is not currently enabled by default at any optimization level. llvm-svn: 194939	2013-11-16 23:59:05 +00:00
Benjamin Kramer	c6f955763e	ScalarEvolution: Warn if the result of setFlags/clearFlags is unused. This was a source of bugs in the past. llvm-svn: 194929	2013-11-16 16:25:47 +00:00
Benjamin Kramer	5f2768c377	Annotate APInt methods where it's not clear whether they are in place with warn_unused_result. Fix ScalarEvolution bugs uncovered by this. llvm-svn: 194928	2013-11-16 16:25:41 +00:00
Duncan P. N. Exon Smith	38fc2e7a47	Fix filename in header comment llvm-svn: 194924	2013-11-16 15:40:54 +00:00
Jim Grosbach	664d148a92	X86: Encode the 'h' cpu subtype in the MachO header for x86. llvm-svn: 194906	2013-11-16 00:52:57 +00:00
Ana Pazos	d035209bd7	Implemented aarch64 Neon scalar vmulx_lane intrinsics Implemented aarch64 Neon scalar vfma_lane intrinsics Implemented aarch64 Neon scalar vfms_lane intrinsics Implemented legacy vmul_n_f64, vmul_lane_f64, vmul_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. Implemented legacy vfma_lane_f64, vfms_lane_f64, vfma_laneq_f64, vfms_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. llvm-svn: 194888	2013-11-15 23:32:10 +00:00
Juergen Ributzka	dbedae89b9	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 194865	2013-11-15 22:34:48 +00:00
Chad Rosier	0c57c3402e	[AArch64] Fix the scalar NEON ACLE functions so that they return float/double rather than the vector equivalent. llvm-svn: 194853	2013-11-15 21:28:10 +00:00
Rui Ueyama	e448f9e418	Path: Recognize COFF import library file magic. Summary: Make identify_magic to recognize COFF import file. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2165 llvm-svn: 194852	2013-11-15 21:22:02 +00:00
Rui Ueyama	15ba1e20db	Readobj: If NumbersOfSections is 0xffff, it's an COFF import library. 0xffff does not mean that there are 65535 sections in a COFF file but indicates that it's a COFF import library. This patch fixes SEGV error when an import library file is passed to llvm-readobj. llvm-svn: 194844	2013-11-15 20:23:25 +00:00
Bob Wilson	9f3e6b25ee	Avoid illegal integer promotion in fastisel Stop folding constant adds into GEP when the type size doesn't match. Otherwise, the adds' operands are effectively being promoted, changing the conditions of an overflow. Results are different when: sext(a) + sext(b) != sext(a + b) Problem originally found on x86-64, but also fixed issues with ARM and PPC, which used similar code. <rdar://problem/15292280> Patch by Duncan Exon Smith! llvm-svn: 194840	2013-11-15 19:09:27 +00:00
Cameron McInally	ad41f1f693	Add AVX512 unmasked FMA intrinsics and support. llvm-svn: 194824	2013-11-15 17:01:14 +00:00
Daniel Sanders	50b8041066	Fix illegal DAG produced by SelectionDAG::getConstant() for v2i64 type Summary: When getConstant() is called for an expanded vector type, it is split into multiple scalar constants which are then combined using appropriate build_vector and bitcast operations. In addition to the usual big/little endian differences, the case where the element-order of the vector does not have the same endianness as the elements themselves is also accounted for. For example, for v4i32 on big-endian MIPS, the byte-order of the vector is <3210,7654,BA98,FEDC>. For little-endian, it is <0123,4567,89AB,CDEF>. Handling this case turns out to be a nop since getConstant() returns a splatted vector (so reversing the element order doesn't change the value) This fixes a number of cases in MIPS MSA where calling getConstant() during operation legalization introduces illegal types (e.g. to legalize v2i64 UNDEF into a v2i64 BUILD_VECTOR of illegal i64 zeros). It should also handle bigger differences between illegal and legal types such as legalizing v2i64 into v8i16. lowerMSASplatImm() in the MIPS backend no longer needs to avoid calling getConstant() so this function has been updated in the same patch. For the sake of transparency, the steps I've taken since the review are: * Added 'virtual' to isVectorEltOrderLittleEndian() as requested. This revealed that the MIPS tests were falsely passing because a polymorphic function was not actually polymorphic in the reviewed patch. * Fixed the tests that were now failing. This involved deleting the code to handle the MIPS MSA element-order (which was previously doing an byte-order swap instead of an element-order swap). This left isVectorEltOrderLittleEndian() unused and it was deleted. * Fixed build failures caused by rebasing beyond r194467-r194472. These build failures involved the bset, bneg, and bclr instructions added in these commits using lowerMSASplatImm() in a way that was no longer valid after this patch. Some of these were fixed by calling SelectionDAG::getConstant() instead, others were fixed by a new function getBuildVectorSplat() that provided the removed functionality of lowerMSASplatImm() in a more sensible way. Reviewers: bkramer Reviewed By: bkramer CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1973 llvm-svn: 194811	2013-11-15 12:56:49 +00:00
Matt Arsenault	c5559bb14b	Add target hook to prevent folding some bitcasted loads. This is to avoid this transformation in some cases: fold (conv (load x)) -> (load (conv*)x) On architectures that don't natively support some vector loads efficiently casting the load to a smaller vector of larger types and loading is more efficient. Patch by Micah Villmow. llvm-svn: 194783	2013-11-15 04:42:23 +00:00
Peter Zotov	0e38fc8d5e	[llvm-c] Add missing const qualifiers to LLVMCreateTargetMachine llvm-svn: 194770	2013-11-15 02:51:12 +00:00
Peter Zotov	b2c8b8a460	[llvm-c] Simplify signature of LLVMGetTargetFromName LLVMGetTargetFromName was not yet present in an LLVM release, so this does not break compatibility. llvm-svn: 194769	2013-11-15 02:51:01 +00:00
Matt Arsenault	b03bd4d96b	Add addrspacecast instruction. Patch by Michele Scandale! llvm-svn: 194760	2013-11-15 01:34:59 +00:00
Rui Ueyama	08c0b1a1bd	Include raw_ostream.h. Including only Debug.h did not cause a compilation error, but you couldn't do anything (like writing something with <<) to raw_ostreams returned by llvm::dbgs() or llvm::errs() without including raw_ostream.h. So including it from Debug.h should make sense. Differential Revision: http://llvm-reviews.chandlerc.com/D2183 llvm-svn: 194759	2013-11-15 01:25:34 +00:00
Chandler Carruth	d9a328437e	Fix the header comment of the new pass manager stuff to not claim to be the legacy stuff. =] llvm-svn: 194689	2013-11-14 10:55:14 +00:00
Kevin Qin	afc8bdfd57	[AArch64 neon] support poly64 and relevant intrinsic functions. llvm-svn: 194659	2013-11-14 03:27:58 +00:00
Kevin Qin	aec95baf1a	Implement aarch64 neon instruction class SIMD misc. llvm-svn: 194656	2013-11-14 02:44:13 +00:00
Nick Kledzik	dd34f77cbd	Add dyn_cast<> support to YAML I/O's IO class llvm-svn: 194655	2013-11-14 02:38:07 +00:00
Michael Gottesman	fd8aee76eb	Added BlockFrequencyInfo::view for displaying the block frequency propagation graph via graphviz. This is useful for debugging issues in the BlockFrequency implementation since one can easily visualize where probability mass and other errors occur in the propagation. llvm-svn: 194654	2013-11-14 02:27:46 +00:00
Jiangning Liu	bb60ccf355	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194648	2013-11-14 01:57:32 +00:00
Nick Kledzik	1e6033ca33	Add simple support for tags in YAML I/O llvm-svn: 194644	2013-11-14 00:59:59 +00:00
Yuchen Wu	7981f5b86c	llvm-cov: Slightly improved error checking. - readInt() should check all 4 bytes can be read, not just 1. - In the event of false data in the gcno file, it was possible to index into a non-existent index of SmallVector, causing assertion error. llvm-svn: 194639	2013-11-14 00:38:41 +00:00
Yuchen Wu	d738beec44	llvm-cov: Removed StringMap holding GCOVLines. According to the hazy gcov documentation, it appeared to be technically possible for lines within a block to belong to different source files. However, upon further investigation, gcov does not actually support multiple source files for a single block. This change removes a level of separation between blocks and lines by replacing the StringMap of GCOVLines with a SmallVector of ints representing line numbers. This also means that the GCOVLines class is no longer needed. This paves the way for supporting the "-a" option, which will output block information. llvm-svn: 194637	2013-11-14 00:32:00 +00:00
Yuchen Wu	e28da84c96	llvm-cov: Replaced asserts with proper error handling. Unified the interface for read functions. They all return a boolean indicating if the read from file succeeded. Functions that previously returned the read value now store it into a variable that is passed in by reference instead. Callers will need to check the return value to detect if an error occurred. Also added a new test which ensures that no assertions occur when file contains invalid data. llvm-cov should return with error code 1 upon failure. llvm-svn: 194635	2013-11-14 00:07:15 +00:00
Chad Rosier	d3ae5f895e	[AArch64] Add support for legacy AArch32 NEON scalar shift by immediate instructions. This patch does not include the shift right and accumulate instructions. A number of non-overloaded intrinsics have been remove in favor of their overloaded counterparts. llvm-svn: 194598	2013-11-13 20:05:37 +00:00
Benjamin Kramer	505d2408a1	Make sure LLVMLoadLibraryPermanently gets an extern "C" symbol. Otherwise it's impossible to use it. Also don't include C++ headers in a C header. llvm-svn: 194581	2013-11-13 15:35:13 +00:00
Rafael Espindola	fdc88137f4	Remove AllowQuotesInName and friends from MCAsmInfo. Accepting quotes is a property of an assembler, not of an object file. For example, ELF can support any names for sections and symbols, but the gnu assembler only accepts quotes in some contexts and llvm-mc in a few more. LLVM should not produce different symbols based on a guess about which assembler will be reading the code it is printing. llvm-svn: 194575	2013-11-13 14:01:59 +00:00
Diego Novillo	8d6568b56b	SampleProfileLoader pass. Initial setup. This adds a new scalar pass that reads a file with samples generated by 'perf' during runtime. The samples read from the profile are incorporated and emmited as IR metadata reflecting that profile. The profile file is assumed to have been generated by an external profile source. The profile information is converted into IR metadata, which is later used by the analysis routines to estimate block frequencies, edge weights and other related data. External profile information files have no fixed format, each profiler is free to define its own. This includes both the on-disk representation of the profile and the kind of profile information stored in the file. A common kind of profile is based on sampling (e.g., perf), which essentially counts how many times each line of the program has been executed during the run. The SampleProfileLoader pass is organized as a scalar transformation. On startup, it reads the file given in -sample-profile-file to determine what kind of profile it contains. This file is assumed to contain profile information for the whole application. The profile data in the file is read and incorporated into the internal state of the corresponding profiler. To facilitate testing, I've organized the profilers to support two file formats: text and native. The native format is whatever on-disk representation the profiler wants to support, I think this will mostly be bitcode files, but it could be anything the profiler wants to support. To do this, every profiler must implement the SampleProfile::loadNative() function. The text format is mostly meant for debugging. Records are separated by newlines, but each profiler is free to interpret records as it sees fit. Profilers must implement the SampleProfile::loadText() function. Finally, the pass will call SampleProfile::emitAnnotations() for each function in the current translation unit. This function needs to translate the loaded profile into IR metadata, which the analyzer will later be able to use. This patch implements the first steps towards the above design. I've implemented a sample-based flat profiler. The format of the profile is fairly simplistic. Each sampled function contains a list of relative line locations (from the start of the function) together with a count representing how many samples were collected at that line during execution. I generate this profile using perf and a separate converter tool. Currently, I have only implemented a text format for these profiles. I am interested in initial feedback to the whole approach before I send the other parts of the implementation for review. This patch implements: - The SampleProfileLoader pass. - The base ExternalProfile class with the core interface. - A SampleProfile sub-class using the above interface. The profiler generates branch weight metadata on every branch instructions that matches the profiles. - A text loader class to assist the implementation of SampleProfile::loadText(). - Basic unit tests for the pass. Additionally, the patch uses profile information to compute branch weights based on instruction samples. This patch converts instruction samples into branch weights. It does a fairly simplistic conversion: Given a multi-way branch instruction, it calculates the weight of each branch based on the maximum sample count gathered from each target basic block. Note that this assignment of branch weights is somewhat lossy and can be misleading. If a basic block has more than one incoming branch, all the incoming branches will get the same weight. In reality, it may be that only one of them is the most heavily taken branch. I will adjust this assignment in subsequent patches. llvm-svn: 194566	2013-11-13 12:22:21 +00:00
Chandler Carruth	3d7fd3daa3	Add another (perhaps better) video for Sean's talk. (Thanks Marshall!) llvm-svn: 194549	2013-11-13 02:49:38 +00:00
Chandler Carruth	ccb190972e	Fix a null pointer dereference when copying a null polymorphic pointer. This bug only bit the C++98 build bots because all of the actual uses really do move. ;] But not quite ready to do the whole C++11 switch yet, so clean it up. Also add a unit test that catches this immediately. llvm-svn: 194548	2013-11-13 02:48:20 +00:00
Chandler Carruth	a477d2ab57	Give folks a reference to some material on the fundamental design pattern in use here. Addresses review feedback from Sean (thanks!) and others. llvm-svn: 194541	2013-11-13 01:51:36 +00:00
Chandler Carruth	74015a7084	Introduce an AnalysisManager which is like a pass manager but with a lot more smarts in it. This is where most of the interesting logic that used to live in the implicit-scheduling-hackery of the old pass manager will live. Like the previous commits, note that this is a very early prototype! I expect substantial changes before this is ready to use. The core of the design is the following: - We have an AnalysisManager which can be used across a series of passes over a module. - The code setting up a pass pipeline registers the analyses available with the manager. - Individual transform passes can check than an analysis manager provides the analyses they require in order to fail-fast. - There is no implicit registration or scheduling. - Analysis passes are different from other passes: they produce an analysis result that is cached and made available via the analysis manager. - Cached results are invalidated automatically by the pass managers. - When a transform pass requests an analysis result, either the analysis is run to produce the result or a cached result is provided. There are a few aspects of this design that I know will change in subsequent commits: - Currently there is no "preservation" system, that needs to be added. - All of the analysis management should move up to the analysis library. - The analysis management needs to support at least SCC passes. Maybe loop passes. Living in the analysis library will facilitate this. - Need support for analyses which are both module and function passes. - Need support for pro-actively running module analyses to have cached results within a function pass manager. - Need a clear design for "immutable" passes. - Need support for requesting cached results when available and not re-running the pass even if that would be necessary. - Need more thorough testing of all of this infrastructure. There are other aspects that I view as open questions I'm hoping to resolve as I iterate a bit on the infrastructure, and especially as I start writing actual passes against this. - Should we have separate management layers for function, module, and SCC analyses? I think "yes", but I'm not yet ready to switch the code. Adding SCC support will likely resolve this definitively. - How should the 'require' functionality work? Should that be the only way to request results to ensure that passes always require things? - How should preservation work? - Probably some other things I'm forgetting. =] Look forward to more patches in shorter order now that this is in place. llvm-svn: 194538	2013-11-13 01:12:08 +00:00
Aaron Ballman	4337e97029	Removing llvm::huge_vald and llvm::huge_vall because they are not currently used, and HUGE_VALD does not appear to be supported everywhere anyways. llvm-svn: 194535	2013-11-13 00:20:43 +00:00
Aaron Ballman	04999041e8	Replacing HUGE_VALF with llvm::huge_valf in order to work around a warning triggered in MSVC 12. Patch reviewed by Reid Kleckner and Jim Grosbach. llvm-svn: 194533	2013-11-13 00:15:44 +00:00
Rafael Espindola	6cd1b9aec4	Remove always true flag. llvm-svn: 194530	2013-11-12 23:27:08 +00:00
Sebastian Pop	c62c679c1b	delinearization of arrays llvm-svn: 194527	2013-11-12 22:47:20 +00:00
Sebastian Pop	9f8004fb08	remove virtual methods in SCEVApplyRewriter and SCEVParameterRewriter llvm-svn: 194526	2013-11-12 22:47:05 +00:00
Justin Bogner	b10a520c8f	Protect user-supplied runtime library functions in LTO Add user-supplied C runtime and compiler-rt library functions to llvm.compiler.used to protect them from premature optimization by passes like -globalopt and -ipsccp. Calls to (seemingly unused) runtime library functions can be added by -instcombine and instruction lowering. Patch by Duncan Exon Smith, thanks! Fixes <rdar://problem/14740087> llvm-svn: 194514	2013-11-12 21:44:01 +00:00
Weiming Zhao	813432f1ae	Export intrinsics:__builtin_arm_{dmb,dsb} to frontend llvm-svn: 194505	2013-11-12 19:57:43 +00:00
Andrew Trick	eb443d7f23	GraphViz CFGPrinter: wrap long lines. llvm-svn: 194496	2013-11-12 18:06:09 +00:00
Andrew Trick	0926513eb1	whitespace llvm-svn: 194495	2013-11-12 18:06:06 +00:00
Rafael Espindola	e1b88dad8f	Revert "Remove unused variable." This reverts commit r194485. The variable is unused in some macro instantiations, but not others. We should probably fix clang to not warn on this. llvm-svn: 194486	2013-11-12 16:37:31 +00:00
Rafael Espindola	984d3c4587	Remove unused variable. llvm-svn: 194485	2013-11-12 16:31:59 +00:00
Wan Xiaofei	b2c8cdc766	Change data structure to memorize computed result in ScalarEvolution Replace std::map with SmallVector to memorize the cached result since SCEV usually belongs to little Loop/BB Linear scan on SmallVector is faster than std::map. Code reviewer : Andrew Trick. Test result : Pass Unit Test & LLVM Test Suite 401.bzip2 0.425721 0.419981 101.37% 403.gcc 24.53855 24.2667 101.12% 429.mcf 0.060847 0.059944 101.51% 433.milc 0.646009 0.636119 101.55% 444.namd 1.383928 1.370614 100.97% 445.gobmk 5.836575 5.800225 100.63% 450.soplex 1.911257 1.895963 100.81% 456.hmmer 1.039565 1.032534 100.68% 458.sjeng 0.897401 0.885567 101.34% 464.h264ref 3.645908 3.577991 101.90% 470.lbm 0.049456 0.048398 102.19% 471.omnetpp 5.638575 5.60435 100.61% bitmnp01 0.045738 0.045291 100.99% cjpegv2data 0.304359 0.302833 100.50% idctrn01 0.046433 0.045763 101.46% quake2 4.534416 4.4952 100.87% quake 2.688566 2.659208 101.10% xcsoar 12.42545 12.30385 100.99% linpack 0.038739 0.03803 101.86% matrix01 0.053564 0.0528 101.45% nbench 0.402867 0.395803 101.78% tblook01 0.021265 0.021015 101.19% ttsprk01 0.066384 0.065566 101.25% llvm-svn: 194459	2013-11-12 09:40:41 +00:00
Arnaud A. de Grandmaison	f5f040fa1e	CalcSpillWeights: allow overidding the spill weight normalizing function This will enable the PBQP register allocator to provide its own normalizing function. No functionnal change. llvm-svn: 194417	2013-11-11 19:56:14 +00:00
Chad Rosier	d3684a0566	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Arnaud A. de Grandmaison	ea3ac1612c	CalcSpillWeights: give a better describing name to calculateSpillWeights Besides, this relates it more obviously to the VirtRegAuxInfo::calculateSpillWeightAndHint. No functionnal change. llvm-svn: 194404	2013-11-11 19:04:45 +00:00
Chad Rosier	35575e737c	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Peter Zotov	d2cf791ad8	[llvm-c] Remove dead typedef llvm-svn: 194379	2013-11-11 14:47:01 +00:00
Pete Cooper	a8b685cd7b	Don't universally enable initialiser lists on GCC. Thanks for catching this Chandler llvm-svn: 194365	2013-11-11 05:14:42 +00:00
Pete Cooper	020832fb6e	Add LLVM_HAS_INITIALIZER_LISTS for upcoming C++11 support. Use it in ArrayRef llvm-svn: 194362	2013-11-11 03:58:00 +00:00
Arnaud A. de Grandmaison	760c1e0b0a	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194356	2013-11-10 17:46:31 +00:00
Chandler Carruth	90a835d2a0	[PM] Start sketching out the new module and function pass manager. This is still just a skeleton. I'm trying to pull together the experimentation I've done into committable chunks, and this is the first coherent one. Others will follow in hopefully short order that move this more toward a useful initial implementation. I still expect the design to continue evolving in small ways as I work through the different requirements and features needed here though. Keep in mind, all of this is off by default. Currently, this mostly exercises the use of a polymorphic smart pointer and templates to hide the polymorphism for the pass manager from the pass implementation. The next step will be more significant, adding the first framework of analysis support. llvm-svn: 194325	2013-11-09 13:09:08 +00:00
Chandler Carruth	7caea41545	Move the old pass manager infrastructure into a legacy namespace and give the files a legacy prefix in the right directory. Use forwarding headers in the old locations to paper over the name change for most clients during the transitional period. No functionality changed here! This is just clearing some space to reduce renaming churn later on with a new system. Even when the new stuff starts to go in, it is going to be hidden behind a flag and off-by-default as it is still WIP and under development. This patch is specifically designed so that very little out-of-tree code has to change. I'm going to work as hard as I can to keep that the case. Only direct forward declarations of the PassManager class are impacted by this change. llvm-svn: 194324	2013-11-09 12:26:54 +00:00
Filip Pizlo	dfc9b586ae	This exposes the new calling conventions (WebKit_JS and AnyReg) via the C API by adding them to the enumeration in Core.h. llvm-svn: 194323	2013-11-09 06:00:03 +00:00
Chandler Carruth	42fabdead0	Switch to allow implicit construction. In many cases, we're wrapping a derived type and this makes it much easier to write this code. llvm-svn: 194321	2013-11-09 05:55:03 +00:00
Chandler Carruth	64b0556071	Add a polymorphic_ptr<T> smart pointer data type. It's a somewhat silly unique ownership smart pointer which is deep copyable by assuming it can call a T::clone() method to allocate a copy of the owned data. This is mostly useful with containers or other collections of uniquely owned data in C++98 where they might copy. With C++11 we can likely remove this in favor of move-only types and containers wrapped around those types. llvm-svn: 194315	2013-11-09 04:06:02 +00:00
NAKAMURA Takumi	5f847c007b	include/llvm/CodeGen/PBQP: Update @param(s) in comments. [-Wdocumentation] llvm-svn: 194314	2013-11-09 03:54:05 +00:00
NAKAMURA Takumi	866975c26c	Fix whitespace. llvm-svn: 194313	2013-11-09 03:53:55 +00:00
Lang Hames	fb82630a91	Re-apply r194300 with fixes for warnings. llvm-svn: 194311	2013-11-09 03:08:56 +00:00
Nick Lewycky	59886d00ec	Revert r194300 which broke the build. llvm-svn: 194308	2013-11-09 02:01:25 +00:00
Juergen Ributzka	87ed906b2e	[Stackmap] Materialize the jump address within the patchpoint noop slide. This patch moves the jump address materialization inside the noop slide. This enables patching of the materialization itself or its complete removal. This patch also adds the ability to define scratch registers that can be used safely by the code called from the patchpoint intrinsic. At least one scratch register is required, because that one is used for the materialization of the jump address. This patch depends on D2009. Differential Revision: http://llvm-reviews.chandlerc.com/D2074 Reviewed by Andy llvm-svn: 194306	2013-11-09 01:51:33 +00:00
Lang Hames	1662b832d9	Rewrite the PBQP graph data structure. The new graph structure replaces the node and edge linked lists with vectors. Free lists (well, free vectors) are used for fast insertion/deletion. The ultimate aim is to make PBQP graphs cheap to clone. The motivation is that the PBQP solver destructively consumes input graphs while computing a solution, forcing the graph to be fully reconstructed for each round of PBQP. This imposes a high cost on large functions, which often require several rounds of solving/spilling to find a final register allocation. If we can cheaply clone the PBQP graph and incrementally update it between rounds then hopefully we can reduce this cost. Further, once we begin pooling matrix/vector values (future work), we can cache some PBQP solver metadata and share it between cloned graphs, allowing the PBQP solver to re-use some of the computation done in earlier rounds. For now this is just a data structure update. The allocator and solver still use the graph the same way as before, fully reconstructing it between each round. I expect no material change from this update, although it may change the iteration order of the nodes, causing ties in the solver to break in different directions, and this could perturb the generated allocations (hopefully in a completely benign way). Thanks very much to Arnaud Allard de Grandmaison for encouraging me to get back to work on this, and for a lot of discussion and many useful PBQP test cases. llvm-svn: 194300	2013-11-09 00:14:07 +00:00
Juergen Ributzka	9969d3e6e8	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Lang Hames	3078977d28	Add a method to get the object-file appropriate stack map section. Thanks to Eric Christopher for the tips on the appropriate way to do this. llvm-svn: 194282	2013-11-08 22:14:49 +00:00
Arnaud A. de Grandmaison	f7a60a8e01	Revert "CalculateSpillWeights does not need to be a pass" Temporarily revert my previous commit until I understand why it breaks 3 target tests. llvm-svn: 194272	2013-11-08 18:19:19 +00:00
Arnaud A. de Grandmaison	ed812f6590	CalculateSpillWeights does not need to be a pass Based on discussions with Lang Hames and Jakob Stoklund Olesen at the hacker's lab, and in the light of upcoming work on the PBQP register allocator, it was though that CalcSpillWeights does not need to be a pass. This change will enable to customize / tune the spill weight computation depending on the allocator. Update the documentation style while there. No functionnal change. llvm-svn: 194269	2013-11-08 17:56:29 +00:00
Jordan Rose	09e604333e	Add ImmutableSet profiling info for 'bool'. Useful for tri-state maps: true, false, and "no data yet". llvm-svn: 194266	2013-11-08 17:23:49 +00:00
Artyom Skrobov	08b2257f14	Export MCDisassembler's SubtargetInfo, to allow architecture-aware disassembly llvm-svn: 194260	2013-11-08 16:07:43 +00:00
NAKAMURA Takumi	29c3b55897	llvm-c/Support.h: Add a newline at eof. llvm-svn: 194203	2013-11-07 13:54:24 +00:00
Simon Atanasyan	0f756cd70b	Add DT_VERSYM dynamic table entry tag definition. llvm-svn: 194149	2013-11-06 12:23:52 +00:00
Peter Zotov	f7e64feb33	[llvm-c] Add parameter names in Target.h for C99 compliance llvm-svn: 194146	2013-11-06 11:52:40 +00:00
Peter Zotov	7b61b75c21	[llvm-c] Improve TargetMachine bindings Original patch by Chris Wailes llvm-svn: 194143	2013-11-06 10:25:18 +00:00
Peter Zotov	6b5e8b9409	[llvm-c] Correctly check for existence of native AsmParser, AsmPrinter, Disassembler Also, properly name the functions. llvm-svn: 194141	2013-11-06 09:45:53 +00:00
Peter Zotov	04f5981996	[llvm-c] Add functions for initializing native AsmPrinter, AsmParser & Disassembler Original patch by Chris Wailes llvm-svn: 194140	2013-11-06 09:21:35 +00:00
Peter Zotov	34ddbf1a7e	[llvm-c] Expose LLVMLoadLibraryPermanently Original patch by Chris Wailes llvm-svn: 194139	2013-11-06 09:21:31 +00:00
Peter Zotov	285eed6073	[llvm-c] Expose IRReader interface Original patch by Chris Wailes llvm-svn: 194137	2013-11-06 09:21:15 +00:00
Peter Zotov	cd93b370d5	[llvm-c] Implement LLVMPrintValueToString Original patch by Chris Wailes llvm-svn: 194135	2013-11-06 09:21:01 +00:00
Andrew Trick	34e2f0c4ea	Rewrite SCEV's backedge taken count computation. Patch by Michele Scandale! Rewrite of the functions used to compute the backedge taken count of a loop on LT and GT comparisons. I decided to split the handling of LT and GT cases becasue the trick "a > b == -a < -b" in some cases prevents the trip count computation due to the multiplication by -1 on the two operands of the comparison. This issue comes from the conservative computation of value range of SCEVs: taking the negative SCEV of an expression that have a small positive range (e.g. [0,31]), we would have a SCEV with a fullset as value range. Indeed, in the new rewritten function I tried to better handle the maximum backedge taken count computation when MAX/MIN expression are used to handle the cases where no entry guard is found. Some test have been modified in order to check the new value correctly (I manually check them and reasoning on possible overflow the new values seem correct). I finally added a new test case related to the multiplication by -1 issue on GT comparisons. llvm-svn: 194116	2013-11-06 02:08:26 +00:00
Rafael Espindola	03cb49e159	Remove another unused, and IMHO, not very desirable feature of ErrorOr. One of the uses of the IsValid flag is to support default constructing a ErrorOr that is not a Error or a Value. There is not much value in doing that IMHO. If ErrorOr was to have a default constructor, it should be implemented by default constructing the value, but even that looks unnecessary. The other use is to avoid calling destructors on moved objects. This looks wrong. If the data being moved has non trivial treatment of moves (an std::vector for example), it is its destructor that should handle it, not ~ErrorOr. With this change ErrorOr becomes a fairly simple wrapper and should always be better than using an error_code + value in an API. llvm-svn: 194109	2013-11-05 23:41:57 +00:00
Dmitri Gribenko	75e12236cc	Convert comments to documentation comments (// -> ///) Patch by MathOnNapkins llvm-svn: 194093	2013-11-05 21:28:42 +00:00
Rafael Espindola	2b11ad4fe9	Use error_code in GVMaterializer. They just propagate out the bitcode reader error, so we don't need a new enum. llvm-svn: 194091	2013-11-05 19:36:34 +00:00
Jiangning Liu	d7c52676f6	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194085	2013-11-05 17:42:05 +00:00
Peter Zotov	ae0344b07f	[llvm-c] (PR16190) Add LLVMIsA* functions for ConstantDataSequential and subclasses Original patch by David Monniaux llvm-svn: 194074	2013-11-05 12:55:37 +00:00
Alp Toker	e5e3bc0c04	Fix symbol defines in config.h.cmake These were incorrectly pointing to HAVE_LOG despite being checked for correctly in config-ix.cmake. Patch by James Lyon! llvm-svn: 194051	2013-11-05 07:27:18 +00:00
Yuchen Wu	30672d9086	Support for reading run counts in llvm-cov. This patch enables llvm-cov to correctly output the run count stored in the GCDA file. GCOVProfiling currently does not generate this information, so the GCDA run data had to be hacked on from a GCDA file generated by gcc. This is corrected by a subsequent patch. With the run and program data included, both llvm-cov and gcov produced the same output. llvm-svn: 194033	2013-11-05 01:11:58 +00:00
Rafael Espindola	2bad63c341	Fix MSVC build by not putting an error_code directly in a union. llvm-svn: 194032	2013-11-05 01:07:06 +00:00
Rafael Espindola	ca35ffe6a2	Simplify ErrorOr. ErrorOr had quiet a bit of complexity and indirection to be able to hold a user type with the error. That feature is not used anymore. This patch removes it, it will live in svn history if we ever need it again. If we do need it again, IMHO there is one thing that should be done differently: Holding extra info in the error is not a property a function also returning a value or not. The ability to hold extra info should be in the error type and ErrorOr templated over it so that we don't need the funny looking ErrorOr<void>. llvm-svn: 194030	2013-11-05 00:28:01 +00:00
Hal Finkel	081eaef6fa	Add a runtime unrolling parameter to the LoopUnroll pass constructor As with the other loop unrolling parameters (the unrolling threshold, partial unrolling, etc.) runtime unrolling can now also be controlled via the constructor. This will be necessary for moving non-trivial unrolling late in the pass manager (after loop vectorization). No functionality change intended. llvm-svn: 194027	2013-11-05 00:08:03 +00:00
Cameron McInally	d80f7d34de	Add support for AVX512 masked vector blend intrinsics. llvm-svn: 194006	2013-11-04 19:14:56 +00:00
Zoran Jovanovic	8a80aa76c8	Support for microMIPS branch instructions. llvm-svn: 193992	2013-11-04 14:53:22 +00:00
Elena Demikhovsky	46eeaba93b	AVX-512: fixed a typo in builtin name llvm-svn: 193988	2013-11-04 11:48:23 +00:00
Filip Pizlo	c10ca90324	Make the pretty stack trace be an opt-in, rather than opt-out, facility. Enable pretty stack traces by default if you use PrettyStackTraceProgram, so that existing LLVM-based tools will continue to get it without any changes. llvm-svn: 193971	2013-11-04 02:22:25 +00:00

1 2 3 4 5 ...

19189 Commits