llvm-project

Commit Graph

Author	SHA1	Message	Date
Christian Pirker	b5728191c2	ARM big endian function argument passing llvm-svn: 208316	2014-05-08 14:06:24 +00:00
Hal Finkel	e8172d85f9	Fix a spelling error llvm-svn: 208314	2014-05-08 13:42:57 +00:00
Daniel Sanders	cdbbe08b05	[mips] Implement l[wd]c3, and s[wd]c3. Summary: These instructions were added in MIPS-I, and MIPS-II but were removed in MIPS-III. Interestingly, GAS continues to accept them when assembling for MIPS-III. For the moment, these instructions will follow GAS and accept them for MIPS-III and newer but this will be tightened up when the invalid-*.s tests are added. Depends on D3647 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3648 llvm-svn: 208311	2014-05-08 13:02:11 +00:00
James Molloy	c42ea14f74	[ARM64-BE] Teach fast-isel about how to set up sub-word stack arguments for big endian calls. SelectionDAG already knows about this, but fast-isel was ignorant. llvm-svn: 208307	2014-05-08 12:53:50 +00:00
Daniel Sanders	d39320c6b6	[mips] Marked up instructions added in MIPS-II and tested that IAS for -mcpu=mips1 does not accept them Summary: A small number of instructions are rejected with the wrong error message. These have been placed in a separate test for now. There seems to be some parsing quirk that triggers when these instructions are disabled. Depends on D3571 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3647 llvm-svn: 208305	2014-05-08 12:40:48 +00:00
Daniel Sanders	8dcb116a3e	[mips] Implement tlbp, tlbr, tlbwi, and tlbwr Reviewers: vmedic, dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3571 llvm-svn: 208301	2014-05-08 11:51:18 +00:00
Tim Northover	18f8bb84fa	ARM64: make sure FastISel emits SSA MachineInstrs We need to use a temporary register for a 2-step operation like REM. llvm-svn: 208297	2014-05-08 10:30:56 +00:00
Evgeniy Stepanov	9661ec0ec3	[asan] Preserve flags in asm instrumentation. Patch by Yuri Gorshenin. llvm-svn: 208296	2014-05-08 09:55:24 +00:00
Daniel Sanders	2b6b3d1c5d	Use a vector of unique_ptrs to fix a memory leak introduced in r208179. Also removed an inaccurate comment that stated that a DenseMap was used as storage for the ListInit's. It's currently using a FoldingSet. I expect there's a better way to fix this but I haven't found it yet. FoldingSet is incompatible with the Pool template and I'm not sure if FoldingSet can be safely replaced with a DenseMap of computed FoldingSetID's to ListInit's. llvm-svn: 208293	2014-05-08 09:29:28 +00:00
Hal Finkel	6532c20faa	Move late partial-unrolling thresholds into the processor definitions The old method used by X86TTI to determine partial-unrolling thresholds was messy (because it worked by testing target features), and also would not correctly identify the target CPU if certain target features were disabled. After some discussions on IRC with Chandler et al., it was decided that the processor scheduling models were the right containers for this information (because it is often tied to special uop dispatch-buffer sizes). This does represent a small functionality change: - For generic x86-64 (which uses the SB model and, thus, will get some unrolling). - For AMD cores (because they still currently use the SB scheduling model) - For Haswell (based on benchmarking by Louis Gerbarg, it was decided to bump the default threshold to 50; we're working on a test case for this). Otherwise, nothing has changed for any other targets. The logic, however, has been moved into BasicTTI, so other targets may now also opt-in to this functionality simply by setting LoopMicroOpBufferSize in their processor model definitions. llvm-svn: 208289	2014-05-08 09:14:44 +00:00
Tobias Grosser	3080cf16a5	Revert "SCEV: Use I = vector<>.erase(I) to iterate and delete at the same time" as committed in r208282. The original commit was incorrect. llvm-svn: 208286	2014-05-08 07:55:34 +00:00
Hao Liu	1187a3d8db	AArch64/ARM64: Port NEON post-increment load/store with 2/3/4 vectors to ARM64 backend. llvm-svn: 208284	2014-05-08 07:38:13 +00:00
Tobias Grosser	ecfe9d06eb	SCEV: Use I = vector<>.erase(I) to iterate and delete at the same time llvm-svn: 208282	2014-05-08 07:12:44 +00:00
Saleem Abdulrasool	fc6b85b185	ARM: support FK_SecRel_2 relocations on WoA This adds FK_SecRel_2 relocation support to ARM. This enables the building of object files for armv7-windows-msvc which enables CodeView line tables for debugging as opposed to armv7-windows-itanium which currently uses DWARF. llvm-svn: 208273	2014-05-08 01:35:57 +00:00
Richard Smith	c45f3f7433	Simplify and fix incorrect comment. No functionality change. llvm-svn: 208272	2014-05-08 01:08:43 +00:00
Filipe Cabecinhas	095d9d573a	Lower certain build_vectors to insertps instructions Summary: Vectors built with zeros and elements in the same order as another (source) vector are optimized to be built using a single insertps instruction. Also optimize when we move one element in a vector to a different place in that vector while zeroing out some of the other elements. Further optimizations are possible, described in TODO comments. I will be implementing at least some of them in the near future. Added some tests for different cases where this optimization triggers. Reviewers: nadav, delena, craig.topper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3521 llvm-svn: 208271	2014-05-08 00:25:16 +00:00
Lang Hames	d7bfe4bb86	Back out r208257 while I investigate tester failures. llvm-svn: 208267	2014-05-07 23:35:53 +00:00
Duncan P. N. Exon Smith	e60adfdbd0	GlobalValue: Assert symbols with local linkage have default visibility The change to ExtractGV.cpp has no functionality change except to avoid the asserts. Existing testcases already cover this, so I didn't add a new one. llvm-svn: 208264	2014-05-07 23:00:22 +00:00
Duncan P. N. Exon Smith	b80de1012a	IR: Don't allow non-default visibility on local linkage Visibilities of `hidden` and `protected` are meaningless for symbols with local linkage. - Change the assembler to reject non-default visibility on symbols with local linkage. - Change the bitcode reader to auto-upgrade `hidden` and `protected` to `default` when the linkage is local. - Update LangRef. <rdar://problem/16141113> llvm-svn: 208263	2014-05-07 22:57:20 +00:00
Duncan P. N. Exon Smith	b2becfdbae	LTO: Assert visibility of local linkage when merging symbols `ModuleLinker::getLinkageResult()` shouldn't create symbols with local linkage and non-default visibility -- in fact, symbols with local linkage shouldn't be merged at all. Assert to that effect. llvm-svn: 208262	2014-05-07 22:55:46 +00:00
Duncan P. N. Exon Smith	87121f84d2	LTO: Check local linkage first Since visibility is meaningless for symbols with local linkage, check local linkage before visibility when setting symbol attributes. When linkage is `internal` and the visibility is `hidden`, the exposed attribute is now `LTO_SYMBOL_SCOPE_INTERNAL` instead of `LTO_SYMBOL_SCOPE_HIDDEN`. Although the bitfield allows both to be specified, the combination is nonsense anyway. Given changes (in progress) to drop visibility when a symbol has local linkage, this almost has no functionality change: it's mostly a cleanup to clarify the logic. The exception is when something has `appending` linkage. Before this change, such symbols would be advertised as `LTO_SYMBOL_SCOPE_INTERNAL`; now, they'll be given `LTO_SYMBOL_SCOPE_COMMON`. Unfortunately this is really awkward to test. This only changes what we advertise to linkers (before running LTO), not what the final object looks like. In theory I could add `DEBUG` output to `llvm-lto` (and test with "REQUIRES: asserts"), but follow-up commits to disallow `internal hidden` simplify this anyway. <rdar://problem/16141113> llvm-svn: 208261	2014-05-07 22:53:14 +00:00
Lang Hames	fd284c6ce1	[RuntimeDyld] Make RuntimeDyldImpl::resolveExternalSymbols preserve the relocation entries it applies. Prior to this patch, RuntimeDyldImpl::resolveExternalSymbols discarded relocations for external symbols once they had been applied. This causes issues if the client calls MCJIT::finalizeLoadedModules more than once, and updates the location of any symbols in between (e.g. by calling MCJIT::mapSectionAddress). No test case yet: None of our in-tree memory managers support moving sections around. I'll have to hack up a dummy memory manager before I can write a unit test. Fixes <rdar://problem/16764378> llvm-svn: 208257	2014-05-07 22:34:08 +00:00
Hal Finkel	f6475bbc4b	[X86TTI] Remove the unrolling branch limits The loop stream detector (LSD) on modern Intel cores, which optimizes the execution of small loops, has limits on the number of taken branches in addition to uop-count limits (modern AMD cores have similar limits). Unfortunately, at the IR level, estimating the number of branches that will be taken is difficult. For one thing, it strongly depends on later passes (block placement, etc.). The original implementation took a conservative approach and limited the maximal BB DFS depth of the loop. However, fairly-extensive benchmarking by several of us has revealed that this is the wrong approach. In fact, there are zero known cases where the branch limit prevents a detrimental unrolling (but plenty of cases where it does prevent beneficial unrolling). While we could improve the current branch counting logic by incorporating branch probabilities, this further complication seems unjustified without a motivating regression. Instead, unless and until a regression appears, the branch counting will be removed. llvm-svn: 208255	2014-05-07 22:25:18 +00:00
Quentin Colombet	246b6fcd28	[X86] Selectively mark the FMA variants inside a family as isCommutable. Given a FMA family (e.g., 213, 231), not all the variants (i.e., register or memory) are commutable. E.g., for the 213 family (with the syntax src1, src2, src3): fmaXXX213 A, B, reg3/mem3 == fmaXXX213 B, A, reg3/mem3 Now consider the 231 family: fmaXXX231 A, B, reg3 == fmaXXX231 A, reg3, B But fmaXXX231 A, B, mem3 != fmaXXX231 A, mem3, B Indeed, mem3 cannot be the second argument of the memory variant of fmaXXX231. Working on a reduced test case! <rdar://problem/16800495> llvm-svn: 208252	2014-05-07 21:43:35 +00:00
Eric Christopher	b8f9768880	Reformat a couple of functions for clarity. llvm-svn: 208248	2014-05-07 21:05:47 +00:00
Jyotsna Verma	f98a1eca6e	[Hexagon] Add New TSFlags to be used in the upcoming patches. llvm-svn: 208239	2014-05-07 19:07:34 +00:00
Sebastian Pop	b8d56f42b7	avoid segfaulting Quotient and Remainder don't have to be initialized. llvm-svn: 208238	2014-05-07 19:00:37 +00:00
Sebastian Pop	a7d3d6ab9f	do not collect undef terms llvm-svn: 208237	2014-05-07 19:00:32 +00:00
Matt Arsenault	5f2fd4b22a	Fix using wrong result type for setcc. When reducing the bitwidth of a comparison against a constant, the original setcc's result type was used, which was incorrect. No test since I don't think any other in tree targets change the bitwidth of the setcc type depending on the bitwidth of the compared type. llvm-svn: 208236	2014-05-07 18:26:58 +00:00
Sebastian Pop	448712b1a6	split delinearization pass in 3 steps To compute the dimensions of the array in a unique way, we split the delinearization analysis in three steps: - find parametric terms in all memory access functions - compute the array dimensions from the set of terms - compute the delinearized access functions for each dimension The first step is executed on all the memory access functions such that we gather all the patterns in which an array is accessed. The second step reduces all this information in a unique description of the sizes of the array. The third step is delinearizing each memory access function following the common description of the shape of the array computed in step 2. This rewrite of the delinearization pass also solves a problem we had with the previous implementation: because the previous algorithm was by induction on the structure of the SCEV, it would not correctly recognize the shape of the array when the memory access was not following the nesting of the loops: for example, see polly/test/ScopInfo/multidim_only_ivs_3d_reverse.ll ; void foo(long n, long m, long o, double A[n][m][o]) { ; ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) ; A[i][k][j] = 1.0; Starting with this patch we no longer delinearize access functions that do not contain parameters, for example in test/Analysis/DependenceAnalysis/GCD.ll ;; for (long int i = 0; i < 100; i++) ;; for (long int j = 0; j < 100; j++) { ;; A[2i - 4j] = i; ;; B++ = A[6i + 8*j]; these accesses will not be delinearized as the upper bound of the loops are constants, and their access functions do not contain SCEVUnknown parameters. llvm-svn: 208232	2014-05-07 18:01:20 +00:00
Chandler Carruth	32908d7a35	[x86] Make the 'x86-64' cpu, what I see as and many use as the generic default architecture for reasonable modern x86 processors, actually be modern. This processor model should essentially be "tuned" for modern x86 chips as much as possible without undue penalties on any specific architecture. Previously we weren't even using the nice scheduling models. There are a few other tweaks needed here, but this change at least I have benchmarked across a decent swatch of chips (intel's clovertown, westmere, and sandybridge; amd's istanbul) and seen no significant regressions. If anyone has suggested ways to test this, just let me know. Somewhat alarmingly, no existing tests failed. llvm-svn: 208230	2014-05-07 17:37:03 +00:00
Chandler Carruth	d70cc604af	Tidy up whitespace with clang-format prior to making significant changes. llvm-svn: 208229	2014-05-07 17:36:59 +00:00
Simon Atanasyan	68f6150156	[yaml2obj] Support ELF x86 relocations. llvm-svn: 208228	2014-05-07 17:06:38 +00:00
Chad Rosier	788e5e3d7c	[ARM64][fast-isel] Disable target specific optimizations at -O0. Functionally, this patch disables the dead register elimination pass and the load/store pair optimization pass at -O0. The ILP optimizations don't require the optimization level to be checked because the call to addILPOpts is predicated with the necessary check. The AdvSIMDScalar pass is disabled by default at all optimization levels. This patch leaves that pass disabled by default. Also, move command-line options into ARM64TargetMachine.cpp and add a few additional flags to aid in debugging. This fixes an issue with the -debug-pass=Structure flag where passes were printed, but not actually run (i.e., AdvSIMDScalar pass). llvm-svn: 208223	2014-05-07 16:41:55 +00:00
Daniel Sanders	d240953db2	[mips] Add highly experimental support for MIPS-I, MIPS-II, MIPS-III, and MIPS-V Summary: These processors will only be available for the integrated assembler at first (CodeGen will emit a fatal error saying they are not implemented). The intention is to work through the existing instructions and correctly annotate the ISA they were added in so that we have a sufficiently good base to start MIPS64r6 development. MIPS64r6 removes/re-encodes certain instructions and I believe it is best to define ISA's using set-union's as far as possible rather than using set-subtraction. Reviewers: vmedic Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D3569 llvm-svn: 208221	2014-05-07 16:25:22 +00:00
Justin Bogner	7c093732e8	llvm-cov: Explicitly namespace llvm::make_unique to keep MSVC happy This is a followup to r208171, where a call to make_unique was disambiguated for MSVC. Disambiguate two more calls, and remove the comment about it since this is what we do everywhere. llvm-svn: 208219	2014-05-07 16:01:27 +00:00
Rafael Espindola	de3e36be38	Use range loop. llvm-svn: 208218	2014-05-07 14:53:32 +00:00
Michael Zolotukhin	7d6293a0d3	[InstCombine] Add optimization of redundant insertvalue instructions. rdar://problem/11861387 llvm-svn: 208214	2014-05-07 14:30:18 +00:00
Daniel Sanders	5b864d0cbb	[mips] Add FGR_32/FGR_64/GPR_64 adjectives and use then instead of FGRPredicates/GPRPredicates Summary: No functional change (confirmed by diffing tablegen-erated files). Depends on D3642 Reviewers: vmedic, dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3645 llvm-svn: 208213	2014-05-07 14:25:43 +00:00
Daniel Sanders	3872b47231	[mips] Add INSN_<name> adverbs and start using them instead of AdditionalPredicates overrides Summary: No functional change Depends on D3641 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3642 llvm-svn: 208212	2014-05-07 14:11:46 +00:00
Evgeniy Stepanov	c14fc42137	[msan] Fix -fsanitize=memory -fno-integrated-as. llvm-svn: 208211	2014-05-07 14:10:51 +00:00
Tim Northover	88a51d983e	AArch64/ARM64: optimise vector selects & enable test When performing a scalar comparison that feeds into a vector select, it's actually better to do the comparison on the vector side: the scalar route would be "CMP -> CSEL -> DUP", the vector is "CM -> DUP" since the vector comparisons are all mask based. llvm-svn: 208210	2014-05-07 14:10:27 +00:00
Daniel Sanders	9c1b1bec03	[mips] Add ISA_<name> adverbs and start using them instead of AdditionalPredicates overrides Summary: One small functional change. The recently added PAUSE instruction now has the HasStdEnc predicate which was accidentally removed by a Requires<>. Depends on D3640 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3641 llvm-svn: 208209	2014-05-07 13:57:22 +00:00
Rafael Espindola	566fcfe69b	Remove the UseCFI option from createAsmStreamer. We were already always passing true, this just removes the option. llvm-svn: 208205	2014-05-07 13:00:43 +00:00
Daniel Sanders	13d7209fa9	[mips] Continue splitting Instruction.Predicates into smaller lists and re-join them with !listconcat Summary: Move IsGP64bit into GPRPredicates, and IsFP64bit/NotFP64bit into FGRPredicates No functional change (confirmed by diffing tablegen-erated files). Depends on D3639 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3640 llvm-svn: 208201	2014-05-07 12:48:37 +00:00
James Molloy	d3c401a2d0	[ARM64-BE] Fix fast-isel, and add appropriate RUN lines to appropriate tests. llvm-svn: 208200	2014-05-07 12:33:55 +00:00
James Molloy	36132057da	[ARM64-BE] Fix variable-argument saving. llvm-svn: 208199	2014-05-07 12:33:48 +00:00
James Molloy	4049e4fd77	[ARM64-BE] Implement the lane-twiddling logic at AAPCS boundaries for big endian. The AAPCS states that values passed in registers must have a value as though they had been loaded with "LDR". LDR is equivalent to "LD1.64 vX.1D" - that is, loading scalars to vector registers and loading 1-element vectors is equivalent. The logic implemented here is to ensure that at all call boundaries and during formal argument lowering all vectors are treated as their bitwidth-based floating point scalar counterpart, which is always one of f64 or f128 (v2i32 -> f64, v4i32 -> f128 etc). A BITCAST is inserted so that the appropriate REV will be generated during code generation. llvm-svn: 208198	2014-05-07 12:33:41 +00:00
Daniel Sanders	4cd0782bf2	[mips] Move IsFP64bit/NotFP64bit to the front of the AdditionalPredicates list Summary: This makes it easier to prove a more complicated change in the next commit is non-functional. Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3639 llvm-svn: 208197	2014-05-07 12:27:46 +00:00
James Molloy	30e0e11eb4	[ARM64-BE] Implement the crazy bitcast handling for big endian vectors. Because we've canonicalised on using LD1/ST1, every time we do a bitcast between vector types we must do an equivalent lane reversal. Consider a simple memory load followed by a bitconvert then a store. v0 = load v2i32 v1 = BITCAST v2i32 v0 to v4i16 store v4i16 v2 In big endian mode every memory access has an implicit byte swap. LDR and STR do a 64-bit byte swap, whereas LD1/ST1 do a byte swap per lane - that is, they treat the vector as a sequence of elements to be byte-swapped. The two pairs of instructions are fundamentally incompatible. We've decided to use LD1/ST1 only to simplify compiler implementation. LD1/ST1 perform the equivalent of a sequence of LDR/STR + REV. This makes the original code sequence: v0 = load v2i32 v1 = REV v2i32 (implicit) v2 = BITCAST v2i32 v1 to v4i16 v3 = REV v4i16 v2 (implicit) store v4i16 v3 But this is now broken - the value stored is different to the value loaded due to lane reordering. To fix this, on every BITCAST we must perform two other REVs: v0 = load v2i32 v1 = REV v2i32 (implicit) v2 = REV v2i32 v3 = BITCAST v2i32 v2 to v4i16 v4 = REV v4i16 v5 = REV v4i16 v4 (implicit) store v4i16 v5 This means an extra two instructions, but actually in most cases the two REV instructions can be combined into one. For example: (REV64_2s (REV64_4h X)) === (REV32_4h X) There is also no 128-bit REV instruction. This must be synthesized with an EXT instruction. Most bitconverts require some sort of conversion. The only exceptions are: a) Identity conversions - vNfX <-> vNiX b) Single-lane-to-scalar - v1fX <-> fX or v1iX <-> iX Even though there are hundreds of changed lines, I have a fairly high confidence that they are somewhat correct. The changes to add two REV instructions per bitcast were pretty mechanical, and once I'd done that I threw the resulting .td at a script I wrote which combined the two REVs together (and added an EXT instruction, for f128) based on an instruction description I gave it. This was much less prone to error than doing it all manually, plus my brain would not just have melted but would have vapourised. llvm-svn: 208194	2014-05-07 11:28:53 +00:00
James Molloy	3f0da857b4	[ARM64-BE] Predicate VLDR/VSTR for vectors as little-endian only. We must use LD1/ST1 on big-endian. llvm-svn: 208193	2014-05-07 11:28:45 +00:00
James Molloy	ccc7f982c1	[ARM64-BE] Make big endian (scalar) argument passing work correctly. This completes the port of r204814 (cpirker "AArch64_BE function argument passing for ARM ABI") from AArch64 to ARM64, and fixes a bunch of issues found during later development along the way. The biggest of these was that the alignment fixup logic wasn't replicated into all the places it should have been. llvm-svn: 208192	2014-05-07 11:28:36 +00:00
Stepan Dyatkovskiy	cfd641f123	MergeFunctions Pass, introduced total ordering among values. This is a third patch of patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). This patch description: Being comparing functions we need to compare values we meet at left and right sides. Its easy to sort things out for external values. It just should be the same value at left and right. But for local values (those were introduced inside function body) we have to ensure they were introduced at exactly the same place, and plays the same role. In short, patch introduces values serial numbering and comparison routine. The last one compares two values by their serial numbers. llvm-svn: 208189	2014-05-07 11:11:39 +00:00
Daniel Sanders	3dc2c016a6	[mips] Split Instruction.Predicates into smaller lists and re-join them with !listconcat Summary: The overall idea is to chop the Predicates list into subsets that are usually overridden independently. This allows subclasses to partially override the predicates of their superclasses without having to re-add all the existing predicates. This patch starts the process by moving HasStdEnc into a new EncodingPredicates list and almost everything else into AdditionalPredicates. It has revealed a couple likely bugs where 'let Predicates' has removed the HasStdEnc predicate. No functional change (confirmed by diffing tablegen-erated files). Depends on D3549, D3506 Reviewers: vmedic Differential Revision: http://reviews.llvm.org/D3550 llvm-svn: 208184	2014-05-07 10:27:09 +00:00
Daniel Sanders	314e80e5f8	[tablegen] Add !listconcat operator with the similar semantics as !strconcat Summary: It concatenates two or more lists. In addition to the !strconcat semantics the lists must have the same element type. My overall aim is to make it easy to append to Instruction.Predicates rather than override it. This can be done by concatenating lists passed as arguments, or by concatenating lists passed in additional fields. Reviewers: dsanders Reviewed By: dsanders Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D3506 llvm-svn: 208183	2014-05-07 10:13:19 +00:00
Daniel Sanders	0e2364149c	[mips] Move HasStdEnc to the front of the predicates lists. Summary: This will make it easier to prove that a more complicated change in the following commit is non-functional. No functional change. Depends on D3506 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3549 llvm-svn: 208179	2014-05-07 09:58:05 +00:00
Zinovy Nis	da925c0d7c	[BUG][REFACTOR] 1) Fix for printing debug locations for absolute paths. 2) Location printing is moved into public method DebugLoc::print() to avoid re-inventing the wheel. Differential Revision: http://reviews.llvm.org/D3513 llvm-svn: 208177	2014-05-07 09:51:22 +00:00
Stepan Dyatkovskiy	d103130ee0	Second patch of patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). The idea is to introduce total ordering among functions set. It allows to build binary tree and perform function look-up procedure in O(log(N)) time. This patch description: Introduced total ordering among constants implemented in cmpConstants method. Method performs lexicographical comparison between constants represented as hypothetical numbers of next format: <bitcastability-trait><raw-bit-contents> Please, read cmpConstants declaration comments for more details. llvm-svn: 208173	2014-05-07 09:05:10 +00:00
Timur Iskhodzhanov	2e5d6d3ce3	Work-around MSVS build breakage due to r208148 llvm-svn: 208171	2014-05-07 08:52:13 +00:00
Evgeniy Stepanov	3819f02819	[asan] Add a flag to control asm instrumentation. With this change, asm instrumentation is disabled by default. llvm-svn: 208167	2014-05-07 07:54:11 +00:00
Joerg Sonnenberger	cf86ce136c	Allow using normal .eh_frame based unwinding on ARM. Use the same encodings as x86. Use this exception model for NetBSD. llvm-svn: 208166	2014-05-07 07:49:34 +00:00
David Blaikie	f248c16f5f	PR19562: DebugInfo temporary MDNode leak: Don't include a temporary node to replace with a variable list for methods, since they're always declarations and thus never include variables This field is used for a list of variables to ensure they are not lost during optimization (they're only included when optimizations are enabled). llvm-svn: 208159	2014-05-07 06:08:28 +00:00
Tobias Grosser	924221cb37	[C++11] Add NArySCEV->Operands iterator range llvm-svn: 208158	2014-05-07 06:07:47 +00:00
Saleem Abdulrasool	985dcf18a9	ARM: mark additional instructions as MachineFrameSetup Mark up additional instructions which are part of the function prologue as MachineFrameSetup. These instructions are part of the function prologue, emitted by the PEI pass to setup the stack for use in the activating frame. llvm-svn: 208153	2014-05-07 03:03:31 +00:00
Saleem Abdulrasool	acd0338c61	ARM: fix WoA PEI instruction selection The ARM::BLX instruction is an ARM mode instruction. The Windows on ARM target is limited to Thumb instructions. Correctly use the thumb mode tBLXr instruction. This would manifest as an errant write into the object file as the instruction is 4-bytes in length rather than 2. The result would be a corrupted object file that would eventually result in an executable that would crash at runtime. llvm-svn: 208152	2014-05-07 03:03:27 +00:00
Justin Bogner	cf27e1b996	llvm-cov: Handle missing source files as GCOV does If the source files referenced by a gcno file are missing, gcov outputs a coverage file where every line is simply /EOF/. This also occurs for lines in the coverage that are past the end of a file that is found. This change mimics gcov. llvm-svn: 208149	2014-05-07 02:11:23 +00:00
Justin Bogner	1a18d7caa3	llvm-cov: Implement --no-output In gcov, there's a -n/--no-output option, which disables the writing of any .gcov files, so that it emits only the summary info on stdout. This implements the same behaviour in llvm-cov. llvm-svn: 208148	2014-05-07 02:11:18 +00:00
Argyrios Kyrtzidis	dff3ef80bb	[Support/MemoryBuffer] Remove the assertion that the file size did not shrink. This can happen in practice with the user changing files and we can recover from it. llvm-svn: 208143	2014-05-06 23:30:56 +00:00
Nico Weber	ba8a99cf77	Fix ASan init function detection after clang r208128. llvm-svn: 208141	2014-05-06 23:17:26 +00:00
Rafael Espindola	8d8f100c57	Special case aliases in GlobalValue::getSection. This is similar to the getAlignment patch, but is done just for completeness. It looks like we never call getSection on an alias. All the tests still pass if the if is replaced with an assert. llvm-svn: 208139	2014-05-06 22:44:30 +00:00
Andrew Trick	d0d8cb1d21	Update an embarassing out-of-date comment. llvm-svn: 208137	2014-05-06 22:18:43 +00:00
Eric Christopher	9c928478f4	Use a range based for loop for the SubtargetFeatures print function. llvm-svn: 208132	2014-05-06 21:20:29 +00:00
David Blaikie	9dabbf6228	Revert "Try simplifying LexicalScopes ownership again." Speculatively reverting due to a suspicious failure on a Windows buildbot. This reverts commit 10c37a012ea11596d44cd9059fe09c959caf30c8. llvm-svn: 208131	2014-05-06 21:07:17 +00:00
Eric Christopher	0e6f41c659	Fix odd formatting that snuck into last patch. llvm-svn: 208130	2014-05-06 21:04:27 +00:00
Joerg Sonnenberger	818e725158	If a function needs a frame pointer, but r11 (aka fp) has not been used, remove it from the list of unspilled registers. Otherwise the following attempt to keep the stack aligned by picking an extra GPR register to spill will not work as it picks up r11. llvm-svn: 208129	2014-05-06 20:43:01 +00:00
Eric Christopher	dc5072d60e	ArrayRef-ize the Feature and Processor tables for SubtargetFeatures. This removes arguments passed everywhere and allows the use of standard iteration over lists. Should be no functional change. llvm-svn: 208127	2014-05-06 20:23:04 +00:00
Reid Kleckner	118e1bf862	Copy the full TailCallKind in CallInst::clone_impl Split from the musttail inliner change. This will be covered by an opt test when the inliner change lands. llvm-svn: 208126	2014-05-06 20:08:20 +00:00
Diego Novillo	dd49157db1	Do not make -pass-remarks additive. Summary: When I initially introduced -pass-remarks, I thought it would be a neat idea to make it additive. So, if one used it as: $ llc -pass-remarks=inliner --pass-remarks=loop.* the compiler would build the regular expression '(inliner)\|(loop.*)'. The more I think about it, the more I regret it. This is not how other flags work. The standard semantics are right-to-left overrides. This is how clang interprets -Rpass. And I think the two should be compatible in this respect. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3614 llvm-svn: 208122	2014-05-06 19:14:00 +00:00
Benjamin Kramer	1625bfccbe	TTI: Estimate @llvm.fmuladd cost as fmul + fadd when FMA's aren't legal on the target. llvm-svn: 208115	2014-05-06 18:36:23 +00:00
Andrea Di Biagio	c14ccc9184	[X86] Improve the lowering of BITCAST dag nodes from type f64 to type v2i32 (and vice versa). Before this patch, the backend always emitted a store+load sequence to bitconvert from f64 to i64 the input operand of a ISD::BITCAST dag node that performed a bitconvert from type MVT::f64 to type MVT::v2i32. The resulting i64 node was then used to build a v2i32 vector. With this patch, the backend now produces a cheaper SCALAR_TO_VECTOR from MVT::f64 to MVT::v2f64. That SCALAR_TO_VECTOR is then followed by a "free" bitcast to type MVT::v4i32. The elements of the resulting v4i32 are then extracted to build a v2i32 vector (which is illegal and therefore promoted to MVT::v2i64). This is in general cheaper than emitting a stack store+load sequence to bitconvert the operand from type f64 to type i64. llvm-svn: 208107	2014-05-06 17:09:03 +00:00
Renato Golin	c7aea40ec6	Implememting named register intrinsics This patch implements the infrastructure to use named register constructs in programs that need access to specific registers (bare metal, kernels, etc). So far, only the stack pointer is supported as a technology preview, but as it is, the intrinsic can already support all non-allocatable registers from any architecture. llvm-svn: 208104	2014-05-06 16:51:25 +00:00
Rafael Espindola	52dc5d828f	Special case aliases in GlobalValue::getAlignment. An alias has the address of what it points to, so it also has the same alignment. This allows a few optimizations to see past aliases for free. llvm-svn: 208103	2014-05-06 16:48:58 +00:00
Eric Christopher	a9f3a5cb37	Have the SubtargetFeature help routine just not return a number and fall back to the normal path without a cpu. While doing this fix llc to just exit when we don't have a module to process instead of asserting. llvm-svn: 208102	2014-05-06 16:29:50 +00:00
Rafael Espindola	8fbbfbbec3	Be more strict about not allowing setSection on aliases. llvm-svn: 208095	2014-05-06 14:59:14 +00:00
Rafael Espindola	a7d9c69cc8	Be more strict about not calling setAlignment on global aliases. The fact that GlobalAlias::setAlignment exists at all is a side effect of how the classes are organized, it should never be used. llvm-svn: 208094	2014-05-06 14:51:36 +00:00
Tim Northover	618850b6a5	AArch64/ARM64: implement diagnosis of unpredictable loads & stores llvm-svn: 208091	2014-05-06 14:15:14 +00:00
Tim Northover	15641cd4e1	AArch64/ARM64: make NEON vector list parsing a bit more robust It doesn't change the results, but it seems silly not to diagnose obvious problems early on. llvm-svn: 208083	2014-05-06 12:50:51 +00:00
Tim Northover	339ecf14ee	AArch64/ARM64: add more specific diagnostic for floating imm 0.0. llvm-svn: 208082	2014-05-06 12:50:47 +00:00
Tim Northover	05cbe7c80a	AArch64/ARM64: add more specific diagnostic for invalid vector lanes llvm-svn: 208081	2014-05-06 12:50:44 +00:00
Tim Northover	0f54f309bb	AArch64/ARM64: produce more informative diagnostic assembling some immediates No tests here, they'll be added when the entire neon-diagnostics.s test from AArch64 is enabled. llvm-svn: 208079	2014-05-06 11:18:53 +00:00
Christian Pirker	fdce7cea93	ARM: For thumb fixups store halfwords high first and low second llvm-svn: 208076	2014-05-06 10:05:11 +00:00
Kevin Qin	1353c3405d	[ARM64] Enable alignment control option in front-end for ARM64. This is the modification in llvm part. llvm-svn: 208074	2014-05-06 09:48:52 +00:00
Craig Topper	646f64f04a	Use X86 memory operand enums instead of hardcoding. llvm-svn: 208064	2014-05-06 07:04:32 +00:00
David Blaikie	d3f094a33b	PR19598: Provide the ability to RAUW a declaration with itself, creating a non-temporary copy and using that to RAUW. Also, provide the ability to create temporary and non-temporary declarations, as not all declarations may be replaced by definitions later on. This provides the necessary infrastructure for Clang to fix PR19598, leaking temporary MDNodes in Clang's debug info generation. llvm-svn: 208054	2014-05-06 03:41:57 +00:00
Eric Christopher	7eba3f90ae	Revert "Walk back commits for unused function parameters - they're still being" this reapplies 208012 and 208002. llvm-svn: 208037	2014-05-06 02:37:26 +00:00
Duncan P. N. Exon Smith	87c40fdfdb	blockfreq: Move include to .cpp llvm-svn: 208035	2014-05-06 01:57:42 +00:00
Richard Smith	c167d656e7	Re-commit r208025, reverted in r208030, with a fix for a conformance issue which GCC detects and Clang does not! llvm-svn: 208033	2014-05-06 01:44:26 +00:00
Richard Smith	09bf116939	Revert r208025, which made buildbots unhappy for unknown reasons. llvm-svn: 208030	2014-05-06 01:26:00 +00:00
Reid Kleckner	4a406d32e9	Fix i128 div/mod on mingw64 The Win64 docs are very clear that anything larger than 8 bytes is passed by reference, and GCC MinGW64 honors that for __modti3 and friends. Patch by Jameson Nash! llvm-svn: 208029	2014-05-06 01:20:42 +00:00
Argyrios Kyrtzidis	8c1eafc9b0	[Support/MemoryBuffer] Rename IsVolatile -> IsVolatileSize and add a comment about the use case for the new parameter. llvm-svn: 208026	2014-05-06 01:03:52 +00:00

1 2 3 4 5 ...

69418 Commits