llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	29f2e95185	[X86] Use i8 immediate for comparison type on AVX512 packed integer instructions. This matches floating point equivalents. Includes autoupgrade support to convert old code. llvm-svn: 227063	2015-01-25 23:26:02 +00:00
Alex Rosenberg	38babd1eeb	Add the triple for the Sony Playstation®4. Lots more to follow. llvm-svn: 227060	2015-01-25 22:46:59 +00:00
Adrian Prantl	40cb819c6f	Debug info: Fix PR22296 by omitting the DW_AT_location if we lost the physical register that is described in a DBG_VALUE. In the testcase the DBG_VALUE describing "p5" becomes unavailable because the register its address is in is clobbered and we (currently) aren't smart enough to realize that the value is rematerialized immediately after the DBG_VALUE and/or is actually a stack slot. llvm-svn: 227056	2015-01-25 19:04:08 +00:00
Bill Schmidt	258ac3cc56	[PowerPC] Revert ppc64le-aggregates.ll test changes from r227053 It appears we have different behavior with and without -mcpu=pwr8 even with ppc64le defaulting to POWER8. The failure appears as follows: /home/bb/cmake-llvm-x86_64-linux/llvm-project/llvm/test/CodeGen/PowerPC/ppc64le-aggregates.ll:268:14: error: expected string not found in input ; CHECK-DAG: lfs 1, 0([[REG]]) ^ <stdin>:497:11: note: scanning from here ld 3, .LC1@toc@l(3) ^ <stdin>:497:11: note: with variable "REG" equal to "3" ld 3, .LC1@toc@l(3) ^ <stdin>:514:2: note: possible intended match here lfs 1, 0(4) ^ Reverting this particular test case change. Nemanja, please have a look at the reason for the failure. llvm-svn: 227055	2015-01-25 18:18:54 +00:00
Bill Schmidt	279cabb450	[PowerPC] Reset the baseline for ppc64le to be equivalent to pwr8 Test by Nemanja Ivanovic. Since ppc64le implies POWER8 as a minimum, it makes sense that the same features are included. Since the pwr8 processor model will likely be getting new features until the implementation is complete, I created a new list to add these updates to. This will include them in both pwr8 and ppc64le. Furthermore, it seems that it would make sense to compose the feature lists for other processor models (pwr3 and up). Per discussion in the review, I will make this change in a subsequent patch. In order to test the changes, I've added an additional run step to test cases that specify -march=ppc64le -mcpu=pwr8 to omit the -mcpu option. Since the feature lists are the same, the behaviour should be unchanged. llvm-svn: 227053	2015-01-25 18:05:42 +00:00
Simon Atanasyan	4685c839d4	[docs] Add link to the MIPS 64-bit ELF object file specification llvm-svn: 227050	2015-01-25 16:20:30 +00:00
NAKAMURA Takumi	4834009898	Instantiate Registry<GCStrategy> in LLVMCore, to let it available on Win32 DLL. llvm-svn: 227046	2015-01-25 15:05:36 +00:00
Simon Atanasyan	5d19c67a68	[ELFYAML] Support mips64 relocation record format in yaml2obj/obj2yaml MIPS64 ELF file has a very specific relocation record format. Each record might specify up to three relocation operations. So the `r_info` field in fact consists of three relocation type sub-fields and optional code of "special" symbols. http://techpubs.sgi.com/library/manuals/4000/007-4658-001/pdf/007-4658-001.pdf page 40 The patch implements support of the MIPS64 relocation record format in yaml2obj/obj2yaml tools by introducing new optional Relocation fields: Type2, Type3, and SpecSym. These fields are recognized only if the object/YAML file relates to the MIPS64 target. Differential Revision: http://reviews.llvm.org/D7136 llvm-svn: 227044	2015-01-25 13:29:25 +00:00
Elena Demikhovsky	1a603b3f13	AVX-512: Changes in operations on masks registers for KNL and SKX - Added KSHIFTB/D/Q for skx - Added KORTESTB/D/Q for skx - Fixed store operation for v8i1 type for KNL - Store size of v8i1, v4i1 and v2i1 are changed to 8 bits llvm-svn: 227043	2015-01-25 12:47:15 +00:00
NAKAMURA Takumi	c17696022d	Orc/IRCompileLayer.h: Avoid non-static initializer. llvm-svn: 227042	2015-01-25 11:41:56 +00:00
NAKAMURA Takumi	2fb9a523ab	OrcJIT: Avoid non-static initializers. llvm-svn: 227041	2015-01-25 11:41:49 +00:00
NAKAMURA Takumi	94a6bef31f	Orc/LLVMBuild.txt: Prune redundant "Target" in libdeps. llvm-svn: 227040	2015-01-25 11:41:41 +00:00
Craig Topper	ca8e179bc2	[X86] Give scalar VRNDSCALE instructions priority in AVX512 mode. llvm-svn: 227039	2015-01-25 08:49:22 +00:00
Craig Topper	1d60952401	Simplify a multiclass. No functional change. llvm-svn: 227038	2015-01-25 08:49:19 +00:00
Craig Topper	e3155c96ee	Remove tab characters. NFC llvm-svn: 227036	2015-01-25 08:45:32 +00:00
Elena Demikhovsky	a3232f764e	Implemented cost model for masked load/store operations. llvm-svn: 227035	2015-01-25 08:44:46 +00:00
Craig Topper	53a846764c	[X86] Replace i32i8imm on SSE/AVX instructions with i32u8imm which will make the assembler bounds check them. It will also make them print as unsigned. llvm-svn: 227032	2015-01-25 02:21:16 +00:00
Craig Topper	fc946a0e6f	[X86] Use u8imm in several places that used i32i8imm that don't require an i32 type. llvm-svn: 227031	2015-01-25 02:21:13 +00:00
Craig Topper	e7f6cf437c	Remove tab characters. NFC. llvm-svn: 227030	2015-01-25 02:21:11 +00:00
Chandler Carruth	db0809fff3	[PM] Remove the restricted visibility from the instcombine worklist. Now that library consumers access the instcombine pass directly, they also (transitively) access the worklist. Also, it would need to be used directly in order to have a useful utility if we ever want that. This should fix some warnings since I moved this code. Sorry for the trouble. llvm-svn: 227025	2015-01-25 00:30:05 +00:00
Lang Hames	b5be1addce	Remove a few more redundant ExecutionEngine regression tests. llvm-svn: 227021	2015-01-24 22:41:13 +00:00
Charlie Turner	d415cc8198	Fixup debug information references. llvm-svn: 227020	2015-01-24 21:51:21 +00:00
Charlie Turner	6cba064414	Update references to lines of code count. The number of lines of code in Kaleidoscope has risen from the previously reported 700 to 986 according to the cloc tool. This tools was run on the toy.cpp file from Chapter 8. llvm-svn: 227019	2015-01-24 21:51:17 +00:00
Justin Bogner	db7afdbb01	InstrProf: Add operator!= to coverage counters I'll use this in clang shortly. Also makes the operator definition style more consistent in this class. llvm-svn: 227018	2015-01-24 21:13:23 +00:00
Justin Bogner	3c0f124a74	llvm-cov: Only combine segments if they overlap exactly If two coverage segments cover the same area we need to combine them, as per r218432. OTOH, just because they start at the same place doesn't mean they cover the same area. This fixes the check to be more exact about this. This is pretty hard to test right now. The frontend doesn't currently emit regions that start at the same place but don't overlap, but some upcoming work changes this. llvm-svn: 227017	2015-01-24 20:58:52 +00:00
Patrik Hagglund	d03dad8b4b	Revert r227013 "Add visibility attribute for InstCombinePass (r226987)." Buildbot breakage. http://lab.llvm.org:8011/builders/clang-hexagon-elf/builds/21749 llvm-svn: 227016	2015-01-24 20:35:36 +00:00
Saleem Abdulrasool	fc07c72674	CodeGen: drive-by formatting clean ups Minor tweaks to whitespace formatting that I noticed was off. NFC. llvm-svn: 227014	2015-01-24 20:19:45 +00:00
Patrik Hagglund	87aada2d3c	Add visibility attribute for InstCombinePass (r226987). Warning by gcc: 'llvm::InstCombinePass' declared with greater visibility than the type of its field 'llvm::InstCombinePass::Worklist' [-Wattributes] llvm-svn: 227013	2015-01-24 20:06:53 +00:00
Benjamin Kramer	4d50b6c306	DebugInfo: Fix use after return found by asan. llvm-svn: 227012	2015-01-24 19:55:23 +00:00
Lang Hames	069ff67b1a	[Orc] Add TransformUtils to Orc's dependency list. Patch by Jan Vesely. Thanks Jan! llvm-svn: 227011	2015-01-24 19:00:09 +00:00
Lang Hames	73e55395e8	Remove a number of redundant ExecutionEngine regression tests. These tests used to test the legacy JIT but since that has been removed they're just redundantly testing MCJIT. Remove them and just leave their counterparts in test/ExecutionEngine/MCJIT. llvm-svn: 227010	2015-01-24 18:49:51 +00:00
Alexei Starovoitov	8a3d0c1557	bpf: add missing lit.local.cfg llvm-svn: 227009	2015-01-24 18:20:52 +00:00
Alexei Starovoitov	e4c8c807bb	BPF backend Summary: V8->V9: - cleanup tests V7->V8: - addressed feedback from David: - switched to range-based 'for' loops - fixed formatting of tests V6->V7: - rebased and adjusted AsmPrinter args - CamelCased .td, fixed formatting, cleaned up names, removed unused patterns - diffstat: 3 files changed, 203 insertions(+), 227 deletions(-) V5->V6: - addressed feedback from Chandler: - reinstated full verbose standard banner in all files - fixed variables that were not in CamelCase - fixed names of #ifdef in header files - removed redundant braces in if/else chains with single statements - fixed comments - removed trailing empty line - dropped debug annotations from tests - diffstat of these changes: 46 files changed, 456 insertions(+), 469 deletions(-) V4->V5: - fix setLoadExtAction() interface - clang-formated all where it made sense V3->V4: - added CODE_OWNERS entry for BPF backend V2->V3: - fix metadata in tests V1->V2: - addressed feedback from Tom and Matt - removed top level change to configure (now everything via 'experimental-backend') - reworked error reporting via DiagnosticInfo (similar to R600) - added few more tests - added cmake build - added Triple::bpf - tested on linux and darwin V1 cover letter: --------------------- recently linux gained "universal in-kernel virtual machine" which is called eBPF or extended BPF. The name comes from "Berkeley Packet Filter", since new instruction set is based on it. This patch adds a new backend that emits extended BPF instruction set. The concept and development are covered by the following articles: http://lwn.net/Articles/599755/ http://lwn.net/Articles/575531/ http://lwn.net/Articles/603983/ http://lwn.net/Articles/606089/ http://lwn.net/Articles/612878/ One of use cases: dtrace/systemtap alternative. bpf syscall manpage: https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/commit/?id=b4fc1a460f3017e958e6a8ea560ea0afd91bf6fe instruction set description and differences vs classic BPF: http://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git/tree/Documentation/networking/filter.txt Short summary of instruction set: - 64-bit registers R0 - return value from in-kernel function, and exit value for BPF program R1 - R5 - arguments from BPF program to in-kernel function R6 - R9 - callee saved registers that in-kernel function will preserve R10 - read-only frame pointer to access stack - two-operand instructions like +, -, *, mov, load/store - implicit prologue/epilogue (invisible stack pointer) - no floating point, no simd Short history of extended BPF in kernel: interpreter in 3.15, x64 JIT in 3.16, arm64 JIT, verifier, bpf syscall in 3.18, more to come in the future. It's a very small and simple backend. There is no support for global variables, arbitrary function calls, floating point, varargs, exceptions, indirect jumps, arbitrary pointer arithmetic, alloca, etc. From C front-end point of view it's very restricted. It's done on purpose, since kernel rejects all programs that it cannot prove safe. It rejects programs with loops and with memory accesses via arbitrary pointers. When kernel accepts the program it is guaranteed that program will terminate and will not crash the kernel. This patch implements all 'must have' bits. There are several things on TODO list, so this is not the end of development. Most of the code is a boiler plate code, copy-pasted from other backends. Only odd things are lack or < and <= instructions, specialized load_byte intrinsics and 'compare and goto' as single instruction. Current instruction set is fixed, but more instructions can be added in the future. Signed-off-by: Alexei Starovoitov <alexei.starovoitov@gmail.com> Subscribers: majnemer, chandlerc, echristo, joerg, pete, rengolin, kristof.beyls, arsenm, t.p.northover, tstellarAMD, aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D6494 llvm-svn: 227008	2015-01-24 17:51:26 +00:00
Daniel Sanders	9a4f2c55df	[mips] Fix 'jumpy' debug line info around calls. Summary: At the moment, address calculation is taking the debug line info from the address node (e.g. TargetGlobalAddress). When a function is called multiple times, this results in output of the form: .loc $first_call_location .. address calculation .. .. function call .. .. address calculation .. .loc $second_call_location .. function call .. .loc $first_call_location .. address calculation .. .loc $third_call_location .. function call .. This patch makes address calculations for function calls take the debug line info for the call node and results in output of the form: .loc $first_call_location .. address calculation .. .. function call .. .loc $second_call_location .. address calculation .. .. function call .. .loc $third_call_location .. address calculation .. .. function call .. All other address calculations continue to use the address node. Test Plan: Fixes test/DebugInfo/multiline.ll on a mips host. Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D7050 llvm-svn: 227005	2015-01-24 14:35:11 +00:00
Sylvestre Ledru	450f97dcb9	Update of the gold-plugin.cpp code to match Chandler's changes (r226981) llvm-svn: 227004	2015-01-24 13:59:08 +00:00
Daniel Sanders	5a9225b262	[mips] Fix assertion on i128 addition/subtraction on MIPS64 Summary: In addition to the included tests, this fixes test/CodeGen/Generic/i128-addsub.ll on a mips64 host. Reviewers: atanasyan, sagar, vmedic Reviewed By: vmedic Subscribers: sdkie, llvm-commits Differential Revision: http://reviews.llvm.org/D6610 llvm-svn: 227003	2015-01-24 12:58:10 +00:00
Andrea Di Biagio	8381475a75	[DAG] Fix wrong canonicalization performed on shuffle nodes. This fixes a regression introduced by r226816. When replacing a splat shuffle node with a constant build_vector, make sure that the new build_vector has a valid number of elements. Thanks to Patrik Hagglund for reporting this problem and providing a small reproducible. llvm-svn: 227002	2015-01-24 11:54:29 +00:00
Chandler Carruth	9dea5cdb8e	[PM] General doxygen and comment cleanup for this pass. llvm-svn: 227001	2015-01-24 11:44:32 +00:00
Chandler Carruth	7253bba458	[PM] Reformat this code with clang-format so that I can use clang-format when refactoring for the new pass manager without introducing too many formatting changes into meaning full diffs. llvm-svn: 227000	2015-01-24 11:33:55 +00:00
Chandler Carruth	43e590e51f	[PM] Port LowerExpectIntrinsic to the new pass manager. This just lifts the logic into a static helper function, sinks the legacy pass to be a trivial wrapper of that helper fuction, and adds a trivial wrapper for the new PM as well. Not much to see here. I switched a test case to run in both modes, but we have to strip the dead prototypes separately as that pass isn't in the new pass manager (yet). llvm-svn: 226999	2015-01-24 11:13:02 +00:00
Chandler Carruth	c3bf5bd8cf	[PM] Change LowerExpectIntrinsic to actually return true when it has changed the IR. This is particularly easy as we can just look for the existence of any expect intrinsic at all to know whether we've changed the IR. llvm-svn: 226998	2015-01-24 11:12:57 +00:00
Chandler Carruth	6eb60eb5c9	[PM] Use a more appropriate name for the statistics variable in lower-expect, as we don't have 'if's in the IR and we use it for switches as well. llvm-svn: 226997	2015-01-24 10:57:25 +00:00
Chandler Carruth	d12741e0a9	[PM] Switch tihs code to use a range based for loop over the function. We can't switch the loop over the instructions because it needs to early-increment the iterator. llvm-svn: 226996	2015-01-24 10:57:19 +00:00
Chandler Carruth	3f5e7b1fb6	[PM] Use a SmallVector instead of std::vector to avoid heap allocations for small switches, and avoid using a complex loop to set up the weights. We know what the baseline weights will be so we can just resize the vector to contain all that value and clobber the one slot that is likely. This seems much more direct than the previous code that tested at every iteration, and started off by zeroing the vector. llvm-svn: 226995	2015-01-24 10:47:13 +00:00
Chandler Carruth	0012c778a4	[PM] Pull the two helpers for this pass into static functions. There are no members for them to use. Also, make them accept references as there is no possibility of a null pointer. llvm-svn: 226994	2015-01-24 10:39:24 +00:00
Chandler Carruth	579c5c45c2	[PM] Add a basic doxygen comment for this pass. llvm-svn: 226993	2015-01-24 10:32:53 +00:00
Chandler Carruth	0ea746bf9f	[PM] Clean up the formatting of the LowerExpectIntrinsic pass prior to refactoring its code. llvm-svn: 226992	2015-01-24 10:30:14 +00:00
Chandler Carruth	72793727cc	[PM] Move the LowerExpectIntrinsic pass to the Scalar library. It was already in the Scalar header and referenced extensively as being in this library, the source file was just in the utils directory for some reason. No actual functionality changed. I noticed as it didn't make sense to add a pass header to the utils headers. llvm-svn: 226991	2015-01-24 10:18:47 +00:00
Yunzhong Gao	a8cf495a15	If we see UTF-8 BOM sequence at the beginning of a response file, we shall remove these bytes before parsing. Phabricator Revision: http://reviews.llvm.org/D7156 llvm-svn: 226988	2015-01-24 04:23:08 +00:00
Chandler Carruth	83ba269e4b	[PM] Port instcombine to the new pass manager! This is exciting as this is a much more involved port. This is a complex, existing transformation pass. All of the core logic is shared between both old and new pass managers. Only the access to the analyses is separate because the actual techniques are separate. This also uses a bunch of different and interesting analyses and is the first time where we need to use an analysis across an IR layer. This also paves the way to expose instcombine utility functions. I've got a static function that implements the core pass logic over a function which might be mildly interesting, but more interesting is likely exposing a routine which just uses instructions already in the worklist and combines until empty. I've switched one of my favorite instcombine tests to run with both as well to make sure this keeps working. llvm-svn: 226987	2015-01-24 04:19:17 +00:00
Filipe Cabecinhas	de968ecb05	[Bitcode] Diagnose errors instead of asserting from bad input Eventually we can make some of these pass the error along to the caller. Reports a fatal error if: We find an invalid abbrev record We try to get an invalid abbrev number We can't fill the current word due to an EOF Fixed an invalid bitcode test to check for output with FileCheck Bugs found with afl-fuzz llvm-svn: 226986	2015-01-24 04:15:05 +00:00
Chandler Carruth	c0291865ed	[PM] Rework how the TargetLibraryInfo pass integrates with the new pass manager to support the actual uses of it. =] When I ported instcombine to the new pass manager I discover that it didn't work because TLI wasn't available in the right places. This is a somewhat surprising and/or subtle aspect of the new pass manager design that came up before but I think is useful to be reminded of: While the new pass manager allows a function pass to query a module analysis, it requires that the module analysis is already run and cached prior to the function pass manager starting up, possibly with a 'require<foo>' style utility in the pass pipeline. This is an intentional hurdle because using a module analysis from a function pass requires that the module analysis is run prior to entering the function pass manager. Otherwise the other functions in the module could be in who-knows-what state, etc. A somewhat surprising consequence of this design decision (at least to me) is that you have to design a function pass that leverages a module analysis to do so as an optional feature. Even if that means your function pass does no work in the absence of the module analysis, you have to handle that possibility and remain conservatively correct. This is a natural consequence of things being able to invalidate the module analysis and us being unable to re-run it. And it's a generally good thing because it lets us reorder passes arbitrarily without breaking correctness, etc. This ends up causing problems in one case. What if we have a module analysis that is definitionally impossible to invalidate. In the places this might come up, the analysis is usually also definitionally trivial to run even while other transformation passes run on the module, regardless of the state of anything. And so, it follows that it is natural to have a hard requirement on such analyses from a function pass. It turns out, that TargetLibraryInfo is just such an analysis, and InstCombine has a hard requirement on it. The approach I've taken here is to produce an analysis that models this flexibility by making it both a module and a function analysis. This exposes the fact that it is in fact safe to compute at any point. We can even make it a valid CGSCC analysis at some point if that is useful. However, we don't want to have a copy of the actual target library info state for each function! This state is specific to the triple. The somewhat direct and blunt approach here is to turn TLI into a pimpl, with the state and mutators in the implementation class and the query routines primarily in the wrapper. Then the analysis can lazily construct and cache the implementations, keyed on the triple, and on-demand produce wrappers of them for each function. One minor annoyance is that we will end up with a wrapper for each function in the module. While this is a bit wasteful (one pointer per function) it seems tolerable. And it has the advantage of ensuring that we pay the absolute minimum synchronization cost to access this information should we end up with a nice parallel function pass manager in the future. We could look into trying to mark when analysis results are especially cheap to recompute and more eagerly GC-ing the cached results, or we could look at supporting a variant of analyses whose results are specifically not cached and expected to just be used and discarded by the consumer. Either way, these seem like incremental enhancements that should happen when we start profiling the memory and CPU usage of the new pass manager and not before. The other minor annoyance is that if we end up using the TLI in both a module pass and a function pass, those will be produced by two separate analyses, and thus will point to separate copies of the implementation state. While a minor issue, I dislike this and would like to find a way to cleanly allow a single analysis instance to be used across multiple IR unit managers. But I don't have a good solution to this today, and I don't want to hold up all of the work waiting to come up with one. This too seems like a reasonable thing to incrementally improve later. llvm-svn: 226981	2015-01-24 02:06:09 +00:00
Richard Smith	baa128ea5a	Bring the modules buildbot back to life after r226940. llvm-svn: 226980	2015-01-24 01:55:52 +00:00
Kuba Brecka	a62bbccad0	Reverting r226937: lit: Make MCJIT's supported arch check case insensitive The r226937 commit causes ASan lit tests to be all skipped on OS X. llvm-svn: 226979	2015-01-24 01:42:44 +00:00
Quentin Colombet	29f553398f	[AArch64][LoadStoreOptimizer] Form LDPSW when possible. This patch adds the missing LD[U]RSW variants to the load store optimizer, so that we generate LDPSW when possible. <rdar://problem/19583480> llvm-svn: 226978	2015-01-24 01:25:54 +00:00
Lang Hames	f4ff3435e7	[Orc] Add some missing headers to the CompileOnDemandLayer.h llvm-svn: 226975	2015-01-24 00:45:11 +00:00
Bruno Cardoso Lopes	ddcc2e31a7	[x86] Fix a comment llvm-svn: 226974	2015-01-24 00:22:04 +00:00
Lang Hames	17e6caee63	[Orc] Add orcjit to the dependencies list in the Makefile for lli. This should fix a few more broken bots. llvm-svn: 226973	2015-01-24 00:01:29 +00:00
Tom Stellard	edd188c459	R600/SI: Emit .hsa.version section for amdhsa OS llvm-svn: 226970	2015-01-23 23:59:08 +00:00
Reid Kleckner	3d4638b391	Fix assertion when C++ EH filters are present in functions using SEH Should fix PR22305. llvm-svn: 226969	2015-01-23 23:51:25 +00:00
Adrian Prantl	70f2a736db	Address more review comments for DIExpression::iterator. - input_iterator - define an operator-> - make constructors private were possible llvm-svn: 226967	2015-01-23 23:40:47 +00:00
Justin Bogner	cf3063b4a3	InstrProf: debug dumps should go to dbgs(), not outs() llvm-svn: 226964	2015-01-23 23:28:30 +00:00
Justin Bogner	0b9858dca5	llvm-cov: Don't use llvm::outs() in library code Nothing in lib/ should be using llvm::outs() directly. Thread it in from the caller instead. llvm-svn: 226961	2015-01-23 23:09:27 +00:00
Justin Bogner	000b5223e4	llvm-cov: Use range-for (NFC) llvm-svn: 226960	2015-01-23 22:57:02 +00:00
Reid Kleckner	f4ebbc6825	mips: Fix "XPASS" test results by removing 'not' commands These tests are asserting and crashing for me, and 'not' sees that as a non-zero exit code instead of a signal code for obscure Windows reasons. This causes the test to pass, giving me an unclean 'ninja check'. The test is already XFAILd, so just run the test without 'not' and let lit handle the failure. llvm-svn: 226958	2015-01-23 22:55:31 +00:00
Bruno Cardoso Lopes	56567f9135	[x86] Combine x86mmx/i64 to v2i64 conversion to use scalar_to_vector Handle the poor codegen for i64/x86xmm->v2i64 (%mm -> %xmm) moves. Instead of using stack store/load pair to do the job, use scalar_to_vector directly, which in the MMX case can use movq2dq. This was the current behavior prior to improvements for vector legalization of extloads in r213897. This commit fixes the regression and as a side-effect also remove some unnecessary shuffles. In the new attached testcase, we go from: pshufw $-18, (%rdi), %mm0 movq %mm0, -8(%rsp) movq -8(%rsp), %xmm0 pshufd $-44, %xmm0, %xmm0 movd %xmm0, %eax ... To: pshufw $-18, (%rdi), %mm0 movq2dq %mm0, %xmm0 movd %xmm0, %eax ... Differential Revision: http://reviews.llvm.org/D7126 rdar://problem/19413324 llvm-svn: 226953	2015-01-23 22:44:16 +00:00
Justin Bogner	011c742535	llvm-cov: clang-format the GCOV files (NFC) llvm-svn: 226952	2015-01-23 22:38:01 +00:00
Reid Kleckner	7b23b43bfe	Fix the MSVC build with the new Orc JIT APIs llvm-svn: 226949	2015-01-23 22:25:47 +00:00
Michael J. Spencer	58e419593c	[YAMLIO] Dirty hack: Force integral conversion to allow strong typedefs to convert. llvm-svn: 226948	2015-01-23 22:24:57 +00:00
Lang Hames	28452d85c8	[Orc] Remove a bunch of constructors from ObjectLinkingLayer. These constructors were causing trouble for MSVC and older GCCs. This should fix more of the build failures from r226940. llvm-svn: 226946	2015-01-23 22:11:07 +00:00
Tom Stellard	20f6c0732f	R600/SI: Move i64 -> v2i32 load promotion into AMDGPUDAGToDAGISel::Select() We used to do this promotion during DAG legalization, but this caused an infinite loop in ExpandUnalignedLoad() because it assumed that i64 loads were legal if i64 was a legal type. It also seems better to report i64 loads as legal, since they actually are and we were just promoting them to simplify our tablegen files. llvm-svn: 226945	2015-01-23 22:05:45 +00:00
Michael J. Spencer	e368a62676	[Object][ELF] Test unknown type. llvm-svn: 226943	2015-01-23 21:58:09 +00:00
Michael J. Spencer	731cae3839	[YAMLIO] Add support for numeric values in enums. llvm-svn: 226942	2015-01-23 21:57:50 +00:00
Lang Hames	d235df0aaa	[Orc] LLVMLinkInOrcMCJITReplacement shouldn't be in the anonymous namespace. This should fix some of the builder errors from r226940. llvm-svn: 226941	2015-01-23 21:49:12 +00:00
Lang Hames	93de2a12a3	[Orc] New JIT APIs. This patch adds a new set of JIT APIs to LLVM. The aim of these new APIs is to cleanly support a wider range of JIT use cases in LLVM, and encourage the development and contribution of re-usable infrastructure for LLVM JIT use-cases. These APIs are intended to live alongside the MCJIT APIs, and should not affect existing clients. Included in this patch: 1) New headers in include/llvm/ExecutionEngine/Orc that provide a set of components for building JIT infrastructure. Implementation code for these headers lives in lib/ExecutionEngine/Orc. 2) A prototype re-implementation of MCJIT (OrcMCJITReplacement) built out of the new components. 3) Minor changes to RTDyldMemoryManager needed to support the new components. These changes should not impact existing clients. 4) A new flag for lli, -use-orcmcjit, which will cause lli to use the OrcMCJITReplacement class as its underlying execution engine, rather than MCJIT itself. Tests to follow shortly. Special thanks to Michael Ilseman, Pete Cooper, David Blaikie, Eric Christopher, Justin Bogner, and Jim Grosbach for extensive feedback and discussion. llvm-svn: 226940	2015-01-23 21:25:00 +00:00
Adrian Prantl	4bd0a0c3ed	Move the accessor functions from DIExpression::iterator into a wrapper DIExpression::Operand, so we can write range-based for loops. Thanks to David Blaikie for the idea. llvm-svn: 226939	2015-01-23 21:24:41 +00:00
Reid Kleckner	5f5036a073	lit: Make MCJIT's supported arch check case insensitive Should make the tests run when using CMake on systems where 'uname -p' reports "amd64", such as FreeBSD. Should fix PR21559. llvm-svn: 226937	2015-01-23 21:11:40 +00:00
Kevin Enderby	479ee6135d	Fix the problem with llvm-objdump and -archive-headers in printing the archive header size field. This problem showed up with the clang-cmake-armv7-a15-full bot. Thanks to Renato Golin for his help. llvm-svn: 226936	2015-01-23 21:02:44 +00:00
Alexei Starovoitov	4ea2f606a8	[mips] fix spelling of 'disassembler' trivial first commit llvm-svn: 226935	2015-01-23 21:00:08 +00:00
Hans Wennborg	ae9c971a2f	LowerSwitch: replace unreachable default with popular case destination SimplifyCFG currently does this transformation, but I'm planning to remove that to allow other passes, such as this one, to exploit the unreachable default. This patch takes care to keep track of what case values are unreachable even after the transformation, allowing for more efficient lowering. Differential Revision: http://reviews.llvm.org/D6697 llvm-svn: 226934	2015-01-23 20:43:51 +00:00
Colin LeMahieu	bc2f47a76e	[Objdump] Output information about common symbols in a way closer to GNU objdump. llvm-svn: 226932	2015-01-23 20:06:24 +00:00
Ramkumar Ramachandra	c3c8b27616	[emacs] llvm-mode: fix parens, font-lock i* In llvm-mode, with electric-pair-mode turned on, typing a literal '[' would print out '[[', and '(' would print a '(('. This was a very annoying bug caused by overzealous syntax-table entries: the parens are already part of the '(' and ')' class by default. Fix this. While at it, notice that i32, i64, i1 etc. are not font-locked despite a clear intent to do so. The issue is that regexp-opt doesn't accept regular expressions. So, spell out the common literal integers with different widths. Differential Revision: http://reviews.llvm.org/D7036 llvm-svn: 226931	2015-01-23 19:45:35 +00:00
Kevin Enderby	69fe98da14	Add the option, -data-in-code, to llvm-objdump used with -macho to print the Mach-O data in code table. llvm-svn: 226921	2015-01-23 18:52:17 +00:00
Reid Kleckner	5cc1569c54	Classify functions by EH personality type rather than using the triple This mostly reverts commit r222062 and replaces it with a new enum. At some point this enum will grow at least for other MSVC EH personalities. Also beefs up the way we were sniffing the personality function. Previously we would emit the Itanium LSDA despite using __C_specific_handler. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D6987 llvm-svn: 226920	2015-01-23 18:49:01 +00:00
Adrian Prantl	577feba44b	Debug Info / PR22309: Allow union types to be emitted as unsigned constants. llvm-svn: 226919	2015-01-23 18:01:39 +00:00
Eric Christopher	a1c6e0c8ce	Remove some local variables in place of just querying for them in the couple of asserts. llvm-svn: 226917	2015-01-23 17:22:44 +00:00
Toma Tabacu	c405c82214	[mips] Add new error message and improve testing for parsing the .module directive. Summary: We used to silently ignore any empty .module's and we used to give an error saying that we found an "unexpected token at start of statement" when the value of the option wasn't an identifier (e.g. if it was a number). We now give an error saying that we "expected .module option identifier" in both of those cases. I also fixed the other tests in mips-abi-bad.s, which all seemed to be broken. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7095 llvm-svn: 226905	2015-01-23 10:40:19 +00:00
Jyoti Allur	f1d7050a25	This patch fixes issue with lowering below mentioned pattern :- _foo: smull r0, r1, r1, r0 smull r2, r3, r3, r2 adds r0, r2, r0 adc r1, r3, r1 bx lr to _foo: smull r0, r1, r1, r0 smlal r0, r1, r3, r2 bx lr llvm-svn: 226904	2015-01-23 09:10:03 +00:00
Craig Topper	0271d10d35	[x86] Change u8imm operands to always print as unsigned. This makes shuffle masks and the like make way more sense. llvm-svn: 226902	2015-01-23 08:00:59 +00:00
Mehdi Amini	5059813c2d	DAGCombine: always constant fold FMA when target disable FP exceptions Summary: When trying to constant fold an FMA in the DAG, getNode() fails to fold the FMA if an operand is not finite. In this case this patch allows the constant folding if !TLI->hasFloatingPointExceptions() Reviewers: resistor Reviewed By: resistor Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D6912 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 226901	2015-01-23 07:07:20 +00:00
Lang Hames	ac92bfc505	[ADT] Add move operations to SmallVector<T,N> from SmallVectorImpl<T>. This makes it possible to move between SmallVectors of different sizes. Thanks to Dave Blaikie and Duncan Smith for patch feedback. llvm-svn: 226899	2015-01-23 06:25:17 +00:00
Craig Topper	f9c6598a9c	Fix 80 column violation llvm-svn: 226898	2015-01-23 06:18:35 +00:00
Craig Topper	46469aa4da	[X86] Add IntrNoMem to the AVX512 conflict intrinsics. llvm-svn: 226897	2015-01-23 06:11:45 +00:00
Rafael Espindola	5fa925ebf6	Add STB_GNU_UNIQUE to the ELF writer. This lets llvm-mc assemble files produced by gcc. llvm-svn: 226895	2015-01-23 04:44:35 +00:00
NAKAMURA Takumi	7ca79f7f96	Prune an out-of-date \param since r226476. [-Wdocumentation] llvm-svn: 226890	2015-01-23 01:05:12 +00:00
NAKAMURA Takumi	2bbc90cca5	Reformat. llvm-svn: 226888	2015-01-23 01:02:07 +00:00
NAKAMURA Takumi	f6eee4ad67	MipsAsmParser.cpp: Suppress a warning introduced in r226657. [-Wunused-variable] llvm-svn: 226887	2015-01-23 01:01:52 +00:00
Jan Vesely	5f715d36a7	R600: Try to use lower types for 64bit division if possible v2: add and enable tests for SI Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 226881	2015-01-22 23:42:43 +00:00
Jan Vesely	6269e3ca2f	SelectionDAG: Add KnownBits and SignBits computation for EXTRACT_ELEMENT v2: use getZExtValue add missing break codestyle v3: add few more comments Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 226880	2015-01-22 23:42:41 +00:00
Jan Vesely	f7987ca5a7	R600: Simplify LowerUDIVREM optimizations can handle removing the Hi part operations. The generated code is identical for R600, ~10% icount reduction for SI v2: rebase Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 226879	2015-01-22 23:42:39 +00:00
Duncan P. N. Exon Smith	68ab023ef7	IR: Change GenericDwarfNode::getHeader() to StringRef Simplify the API to use a `StringRef` directly rather than exposing the `MDString` bits underneath. llvm-svn: 226876	2015-01-22 23:10:55 +00:00
Duncan P. N. Exon Smith	e8b5e49ffd	IR: DwarfNode => DebugNode, NFC These things are potentially used for non-DWARF data (see the discussion in PR22235), so take the `Dwarf` out of the name. Since the new name gives fewer clues, update the doxygen to properly describe what they are. llvm-svn: 226874	2015-01-22 22:47:44 +00:00
Simon Pilgrim	7e6d573e87	[X86][AVX] Added (V)MOVDDUP / (V)MOVSLDUP / (V)MOVSHDUP memory folding + tests. Minor tweak now that D7042 is complete, we can enable stack folding for (V)MOVDDUP and do proper testing. Added missing AVX ymm folding patterns and fixed alignment for AVX VMOVSLDUP / VMOVSHDUP. llvm-svn: 226873	2015-01-22 22:39:59 +00:00
Simon Pilgrim	c976e8eef4	Line endings fixes. NFC. llvm-svn: 226872	2015-01-22 22:27:37 +00:00
Simon Pilgrim	507b37fcbb	[X86][SSE] Simplified PSUBUS tests Removed loops from PSUBUS tests - ensures folding is tested. Also renamed SSE2 tests SSSE3 to match cpu. This is a follow up commit agreed in http://reviews.llvm.org/D7094 llvm-svn: 226871	2015-01-22 22:19:58 +00:00
Lang Hames	bd4ccc89e6	[Object] Fix a bug in a condition introduced in r226217 - visibility can't be both hidden and default. Bug found by inspection by Rafael Espindola. No test: As discussed in the commit message for r226217 we don't have a good way to test this yet. llvm-svn: 226869	2015-01-22 22:04:47 +00:00
Chandler Carruth	df8b223dea	[PM] Actually add the new pass manager support for the assumption cache. I had already factored this analysis specifically to enable doing this, but hadn't actually committed the necessary wiring to get at this from the new pass manager. This also nicely shows how the separate cache object can be directly managed by the new pass manager. This analysis didn't have any direct tests and so I've added a printer pass and a boring test case. I chose to print the i1 value which is being assumed rather than the call to llvm.assume as that seems much more useful for testing... but suggestions on an even better printing strategy welcome. My main goal was to make sure things actually work. =] llvm-svn: 226868	2015-01-22 21:53:09 +00:00
Benjamin Kramer	cb36becbeb	Remove dead leak detector parts that fell out of use in r224703. llvm-svn: 226867	2015-01-22 21:43:01 +00:00
Duncan P. N. Exon Smith	8d536973a2	IR: Update references to temporaries before deleting During `MDNode::deleteTemporary()`, call `replaceAllUsesWith(nullptr)` to update all tracking references to `nullptr`. This fixes PR22280, where inverted destruction order between tracking references and the temporaries themselves caused a use-after-free in `LLParser`. An alternative fix would be to add an assertion that there are no users, and continue to fix inverted destruction order in clients (like `LLParser`), but instead I decided to make getting-teardown-right easy. (If someone disagrees let me know.) llvm-svn: 226866	2015-01-22 21:36:45 +00:00
Chris Bieneman	799ef37d02	Refactoring cl::parser construction and initialization. Summary: Some parsers need references back to the option they are members of. This is used for handling the argument string as well as by the various pass name parsers for making pass names into flags. Making parsers that need to refer back to the option have a reference to the option eliminates some of the members of various parsers, and enables further code cleanup. Reviewers: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7131 llvm-svn: 226864	2015-01-22 21:01:12 +00:00
Rafael Espindola	19b538450c	Don't use -z,defs on FreeBSD. Looks like environ is defined only in the main binary. llvm-svn: 226862	2015-01-22 20:57:30 +00:00
Ramkumar Ramachandra	312beb174f	[emacs] Use c-mode-common-hook, derive from "gnu" Make it clear that the "llvm.org" style is deriving from "gnu" style, and use the c-mode-common-hook instead of c-mode-hook and c++-mode-hook. Differential Revision: http://reviews.llvm.org/D7035 llvm-svn: 226861	2015-01-22 20:56:25 +00:00
Ramkumar Ramachandra	75a4f35b26	Intrinsics: introduce llvm_any_ty aka ValueType Any Specifically, gc.result benefits from this greatly. Instead of: gc.result.int.* gc.result.float.* gc.result.ptr.* ... We now have a gc.result.* that can specialize to literally any type. Differential Revision: http://reviews.llvm.org/D7020 llvm-svn: 226857	2015-01-22 20:14:38 +00:00
Reid Kleckner	f12b33454f	Revert "Don't remove a landing pad if the invoke requires a table entry." This reverts commit r176827. Björn Steinbrink pointed out that this didn't actually fix the bug (PR15555) it was attempting to fix. With this reverted, we can now remove landingpad cleanups that immediately resume unwinding, converting the invoke to a call. llvm-svn: 226850	2015-01-22 19:29:46 +00:00
Kevin Enderby	a7bdc7e671	Add the option, -indirect-symbols, used with -macho to print the Mach-O indirect symbol table to llvm-objdump. llvm-svn: 226848	2015-01-22 18:55:27 +00:00
Sanjay Patel	37c41c1d2c	merge consecutive stores of extracted vector elements (PR21711) This is a 2nd try at the same optimization as http://reviews.llvm.org/D6698. That patch was checked in at r224611, but reverted at r225031 because it caused a failure outside of the regression tests. The cause of the crash was not recognizing consecutive stores that have mixed source values (loads and vector element extracts), so this patch adds a check to bail out if any store value is not coming from a vector element extract. This patch also refactors the shared logic of the constant source and vector extracted elements source cases into a helper function. Differential Revision: http://reviews.llvm.org/D6850 llvm-svn: 226845	2015-01-22 18:21:26 +00:00
Adrian Prantl	d4255baede	Fix the condition in this assertion, and also make it into an unreachable. llvm-svn: 226843	2015-01-22 17:52:08 +00:00
David Blaikie	e7d473461e	Revert "PR21408: Workaround the appearance of duplicate variables due to problems when inlining two calls to the same function from the same call site." The underlying bug has been fixed in r226736 so there's no need to workaround this anymore. This reverts commit r220923. llvm-svn: 226842	2015-01-22 17:49:59 +00:00
Tim Northover	7cd58934a8	AArch64: decode all MRS/MSR forms early to avoid saving FeatureBits. Currently, we're adding a uint64_t describing the current subtarget so that matching can check whether the specified register is valid. However, we want to move to a bitset for those bits (x86 has more than 64 of them). This can't live in a union so it's probably better to do the checks early (especially as there are only 3 of them). llvm-svn: 226841	2015-01-22 17:23:04 +00:00
Adrian Prantl	97bdc75461	Run clang-format on parts of DebugInfo.h llvm-svn: 226838	2015-01-22 16:55:27 +00:00
Adrian Prantl	054449538b	Document DIExpression. llvm-svn: 226837	2015-01-22 16:55:24 +00:00
Adrian Prantl	0d7d8e4512	Rewrite DIExpression::printInternal() to use the iterator interface. NFC. llvm-svn: 226836	2015-01-22 16:55:22 +00:00
Adrian Prantl	2585a98d38	Rename DIExpressionIterator to DIExpression::iterator. Addresses review feedback from Duncan. llvm-svn: 226835	2015-01-22 16:55:20 +00:00
Adrian Prantl	438b250d8a	Fix a comment. llvm-svn: 226834	2015-01-22 16:55:16 +00:00
Rafael Espindola	5a67ed1038	[pr21886] Change MCJIT/ELF to support MSVC C++ mangled symbol. The ELF format is used on Windows by the MCJIT engine. Thus, on Windows, the ELFObjectWriter can encounter symbols mangled using the MS Visual Studio C++ name mangling. Symbols mangled using the MSVC C++ name mangling can legally have "@@@" as a substring. The EFLObjectWriter should not interpret the "@@@" substring as specifying GNU-style symbol versioning. The ELFObjectWriter therefore check for the MSVC C++ name mangling prefix which is either "?", "@?", "imp_?" or "imp_?@". llvm-svn: 226830	2015-01-22 14:20:45 +00:00
Rafael Espindola	bb5d09f0e3	Pass -Wl,-z,defs when building shared libraries, but not with the sanitizers. llvm-svn: 226828	2015-01-22 14:06:51 +00:00
Aaron Ballman	9ada6cd0f6	Silencing a -Wsign-compare warning (all uses of this constant are within unsigned expressions anyway); NFC. llvm-svn: 226826	2015-01-22 13:57:41 +00:00
Michael Kuperstein	25e34d11f3	[DAGCombine] Produce better code for constant splats This solves PR22276. Splats of constants would sometimes produce redundant shuffles, sometimes ridiculously so (see the PR for details). Fold these shuffles into BUILD_VECTORs early on instead. Differential Revision: http://reviews.llvm.org/D7093 Fixed recommit of r226811. llvm-svn: 226816	2015-01-22 13:07:28 +00:00
Alexander Potapenko	a007905e4e	Mark \|TLI\| variables used to suppress -Wunused-variable warnings. (These vars are only used in assertions) llvm-svn: 226815	2015-01-22 13:03:33 +00:00
Michael Kuperstein	ff74032018	Revert r226811, MSVC accepts code sane compilers don't. llvm-svn: 226814	2015-01-22 12:48:07 +00:00
Michael Kuperstein	84fad3e5c9	[DAGCombine] Produce better code for constant splats This solves PR22276. Splats of constants would sometimes produce redundant shuffles, sometimes ridiculously so (see the PR for details). Fold these shuffles into BUILD_VECTORs early on instead. Differential Revision: http://reviews.llvm.org/D7093 llvm-svn: 226811	2015-01-22 12:37:23 +00:00
Timur Iskhodzhanov	b4b6b74079	[ASan/Win] Move the shadow to 0x30000000 llvm-svn: 226809	2015-01-22 12:24:21 +00:00
Elena Demikhovsky	150d9f3187	Fixed a bug in type legalizer for masked load/store intrinsics. The problem occurs when after vectorization we have type <2 x i32>. This type is promoted to <2 x i64> and then requires additional efforts for expanding loads and truncating stores. I added EXPAND / TRUNCATE attributes to the masked load/store SDNodes. The code now contains additional shuffles. I've prepared changes in the cost estimation for masked memory operations, it will be submitted separately. llvm-svn: 226808	2015-01-22 12:07:59 +00:00
Elena Demikhovsky	94cfbbab33	Fixed a comment llvm-svn: 226806	2015-01-22 10:01:36 +00:00
Elena Demikhovsky	9c26462a27	Fixed a bug in narrowing store operation. Type MVT::i1 became legal in KNL, but store operation can't be narrowed to this type, since the size of VT (1 bit) is not equal to its actual store size(8 bits). Added a test provided by David (dag@cray.com) llvm-svn: 226805	2015-01-22 09:39:08 +00:00
Sanjoy Das	351db05308	[NFC] Introduce a 'struct Range' for IRCE Use the struct instead of a std::pair<Value , Value >. This makes a Range an obviously immutable object, and we can now assert that a range is well-typed (Begin->getType() == End->getType()) on its construction. llvm-svn: 226804	2015-01-22 09:32:02 +00:00
Craig Topper	e0c8e8f6a7	Revert r226798. Guess I missed the patterns. llvm-svn: 226802	2015-01-22 09:01:20 +00:00
Craig Topper	ffef4cf1e1	Use u8imm instead of i32i8imm on a couple instructions that have no patterns and thus no reason to use a larger operand size. llvm-svn: 226798	2015-01-22 08:53:11 +00:00
Craig Topper	9b39e54001	[X86] Remove some unused multiclasses from AVX512 instruction file. llvm-svn: 226797	2015-01-22 08:53:08 +00:00
Sanjoy Das	d1fb13ce4c	Fix crashes in IRCE caused by mismatched types There are places where the inductive range check elimination pass depends on two llvm::Values or llvm::SCEVs to be of the same llvm::Type when they do not need to be. This patch relaxes those restrictions (by bailing out of the optimization if the types mismatch), and adds test cases to trigger those paths. These issues were found by bootstrapping clang with IRCE running in the -O3 pass ordering. Differential Revision: http://reviews.llvm.org/D7082 llvm-svn: 226793	2015-01-22 08:29:18 +00:00
Erik Eckstein	96cfb9c655	SLPVectorizer: add a second limit for the number of alias checks. Even with the current limit on the number of alias checks, the containing loop has quadratic complexity. This begins to hurt for blocks containing > 1K load/store instructions. This commit introduces a limit for the loop count. It reduces the runtime for such very large blocks. llvm-svn: 226792	2015-01-22 08:20:51 +00:00
Elena Demikhovsky	079b2d8c0c	Fixed a bug in masked load/store in reversed loop. Added a test. The bug was submitted to bugzilla: http://llvm.org/bugs/show_bug.cgi?id=22225 llvm-svn: 226791	2015-01-22 08:20:06 +00:00
Chandler Carruth	a917458203	[PM] Rename InstCombine.h to InstCombineInternal.h in preparation for creating a non-internal header file for the InstCombine pass. I thought about calling this InstCombiner.h or in some way more clearly associating it with the InstCombiner clas that it is primarily defining, but there are several other utility interfaces defined within this for InstCombine. If, in the course of refactoring, those end up moving elsewhere or going away, it might make more sense to make this the combiner's header alone. Naturally, this is a bikeshed to a certain degree, so feel free to lobby for a different shade of paint if this name just doesn't suit you. llvm-svn: 226783	2015-01-22 05:25:13 +00:00
Chandler Carruth	cd8522ef44	[canonicalize] Teach InstCombine to canonicalize loads which are only ever stored to always use a legal integer type if one is available. Regardless of whether this particular type is good or bad, it ensures we don't get weird differences in generated code (and resulting performance) from "equivalent" patterns that happen to end up using a slightly different type. After some discussion on llvmdev it seems everyone generally likes this canonicalization. However, there may be some parts of LLVM that handle it poorly and need to be fixed. I have at least verified that this doesn't impede GVN and instcombine's store-to-load forwarding powers in any obvious cases. Subtle cases are exactly what we need te flush out if they remain. Also note that this IR pattern should already be hitting LLVM from Clang at least because it is exactly the IR which would be produced if you used memcpy to copy a pointer or floating point between memory instead of a variable. llvm-svn: 226781	2015-01-22 05:08:12 +00:00
Saleem Abdulrasool	10ed0babd3	ARM: fail less catastrophically on invalid Windows input Windows supports a restricted set of relocations (compared to ARM ELF). In some cases, we may end up generating an unsupported relocation. This can occur with bad input to the assembler in particular (the frontend should never generate code that cannot be compiled). Generate an error rather than just aborting. The change in the API is driven by the desire to provide a slightly more helpful message for debugging purposes. llvm-svn: 226779	2015-01-22 04:03:32 +00:00
Chandler Carruth	fa11d837a0	[canonicalize] Move a helper function further up the file so it can be used earlier. NFC. llvm-svn: 226777	2015-01-22 03:34:54 +00:00
Duncan P. N. Exon Smith	a695258ed4	DIBuilder: Make header iterator constructor explicit, NFC llvm-svn: 226775	2015-01-22 03:20:09 +00:00
Duncan P. N. Exon Smith	893a534895	DIBuilder: Extract header_begin() and header_end(), NFC Use begin/end functions so that users don't need to know how these weird things work. llvm-svn: 226774	2015-01-22 03:17:43 +00:00
Duncan P. N. Exon Smith	e7354b52ca	DIBuilder: Stop abusing DIExpressionIterator::operator*(), NFC This code was confusing, since it created a `DIExpressionIterator` from an invalid start point (although it wasn't wrong: it never actually iterated). Now that the underlying iterator has `getNumber()`, just use it directly. llvm-svn: 226773	2015-01-22 03:13:35 +00:00
Duncan P. N. Exon Smith	c6ed894973	DIBuilder: Extract DIHeaderFieldIterator::getNumber(), NFC Reduce code duplication between `DIBuilder` and `DIExpressionIterator` by implementing a `getNumber()` directly in the iterator. llvm-svn: 226772	2015-01-22 03:11:13 +00:00
Duncan P. N. Exon Smith	2e4db3d00c	DIBuilder: Create a getHeaderIterator() helper, NFC Extract this so it can be reused. llvm-svn: 226770	2015-01-22 03:00:01 +00:00
Chris Bieneman	42fb49cc52	Making deleted copy constructors and operators to be private for better diagnostics when deleted is not available. llvm-svn: 226769	2015-01-22 02:51:33 +00:00
Reid Kleckner	5eb6ade35f	SEH: Finish writing the catch-all test case llvm-svn: 226768	2015-01-22 02:31:09 +00:00
Reid Kleckner	f690f50519	Win64 SEH: Emit the constant 1 for catch-all into xdata llvm-svn: 226767	2015-01-22 02:27:44 +00:00
Chris Bieneman	e71fb5ce05	Assigning and copying command line option objects shouldn't be allowed. Summary: The default copy and assignment operators for these objects probably don't actually do what the clients intend, so they should be deleted. Places using the assignment operator to set the value of an option should cast to the option's data type first to call into the override for operator=. Places using the copy constructor just need to be changed to not copy (i.e. passing by const reference instead of value). Reviewers: dexonsmith, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7114 llvm-svn: 226762	2015-01-22 01:49:59 +00:00
Sanjoy Das	cb47366366	Make ScalarEvolution less aggressive with respect to no-wrap flags. ScalarEvolution currently lowers a subtraction recurrence to an add recurrence with the same no-wrap flags as the subtraction. This is incorrect because `sub nsw X, Y` is not the same as `add nsw X, -Y` and `sub nuw X, Y` is not the same as `add nuw X, -Y`. This patch fixes the issue, and adds two test cases demonstrating the bug. Differential Revision: http://reviews.llvm.org/D7081 llvm-svn: 226755	2015-01-22 00:48:47 +00:00
Paul Robinson	343e496473	Explicitly describe '///' versus '//' comment delimiters. llvm-svn: 226750	2015-01-22 00:19:56 +00:00
Adrian Prantl	531641a0c6	Make DwarfExpression use the new DIExpressionIterator. NFC. llvm-svn: 226748	2015-01-22 00:00:59 +00:00
Adrian Prantl	9260ccaeb4	Rewrite DIExpression::Verify() using an iterator. NFC. Addresses review comments for r226627. llvm-svn: 226747	2015-01-22 00:00:52 +00:00
Chandler Carruth	2135b97d8f	[canonicalization] Refactor how we create new stores into a helper function. This is a bit tidier anyways and will make a subsquent patch simpler as I want to add another case to this combine. llvm-svn: 226746	2015-01-21 23:45:01 +00:00
Simon Pilgrim	5fa0fb23ca	[X86][SSE] Missing SSE/AVX1 memory folding integer instructions Added most of the missing integer vector folding patterns for SSE (to SSE42) and AVX1. The most useful of these are probably the i32/i64 extraction, i8/i16/i32/i64 insertions, zero/sign extension, unsigned saturation subtractions, i64 subtractions and the variable mask blends (pblendvb) - others include CLMUL, SSE42 string comparisons and bit tests. Differential Revision: http://reviews.llvm.org/D7094 llvm-svn: 226745	2015-01-21 23:43:30 +00:00
Tim Northover	3007ba0ab3	DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N)) It can help with argument juggling on some targets, and is generally a good idea. llvm-svn: 226740	2015-01-21 23:17:19 +00:00
David Blaikie	df706288fb	DebugInfo: Use distinct inlinedAt MDLocations to avoid separate inlined calls being coalesced When two calls from the same MDLocation are inlined they currently get treated as one inlined function call (creating difficulty debugging, duplicate variables, etc). Clang worked around this by including column information on inline calls which doesn't address LTO inlining or calls to the same function from the same line and column (such as through a macro). It also didn't address ctor and member function calls. By making the inlinedAt locations distinct, every call site has an explicitly distinct location that cannot be coalesced with any other call. This can produce linearly (2x in the worst case where every call is inlined and the call instruction has a non-call instruction at the same location) more debug locations. Any increase beyond that are in cases where the Clang workaround was insufficient and the new scheme is creating necessary distinct nodes that were being erroneously coalesced previously. After this change to LLVM the incomplete workarounds in Clang. That should reduce the number of debug locations (in a build without column info, the default on Darwin, not the default on Linux) by not creating pseudo-distinct locations for every call to an inline function. (oh, and I made the inlined-at chain rebuilding iterative instead of recursive because I was having trouble wrapping my head around it the way it was - open to discussion on the right design for that function (including going back to a recursive solution)) llvm-svn: 226736	2015-01-21 22:57:29 +00:00
Matt Arsenault	b45c78bc2c	R600: Add checks for urem/srem by a constant Make sure this uses the faster expansion using magic constants to avoid the full division path. llvm-svn: 226734	2015-01-21 22:56:15 +00:00
Matthias Braun	c1988f384c	LiveIntervalAnalysis: Mark subregister defs as undef when we determined they are only reading a dead superregister value This was not necessary before as this case can only be detected when the liveness analysis is at subregister level. llvm-svn: 226733	2015-01-21 22:55:13 +00:00
Chris Bieneman	9e13af7ac3	Adding a new cl::HideUnrelatedOptions API to allow clang to migrate off cl::getRegisteredOptions. Summary: cl::getRegisteredOptions really exposes some of the innards of how command line parsing is implemented. Exposing new APIs that allow us to disentangle client code from implementation details will allow us to make more extensive changes to command line parsing. Reviewers: chandlerc, dexonsmith, beanz Reviewed By: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7100 llvm-svn: 226729	2015-01-21 22:45:52 +00:00
Simon Pilgrim	b16b09b154	[X86][SSE] Added support for SSE3 lane duplication shuffle instructions This patch adds shuffle matching for the SSE3 MOVDDUP, MOVSLDUP and MOVSHDUP instructions. The big use of these being that they avoid many single source shuffles from needing to use (pre-AVX) dual source instructions such as SHUFPD/SHUFPS: causing extra moves and preventing load folds. Adding these instructions uncovered an issue in XFormVExtractWithShuffleIntoLoad which crashed on single operand shuffle instructions (now fixed). It also involved fixing getTargetShuffleMask to correctly identify theses instructions as unary shuffles. Also adds a missing tablegen pattern for MOVDDUP. Differential Revision: http://reviews.llvm.org/D7042 llvm-svn: 226716	2015-01-21 22:44:35 +00:00
Matt Arsenault	d9987c7b0d	R600: Add missing tests for i64 srem llvm-svn: 226713	2015-01-21 22:43:19 +00:00
Jonathan Roelofs	229eb4ca5c	Fix load-store optimizer on thumbv4t Thumbv4t does not have lo->lo copies other than MOVS, and that can't be predicated. So emit MOVS when needed and bail if there's a predicate. http://reviews.llvm.org/D6592 llvm-svn: 226711	2015-01-21 22:39:43 +00:00
George Burgess IV	a1255d3a74	Added test to cover the CFLAA bitset indexing bug. llvm-svn: 226710	2015-01-21 22:39:35 +00:00
David Majnemer	4c0a6e918a	InstCombine: Don't strip bitcasts off of callsites marked 'thunk' The return type of a thunk is meaningless, we just want the arguments and return value to be forwarded. llvm-svn: 226708	2015-01-21 22:32:04 +00:00
Simon Pilgrim	47af023ada	[X86][SSE] movddup shuffle mask decodes Patch to provide shuffle decodes and asm comments for the SSE3/AVX1 movddup double duplication instructions. llvm-svn: 226705	2015-01-21 22:02:30 +00:00
Adrian Prantl	abf5553ec7	simplify expression llvm-svn: 226701	2015-01-21 21:23:35 +00:00
Adrian Prantl	53d382fcc7	Fix a compile issue on MSVC and call finalize(). llvm-svn: 226694	2015-01-21 19:25:35 +00:00
Matthias Braun	311730ac78	LiveIntervalAnalysis: Factor out code to update liveness on vreg def removal This cleans up code and is more in line with the general philosophy of modifying LiveIntervals through LiveIntervalAnalysis instead of changing them directly. This also fixes a case where SplitEditor::removeBackCopies() would miss the subregister ranges. llvm-svn: 226690	2015-01-21 19:02:30 +00:00
Matthias Braun	e1b7da7134	LiveIntervalAnalysis: document removePhysRegDefAt() function. llvm-svn: 226689	2015-01-21 19:02:26 +00:00
Matthias Braun	cfb8ad29b5	LiveIntervalAnalysis: Factor out code to update liveness on physreg def removal This cleans up code and is more in line with the general philosophy of modifying LiveIntervals through LiveIntervalAnalysis instead of changing them directly. llvm-svn: 226687	2015-01-21 18:50:21 +00:00
Matthias Braun	1002baf7b9	LiveIntervalAnalysis: Remove unused pruneValue() variant. llvm-svn: 226686	2015-01-21 18:45:57 +00:00
Adrian Prantl	1292e24d0e	Let subprograms with instructions without parent scopes fail the verification. Tested via a unit test. Follow-up to r226616. llvm-svn: 226684	2015-01-21 18:32:56 +00:00
Matt Arsenault	b00554886f	R600/SI: Custom lower fround This fixes it for SI. It also removes the pattern used previously for Evergreen for f32. I'm not sure if the the new R600 output is better or not, but it uses 1 fewer instructions if BFI is available. llvm-svn: 226682	2015-01-21 18:18:25 +00:00
Colin LeMahieu	94269db8ba	[Hexagon] Converting multiply and accumulate with immediate intrinsics to patterns. llvm-svn: 226681	2015-01-21 18:13:15 +00:00
Ahmed Bougacha	8f09e9f7c5	[X86] Declare SSE4.1/AVX2 vector extloads covered by PMOV[SZ]X legal. Now that we can fully specify extload legality, we can declare them legal for the PMOVSX/PMOVZX instructions. This for instance enables a DAGCombine to fire on code such as (and (<zextload-equivalent> ...), <redundant mask>) to turn it into: (zextload ...) as seen in the testcase changes. There is one regression, in widen_load-2.ll: we're no longer able to do store-to-load forwarding with illegal extload memory types. This will be addressed separately. Differential Revision: http://reviews.llvm.org/D6533 llvm-svn: 226676	2015-01-21 17:07:06 +00:00
Eric Fiselier	dea770aeb3	[lit] Format JSONMetricValue strings better. llvm-svn: 226672	2015-01-21 16:38:31 +00:00
George Burgess IV	3c898c2119	Fixed a bug with how we determine bitset indices. llvm-svn: 226671	2015-01-21 16:37:21 +00:00
Yaron Keren	3f02c14cc7	Add missing include guards to WindowsSupport.h. llvm-svn: 226669	2015-01-21 16:20:38 +00:00
Tim Northover	cf3d80fedb	Revert "DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N))" It hadn't gone through review yet, but was still on my local copy. This reverts commit r226663 llvm-svn: 226665	2015-01-21 15:48:52 +00:00
Tim Northover	b9184f2b1a	AArch64: add backend option to reserve x18 (platform register) AAPCS64 says that it's up to the platform to specify whether x18 is reserved, and a first step on that way is to add a flag controlling it. From: Andrew Turner <andrew@fubar.geek.nz> llvm-svn: 226664	2015-01-21 15:43:31 +00:00
Tim Northover	85cd2791c9	DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N)) llvm-svn: 226663	2015-01-21 15:43:28 +00:00
Michael Kuperstein	ada9fa1ca9	[x32] Fast ISel should use LEA64_32r instead of LEA32r to adjust addresses in x32 mode. llvm-svn: 226661	2015-01-21 14:44:05 +00:00
Alexander Potapenko	4ac461c35f	Use a smaller pragma unroll threshold to reduce test execution time. When opt is compiled with AddressSanitizer it takes more than 30 seconds to unroll the loop in unroll_1M(). llvm-svn: 226660	2015-01-21 13:52:02 +00:00
Evgeniy Stepanov	79ca0fd1a0	[msan] Update origin for the entire destination range on memory store. Previously we always stored 4 bytes of origin at the destination address even for 8-byte (and longer) stores. This should fix rare missing, or incorrect, origin stacks in MSan reports. llvm-svn: 226658	2015-01-21 13:21:31 +00:00
Jozef Kolek	5cfebdde2b	[mips][microMIPS] MicroMIPS 16-bit unconditional branch instruction B Implement microMIPS 16-bit unconditional branch instruction B. Implemented 16-bit microMIPS unconditional instruction has real name B16, and B is an alias which expands to either B16 or BEQ according to the rules: b 256 --> b16 256 # R_MICROMIPS_PC10_S1 b 12256 --> beq $zero, $zero, 12256 # R_MICROMIPS_PC16_S1 b label --> beq $zero, $zero, label # R_MICROMIPS_PC16_S1 Differential Revision: http://reviews.llvm.org/D3514 llvm-svn: 226657	2015-01-21 12:39:30 +00:00
Jozef Kolek	2c6d73207e	[mips][microMIPS] Implement ADDIUPC instruction Differential Revision: http://reviews.llvm.org/D6582 llvm-svn: 226656	2015-01-21 12:10:11 +00:00
Chandler Carruth	df5747a900	[PM] Refactor the InstCombiner interface to use an external worklist. Because in its primary function pass the combiner is run repeatedly over the same function until doing so produces no changes, it is essentially to not re-allocate the worklist. However, as a utility, the more common pattern would be to put a limited set of instructions in the worklist rather than the entire function body. That is also the more likely pattern when used by the new pass manager. The result is a very light weight combiner that does the visiting with a separable worklist. This can then be wrapped up in a helper function for users that want a combiner utility, or as I have here it can be wrapped up in a pass which manages the iterations used when combining an entire function's instructions. Hopefully this removes some of the worst of the interface warts that became apparant with the last patch here. However, there is clearly more work. I've again left some FIXMEs for the most egregious. The ones that stick out to me are the exposure of the worklist and IR builder as public members, and the use of pointers rather than references. However, fixing these is likely to be much more mechanical and less interesting so I didn't want to touch them in this patch. llvm-svn: 226655	2015-01-21 11:38:17 +00:00
Chandler Carruth	ba4c5179a0	[PM] Simplify (ha! ha!) the way that instcombine calls the SimplifyLibCalls utility by sinking it into the specific call part of the combiner. This will avoid us needing to do any contortions to build this object in a subsequent refactoring I'm doing and seems generally better factored. We don't need this utility everywhere and it carries no interesting state so we might as well build it on demand. llvm-svn: 226654	2015-01-21 11:23:40 +00:00
Vladimir Medic	435cf8a415	[Mips][Disassembler]When disassembler meets load/store from coprocessor 2 instructions for mips r6 it crashes as the access to operands array is out of range. This patch adds dedicated decoder method that properly handles decoding of these instructions. llvm-svn: 226652	2015-01-21 10:47:36 +00:00
Craig Topper	42b326ea12	[x86] Remove some unnecessary and slightly confusing typecasts from some patterns. I think it actually went i32->iPtr->i32 in some of these cases. llvm-svn: 226647	2015-01-21 08:43:57 +00:00
Craig Topper	7ff6ab30a9	[X86] Convert all the i8imm used by AVX512 and MMX instructions to u8imm. llvm-svn: 226646	2015-01-21 08:43:49 +00:00
Craig Topper	620b50cc23	[X86] Convert all the i8imm used by SSE and AVX instructions to u8imm. This makes the assembler check their size and removes a hack from the disassembler to avoid sign extending the immediate. llvm-svn: 226645	2015-01-21 08:15:54 +00:00
Craig Topper	f38dea1cfa	[x86] Add assembly parser bounds checking to the immediate value for cmpss/cmpsd/cmpps/cmppd. llvm-svn: 226642	2015-01-21 06:07:53 +00:00
Chandler Carruth	9280382ac6	[PM] Replace an abuse of inheritance to override a single function with a more direct approach: a type-erased glorified function pointer. Now we can pass a function pointer into this for the easy case and we can even pass a lambda into it in the interesting case in the instruction combiner. I'll be using this shortly to simplify the interfaces to InstCombiner, but this helps pave the way and seems like a better design for the libcall simplifier utility. llvm-svn: 226640	2015-01-21 02:11:59 +00:00
Adrian Prantl	34bcbeed03	Make DIExpression::Verify() stricter by checking that the number of elements and the ordering is sane and cleanup the accessors. llvm-svn: 226627	2015-01-21 00:59:20 +00:00
Simon Pilgrim	f5dcc1cbe6	[X86][AVX] Simplified diff between AVX1 and SSE42 fp stack folding tests. NFC. Changed the AVX1 tests register spill tail call to return a xmm like the SSE42 version - makes doing diffs between them a lot easier without affecting the spills themselves. llvm-svn: 226623	2015-01-21 00:02:13 +00:00
Simon Pilgrim	0177fa3d8c	[X86][SSE] Added SSE/AVX1 integer stack folding tests. Some folding patterns + tests are missing (marked as TODO) - these will be added in a future patch for review. llvm-svn: 226622	2015-01-20 23:54:17 +00:00
Simon Pilgrim	ffddc01b00	[X86][SSE] Added SSE fp stack folding tests. Some folding patterns + tests are missing (marked as TODO) - these will be added in a future patch for review. llvm-svn: 226621	2015-01-20 23:50:18 +00:00
Simon Pilgrim	0d68d98bd5	[X86][AVX] Renamed AVX1 fp stack folding tests. NFC. The SSE42 version of the AVX1 float stack folding tests will be added shortly, this renames the AVX1 file so that the files will be near each other in a directory listing to help ensure they are kept in sync. llvm-svn: 226620	2015-01-20 23:45:50 +00:00
Chandler Carruth	1edb9d63e9	[PM] Separate the InstCombiner from its pass. This creates a small internal pass which runs the InstCombiner over a function. This is the hard part of porting InstCombine to the new pass manager, as at this point none of the code in InstCombine has access to a Pass object any longer. The resulting interface for the InstCombiner is pretty terrible. I'm not planning on leaving it that way. The key thing missing is that we need to separate the worklist from the combiner a touch more. Once that's done, it should be possible for any part of LLVM to just create a worklist with instructions, populate it, and then combine it until empty. The pass will just be the (obvious and important) special case of doing that for an entire function body. For now, this is the first increment of factoring to make all of this work. llvm-svn: 226618	2015-01-20 22:44:35 +00:00
Adrian Prantl	de200dfad2	DebugLocs without a scope should fail the verification. Follow-up to r226588. llvm-svn: 226616	2015-01-20 22:37:25 +00:00
Rafael Espindola	8c8ad37f88	Don't pass -Wl,z,defs for now. It broke the msan build. llvm-svn: 226613	2015-01-20 22:08:20 +00:00
Kevin Enderby	98da6136d0	For llvm-objdump, hook up existing options to work when using -macho (the Mach-O parser). llvm-svn: 226612	2015-01-20 21:47:46 +00:00
Rafael Espindola	cb274c0c47	Use -Wl,defs when linking. ELF linkers by default allow shared libraries to contain undefined references and it is up to the dynamic linker to look for them. On COFF and MachO, that is not the case. This creates a situation where a .so might build on an ELF system, but the build of the corresponding .dylib or .dll will fail. This patch changes the cmake build to use -Wl,-z,defs when linking and updates the dependencies so that -DBUILD_SHARED_LIBS=ON build still works. llvm-svn: 226611	2015-01-20 21:23:15 +00:00
Chandler Carruth	b3d03df3ac	[PM] Reformat this code with clang-format so that subsequent changes don't get muddied up by formatting changes. Some of these don't really seem like improvements to me, but they also don't seem any worse and I care much more about not formatting them manually than I do about the particular formatting. =] llvm-svn: 226610	2015-01-20 21:10:35 +00:00
Colin LeMahieu	988c68f2a7	[Hexagon] Adding intrinsics for doubleword ALU operations. llvm-svn: 226606	2015-01-20 20:45:05 +00:00
Colin LeMahieu	734001c0a6	[Hexagon] Removing unnecessary clutter in intrinsic tests. llvm-svn: 226602	2015-01-20 19:46:07 +00:00
Daniel Jasper	6b77455f81	Prevent binary-tree deterioration in sparse switch statements. This addresses part of llvm.org/PR22262. Specifically, it prevents considering the densities of sub-ranges that have fewer than TLI.getMinimumJumpTableEntries() elements. Those densities won't help jump tables. This is not a complete solution but works around the most pressing issue. Review: http://reviews.llvm.org/D7070 llvm-svn: 226600	2015-01-20 19:43:33 +00:00
Ramkumar Ramachandra	be10ece5ed	[GC] Verify-pass void vararg functions in gc.statepoint With the appropriate Verifier changes, exactracting the result out of a statepoint wrapping a vararg function crashes. However, a void vararg function works fine: commit this first step. Differential Revision: http://reviews.llvm.org/D7071 llvm-svn: 226599	2015-01-20 19:42:46 +00:00
Adrian Prantl	565cc18d8f	Reapply: Teach SROA how to update debug info for fragmented variables. This reapplies r225379. ChangeLog: - The assertion that this commit previously ran into about the inability to handle indirect variables has since been removed and the backend can handle this now. - Testcases were upgrade to the new MDLocation format. - Instead of keeping a DebugDeclares map, we now use llvm::FindAllocaDbgDeclare(). Original commit message follows. Debug info: Teach SROA how to update debug info for fragmented variables. This allows us to generate debug info for extremely advanced code such as typedef struct { long int a; int b;} S; int foo(S s) { return s.b; } which at -O1 on x86_64 is codegen'd into define i32 @foo(i64 %s.coerce0, i32 %s.coerce1) #0 { ret i32 %s.coerce1, !dbg !24 } with this patch we emit the following debug info for this TAG_formal_parameter [3] AT_location( 0x00000000 0x0000000000000000 - 0x0000000000000006: rdi, piece 0x00000008, rsi, piece 0x00000004 0x0000000000000006 - 0x0000000000000008: rdi, piece 0x00000008, rax, piece 0x00000004 ) AT_name( "s" ) AT_decl_file( "/Volumes/Data/llvm/_build.ninja.release/test.c" ) Thanks to chandlerc, dblaikie, and echristo for their feedback on all previous iterations of this patch! llvm-svn: 226598	2015-01-20 19:42:22 +00:00
Tom Stellard	e99fb65d87	R600/SI: Add subtarget feature to enable VGPR spilling for all shader types This is disabled by default, but can be enabled with the subtarget feature: 'vgpr-spilling' llvm-svn: 226597	2015-01-20 19:33:04 +00:00
Tom Stellard	021053f500	R600/SI: Fix simple-loop.ll test llvm-svn: 226596	2015-01-20 19:33:02 +00:00
Jozef Kolek	0d49117769	Reverted revision 226577. llvm-svn: 226595	2015-01-20 19:29:28 +00:00
Chandler Carruth	3a62216a8a	[PM] Clean up a bunch of the doxygen / API docs on the InstCombiner pass prior to refactoring it. llvm-svn: 226594	2015-01-20 19:27:58 +00:00
Manman Ren	dab999d54f	[llvm link] Destroy ConstantArrays in LLVMContext if they are not used. ConstantArrays constructed during linking can cause quadratic memory explosion. An example is the ConstantArrays constructed when linking in GlobalVariables with appending linkage. Releasing all unused constants can cause a 20% LTO compile-time slowdown for a large application. So this commit releases unused ConstantArrays only. rdar://19040716. It reduces memory footprint from 20+G to 6+G. llvm-svn: 226592	2015-01-20 19:24:59 +00:00
Tom Stellard	3a70d07f51	R600/SI: Remove stray debugging code from r226586 llvm-svn: 226591	2015-01-20 19:24:31 +00:00
Chandler Carruth	c277b3f2b4	[PM] Don't spend time making self moves no-ops. They're allowed to leave the object in a moved-from state, and its simpler to write the code that way. llvm-svn: 226589	2015-01-20 18:54:16 +00:00
Adrian Prantl	f88b2c8c74	Add an assertion and prefer a crash over an infinite loop. llvm-svn: 226588	2015-01-20 18:03:37 +00:00
Tom Stellard	95292bbfcd	R600/SI: Use external symbols for scratch buffer We were passing the scratch buffer address to the shaders via user sgprs, but now we use external symbols and have the driver patch the shader using reloc information. llvm-svn: 226586	2015-01-20 17:49:47 +00:00
Tom Stellard	8255af45cb	R600/SI: Add kill flag when copying scratch offset to a register This allows us to re-use the same register for the scratch offset when accessing large private arrays. llvm-svn: 226585	2015-01-20 17:49:45 +00:00
Tom Stellard	8058069529	R600/SI: Don't store scratch buffer frame index in MUBUF offset field We don't have a good way of legalizing this if the frame index offset is more than the 12-bits, which is size of MUBUF's offset field, so now we store the frame index in the vaddr field. llvm-svn: 226584	2015-01-20 17:49:43 +00:00
Tom Stellard	1106b1c662	R600/SI: Update SIInstrInfo:verifyInstruction() after r225662 Now that we have our own custom register operand types, we need to handle them in the verifiier. llvm-svn: 226583	2015-01-20 17:49:41 +00:00
Aaron Ballman	6fa2141dca	Silencing a -Wunused-variable warning in non-asserts builds; NFC. llvm-svn: 226581	2015-01-20 17:10:45 +00:00
Duncan P. N. Exon Smith	3653474f57	Revert "IR: Specify underlying type instead of r226570, NFC" This reverts commit r226571. GCC really doesn't like it [1]. [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/20260 llvm-svn: 226579	2015-01-20 17:04:56 +00:00
Jozef Kolek	45f7f9c1ab	[mips][microMIPS] MicroMIPS 16-bit unconditional branch instruction B Implement microMIPS 16-bit unconditional branch instruction B. Implemented 16-bit microMIPS unconditional instruction has real name B16, and B is an alias which expands to either B16 or BEQ according to the rules: b 256 --> b16 256 # R_MICROMIPS_PC10_S1 b 12256 --> beq $zero, $zero, 12256 # R_MICROMIPS_PC16_S1 b label --> beq $zero, $zero, label # R_MICROMIPS_PC16_S1 Differential Revision: http://reviews.llvm.org/D3514 llvm-svn: 226577	2015-01-20 16:45:27 +00:00
Kai Nacke	e7a647886a	[mips] Add registers and ALL check prefix to octeon test case. No functional change. Reviewed by D. Sanders llvm-svn: 226574	2015-01-20 16:14:02 +00:00
Kai Nacke	63072f81b3	[mips] Add octeon branch instructions bbit0/bbit032/bbit1/bbit132 This commits adds the octeon branch instructions bbit0/bbit032/bbit1/bbit132. It also includes patterns for instruction selection and test cases. Reviewed by D. Sanders llvm-svn: 226573	2015-01-20 16:10:51 +00:00
Duncan P. N. Exon Smith	9ebd858f6c	IR: Specify underlying type instead of r226570, NFC llvm-svn: 226571	2015-01-20 16:03:09 +00:00
Duncan P. N. Exon Smith	7e80640aa1	IR: Store StorageType as an unsigned bitfield Use `unsigned` instead of `StorageType` for the bitfield to prevent MSVC from treating the top bit of the bitfield as a sign bit. llvm-svn: 226570	2015-01-20 15:51:14 +00:00
Evgeniy Stepanov	c5b974e6d2	[msan] Optimize -msan-check-constant-shadow. The new code does not create new basic blocks in the case when shadow is a compile-time constant; it generates either an unconditional __msan_warning call or nothing instead. llvm-svn: 226569	2015-01-20 15:21:35 +00:00
Mohit K. Bhakkad	46ad7f7ec5	[MSan][LLVM][MIPS] Shadow and Origin offsets for MIPS Reviewers: kcc, samsonov, petarj, eugenis Differential Revision: http://reviews.llvm.org/D6146 llvm-svn: 226565	2015-01-20 13:05:42 +00:00
Craig Topper	9f4d485610	[x86] Add some mayLoad/hasSideEffects flags. Remove one that was already covered by a pattern. llvm-svn: 226562	2015-01-20 12:15:30 +00:00
Chandler Carruth	aaf0b4cd57	[PM] Port LoopInfo to the new pass manager, adding both a LoopAnalysis pass and a LoopPrinterPass with the expected associated wiring. I've added a RUN line to the only test case (!!!) we have that actually prints loops. Everything seems to be working. This is somewhat exciting as this is the first analysis using another analysis to go in for the new pass manager. =D I also believe it is the last analysis necessary for porting instcombine, but of course I may yet discover more. llvm-svn: 226560	2015-01-20 10:58:50 +00:00
Chandler Carruth	e7427f2f86	[PM] Make the LoopInfoBase and LoopInfo objects movable so that they can be used as results in the new pass manager. llvm-svn: 226559	2015-01-20 10:58:38 +00:00
Chandler Carruth	ea6de1f81d	[PM] Fix a moderately scary typo in the deleted copy constructor I noticed when adding move semantics to LoopInfo. Hopefully not relevant, but still scary. =] llvm-svn: 226556	2015-01-20 10:20:52 +00:00
Chandler Carruth	88f5f742ca	[PM] Use range-based for and auto to clean up some of the LoopInfo code. No functionality changed. llvm-svn: 226555	2015-01-20 10:02:49 +00:00
Daniel Jasper	d106b734cf	Factor out a splitSwitchCase() function so that it can be reused. This is in preparation for a fix to llvm.org/PR22262. One of the ideas here is to first find a good jump table range first and then split before and after it. Thereby, we don't need to use the split-based-on-density heuristic at all, which can make the "binary tree" deteriorate in various cases. Also some minor cleanups. No functional changes. llvm-svn: 226551	2015-01-20 08:57:44 +00:00
Chandler Carruth	5175b9a7b9	[PM] Move the LoopInfo analysis pointer into the InstCombiner class along with the other analyses. The most obvious reason why is because eventually I need to separate out the pass layer from the rest of the instcombiner. However, it is also probably a compile time win as every query through the pass manager layer is pretty slow these days. llvm-svn: 226550	2015-01-20 08:35:24 +00:00
Karthik Bhat	0b0f4660fa	Fix Operandreorder logic in SLPVectorizer to generate longer vectorizable chain. This patch fixes 2 issues in reorderInputsAccordingToOpcode 1) AllSameOpcodeLeft and AllSameOpcodeRight was being calculated incorrectly resulting in code not being vectorized in few cases. 2) Adds logic to reorder operands if we get longer chain of consecutive loads enabling vectorization. Handled the same for cases were we have AltOpcode. Thanks Michael for inputs and review. Review: http://reviews.llvm.org/D6677 llvm-svn: 226547	2015-01-20 06:11:00 +00:00
David Majnemer	3087b22e1a	Bitcode: Don't create comdats when autoupgrading macho bitcode Don't infer COMDAT groups from older bitcode if the target is macho, it doesn't have COMDATs. llvm-svn: 226546	2015-01-20 05:58:07 +00:00
Duncan P. N. Exon Smith	aa687a3d4c	Reapply "IR: Simplify DIBuilder's HeaderBuilder API, NFC" This reverts commit r226542, effectively reapplying r226540. This time, initialize `IsEmpty` in the copy and move constructors as well. llvm-svn: 226545	2015-01-20 05:02:42 +00:00
Duncan P. N. Exon Smith	5f39dfd429	Revert "IR: Simplify DIBuilder's HeaderBuilder API, NFC" This reverts commit r226540, since I hit an unexpected bot failure [1]. I'll investigate. [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/20244 llvm-svn: 226542	2015-01-20 03:01:27 +00:00
Duncan P. N. Exon Smith	03e0583a2d	IR: Move MDNode clone() methods from ValueMapper to MDNode, NFC Now that the clone methods used by `MapMetadata()` don't do any remapping (and return a temporary), they make more sense as member functions on `MDNode` (and subclasses). llvm-svn: 226541	2015-01-20 02:56:57 +00:00

... 3 4 5 6 7 ...

112389 Commits