llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	37f1f12226	[SROA] Fix PR25873, which Andrea Di Biagio analyzed the daylights out of, and I misdiagnosed for months and months. Andrea has had a patch for this forever, but I just couldn't see how it was fixing the root cause of the problem. It didn't make sense to me, even though the patch was perfectly good and the analysis of the actual failure event was fantastic. Well, I came back to it today because the patch has sat for far too long and needs attention and decided I wouldn't let it go until I really understood what was going on. After quite some time in the debugger, I finally realized that in fact I had just missed an important case with my previous attempt to fix PR22093 in r225149. Not only do we need to handle loads that won't be split, but stores-of-loads that we won't split. We do actually have enough logic in the presplitting to form new slices for split stores.... unless we decided not to split them! I'm so sorry that it took me this long to come to the realization that this is the issue. It seems so obvious in hind sight (of course). Anyways, the fix becomes much smaller and more focused. The fact that we're left doing integer smashing is related to the FIXME in my original commit: fundamentally, we're not aggressive about pre-splitting for loads and stores to the same alloca. If we want to get aggressive about this, it'll need both what Andrea had put into the proposed fix, but also a lot more logic to essentially iteratively pre-split the alloca until we can't do any more. As I said in that commit log, its really unclear that this is the right call. Instead, the integer blending and letting targets lower this to narrower stores seems slightly better. But we definitely shouldn't really go down that path just to fix this bug. Again, tons of thanks are owed to Andrea and others at Sony for working on this bug. I really should have seen what was going on here and re-directed them sooner. =//// llvm-svn: 263121	2016-03-10 15:31:17 +00:00
David L Kreitzer	14f0077f38	Unified the handling of returns in the X87 stackifier so that the stackifier runs successfully on routines containing IRETs. This fixes PR26410. Differential Revision: http://reviews.llvm.org/D17643 llvm-svn: 263120	2016-03-10 15:14:02 +00:00
NAKAMURA Takumi	f8fc7e124e	Fixup for r263114. llvm::AnalysisBase<CallGraphAnalysis> should be declared as extern. llvm-svn: 263119	2016-03-10 15:13:00 +00:00
Saleem Abdulrasool	8b30f9854e	ARM: correct __builtin_longjmp on WoA WoA uses r11 as the FP even though it is a pure thumb-2 environment in contrast to AAPCS which states r7. This adjusts __builtin_longjmp to not clobber r7 and to properly restore the frame pointer on execution. llvm-svn: 263118	2016-03-10 15:11:09 +00:00
Chandler Carruth	cf3f4f25ca	[CG] Back out my pointless move ctor and add the explicit template instantiation needed for the mingw dll build bot. llvm-svn: 263114	2016-03-10 14:33:10 +00:00
Chandler Carruth	d94a5962cc	[SROA] Clean up some really weird code, no functionality changed. We already have the instruction extracted into 'I', just cast that to a store the way we do for loads. Also, we don't enter the if unless SI is non-null, so don't test it again for null. I'm pretty sure the entire test there can be nuked, but this is just the trivial cleanup. llvm-svn: 263112	2016-03-10 14:16:18 +00:00
Elena Demikhovsky	cd9967d160	AVX-512: Fixed a bug in i1 vector zero extending. (Skylake-avx512) (failed on instruction selection phase) Differential Revision: http://reviews.llvm.org/D17924 llvm-svn: 263111	2016-03-10 13:44:22 +00:00
Chandler Carruth	3d1506ed37	[CG] Try adding an explicit move constructor to see if that helps the one build bot that is crashing on this code. llvm-svn: 263110	2016-03-10 13:43:06 +00:00
Valery Pykhtin	a4db224d54	[AMDGPU] Fix SMEM instructions encoding/operand namings Differential Revision: http://reviews.llvm.org/D17651 llvm-svn: 263108	2016-03-10 13:06:08 +00:00
Simon Pilgrim	13d4056795	[X86][AVX] Improve target shuffle combining of BLEND+zero The BLEND+zero combine was failing to combine equivalent BLEND masks. Follow up to D17483 and D17858 llvm-svn: 263105	2016-03-10 11:50:15 +00:00
Chandler Carruth	4c660f7087	[CG] Add a new pass manager printer pass for the old call graph and actually finish wiring up the old call graph. There were bugs in the old call graph that hadn't been caught because it wasn't being tested. It wasn't being tested because it wasn't in the pipeline system and we didn't have a printing pass to run in tests. This fixes all of that. As for why I'm still keeping the old call graph alive its so that I can port GlobalsAA to the new pass manager with out forking it to work with the lazy call graph. That's clearly the right eventual design, but it seems pragmatic to defer that until its necessary. The old call graph works just fine for GlobalsAA. llvm-svn: 263104	2016-03-10 11:24:11 +00:00
Chandler Carruth	b95def7491	[LCG] Spell the printing pass pipeline name for the lazy call graph 'lcg' instead of just 'cg'. This makes it consistent with the analysis name of 'lcg'. No functionality changed. llvm-svn: 263103	2016-03-10 11:24:06 +00:00
Simon Pilgrim	16d11785a5	[X86][SSE] Basic combining of unary target shuffles of binary target shuffles. This patch reorders the combining of target shuffle masks so that when a unary shuffle takes a binary shuffle as its input but only references one of its inputs it can correctly combine into a unary shuffle mask. This is starting to encroach on the purpose of resolveTargetShuffleInputs, but I don't want to remove it until we definitely know we won't need it for full binary shuffle combining. There is a lot more work before we can properly support binary target shuffle masks but this was an easy case to add support for. Differential Revision: http://reviews.llvm.org/D17858 llvm-svn: 263102	2016-03-10 11:23:51 +00:00
Chandler Carruth	1ecd740cf0	[CG] Actually hoist up the generic CallGraphPrinter pass from a weird location in the opt tool to live along side the analysis in LLVM's libraries. No functionality changed here, but this will allow me to port the printer to the new pass manager as well. llvm-svn: 263101	2016-03-10 11:08:44 +00:00
Chandler Carruth	5f432292a6	[CG] Rename the DOT printing pass to actually reference "DOT". There is another pass by the generic name 'CallGraphPrinter' which is actually just a call graph printer tucked away inside the opt tool. I'd like to bring it out and make it follow the same patterns as the rest of the CallGraph code, but doing so would end up conflicting with the name of the DOT printing pass. So this makes the DOT printing pass name be more precise. No functionality changed here. llvm-svn: 263100	2016-03-10 11:04:40 +00:00
Elena Demikhovsky	38f78a2b92	AVX-512: Fixed a bug in shuffle for v64i8 type Operation SCALAR_TO_VECTOR for v64i8 and v32i16 should be lowered if BW feature is "on". Differential Revision: http://reviews.llvm.org/D17994 llvm-svn: 263097	2016-03-10 08:32:09 +00:00
Vedant Kumar	ae22c58737	[opt] Fix description of the -disable-verify flag llvm-svn: 263096	2016-03-10 06:58:53 +00:00
Mark Lacey	125bb29c65	Add an LLVM_BUILTIN_DEBUGTRAP macro. Summary: This provides a macro that expands to __builtin_debugtrap() for clang, and __debugbreak() for MSVC. It intentionally expands to nothing for compilers that do not support a similar mechanism that halts the debugger without otherwise crashing the process. Differential Revision: http://reviews.llvm.org/D18002 llvm-svn: 263095	2016-03-10 05:15:03 +00:00
Roman Levenstein	2792b3f02f	Add support for a preserve_most calling convention to the AArch64 backend. This change adds a support for a preserve_most calling convention to the AArch64 backend, similar to how it was done for X86-64. There is also a subsequent patch on top of this one to add a tail-calls support for this calling convention. Differential Revision: http://reviews.llvm.org/D18016 llvm-svn: 263092	2016-03-10 04:35:09 +00:00
Vedant Kumar	37a1d6207f	[opt] Only create Verifier passes when requested opt adds Verifier passes in AddOptimizationPasses even if -disable-verify is on. Fix it so that the extra verification occurs either when (1) -disable-verifier is off, or (2) -verify-each is on. Thanks to David Jones for pointing out this behavior! llvm-svn: 263090	2016-03-10 03:40:14 +00:00
Michael Zolotukhin	b88fbe08fc	[SLP] Add -slp-min-reg-size command line option. MinVecRegSize is currently hardcoded to 128; this patch adds a cl::opt to allow changing it. I tried not to change any existing behavior for the default case. Differential revision: http://reviews.llvm.org/D13278 llvm-svn: 263089	2016-03-10 02:49:47 +00:00
Mehdi Amini	237e606a42	Add an entry in the Release Notes for LLVMContext::discardValueNames() From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263088	2016-03-10 02:18:17 +00:00
Mehdi Amini	09b4a8daa3	Add a flag to the LLVMContext to disable name for Value other than GlobalValue Summary: This is intended to be a performance flag, on the same level as clang cc1 option "--disable-free". LLVM will never initialize it by default, it will be up to the client creating the LLVMContext to request this behavior. Clang will do it by default in Release build (just like --disable-free). "opt" and "llc" can opt-in using -disable-named-value command line option. When performing LTO on llvm-tblgen, the initial merging of IR peaks at 92MB without this patch, and 86MB after this patch,setNameImpl() drops from 6.5MB to 0.5MB. The total link time goes from ~29.5s to ~27.8s. Compared to a compile-time flag (like the IRBuilder one), it performs very close. I profiled on SROA and obtain these results: 420ms with IRBuilder that preserve name 372ms with IRBuilder that strip name 375ms with IRBuilder that preserve name, and a runtime flag to strip Reviewers: chandlerc, dexonsmith, bogner Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D17946 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 263086	2016-03-10 01:28:54 +00:00
Chandler Carruth	7776377e62	[gvn] Fix more indenting and formatting in regions of code that will need to be changed for porting to the new pass manager. Also sink the comment on the ValueTable class back to that class instead of it dangling on an anonymous namespace. No functionality changed. llvm-svn: 263084	2016-03-10 00:58:20 +00:00
Chandler Carruth	169c84f1cc	[gvn] Reformat a chunk of the GVN code that is strangely indented prior to restructuring it for porting to the new pass manager. No functionality changed. llvm-svn: 263083	2016-03-10 00:58:18 +00:00
Chandler Carruth	61440d225b	[PM] Port memdep to the new pass manager. This is a fairly straightforward port to the new pass manager with one exception. It removes a very questionable use of releaseMemory() in the old pass to invalidate its caches between runs on a function. I don't think this is really guaranteed to be safe. I've just used the more direct port to the new PM to address this by nuking the results object each time the pass runs. While this could cause some minor malloc traffic increase, I don't expect the compile time performance hit to be noticable, and it makes the correctness and other aspects of the pass much easier to reason about. In some cases, it may make things faster by making the sets and maps smaller with better locality. Indeed, the measurements collected by Bruno (thanks!!!) show mostly compile time improvements. There is sadly very limited testing at this point as there are only two tests of memdep, and both rely on GVN. I'll be porting GVN next and that will exercise this heavily though. Differential Revision: http://reviews.llvm.org/D17962 llvm-svn: 263082	2016-03-10 00:55:30 +00:00
Philip Reames	d9f4a3d18c	[BasicAA/MDA] Sink aliasing rules for malloc and calloc into BasicAA MemoryDependenceAnalysis had a hard-coded exception to the general aliasing rules for malloc and calloc. The reasoning that applied there is equally valid in BasicAA and clarifies the remaining logic in MDA. In principal, this can expose slightly more optimization opportunities, but since essentially all of our aliasing aware memory optimization passes go through MDA, this will likely be NFC in practice. Differential Revision: http://reviews.llvm.org/D15912 llvm-svn: 263075	2016-03-09 23:19:56 +00:00
Philip Reames	ac115ed72f	[CGP] Duplicate addressing computation in cold paths if required to sink addressing mode This patch teaches CGP to duplicate addressing mode computations into cold paths (detected via explicit cold attribute on calls) if required to let addressing mode be safely sunk into the basic block containing each load and store. In general, duplicating code into cold blocks may result in code growth, but should not effect performance. In this case, it's better to duplicate some code than to put extra pressure on the register allocator by making it keep the address through the entirely of the fast path. This patch only handles addressing computations, but in principal, we could implement a more general cold cold scheduling heuristic which tries to reduce register pressure in the fast path by duplicating code into the cold path. Getting the profitability of the general case right seemed likely to be challenging, so I stuck to the existing case (addressing computation) we already had. Differential Revision: http://reviews.llvm.org/D17652 llvm-svn: 263074	2016-03-09 23:13:12 +00:00
Philip Reames	e0a5454df4	Fix the build I screwed up rebasing 263072. This change fixes the build and passes all make check. llvm-svn: 263073	2016-03-09 23:07:53 +00:00
Philip Reames	b54c8e6eea	[LICM] Store promotion when memory is thread local This patch teaches LICM's implementation of store promotion to exploit the fact that the memory location being accessed might be provable thread local. The fact it's thread local weakens the requirements for where we can insert stores since no other thread can observe the write. This allows us perform store promotion even in cases where the store is not guaranteed to execute in the loop. Two key assumption worth drawing out is that this assumes a) no-capture is strong enough to imply no-escape, and b) standard allocation functions like malloc, calloc, and operator new return values which can be assumed not to have previously escaped. In future work, it would be nice to generalize this so that it works without directly seeing the allocation site. I believe that the nocapture return attribute should be suitable for this purpose, but haven't investigated carefully. It's also likely that we could support unescaped allocas with similar reasoning, but since SROA and Mem2Reg should destroy those, they're less interesting than they first might seem. Differential Revision: http://reviews.llvm.org/D16783 llvm-svn: 263072	2016-03-09 22:59:30 +00:00
Sanjay Patel	9f6c4d50b4	[x86] fix cost model inaccuracy for vector memory ops The irony of this patch is that one CPU that is affected is AMD Jaguar, and Jaguar has a completely double-pumped AVX implementation. But getting the cost model to reflect that is a much bigger problem. The small goal here is simply to improve on the lie that !AVX2 == SandyBridge. Differential Revision: http://reviews.llvm.org/D18000 llvm-svn: 263069	2016-03-09 22:23:33 +00:00
Derek Schuff	3e89580571	[WebAssembly] Update known gcc test failures llvm-svn: 263068	2016-03-09 22:14:33 +00:00
Sanjay Patel	4a8dd89128	[x86, AVX] optimize masked loads with constant masks Instead of a variable-blend instruction, form a blend with immediate because those are always cheaper. Differential Revision: http://reviews.llvm.org/D17899 llvm-svn: 263067	2016-03-09 22:12:08 +00:00
Philip Reames	8f12eba78d	[ValueTracking] Extract isKnownPositive [NFCI] Extract out a generic interface from a recently landed patch and document a TODO in case compile time becomes a problem. llvm-svn: 263062	2016-03-09 21:31:47 +00:00
Philip Reames	ec8a8b5437	[InstCombine] (icmp sgt smin(PosA, B) 0) -> (icmp sgt B 0) When checking whether an smin is positive, we can move the comparison to one of the inputs if the other is known positive. If the known positive one is the min, then the other can't be negative. If the other is the min, then we compute the min. Differential Revision: http://reviews.llvm.org/D17873 llvm-svn: 263059	2016-03-09 21:05:07 +00:00
Adam Nemet	660748ca8c	[LLE] Add missing check for unit stride I somehow missed this. The case in GCC (global_alloc) was similar to the new testcase except it had an array of structs rather than a two dimensional array. Fixes RP26885. llvm-svn: 263058	2016-03-09 20:47:55 +00:00
Evandro Menezes	669aaccb89	[AArch64] Minor reformatting (NFC). llvm-svn: 263054	2016-03-09 19:56:38 +00:00
Hemant Kulkarni	206ba84413	[llvm-readobj] Enable GNU style section group print Differential Revision: http://reviews.llvm.org/D17822 llvm-svn: 263050	2016-03-09 19:16:13 +00:00
Matthias Braun	c31032d607	InstCombine: Restrict computeKnownBits() on all Values to OptLevel > 2 As part of r251146 InstCombine was extended to call computeKnownBits on every value in the function to determine whether it happens to be constant. This increases typical compiletime by 1-3% (5% in irgen+opt time) in my measurements. On the other hand this case did not trigger once in the whole llvm-testsuite. This patch introduces the notion of ExpensiveCombines which are only enabled for OptLevel > 2. I removed the check in InstructionSimplify as that is called from various places where the OptLevel is not known but given the rarity of the situation I think a check in InstCombine is enough. Differential Revision: http://reviews.llvm.org/D16835 llvm-svn: 263047	2016-03-09 18:47:11 +00:00
Matthias Braun	0b5d5b881f	MachineRegisterInfo: Correct comment llvm-svn: 263046	2016-03-09 18:47:05 +00:00
Chris Dewhurst	52adb575e6	This change adds co-processor condition branching and conditional traps to the Sparc back-end. This will allow inline assembler code to utilize these features, but no automatic lowering is provided, except for the previously provided @llvm.trap, which lowers to "ta 5". The change also separates out the different assembly language syntaxes for V8 and V9 Sparc. Previously, only V9 Sparc assembly syntax was provided. The change also corrects the selection order of trap disassembly, allowing, e.g. "ta %g0 + 15" to be rendered, more readably, as "ta 15", ignoring the %g0 register. This is per the sparc v8 and v9 manuals. Check-in includes many extra unit tests to check this works correctly on both V8 and V9 Sparc processors. Code Reviewed at http://reviews.llvm.org/D17960. llvm-svn: 263044	2016-03-09 18:20:21 +00:00
Sanjay Patel	14f598e5df	add a test RUN to show unexpected behavior llvm-svn: 263037	2016-03-09 17:53:28 +00:00
Kit Barton	a1d6a6f1de	[PPC] backend changes to generate xvabs[s,d]p and xvnabs[s,d]p instructions This has to be committed before the FE changes Phabricator: http://reviews.llvm.org/D17837 llvm-svn: 263035	2016-03-09 17:48:01 +00:00
Adrian Prantl	d6cc53f3c4	Don't crash when compiling inline assembler containing .file directives. Removing the assertion is safe to do because any module level inline assembly is always emitted first via AsmPrinter::doInitialization(). http://reviews.llvm.org/D16101 rdar://22690666 llvm-svn: 263033	2016-03-09 17:32:56 +00:00
Chad Rosier	e4e15ba046	[AArch64] Move helper functions into TII, so they can be reused elsewhere. NFC. llvm-svn: 263032	2016-03-09 17:29:48 +00:00
Hans Wennborg	9e63d61336	ReleaseNotes: update 'you may prefer' link to 3.8 llvm-svn: 263030	2016-03-09 17:25:34 +00:00
Valery Pykhtin	d6331cee2f	[AMDGPU] add AMDGPU target support to ELFObjectFile.h header Differential Revision: http://reviews.llvm.org/D17144 llvm-svn: 263026	2016-03-09 17:08:19 +00:00
Chad Rosier	0da267dd1d	[AArch64] Minor cleanup/remove redundant code. NFC. llvm-svn: 263024	2016-03-09 16:46:48 +00:00
Tom Stellard	9f2e00de7b	SelectionDAG: Fix a crash on inline asm when output register supports multiple types Summary: The code in SelectionDAG did not handle the case where the register type and output types were different, but had the same size. Reviewers: arsenm, echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17940 llvm-svn: 263022	2016-03-09 16:02:52 +00:00
Chad Rosier	c27a18f39f	[TII] Allow getMemOpBaseRegImmOfs() to accept negative offsets. NFC. http://reviews.llvm.org/D17967 llvm-svn: 263021	2016-03-09 16:00:35 +00:00
Teresa Johnson	e50b23c67f	Fix build error due to unsigned compare >= 0 in r263008 (NFC) Fixes error from building with clang: /usr/local/google/home/tejohnson/llvm/llvm_15/lib/Target/AMDGPU/InstPrinter/AMDGPUInstPrinter.cpp:407:12: error: comparison of unsigned expression >= 0 is always true [-Werror,-Wtautological-compare] if ((Imm >= 0x000) && (Imm <= 0x0ff)) { ~~~ ^ ~~~~~ llvm-svn: 263014	2016-03-09 14:58:23 +00:00
Petar Jovanovic	921c2b4eb3	Reland r262337 "calculate builtin_object_size if arg is a removable pointer" Original commit message: calculate builtin_object_size if argument is a removable pointer This patch fixes calculating correct value for builtin_object_size function when pointer is used only in builtin_object_size function call and never after that. Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D17337 Reland the original change with a small modification (first do a null check and then do the cast) to satisfy ubsan. llvm-svn: 263011	2016-03-09 14:12:47 +00:00
Silviu Baranga	ecf1b4c24d	Update comments following the addition of PredicatedScalarEvolution. NFC. We changed several functions in LoopAccessAnalysis to use PSE instead of taking SE and a SCEV predicate as arguments, but didn't update the comments. This also fixes a comment in ScalarEvolution, where we refered to Preds when the argument name was A. llvm-svn: 263009	2016-03-09 12:39:06 +00:00
Sam Kolton	dfa29f7c5b	[AMDGPU] Assembler: Support DPP instructions. Supprot DPP syntax as used in SP3 (except several operands syntax). Added dpp-specific operands in td-files. Added DPP flag to TSFlags to determine if instruction is dpp in InstPrinter. Support for VOP2 DPP instructions in td-files. Some tests for DPP instructions. ToDo: - VOP2bInst: - vcc is considered as operand - AsmMatcher doesn't apply mnemonic aliases when parsing operands - v_mac_f32 - v_nop - disable instructions with 64-bit operands - change dpp_ctrl assembler representation to conform sp3 Review: http://reviews.llvm.org/D17804 llvm-svn: 263008	2016-03-09 12:29:31 +00:00
Nikolay Haustov	9b7577ed22	[AMDGPU] Assembler: Support abs() syntax. Support legacy SP3 abs(v1) syntax. InstPrinter still uses \|v1\|. Add tests. Differential Revision: http://reviews.llvm.org/D17887 llvm-svn: 263006	2016-03-09 11:03:21 +00:00
Nikolay Haustov	8e3f099497	[AMDGPU] Assembler: Fix s_setpc_b64 s_setpc_b64 has just one 64-bit source which is the address of instruction to jump to. Differential Revision: http://reviews.llvm.org/D17888 llvm-svn: 263005	2016-03-09 10:56:19 +00:00
Richard Trieu	af02b1ee0f	Fix uninitialized member bool. Detected by ASan. llvm-svn: 262999	2016-03-09 06:31:25 +00:00
Adam Nemet	34785ecff1	[LoopDataPrefetch] Add stats and debug output llvm-svn: 262998	2016-03-09 05:33:21 +00:00
Adam Nemet	46adc28236	[LAA] Improve comment for isStridedPtr llvm-svn: 262997	2016-03-09 05:33:19 +00:00
Dan Gohman	ddfa1a6c18	[WebAssembly] Update comments about irreducible control flow. llvm-svn: 262995	2016-03-09 04:17:36 +00:00
Sean Silva	05e5cbf4f2	Use lto_bool_t instead of a raw `bool` (fixup for r262977). Hopefully this should bring llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast back to life. llvm-svn: 262994	2016-03-09 04:05:28 +00:00
Mehdi Amini	60ef0f341a	Fix ThinLTO test: depends on the X86 backend From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262993	2016-03-09 04:04:40 +00:00
Mehdi Amini	3ed41d6aa4	void foo() is not a valid C prototype, one has to write void foo(void) Remove a warning introduced in r262977 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262990	2016-03-09 02:36:09 +00:00
Sanjoy Das	2eac48de9e	Return StringRef instead of a naked char*; NFC llvm-svn: 262989	2016-03-09 02:34:19 +00:00
Sanjoy Das	f13900f8ac	[IRCE] Reflow comments; NFC llvm-svn: 262988	2016-03-09 02:34:15 +00:00
Mehdi Amini	0e83a809a6	Fix library dependency for llvm-lto after r262977 It is a transitive dependency, so static build are OK but not build with individual DSO for each LLVM library. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262987	2016-03-09 02:34:13 +00:00
Dan Gohman	d7a2eea619	[WebAssembly] Implement irreducible control flow. This implements a very simple conservative transformation that doesn't require more than linear code size growth. There's room for much more optimization in this space. llvm-svn: 262982	2016-03-09 02:01:14 +00:00
Mehdi Amini	d2d989609f	Fix GOLD plugin build after r262976 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262981	2016-03-09 01:55:15 +00:00
Sanjoy Das	84216672da	Remove trailing newline from test case; NFC llvm-svn: 262980	2016-03-09 01:51:44 +00:00
Sanjoy Das	97d19bd95f	[SCEV] Slightly generalize getRangeViaFactoring Building on the previous change, this generalizes ScalarEvolution::getRangeViaFactoring to work with {Ext(C?A:B)+k0,+,Ext(C?A:B)+k1} where Ext can be a zero extend, sign extend or truncate operation, and k0 and k1 are constants. llvm-svn: 262979	2016-03-09 01:51:02 +00:00
Sanjoy Das	d3488c6060	[SCEV] Slightly generalize getRangeViaFactoring This change generalizes ScalarEvolution::getRangeViaFactoring to work with {Ext(C?A:B),+,Ext(C?A:B)} where Ext can be a zero extend, sign extend or truncate operation. llvm-svn: 262978	2016-03-09 01:50:57 +00:00
Mehdi Amini	7c4a1a8d48	libLTO: add a ThinLTOCodeGenerator on the model of LTOCodeGenerator. This is intended to provide a parallel (threaded) ThinLTO scheme for linker plugin use through the libLTO C API. The intent of this patch is to provide a first implementation as a proof-of-concept and allows linker to start supporting ThinLTO by definiing the libLTO C API. Some part of the libLTO API are left unimplemented yet. Following patches will add support for these. The current implementation can link all clang/llvm binaries. Differential Revision: http://reviews.llvm.org/D17066 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262977	2016-03-09 01:37:22 +00:00
Mehdi Amini	bd04e8fed6	FunctionIndex is not optional for renameModuleForThinLTO(), make it a reference (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262976	2016-03-09 01:37:14 +00:00
Zachary Turner	a99000dd31	[llvm-pdbdump] Dump line table information. This patch adds the -lines command line option which will dump source/line information for each compiland and source file. llvm-svn: 262962	2016-03-08 21:42:24 +00:00
Sanjay Patel	8d950ce18c	fix typo; NFC llvm-svn: 262961	2016-03-08 21:41:13 +00:00
Sanjay Patel	b8d071bc8a	use range-based for loop; NFCI llvm-svn: 262956	2016-03-08 20:53:48 +00:00
Sanjay Patel	f831fdb56a	fix variable name; NFC llvm-svn: 262953	2016-03-08 19:07:42 +00:00
Sanjay Patel	5c96723622	use range-based loop; NFCI llvm-svn: 262952	2016-03-08 19:06:12 +00:00
Hans Wennborg	af845d79ad	Add self to CODE_OWNERS Apparently this makes my email address easier to find. llvm-svn: 262951	2016-03-08 19:01:15 +00:00
Saleem Abdulrasool	2d5e95c00d	cmake: include what you use Add a missing include. This is important in the case HandleLLVMOptions is included prior to the missing CheckCXXSourceCompiles or CheckCXXCompilerFlag which includes CheckCXXSourceCompiles. llvm-svn: 262949	2016-03-08 18:56:00 +00:00
Chris Bieneman	74c98f0e8d	[CMake] Refactor add_llvm_implicit_projects to be reusable This adds llvm_add_implicit_projects which takes a project name and is wrapped by add_llvm_implicit_projects. llvm-svn: 262948	2016-03-08 18:43:28 +00:00
Chad Rosier	2a70624403	[AArch64] Disable the MI scheduler to turn bots green after r262942. llvm-svn: 262944	2016-03-08 17:33:34 +00:00
Quentin Colombet	4340b55593	Revert r262759 and r262760. The fix consisting in using the library call for atomic compare and swap when the instruction is not safe to use may be incorrect. Indeed the library call may not exist on all platform. In other words, we need a better fix! llvm-svn: 262943	2016-03-08 17:29:11 +00:00
Chad Rosier	e40b9513a9	[AArch64] Add MMOs to unscaled pairs. Test to be committed in follow up commit, per discussion in D17097. http://reviews.llvm.org/D17097 llvm-svn: 262942	2016-03-08 17:16:38 +00:00
Sanjay Patel	eaf06851d0	rangify, fix function names; NFCI llvm-svn: 262940	2016-03-08 17:12:32 +00:00
Krzysztof Parzyszek	cd99e364e3	Invoke DAG postprocessing in the post-RA scheduler This was inadvertently omitted from r262774, which added the mutation interface. llvm-svn: 262939	2016-03-08 16:54:20 +00:00
Sanjay Patel	5b8d741632	don't repeat function names in documentation comments; NFC llvm-svn: 262937	2016-03-08 16:26:39 +00:00
Artyom Skrobov	5ddea6a8e9	[ARM] Simplify ARMInstr*.td by getting rid of identity PatFrags (NFC) Reviewers: t.p.northover, grosbach, resistor Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D17636 llvm-svn: 262936	2016-03-08 16:23:54 +00:00
Hans Wennborg	e00b6e7249	Revert r262599 "[X86][SSE] Improve vector ZERO_EXTEND by combining to ZERO_EXTEND_VECTOR_INREG" This caused PR26870. llvm-svn: 262935	2016-03-08 16:21:41 +00:00
Manuel Klimek	43a43079a6	Fix problem with uninitilialized bool found by asan. llvm-svn: 262934	2016-03-08 16:17:48 +00:00
Krzysztof Parzyszek	1a1d78b86f	Add DAG mutation interface to the DFA packetizer llvm-svn: 262930	2016-03-08 15:33:51 +00:00
Igor Breger	999ac754f2	AVX512: Add extract_subvector patterns v8i1->v4i1 , v4i1->v2i1. Differential Revision: http://reviews.llvm.org/D17953 llvm-svn: 262929	2016-03-08 15:21:25 +00:00
Benjamin Kramer	39988a03a5	[gold] Avoid assertion failures when taking a pointer to an empty vector. llvm-svn: 262926	2016-03-08 14:02:46 +00:00
Filipe Cabecinhas	a7e63b1e67	[llvm-config] Get rid of code related to the Makefile builds Summary: I left --build-system for backwards compat, in case there are scripts using it. Feel free to ask for its removal too. Reviewers: chapuni, tstellarAMD Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17886 llvm-svn: 262924	2016-03-08 11:49:24 +00:00
Simon Pilgrim	d8ac7c9f2d	[X86] Regenerated vector float extension tests llvm-svn: 262919	2016-03-08 09:17:12 +00:00
Junmo Park	3452d33ae2	Remove pr25342 test-case. This commit removes pr25342 for reverting r262670 clearly. llvm-svn: 262918	2016-03-08 07:42:12 +00:00
Junmo Park	974eb0a96d	Revert "[InstCombine] Combine A->B->A BitCast" This reverts commit r262670 due to compile failure. llvm-svn: 262916	2016-03-08 07:09:46 +00:00
Justin Bogner	6e2b99516a	SelectionDAG: Appease the bots that don't like my union Should fix the breakage in r262902. llvm-svn: 262908	2016-03-08 03:51:58 +00:00
Peter Collingbourne	3866cc5f69	Fix evaluation order. Spotted by Alexander Riccio! llvm-svn: 262907	2016-03-08 03:50:36 +00:00
Kit Barton	ba532dc816	[Power9] Implement new vsx instructions: load, store instructions for vector and scalar We follow the comments mentioned in http://reviews.llvm.org/D16842#344378 to implement this new patch. This patch implements the following vsx instructions: Vector load/store: lxv lxvx lxvb16x lxvl lxvll lxvh8x lxvwsx stxv stxvb16x stxvh8x stxvl stxvll stxvx Scalar load/store: lxsd lxssp lxsibzx lxsihzx stxsd stxssp stxsibx stxsihx 21 instructions Phabricator: http://reviews.llvm.org/D16919 llvm-svn: 262906	2016-03-08 03:49:13 +00:00
Dan Gohman	1402606477	[WebAssembly] Update for spec change from tableswitch to br_table. Also note that the operand order changed; the default label is now listed after the regular labels. llvm-svn: 262903	2016-03-08 03:18:12 +00:00
Justin Bogner	671febc0f7	Re-apply "SelectionDAG: Store SDNode operands in an ArrayRecycler" This re-applies r262886 with a fix for 32 bit platforms that have 8 byte pointer alignment, effectively reverting r262892. Original Message: Currently some SDNode operands are malloc'd, some are stored inline in subclasses of SDNode, and some are thrown into a BumpPtrAllocator. This scheme is complex, inconsistent, and makes refactoring SDNodes fairly difficult. Instead, we can allocate all of the operands using an ArrayRecycler that wraps a BumpPtrAllocator. This keeps the cache locality when iterating operands, improves locality when iterating SDNodes without looking at operands, and vastly simplifies the ownership semantics. It also means we stop overallocating SDNodes by 2-3x and will make it simpler to fix the rampant undefined behaviour we have in how we mutate SDNodes from one kind to another (See llvm.org/pr26808). This is NFC other than the changes in memory behaviour, and I ran some LNT tests to make sure this didn't hurt compile time. Not many tests changed: there were a couple of 1-2% regressions reported, but there were more improvements (of up to 4%) than regressions. llvm-svn: 262902	2016-03-08 03:14:29 +00:00
Quentin Colombet	5e63e78ca9	[MIR] Change the token name for '<' and '>' to be consitent with the LLVM IR parser. Thanks to Ahmed Bougacha for noticing! llvm-svn: 262899	2016-03-08 02:00:43 +00:00
Quentin Colombet	dca821683c	[AArch64][GlobalISel] Add a test case for the IRTranslator. llvm-svn: 262898	2016-03-08 01:48:08 +00:00
Quentin Colombet	f574ab292b	[AArch64] Initialize GlobalISel as part of the target initialization. llvm-svn: 262897	2016-03-08 01:45:36 +00:00
Quentin Colombet	39293d3aaa	[GlobalISel] Introduce initializer method to support start/stop-after features. llvm-svn: 262896	2016-03-08 01:38:55 +00:00
Quentin Colombet	050b211820	[MIR] Teach the parser/printer that generic virtual registers do not need a register class. llvm-svn: 262893	2016-03-08 01:17:03 +00:00
Justin Bogner	7e6f09c28f	Revert "SelectionDAG: Store SDNode operands in an ArrayRecycler" Looks like the largest SDNode is different between 32 and 64 bit now, so this is breaking 32 bit bots. Reverting while I figure out a fix. This reverts r262886. llvm-svn: 262892	2016-03-08 01:07:03 +00:00
Richard Smith	c2a2830e94	A couple more UB fixes for C++14 sized deallocation. llvm-svn: 262891	2016-03-08 00:59:44 +00:00
Quentin Colombet	287c6bb571	[MIR] Teach the parser how to parse complex types of generic machine instructions. By complex types, I mean aggregate or vector types. llvm-svn: 262890	2016-03-08 00:57:31 +00:00
Justin Bogner	6543a9385f	SelectionDAG: Store SDNode operands in an ArrayRecycler Currently some SDNode operands are malloc'd, some are stored inline in subclasses of SDNode, and some are thrown into a BumpPtrAllocator. This scheme is complex, inconsistent, and makes refactoring SDNodes fairly difficult. Instead, we can allocate all of the operands using an ArrayRecycler that wraps a BumpPtrAllocator. This keeps the cache locality when iterating operands, improves locality when iterating SDNodes without looking at operands, and vastly simplifies the ownership semantics. It also means we stop overallocating SDNodes by 2-3x and will make it simpler to fix the rampant undefined behaviour we have in how we mutate SDNodes from one kind to another (See llvm.org/pr26808). This is NFC other than the changes in memory behaviour, and I ran some LNT tests to make sure this didn't hurt compile time. Not many tests changed: there were a couple of 1-2% regressions reported, but there were more improvements (of up to 4%) than regressions. llvm-svn: 262886	2016-03-08 00:39:51 +00:00
Quentin Colombet	d655483944	[MIR] Teach the printer how to print complex types for generic machine instructions. Before this change, we would get the type definition in the middle of the instruction. E.g., %0(48) = G_ADD %struct_alias = type { i32, i16 } %edi, %edi Now, we have just the expected type name: %0(48) = G_ADD %struct_alias %edi, %edi llvm-svn: 262885	2016-03-08 00:38:01 +00:00
Quentin Colombet	dafed5d7d8	[AsmParser] Expose an API to parse a string starting with a type. Without actually parsing a type it is difficult to perdict where the type definition ends. In other words, instead of expecting the user of the parser API to hand over only the relevant bits of the string being parsed, take the whole string, parse the type, and get back the number of characters that have been read. This will be used by the MIR testing infrastructure. llvm-svn: 262884	2016-03-08 00:37:07 +00:00
Easwaran Raman	b1bd398ceb	Revert revisions 262636, 262643, 262679, and 262682. llvm-svn: 262883	2016-03-08 00:36:35 +00:00
Quentin Colombet	12350a8e13	[MIR] Print the type of generic machine instructions. llvm-svn: 262880	2016-03-08 00:29:15 +00:00
Quentin Colombet	851996778f	[MIR] Teach the mir parser about types on generic machine instructions. llvm-svn: 262879	2016-03-08 00:20:48 +00:00
Quentin Colombet	9d1bc8bd16	[lit] Teach lit about global-isel requirement. llvm-svn: 262878	2016-03-08 00:03:40 +00:00
Quentin Colombet	447f852aa9	[llvm-config] Teach llvm-config about global-isel. llvm-config can know tell whether or not a build has been configured to support global-isel. Use '--has-global-isel' for that. llvm-svn: 262877	2016-03-08 00:02:50 +00:00
Anna Zaks	c1efa64c63	[tsan] Add support for pointer typed atomic stores, loads, and cmpxchg TSan instrumentation functions for atomic stores, loads, and cmpxchg work on integer value types. This patch adds casts before calling TSan instrumentation functions in cases where the value is a pointer. Differential Revision: http://reviews.llvm.org/D17833 llvm-svn: 262876	2016-03-07 23:16:23 +00:00
Sanjay Patel	8c84f74f3a	[x86] add test to show missing optimization This should make it clearer how this proposed patch: http://reviews.llvm.org/D11393 ...will change codegen. llvm-svn: 262875	2016-03-07 23:13:06 +00:00
Sanjay Patel	55c0dd4b26	[x86] simplify test and tighten checks I noticed this test as part of: http://reviews.llvm.org/D11393 ...which is confusing enough as-is. Let's show the exact codegen, so the changes will be more obvious. llvm-svn: 262874	2016-03-07 22:53:23 +00:00
Quentin Colombet	41bea872dd	[MachineInstr] Get rid of some GlobalISel ifdefs. Now the type API is always available, but when global-isel is not built the implementation does nothing. Note: The implementation free of ifdefs is WIP and tracked here in PR26576. llvm-svn: 262873	2016-03-07 22:47:23 +00:00
Amaury Sechet	b813e4d4ae	Remove unused import in Orc C API Summary: It is not used. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17251 llvm-svn: 262870	2016-03-07 22:40:07 +00:00
Quentin Colombet	774b1efa62	[IR] Provide an API to skip the details of a structured type when printed. The mir infrastructure will need this for generic instructions and currently this feature was only available through the anonymous TypePrinter class. llvm-svn: 262869	2016-03-07 22:32:42 +00:00
Quentin Colombet	81e72b4d4e	[AsmParser] Add a function to parse a standalone type. This is useful for MIR serialization. Indeed generic machine instructions must have a type and we don't want to duplicate the logic in the MIParser. llvm-svn: 262868	2016-03-07 22:09:05 +00:00
Quentin Colombet	4e14a497a3	[MIR] Teach the MIPrinter about size for generic virtual registers. llvm-svn: 262867	2016-03-07 21:57:52 +00:00
Matt Arsenault	de2d6a3033	Fix broken example for bitreverse documentation llvm-svn: 262865	2016-03-07 21:54:52 +00:00
Matt Arsenault	c89f2919a4	AMDGPU: Match more med3 integer patterns llvm-svn: 262864	2016-03-07 21:54:48 +00:00
Quentin Colombet	2a831fb826	[MIR] Teach the parser how to handle the size of generic virtual registers. llvm-svn: 262862	2016-03-07 21:48:43 +00:00
Quentin Colombet	1bd7504ef3	[MachineRegisterInfo] Add a method to set the size of a virtual register a posteriori. This is required for mir testing. llvm-svn: 262861	2016-03-07 21:41:39 +00:00
Amaury Sechet	5984dfe7c7	Small formating change in Core.cpp . NFC llvm-svn: 262860	2016-03-07 21:39:20 +00:00
Quentin Colombet	70a9670d80	[MachineRegisterInfo] Get rid of the global-isel ifdefs. One additional pointer is not a big deal size-wise and it makes the code much nicer! llvm-svn: 262856	2016-03-07 21:22:09 +00:00
Matt Arsenault	56356c8a9c	AMDGPU: Remove a fixme for ptrrtoint handling llvm-svn: 262854	2016-03-07 21:12:46 +00:00
Matt Arsenault	81d06015c6	AMDGPU: Move function only used by R600 llvm-svn: 262853	2016-03-07 21:10:13 +00:00
Matt Arsenault	ceb2c06cbd	DAGCombiner: Check legality before creating extract_vector_elt Problem not hit by any in tree target. llvm-svn: 262852	2016-03-07 21:10:09 +00:00
Justin Bogner	bbab368e13	SelectionDAG: Remove some unused AtomicSDNode constructors. NFC llvm-svn: 262849	2016-03-07 20:15:12 +00:00
Adam Nemet	bb3680bd85	[LoopDataPrefetch] If prefetch distance is not set, skip pass This lets select sub-targets enable this pass. The patch implements the idea from the recent llvm-dev thread: http://thread.gmane.org/gmane.comp.compilers.llvm.devel/94925 The goal is to enable the LoopDataPrefetch pass for the Cyclone sub-target only within Aarch64. Positive and negative tests will be included in an upcoming patch that enables selective prefetching of large-strided accesses on Cyclone. llvm-svn: 262844	2016-03-07 18:35:42 +00:00
Marina Yatsina	5f5de9f89b	[ms-inline-asm][AVX512] Add ability to use k registers in MS inline asm + fix bag with curly braces Until now curly braces could only be used in MS inline assembly to mark block start/end. All curly braces were removed completely at a very early stage. This approach caused bugs like: "m{o}v eax, ebx" turned into "mov eax, ebx" without any error. In addition, AVX-512 added special operands (e.g., k registers), which are also surrounded by curly braces that mark them as such. Now, we need to keep the curly braces and identify at a later stage if they are marking block start/end (if so, ignore them), or surrounding special AVX-512 operands (if so, parse them as such). This patch fixes the bug described above and enables the use of AVX-512 special operands. This commit is the the llvm part of the patch. The clang part of the review is: http://reviews.llvm.org/D17766 The llvm part of the review is: http://reviews.llvm.org/D17767 Differential Revision: http://reviews.llvm.org/D17767 llvm-svn: 262843	2016-03-07 18:11:16 +00:00
Adam Nemet	4896c7a82a	[ScopedNoAliasAA] Make test basic.ll less confusing Summary: This testcase had me confused. It made me believe that you can use alias scopes and alias scopes list interchangeably with alias.scope and noalias. Both langref and the other testcase use scope lists so I went looking. Turns out using scope directly only happens to work by chance. When ScopedNoAliasAAResult::mayAliasInScopes traverses this as a scope list: !1 = !{!1, !0, !"some scope"} , the first entry is in fact a scope but only because the scope is happened to be defined self-referentially to make it unique globally. The remaining elements in the tuple (!0, !"some scope") are considered as scopes but AliasScopeNode::getDomain will just bail on those without any error. This change avoids this ambiguity in the test but I've also been wondering if we should issue some sort of a diagnostics. Reviewers: dexonsmith, hfinkel Subscribers: mssimpso, llvm-commits Differential Revision: http://reviews.llvm.org/D16670 llvm-svn: 262841	2016-03-07 17:49:10 +00:00
Adam Nemet	81113ef68c	Revert "Enable LoopLoadElimination by default" This reverts commit r262250. It causes SPEC2006/gcc to generate wrong result (166.s) in AArch64 when running with ref data set. The error happens with "-Ofast -flto -fuse-ld=gold" or "-O3 -fno-strict-aliasing". llvm-svn: 262839	2016-03-07 17:38:02 +00:00
Chandler Carruth	af8321ecf7	[memdep] Switch to range based for loops. llvm-svn: 262831	2016-03-07 15:12:57 +00:00
Chandler Carruth	9ca96384f3	[DFSan] Remove an overly aggressive assert reported in PR26068. This code has been successfully used to bootstrap libc++ in a no-asserts mode for a very long time, so the code that follows cannot be completely incorrect. I've added a test that shows the current behavior for this kind of code with DFSan. If it is desirable for DFSan to do something special when processing an invoke of a variadic function, it can be added, but we shouldn't keep an assert that we've been ignoring due to release builds anyways. llvm-svn: 262829	2016-03-07 14:05:09 +00:00
Chandler Carruth	b32febe48e	[memdep] Switch a function to return true on success instead of false. This is much more clear and less surprising IMO. It also makes things more consistent with the increasingly large chunk of LLVM code that assumes true-on-success. llvm-svn: 262826	2016-03-07 12:45:07 +00:00
Chandler Carruth	40e21f2a20	[memdep] Cleanup the implementation doxygen comments and remove duplicated comments. In several cases these had diverged making them especially nice to canonicalize. I checked to make sure we weren't losing important information of course. llvm-svn: 262825	2016-03-07 12:30:06 +00:00
Chandler Carruth	78954164a9	[memdep] Finish cleaning up all of the comments' doxygen. llvm-svn: 262824	2016-03-07 11:27:56 +00:00
Chandler Carruth	1fac9df95c	[memdep] Switch from a hacky use of PointerIntPair and poorly chosen arbitrary integers cast to Instruction pointers to a sum type over Instruction * and a PointerEmbeddedInt. No functionality changed. Differential Revision: http://reviews.llvm.org/D15845 llvm-svn: 262823	2016-03-07 11:04:46 +00:00
Chandler Carruth	3d79dd9b06	[memdep] Update the comments' doxygen style and place them more clearly. Just cleaning this up, no functionality changed. Next up will be moving it to use the sum type instead of arbitrary "pointer"-like enums. llvm-svn: 262822	2016-03-07 10:35:02 +00:00
Chandler Carruth	60fb1b4bd2	[memdep] Run clang-format over the header before porting it to the new pass manager. The port will involve substantial edits here, and would likely introduce bad formatting if formatted in isolation, so just get all the formatting up to snuff. I'll also go through and try to freshen the doxygen here as well as modernizing some of the code. llvm-svn: 262821	2016-03-07 10:19:30 +00:00
Craig Topper	267bdb2094	[CodeGen] Add space-optimized EmitMergeInputChains1_2 to the DAG isel matching tables. Shaves about 5100 bytes from the X86 matcher table. NFC llvm-svn: 262815	2016-03-07 07:29:12 +00:00
Mehdi Amini	b923d641d0	Add a new insert_as() method to DenseMap and use it for ConstantUniqueMap Just like the existing find_as() method, the new insert_as() accepts an extra parameter which is used as a key to find the bucket in the map. When creating a Constant, we want to check the map before actually creating the object. In this case we have to perform two queries to the map, and this extra parameter can save recomputing the hash value for the second query. This is a reapply of r260458, that was reverted because it was suspected to be the cause of instability of an internal bot, but wasn't confirmed. Differential Revision: http://reviews.llvm.org/D16268 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262812	2016-03-07 00:51:00 +00:00
Mehdi Amini	67dfe09da4	Bitcode reader: Inline readAbbreviatedField in readRecord and move the enclosing loop in each case (NFC) Summary: This make readRecord 20% faster, measured on an LTO build Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17911 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 262811	2016-03-07 00:38:09 +00:00
NAKAMURA Takumi	2de1b320a4	Revert r130657, "Windows/DynamicLibrary.inc: Clean up ELM_Callback. We may check the decl instead of the versions of individual libraries." We may assume the type of 1st argument as PCSTR in PENUMLOADED_MODULES_CALLBACK. PSTR was in the ancient mingw32. llvm-svn: 262810	2016-03-07 00:13:09 +00:00
Simon Pilgrim	253ca348b2	[X86][AVX512] Fixed VPERMT2* shuffle mask decoding and enabled target shuffle combining. Patch to add support for target shuffle combining of X86ISD::VPERMV3 nodes, including support for detecting unary shuffles. This uncovered several issues with the X86ISD::VPERMV3 shuffle mask decoding of non-64 bit shuffle mask elements - the bit masking wasn't being correctly computed. Removed non-constant pool mask decode path as we have no way of testing it right now. Differential Revision: http://reviews.llvm.org/D17916 llvm-svn: 262809	2016-03-06 21:54:52 +00:00
Valery Pykhtin	dc11054f20	[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler. Engages code from r262804. Differential Revision: http://reviews.llvm.org/D17151 llvm-svn: 262808	2016-03-06 20:25:36 +00:00
Valery Pykhtin	50cd3c4ec7	fix sanitizer-ppc64be-linux failure for r262804 error: moving a local object in a return statement prevents copy elision [-Werror,-Wpessimizing-move] http://lab.llvm.org:8011/builders/sanitizer-ppc64be-linux/builds/930 llvm-svn: 262805	2016-03-06 15:13:54 +00:00
Valery Pykhtin	499a5c6323	[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields Differential Revision: http://reviews.llvm.org/D17150 llvm-svn: 262804	2016-03-06 13:27:13 +00:00
Igor Breger	4d94d4d5f7	AVX512BW: Support llvm intrinsic masked vector load/store for i8/i16 element types on SKX Differential Revision: http://reviews.llvm.org/D17913 llvm-svn: 262803	2016-03-06 12:38:58 +00:00
Wilfred Hughes	c0531a4a21	Fix typo. llvm-svn: 262802	2016-03-06 12:37:34 +00:00
Valery Pykhtin	0c6293da68	[AMDGPU] SOPxx instructions operand naming fixed in td files. dst -> sdst ssrcN -> srcN Differential Revision: http://reviews.llvm.org/D17646 llvm-svn: 262801	2016-03-06 10:31:44 +00:00
Craig Topper	581c0087b9	[X86] Use high bits of return value from getEncoding instead of predicate functions to populate the REX and VEX prefix bits that extend register encodings. NFC llvm-svn: 262800	2016-03-06 08:12:47 +00:00
Craig Topper	faab5c68d4	[X86] Remove unnecessary masking. The assert above it already guaranteed it. NFC llvm-svn: 262799	2016-03-06 08:12:44 +00:00
Craig Topper	5e038cf589	[X86] Use uint8_t instead of unsigned char as it shortens the code and more explicitly reflects the desired size. llvm-svn: 262798	2016-03-06 08:12:42 +00:00
Igor Breger	f1bd761e00	AVX512: Remove VSHRI kmask patterns from TD file. It is incorrect to use kshiftw to implement VSHRI v4i1 , bits 15-4 is undef so the upper bits of v4i1 may not be zeroed. v4i1 should be zero_extend to v16i1 ( or any natively supported vector). Differential Revision: http://reviews.llvm.org/D17763 llvm-svn: 262797	2016-03-06 07:46:03 +00:00
Saleem Abdulrasool	11bf1ac297	unitests: add some ARM TargetParser tests The ARM TargetParser would construct invalid StringRefs. This would cause asserts to trigger. Add some tests in LLVM to ensure that we dont regress on this in the future. Although there is a test for this in clang, this ensures that the changes would get caught in the same repository. llvm-svn: 262790	2016-03-06 04:50:55 +00:00
Alexander Kornienko	45c9a5beee	[docs] Updated docs to work with Doxygen 1.8.11 llvm-svn: 262786	2016-03-06 03:50:08 +00:00
Simon Pilgrim	40e1a71cdd	[X86][AVX] Improved VPERMILPS variable shuffle mask decoding. Added support for decoding VPERMILPS variable shuffle masks that aren't in the constant pool. Added target shuffle mask decoding for SCALAR_TO_VECTOR+VZEXT_MOVL cases - these can happen for v2i64 constant re-materialization Followup to D17681 llvm-svn: 262784	2016-03-05 22:53:31 +00:00
Simon Pilgrim	aa99331bad	[X86] AMD Bobcat CPU (btver1) doesn't support XSAVE btver1 is a SSSE3/SSE4a only CPU - it doesn't have AVX and doesn't support XSAVE. Differential Revision: http://reviews.llvm.org/D17683 llvm-svn: 262782	2016-03-05 22:00:50 +00:00
Saleem Abdulrasool	4208381016	Support: catch invalid accesses It is possible to invoke these methods on an invalid input resulting in an invalid substring construction. It seems that we do not have unit tests for these methods. Tests to ensure that the invalid call is caught to follow in clang. Resolves PR26839. llvm-svn: 262778	2016-03-05 20:00:44 +00:00
Saleem Abdulrasool	fa8c6ed3fa	ExecutionEngine: tweak debug log Add a newline to separate the log message. NFC. llvm-svn: 262777	2016-03-05 20:00:41 +00:00
Yaron Keren	ce608690e1	Replace GlobalScopeAsm[GlobalScopeAsm.size()-1] with GlobalScopeAsm.back(), NFC. llvm-svn: 262775	2016-03-05 16:02:09 +00:00
Krzysztof Parzyszek	5c61d11a6d	Add DAG mutation interface to the post-RA scheduler Differential Revision: http://reviews.llvm.org/D17868 llvm-svn: 262774	2016-03-05 15:45:23 +00:00
Chandler Carruth	47dbdd9c31	[aa-eval] Enhance the comments to better describe the overview of why this pass exists. This is based on feedback received when moving this comment from the source file to a new header file. Differential Revision: http://reviews.llvm.org/D17476 llvm-svn: 262769	2016-03-05 08:20:15 +00:00
Matthias Braun	4797ec95e4	RegisterCoalescer: Remap subregister lanemasks before exchanging operands Rematerializing and merging into a bigger register class at the same time, requires the subregister range lanemasks getting remapped to the new register class. This fixes http://llvm.org/PR26805 llvm-svn: 262768	2016-03-05 04:36:13 +00:00
Matthias Braun	8de09aa0c5	RegisterCoalescer: Need to check DstReg+SrcReg for missing undef flags copy coalescing with enabled subregister liveness can reveal undef uses, previously this was only checked for the SrcReg in updateRegDefsUses() but we need to check DstReg as well. llvm-svn: 262767	2016-03-05 04:36:10 +00:00
Matthias Braun	2cbfd9fff5	RegisterPressure: Small cleanup llvm-svn: 262766	2016-03-05 04:36:08 +00:00
Quentin Colombet	2a7676b442	[X86] Fix the lowering of setjmp intrinsic on i386. When the lowering of the setjmp intrinsic requires a global base pointer to be set, make sure such pointer gets defined by the CGBR pass. This fixes PR26742. llvm-svn: 262762	2016-03-05 00:31:04 +00:00
Quentin Colombet	fb5be7a37f	Add missing triple in my previous commit! llvm-svn: 262760	2016-03-04 23:36:32 +00:00
Quentin Colombet	13b524597d	[X86] Do not use cmpxchgXXb when we need the base pointer (RBX). cmpxchgXXb uses RBX as one of its implicit argument. I.e., when we use that instruction we need to clobber RBX. This is generally fine, expect when RBX is a reserved register because in that case, the register allocator will not track its value and will not save and restore it when interferences occur. rdar://problem/24851412 llvm-svn: 262759	2016-03-04 23:29:39 +00:00
Sanjay Patel	216b275994	[x86] add tests for masked loads with constant masks llvm-svn: 262758	2016-03-04 23:28:07 +00:00
Mike Aizatsky	243fe2b3a0	[libfuzzer] adding std:string to allowed adaptable argument. llvm-svn: 262757	2016-03-04 23:18:01 +00:00
David Majnemer	71a1c2c619	Fix build breakage llvm-svn: 262756	2016-03-04 23:02:15 +00:00
David Majnemer	d2f767d2f6	[X86] Support cleaning more than 2**16 bytes of stack The x86 ret instruction has a 16 bit immediate indicating how many bytes to pop off of the stack beyond the return address. There is a problem when extremely large structs are passed by value: we might not be able to fit the number of bytes to pop into the return instruction. To fix this, expand RET_FLAG a little later and use a special sequence to clean the stack: pop %ecx ; return address is now in %ecx add $n, %esp ; clean the stack push %ecx ; bring the return address back on the stack ret ; pop the return address and jmp to it's value llvm-svn: 262755	2016-03-04 22:56:17 +00:00
Kostya Serebryany	5c3701c621	[libFuzzer] log less when re-loading files; fix a silly bug: when running single files actually run all of them, not just the first one llvm-svn: 262754	2016-03-04 22:35:40 +00:00
Philip Reames	a0c9f6e736	[LVI] Fix a bug which prevented use of !range metadata within a query The diff is relatively large since I took a chance to rearrange the code I had to touch in a more obvious way, but the key bit is merely using the !range metadata when we can't analyze the instruction further. The previous !range metadata code was essentially just dead since no binary operator or cast will have !range metadata (per Verifier) and it was otherwise dropped on the floor. llvm-svn: 262751	2016-03-04 22:27:39 +00:00
Rong Xu	ecdc98fdae	[PGO] Add a commandline option to control number of the VP annotation metadata. llvm-svn: 262750	2016-03-04 22:08:44 +00:00
Michael Kuperstein	b89f0fa2a2	[DAGCombine] Fix divrem combine not to assume div/rem type is simple. The divrem combine assumed the type of the div/rem is simple, which isn't necessarily true. This probably worked fine until r250825, since it only saw legal types, but now breaks when it runs as a pre-type-legalization combine. This fixes PR26835. Differential Revision: http://reviews.llvm.org/D17878 llvm-svn: 262746	2016-03-04 21:23:29 +00:00
Teresa Johnson	5d07531d02	Fix new gold test to specify emulation mode. The thinlto_linkonceresolution.ll gold linker test introduced in r262727 included a target triple, but didn't set the emulation mode, which is necessary since the default linker target may be different. Patch by H.J. Lu llvm-svn: 262745	2016-03-04 21:19:08 +00:00
Dan Gohman	e6b81362e9	[WebAssembly] Add another possible code-size optimization to README.txt llvm-svn: 262740	2016-03-04 20:09:57 +00:00
Renato Golin	175c6d6d95	[ARM] Merging 64-bit divmod lib calls into one When div+rem calls on the same arguments are found, the ARM back-end merges the two calls into one __aeabi_divmod call for up to 32-bits values. However, for 64-bit values, which also have a lib call (__aeabi_ldivmod), it wasn't merging the calls, and thus calling ldivmod twice and spilling the temporary results, which generated pretty bad code. This patch legalises 64-bit lib calls for divmod, so that now all the spilling and the second call are gone. It also relaxes the DivRem combiner a bit on the legal type check, since it was already checking for isLegalOrCustom on every value, so the extra check for isTypeLegal was redundant. Second attempt, creating TLI.isOperationCustom like isOperationExpand, to make sure we only emit valid types or the ones that were explicitly marked as custom. Now, passing check-all and test-suite on x86, ARM and AArch64. This patch fixes PR17193 (and a long time FIXME in the tests). llvm-svn: 262738	2016-03-04 19:19:36 +00:00
Tom Stellard	649b5db557	AMDGPU/SI: Add support for spiling SGPRs to scratch buffer Summary: This is necessary for when we run out of VGPRs and can no longer use v_{read,write}_lane for spilling SGPRs. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17592 llvm-svn: 262732	2016-03-04 18:31:18 +00:00
Teresa Johnson	3b8f6126ac	Fix bot failure from r262721: unintented change in gold-plugin save-temps The split code gen task ID should not be appended to save-temps output file when the parallelism factor is 1 (not actually splitting). llvm-svn: 262731	2016-03-04 18:16:00 +00:00
Sanjoy Das	fefc4d50ed	[Statepoint docs] Delete trailing whitespace llvm-svn: 262730	2016-03-04 18:14:09 +00:00
Tom Stellard	ebef6f9771	AMDGPU/SI: Enable frame index scavenging during PrologEpilogueInserter Summary: This allows us to use virtual registers when we need extra registers for inserting spill instructions in SIRegisterInfo:eliminateFrameIndex(). Once all the frame indices have been eliminated, the PrologEpilogueInserter does an extra pass over the program to replace all virtual registers with physical ones. This allows us to make more efficient use of our emergency spill slots, so we only need to create one. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D17591 llvm-svn: 262728	2016-03-04 18:02:01 +00:00
Teresa Johnson	a17f2cd1a3	[ThinLTO] Ensure prevailing linkonce emitted as weak in ThinLTO backends Summary: Since IR files are all compiled into separate independent object files in ThinLTO mode, the prevailing linkonce symbols must be emitted in its object file even if it is no longer referenced there, e.g. if no references remain in the module after inlining, since it may be referenced by another ThinLTO compiled object file. This is done by changing LDPR_PREVAILING_DEF_IRONLY* symbols to LDPR_PREVAILING_DEF, which converts the prevailing linkonce to weak. We also don't need the other prevailing IRONLY handling for internalization, which is not currently performed for ThinLTO. Test case included. Reviewers: davidxl, rafael Subscribers: rafael, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16173 llvm-svn: 262727	2016-03-04 17:48:35 +00:00
Krzysztof Parzyszek	51155fc0d1	[Hexagon] Fix lowering of calls with the return type of i1 This fixes an assertion in test/CodeGen/Hexagon/ifcvt-edge-weight.ll when run with -debug-only=isel llvm-svn: 262726	2016-03-04 17:38:05 +00:00
Zoran Jovanovic	a68b67d1ed	[mips][microMIPS] Prevent usage of OR16_MMR6 instruction when code for microMIPS is generated. Author: milena.vujosevic.janicic Reviewers: dsanders Differential Revision: http://reviews.llvm.org/D17373 llvm-svn: 262725	2016-03-04 17:34:31 +00:00
Teresa Johnson	7cffaf3ad0	[ThinLTO] Launch importing backends in parallel threads from gold plugin Summary: Launch ThinLTO backends (LTO and codegen pipelines with importing) in parallel using a ThreadPool, after creating the combined index. The number of threads is controlled by the existing -jobs gold plugin option, or the hardware concurrency if not specified. The old behavior of exiting after creating the combined index can be invoked via a new thinlto-index-only plugin option. This commit involves just the ThinLTO-specific pieces of D15390, the NFC and other restructuring pieces were committed independently: r262677: Add hardware_concurrency interface to llvm::thread (NFC) r262719: Change split code gen to use ThreadPool r262721: Refactor gold-plugin codegen to prepare for ThinLTO threads (NFC) Reviewers: pcc, joker.eph, rafael Subscribers: rafael, davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D15390 llvm-svn: 262724	2016-03-04 17:06:02 +00:00
Teresa Johnson	a9f65554b0	Refactor gold-plugin codegen to prepare for ThinLTO threads (NFC) This is the NFC part remaining from D15390, which refactors the current codegen() into a CodeGen class with various modular methods and other helper functions that will be used by the follow-on ThinLTO piece. llvm-svn: 262721	2016-03-04 16:36:06 +00:00
Teresa Johnson	d84c7decb6	Change split code gen to use ThreadPool Part of D15390. llvm-svn: 262719	2016-03-04 15:39:13 +00:00
Simon Pilgrim	3c7e94208a	[X86][AVX512] Added some basic X86ISD::VPERMV3 shuffle combining tests None of these actually combine yet as we haven't enabled X86ISD::VPERMV3 for target shuffle combining llvm-svn: 262718	2016-03-04 15:19:42 +00:00

... 2 3 4 5 6 ...

128643 Commits