llvm-project

Commit Graph

Author	SHA1	Message	Date
Zachary Turner	bbc5b46a10	Python 3 - Use universal_newlines when calling subprocess.check_output By default in Python 3, check_output() returns a program's output as an encoded byte sequence. This means it returns a Py3 `bytes` object, which cannot be compared to a string since it's a different fundamental type. Although it might not be correct from a purist standpoint, from a practical one we can assume that all output is encoded in the default locale, in which case using universal_newlines=True will decode it according to the current locale. Anyway, universal_newlines also has the nice behavior that it converts \r\n to \n on Windows platforms so this makes parsing code easier, should we need that. So it seems like a win/win. llvm-svn: 252025	2015-11-04 01:03:47 +00:00
Jim Ingham	1c58d5ac90	Actually implement Section::GetSectionData. llvm-svn: 252024	2015-11-04 01:02:43 +00:00
Jim Ingham	6d9880a6b4	Try a little harder to provide a legit CWD to argdumper if the user hasn't provided one. llvm-svn: 252023	2015-11-04 01:02:06 +00:00
Alexey Samsonov	5365a01dc7	[LLVMSymbolize] Reduce indentation by using helper function. NFC. llvm-svn: 252022	2015-11-04 00:30:26 +00:00
Alexey Samsonov	884adda0fb	[LLVMSymbolize] Properly propagate object parsing errors from the library. llvm-svn: 252021	2015-11-04 00:30:24 +00:00
Alexey Samsonov	b0742319fc	[llvm-symbolizer] Improve the test for missing input file. llvm-svn: 252020	2015-11-04 00:30:19 +00:00
Adam Nemet	7c94c9bf07	Fix unused variable warning from r252017 llvm-svn: 252019	2015-11-04 00:10:33 +00:00
Enrico Granata	b766292951	Fix an issue where LLDB would truncate summaries for string types without producing any evidence thereof llvm-svn: 252018	2015-11-04 00:02:08 +00:00
Adam Nemet	e54a4fa95d	LLE 6/6: Add LoopLoadElimination pass Summary: The goal of this pass is to perform store-to-load forwarding across the backedge of a loop. E.g.: for (i) A[i + 1] = A[i] + B[i] => T = A[0] for (i) T = T + B[i] A[i + 1] = T The pass relies on loop dependence analysis via LoopAccessAnalisys to find opportunities of loop-carried dependences with a distance of one between a store and a load. Since it's using LoopAccessAnalysis, it was easy to also add support for versioning away may-aliasing intervening stores that would otherwise prevent this transformation. This optimization is also performed by Load-PRE in GVN without the option of multi-versioning. As was discussed with Daniel Berlin in http://reviews.llvm.org/D9548, this is inferior to a more loop-aware solution applied here. Hopefully, we will be able to remove some complexity from GVN/MemorySSA as a consequence. In the long run, we may want to extend this pass (or create a new one if there is little overlap) to also eliminate loop-indepedent redundant loads and store that require versioning due to may-aliasing intervening stores/loads. I have some motivating cases for store elimination. My plan right now is to wait for MemorySSA to come online first rather than using memdep for this. The main motiviation for this pass is the 456.hmmer loop in SPECint2006 where after distributing the original loop and vectorizing the top part, we are left with the critical path exposed in the bottom loop. Being able to promote the memory dependence into a register depedence (even though the HW does perform store-to-load fowarding as well) results in a major gain (~20%). This gain also transfers over to x86: it's around 8-10%. Right now the pass is off by default and can be enabled with -enable-loop-load-elim. On the LNT testsuite, there are two performance changes (negative number -> improvement): 1. -28% in Polybench/linear-algebra/solvers/dynprog: the length of the critical paths is reduced 2. +2% in Polybench/stencils/adi: Unfortunately, I couldn't reproduce this outside of LNT The pass is scheduled after the loop vectorizer (which is after loop distribution). The rational is to try to reuse LAA state, rather than recomputing it. The order between LV and LLE is not critical because normally LV does not touch scalar st->ld forwarding cases where vectorizing would inhibit the CPU's st->ld forwarding to kick in. LoopLoadElimination requires LAA to provide the full set of dependences (including forward dependences). LAA is known to omit loop-independent dependences in certain situations. The big comment before removeDependencesFromMultipleStores explains why this should not occur for the cases that we're interested in. Reviewers: dberlin, hfinkel Subscribers: junbuml, dberlin, mssimpso, rengolin, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13259 llvm-svn: 252017	2015-11-03 23:50:08 +00:00
Adam Nemet	397f5829c7	[LAA] LLE 5/6: Add predicate functions Dependence::isForward/isBackward, NFC Summary: Will be used by the LoopLoadElimination pass. Reviewers: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13258 llvm-svn: 252016	2015-11-03 23:50:03 +00:00
Adam Nemet	ed653d6774	[LAA] LLE 4/6: APIs to access the dependent instructions for a dependence, NFC Summary: The functions use LAI and MemoryDepChecker classes so they need to be defined after those definitions outside of the Dependence class. Will be used by the LoopLoadElimination pass. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13257 llvm-svn: 252015	2015-11-03 23:49:58 +00:00
Peter Collingbourne	94d778697a	CodeGen, Target: Move Mach-O-specific symbol name logic to Mach-O lowering. A profile of an LTO link of Chrome revealed that we were spending some ~30-50% of execution time in the function Constant::getRelocationInfo(), which is called from TargetLoweringObjectFile::getKindForGlobal() and in turn from TargetMachine::getNameWithPrefix(). It turns out that we only need the result of getKindForGlobal() when targeting Mach-O, so this change moves the relevant part of the logic to TargetLoweringObjectFileMachO. NFCI. Differential Revision: http://reviews.llvm.org/D14168 llvm-svn: 252014	2015-11-03 23:40:03 +00:00
Greg Clayton	26014690aa	All instance variables start with "m_". Fix "options" to be "m_options". llvm-svn: 252013	2015-11-03 23:23:59 +00:00
Greg Clayton	dee40e72ba	Fix __fp16 types so we can display them and use them in expressions. I am not adding a test case for this since I don't know how portable the __fp16 type is between compilers and I don't want to break the test suite. <rdar://problem/22375079> llvm-svn: 252012	2015-11-03 23:23:22 +00:00
Rafael Espindola	5fbe008eb7	Simplify the logic to avoid the Closed set. IMHO this makes the code easier to read: at each iteration we add a section to a PT_LOAD and increase its size. llvm-svn: 252011	2015-11-03 23:19:42 +00:00
Matt Arsenault	aac9b49325	AMDGPU: Make flat_scratch name consistent The printed name and the parsed assembler names weren't the same. I'm not sure which name SC prints these as, but I think it's this one. llvm-svn: 252010	2015-11-03 22:50:34 +00:00
Matt Arsenault	967c2f5dee	AMDGPU: Fix asserts on invalid register ranges If the requested SGPR was not actually aligned, it was accepted and rounded down instead of rejected. Also fix an assert if the range is an invalid size. llvm-svn: 252009	2015-11-03 22:50:32 +00:00
Matt Arsenault	3473c72aab	AMDGPU: Fix off by one error in register parsing If trying to use one past the end, this would assert. llvm-svn: 252008	2015-11-03 22:50:27 +00:00
Ryan Brown	0fbd187d79	Fix build for go parser unittest. llvm-svn: 252007	2015-11-03 22:46:37 +00:00
Michael J. Spencer	2c6f213bcd	[elf2] Use value-initialization instead of memset. llvm-svn: 252006	2015-11-03 22:43:11 +00:00
Greg Clayton	fc1e77a986	Fix a deadlock when connecting to a remote GDB server that might not support all packets that lldb-server or debugserver supports. The issue was the m_last_stop_packet_mutex mutex was being held by another thread and it was deadlocking getting the thread list. We now try to lock the m_last_stop_packet_mutex, and only continue if we successfully lock it. Else we fall back to qfThreadInfo/qsThreadInfo. <rdar://problem/22140023> llvm-svn: 252005	2015-11-03 22:42:04 +00:00
Derek Schuff	cd9488d521	Address nit llvm-svn: 252004	2015-11-03 22:40:45 +00:00
Derek Schuff	b44d4d350e	Align whitespace llvm-svn: 252003	2015-11-03 22:40:43 +00:00
Derek Schuff	6b5c6da760	[WebAssembly] Support wasm select operator Summary: Add support for wasm's select operator, and lower LLVM's select DAG node to it. Reviewers: sunfish Subscribers: dschuff, llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D14295 llvm-svn: 252002	2015-11-03 22:40:40 +00:00
Greg Clayton	0644900004	With the new modules debugging, we have seen cases where clang is not emitting full definitions for types that are member variables of classes. If we try to make a class with a member where the type of the class in a forward declaration, clang will assert and crash and bring down the IDE. This is not acceptable so we need to work around it. We work around it by making sure that if we have a member that is an instance (not a pointer or reference) of a class/struct/union, that it is a complete type. If it isn't then we emit an error to let the user know to file a bug against the compiler, and then we make the class complete, but empty. We also do this for base classes elsewhere. We use the DWARF to help layout the type, so we will get all instance variables correct, but we just won't have visibility into this instance variable. llvm-svn: 252001	2015-11-03 22:40:07 +00:00
Matt Arsenault	e8ed13d946	AMDGPU: s[102:103] is unavailable on VI llvm-svn: 252000	2015-11-03 22:39:52 +00:00
Matt Arsenault	192b282bf3	AMDGPU: Define correct number of SGPRs There are actually 104 so 2 were missing. More assembler tests with high register number tuples will be included in later patches. llvm-svn: 251999	2015-11-03 22:39:50 +00:00
Michael J. Spencer	d77f0d2526	[elf2] Implement R_X86_64_TPOFF32. This does not support TPOFF32 relocations to local symbols as the address calculations are separate. Support for this will be a separate patch. llvm-svn: 251998	2015-11-03 22:39:09 +00:00
Tamas Berghammer	3bd1def3ec	Revert change comitted in accidentally as r251992 llvm-svn: 251997	2015-11-03 22:31:50 +00:00
Matt Arsenault	6c0674112a	AMDGPU: Make findUsedSGPR more readable Add more comments etc. llvm-svn: 251996	2015-11-03 22:30:15 +00:00
Matt Arsenault	782c03bb7e	AMDGPU: Initialize SIFixSGPRCopies so -print-after works llvm-svn: 251995	2015-11-03 22:30:13 +00:00
Matt Arsenault	d9d659aa23	AMDGPU: Alphabetize includes llvm-svn: 251994	2015-11-03 22:30:08 +00:00
Tamas Berghammer	5c28ae12ce	Use std::list::splice in TaskPool to avoid an allocation Using std::list::splice to move an element from one list to an other avoids the allocation of a new element and a move of the data. llvm-svn: 251993	2015-11-03 22:29:20 +00:00
Tamas Berghammer	276743f060	wip llvm-svn: 251992	2015-11-03 22:29:16 +00:00
Fiona Glaser	a8b653a372	InstCombine: fix sinking of convergent calls llvm-svn: 251991	2015-11-03 22:23:39 +00:00
Simon Pilgrim	191ac7c679	[SelectionDAG] Use existing constant nodes instead of recreating them. NFC. llvm-svn: 251990	2015-11-03 22:21:38 +00:00
Alexey Samsonov	d6aa820262	[LLVMSymbolize] Factor out the logic for printing structs from DIContext. NFC. Introduce DIPrinter which takes care of rendering DILineInfo and friends. This allows LLVMSymbolizer class to return a structured data instead of plain std::strings. llvm-svn: 251989	2015-11-03 22:20:52 +00:00
Rafael Espindola	61362ce876	Handle 0 sized sections like any other section. This is a case where there is inconsistency among ELF linkers: * The spec says nothing special about empty sections. * BFD ld removes them. * Gold handles them like regular sections. We were outputting them but sometimes ignoring them. This would create odd looking outputs where a rw section could be in a ro segment for example. The bfd way of doing things is also strange for the case where a symbol points to the empty section. Now we match gold and what seems to be the intention of the spec. llvm-svn: 251988	2015-11-03 22:08:08 +00:00
Rafael Espindola	d7a267b84b	Remove redundant = nullptr. llvm-svn: 251987	2015-11-03 22:01:20 +00:00
Simon Pilgrim	b0d860a394	[X86][AVX] Tweaked shuffle stack folding tests To avoid alternative lowerings. llvm-svn: 251986	2015-11-03 21:58:35 +00:00
Adam Nemet	a2df750fb3	[LAA] LLE 3/6: Rename InterestingDependence to Dependences, NFC Summary: We now collect all types of dependences including lexically forward deps not just "interesting" ones. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13256 llvm-svn: 251985	2015-11-03 21:39:52 +00:00
Simon Pilgrim	df993479c9	[X86][AVX512] Fixed shuffle test name to match shuffle llvm-svn: 251984	2015-11-03 21:39:30 +00:00
Zachary Turner	46f1784bac	Python 3 - Fix checking of string types in unittest2 module. This patch actually introduces a dependency from unittest2 to six. This should be ok since both packages are in our own repo, and we assume a sys.path of the top-level script that can find the third party packages. So unittest2 should be able to find six. llvm-svn: 251983	2015-11-03 21:37:42 +00:00
Zachary Turner	bac6e4f75b	Introduce seven.cmp_ and use it instead of cmp llvm-svn: 251982	2015-11-03 21:37:27 +00:00
Alexey Samsonov	6881249895	[LLVMSymbolize] Move demangling away from printing routines. NFC. Make printDILineInfo and friends responsible for just rendering the contents of the structures, demangling should actually be performed earlier, when we have the information about the originating SymbolizableModule at hand. llvm-svn: 251981	2015-11-03 21:36:13 +00:00
Rafael Espindola	443f50a833	Create .bss only when needed. This is a small complication, but produces nicer output and is a step to handling zero size sections uniformly. llvm-svn: 251980	2015-11-03 21:35:14 +00:00
Ramkumar Ramachandra	d5fa22620b	Squelch a silly warning regarding an extra 'default' in 'case' Let the editor also clean up whitespace for that file. Reviewers: clayborg Subscribers: lldb-commits Differential Revision: http://reviews.llvm.org/D13816 llvm-svn: 251979	2015-11-03 21:29:52 +00:00
Zachary Turner	234860133b	Python 3 - Fix some issues in unittest2. unittest2 was using print statements in a few places, and also using the `cmp` function (which is removed in Python 3). Again, we need to stop using unittest2 and using unittest instead, but this seems like an easier route for now. llvm-svn: 251978	2015-11-03 21:02:00 +00:00
Zachary Turner	9618980a19	Python 3: Modernize exception raising syntax. Old-style: `raise foo, bar` New-style: `raise foo(bar)` These two statements are equivalent, but the former is an error in Python 3. llvm-svn: 251977	2015-11-03 21:01:45 +00:00
Davide Italiano	c8a7913f23	[SimplifyLibCalls] Add a new transformation: pow(exp(x), y) -> exp(x*y) This one is enabled only under -ffast-math (due to rounding/overflows) but allows us to emit shorter code. Before (on FreeBSD x86-64): 4007f0: 50 push %rax 4007f1: f2 0f 11 0c 24 movsd %xmm1,(%rsp) 4007f6: e8 75 fd ff ff callq 400570 <exp2@plt> 4007fb: f2 0f 10 0c 24 movsd (%rsp),%xmm1 400800: 58 pop %rax 400801: e9 7a fd ff ff jmpq 400580 <pow@plt> 400806: 66 2e 0f 1f 84 00 00 nopw %cs:0x0(%rax,%rax,1) 40080d: 00 00 00 After: 4007b0: f2 0f 59 c1 mulsd %xmm1,%xmm0 4007b4: e9 87 fd ff ff jmpq 400540 <exp2@plt> 4007b9: 0f 1f 80 00 00 00 00 nopl 0x0(%rax) Differential Revision: http://reviews.llvm.org/D14045 llvm-svn: 251976	2015-11-03 20:32:23 +00:00

... 4 5 6 7 8 ...

214867 Commits All Branches Search

214867 Commits

All Branches