llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	6beaa8adb8	Remove unused argument from AddFeature. llvm-svn: 208002	2014-05-05 21:40:44 +00:00
Rafael Espindola	9c8c96f08a	Use a range loop. llvm-svn: 207996	2014-05-05 20:06:41 +00:00
Filipe Cabecinhas	fe59062b75	Revert "Optimize shufflevector that copies an i64/f64 and zeros the rest." This reverts commit 207992. I misread the phab number on the LGTM. llvm-svn: 207993	2014-05-05 19:40:36 +00:00
Filipe Cabecinhas	263d98c19f	Optimize shufflevector that copies an i64/f64 and zeros the rest. Summary: Also ran clang-format on the function. The code added is the last else if block. Reviewers: nadav, craig.topper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3518 llvm-svn: 207992	2014-05-05 19:36:28 +00:00
Marek Olsak	82d3b11e85	R600/SI: allow 5 more input SGPRs to a shader Our OpenGL driver needs 22 SGPRs (16 user SGPRs + 6 streamout non-user SGPRs). Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 207990	2014-05-05 19:30:54 +00:00
Yi Jiang	a4821fc9fb	Always set alignment of vectorized LD/ST in SLP-Vectorizer. <rdar://problem/16812145> llvm-svn: 207983	2014-05-05 17:59:14 +00:00
Duncan P. N. Exon Smith	1789fb6493	LTO: -internalize sets visibility to default Visibility is meaningless when the linkage is local. Change `-internalize` to reset the visibility to `default`. <rdar://problem/16141113> llvm-svn: 207979	2014-05-05 17:40:44 +00:00
Kaelyn Takata	a39d2a0050	Select bdver2 instead of bdver1 if TBM support is present on models < 0x10. Tested that the right -target-cpu is set in the clang -cc1 command line when running "clang -march=native -E -v - </dev/null" on both an FX-8150 and an FX-8350. Both are family 15h; the FX-8150 (Bulldozer processor) reports a model number of 1, and the FX-8350 (Piledriver processor) reports a model number of 2. llvm-svn: 207973	2014-05-05 16:32:10 +00:00
Timur Iskhodzhanov	9dbc206303	[ASan/Win] Fix issue 305 -- don't instrument .CRT initializer/terminator callbacks See https://code.google.com/p/address-sanitizer/issues/detail?id=305 Reviewed at http://reviews.llvm.org/D3607 llvm-svn: 207968	2014-05-05 14:28:38 +00:00
Rafael Espindola	9475117f5d	Trivial simplification. No functionality change. llvm-svn: 207967	2014-05-05 14:18:16 +00:00
Saleem Abdulrasool	e8a7afef86	CodeGen: correct memset emittance for WoA Windows on ARM does not conform to AEABI. However, memset would be emitted using the AEABI signature, resulting in inverted parameters. Handle this special case appropriately. llvm-svn: 207943	2014-05-04 23:13:21 +00:00
Saleem Abdulrasool	729c7a08fb	MC: support FK_SecRel_4 for Windows on ARM Add handling for FK_SecRel_4 (4-byte section relative relocations). These are used by the generation of DWARF debug information (the abbrevations use section relative relocations). This will also be used in generation of CodeView line tables. llvm-svn: 207941	2014-05-04 23:13:15 +00:00
Benjamin Kramer	9130cb8547	LoopUnroll: If we're doing partial unrolling, use the PartialThreshold to limit unrolling. Otherwise we use the same threshold as for complete unrolling, which is way too high. This made us unroll any loop smaller than 150 instructions by 8 times, but only if someone specified -march=core2 or better, which happens to be the default on darwin. llvm-svn: 207940	2014-05-04 19:12:38 +00:00
Arnold Schwaighofer	cd566c423a	SLPVectorizer: Bring back the insertelement patch (r205965) with fixes When can't assume a vectorized tree is rooted in an instruction. The IRBuilder could have constant folded it. When we rebuild the build_vector (the series of InsertElement instructions) use the last original InsertElement instruction. The vectorized tree root is guaranteed to be before it. Also, we can't assume that the n-th InsertElement inserts the n-th element into a vector. This reverts r207746 which reverted the revert of the revert of r205018 or so. Fixes the test case in PR19621. llvm-svn: 207939	2014-05-04 17:10:15 +00:00
Elena Demikhovsky	e73333a50f	AVX-512: minor change in rndscale intrinsic llvm-svn: 207937	2014-05-04 13:35:37 +00:00
Chandler Carruth	312dddfb81	[LCG] Add the last (and most complex) of the edge insertion mutation operations on the call graph. This one forms a cycle, and while not as complex as removing an internal edge from an SCC, it involves a reasonable amount of work to find all of the nodes newly connected in a cycle. Also somewhat alarming is the worst case complexity here: it might have to walk roughly the entire SCC inverse DAG to insert a single edge. This is carefully documented in the API (I hope). llvm-svn: 207935	2014-05-04 09:38:32 +00:00
Saleem Abdulrasool	3c82b499a0	X86: further range-loopify AsmPrinter Use more range loops in the X86AsmPrinter. NFC. llvm-svn: 207928	2014-05-04 01:54:17 +00:00
Saleem Abdulrasool	b942035bae	X86: remove X86COFFMachineModuleInfo Remove dead code. This is vestigial after r98384. llvm-svn: 207927	2014-05-04 01:54:12 +00:00
Saleem Abdulrasool	82b69fa105	X86: repair export compatibility with MinGW/cygwin Both MinGW and cygwin (i686) construct export directives without the global leader prefix. This is mostly due to the fact that they use GNU ld which does not correctly handle the export directive. This apparently has been been broken for a while. However, this was recently reported as being broken by mingwandroid and diorcety of the msys2 project. Remove the global leader prefix if targeting MinGW or cygwin, otherwise, retain the global leader prefix. Add an explicit test for cygwin's behaviour of export directives. llvm-svn: 207926	2014-05-04 00:03:48 +00:00
Saleem Abdulrasool	75e68cbd12	X86: refactor export directive generation Create a helper function to generate the export directive. This was previously duplicated inline to handle export directives for variables and functions. This also enables the use of range-based iterators for the generation of the directive rather than the traditional loops. NFC. llvm-svn: 207925	2014-05-04 00:03:41 +00:00
David Majnemer	cf63a79818	IR: Cleanup AttributeSet::get for AttrBuilder We don't modify the AttrBuilder in AttributeSet::get, make the reference argument const. llvm-svn: 207924	2014-05-03 23:00:35 +00:00
Juergen Ributzka	d35c114d15	[TBAA] Fix handling of mixed TBAA (path-aware and non-path-aware TBAA). This fix simply ensures that both metadata nodes are path-aware before performing path-aware alias analysis. This issue isn't normally triggered in LLVM, because we perform an autoupgrade of the TBAA metadata to the new format when reading in LL or BC files. This issue only appears when a client creates the IR manually and mixes old and new TBAA metadata format. This fixes <rdar://problem/16760860>. llvm-svn: 207923	2014-05-03 22:32:52 +00:00
Rafael Espindola	3d082fa507	Fix pr19645. The fix itself is fairly simple: move getAccessVariant to MCValue so that we replace the old weak expression evaluation with the far more general EvaluateAsRelocatable. This then requires that EvaluateAsRelocatable stop when it finds a non trivial reference kind. And that in turn requires the ELF writer to look harder for weak references. Last but not least, this found a case where we were being bug by bug compatible with gas and accepting an invalid input. I reported pr19647 to track it. llvm-svn: 207920	2014-05-03 19:57:04 +00:00
Joey Gouly	b0afd1b929	[ARM64] Correctly select ANDWri in FastISel. http://reviews.llvm.org/D3598 llvm-svn: 207917	2014-05-03 17:27:06 +00:00
Benjamin Kramer	64425fe875	SLPVectorizer: Lazily allocate the map for block numbering. There is no point in creating it if we're not going to vectorize anything. Creating the map is expensive as it creates large values. No functionality change. llvm-svn: 207916	2014-05-03 15:50:37 +00:00
Rafael Espindola	80df4bb10f	Rename member variable to try to fix the bots. llvm-svn: 207915	2014-05-03 15:28:13 +00:00
Simon Atanasyan	1e3edf98cb	[ELFYAML] Group ELF header falgs to target specific blocks. Handle flags which are corresponding to the current target read from the ELF file. This fix cannot be tested until obj2yaml does not support ELF format. llvm-svn: 207905	2014-05-03 11:39:50 +00:00
Simon Atanasyan	9a922c4ffd	[ELFYAML] Add more SHT_xxx flags to the YAML section type mapping. llvm-svn: 207904	2014-05-03 11:39:44 +00:00
Karthik Bhat	ddd0cb5ecf	Vectorize intrinsic math function calls in SLPVectorizer. This patch adds support to recognize and vectorize intrinsic math functions in SLPVectorizer. Review: http://reviews.llvm.org/D3560 and http://reviews.llvm.org/D3559 llvm-svn: 207901	2014-05-03 09:59:54 +00:00
David Blaikie	658a20b04d	Try simplifying LexicalScopes ownership again. Committed initially in r207724-r207726 and reverted due to compiler-rt crashes in r207732. Instead, fix this harder with unordered_map and store the LexicalScopes by value in the map. This did necessitate moving the definition of LexicalScope above the definition of LexicalScopes. Let's see how the buildbots/compilers tolerate unordered_map::emplace + std::piecewise_construct + std::forward_as_tuple... llvm-svn: 207876	2014-05-02 22:21:05 +00:00
Benjamin Kramer	6dd9f8feb3	Satisfy GCC's urgent need for parentheses around ‘&&’ within ‘\|\|’. llvm-svn: 207871	2014-05-02 21:28:49 +00:00
Rafael Espindola	bf8bf54bfc	Aliases are always definitions. Delete dead code. llvm-svn: 207869	2014-05-02 21:10:48 +00:00
Eric Christopher	6c26beb770	Clean up constructor logic and member access for LoopVectorizeHints. There are public functions that mutate various members as well as another private member already, so make all the members private to avoid the discontinuity and add accessors for the values. Should be no functional change. llvm-svn: 207868	2014-05-02 20:40:04 +00:00
Justin Bogner	c475e1bc77	llvm-cov: Fix handling of line zero appearing in a line table Reading line tables in llvm-cov was pretty broken, but would happen to work as long as no line in the table was 0. It's not clear to me whether a line of zero should show up in these tables, but deciding to read a string in the middle of the line table is certainly the wrong thing to do if it does. I've also added some comments, as trying to figure out what this block of code was doing was fairly unpleasant. llvm-svn: 207866	2014-05-02 20:01:24 +00:00
Nico Weber	4b2acde21a	Teach GlobalDCE how to remove empty global_ctor entries. This moves most of GlobalOpt's constructor optimization code out of GlobalOpt into Transforms/Utils/CDtorUtils.{h,cpp}. The public interface is a single function OptimizeGlobalCtorsList() that takes a predicate returning which constructors to remove. GlobalOpt calls this with a function that statically evaluates all constructors, just like it did before. This part of the change is behavior-preserving. Also add a call to this from GlobalDCE with a filter that removes global constructors that contain a "ret" instruction and nothing else – this fixes PR19590. llvm-svn: 207856	2014-05-02 18:35:25 +00:00
Akira Hatanaka	f76388dd7e	[GVN] Pass the phi-translated address of a load instead of the untranslated address to AnalyzeLoadFromClobberingLoad. This fixes a bug in load-PRE where PRE is applied to a load that is not partially redundant. <rdar://problem/16638765>. llvm-svn: 207853	2014-05-02 17:59:17 +00:00
Saleem Abdulrasool	734bca04ff	MC: place .file records into the correct section .file records are supposed to have a section identifier of 65534 (IMAGE_SCN_DEBUG) rather than 0. This is spelt out clearly within the PE/COFF specification. Fix this minor oversight with the implementation for support for .file records. llvm-svn: 207851	2014-05-02 17:45:24 +00:00
Tim Northover	820e041a3c	DAGCombine: prevent formation of illegal ConstantFP nodes. llvm-svn: 207850	2014-05-02 17:25:02 +00:00
Benjamin Kramer	6004573ecf	Add a description for AMD's bdver4 (aka Excavator). This is just bdver3 + AVX2 + BMI2. llvm-svn: 207847	2014-05-02 15:47:07 +00:00
Tom Stellard	10b1502733	R600/SI: Add processor type for Mullins. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Samuel Li <samuel.li@amd.com> Signed-off-by: Alex Deucher <alexander.deucher@amd.com> llvm-svn: 207846	2014-05-02 15:41:49 +00:00
Tom Stellard	3dbf1f8df0	R600: Expand vector sin and cos. v2: move code to AMDGPUISelLowering.cpp squash with tests (both EG and SI) Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207845	2014-05-02 15:41:47 +00:00
Tom Stellard	605e116e8e	R600: Expand TruncStore i64 -> {i16,i8} llvm-svn: 207844	2014-05-02 15:41:46 +00:00
Tom Stellard	eba61071d7	R600/SI: Only create one instruction when spilling/restoring register v3 The register spiller assumes that only one new instruction is created when spilling and restoring registers, so we need to emit pseudo instructions for vector register spills and lower them after register allocation. v2: - Fix calculation of lane index - Extend VGPR liveness to end of program. v3: - Use SIMM16 field of S_NOP to specify multiple NOPs. https://bugs.freedesktop.org/show_bug.cgi?id=75005 llvm-svn: 207843	2014-05-02 15:41:42 +00:00
Tim Northover	d7360900a8	AArch64/ARM64: add patterns for post-indexed ST1 ops. llvm-svn: 207840	2014-05-02 14:54:27 +00:00
Tim Northover	523b5a43fb	ARM64: refactor NEON post-indexed loads & stores (MC). Previously, LLVM had no knowledge that these instructions actually modified their address register: fine if they never end up in CodeGen, but when I'd rather like to write some patterns for them it becomes a disaster. The change is mostly straightforward, I think the most significant design decision was to always put the address write-back first. This allows loads and stores to be accessed more uniformly, for example permitting the continued sharing of the InstAlias definitions. I also discovered that the custom Decode logic is no longer needed, so I removed it. No tests, because there should be no functionality change. llvm-svn: 207839	2014-05-02 14:54:21 +00:00
Tim Northover	d0b07e133b	AArch64/ARM64: support indexed loads/stores on vector types. While post-indexed LD1/ST1 instructions do exist for vector loads, this patch makes use of the more flexible addressing-modes in LDR/STR instructions. llvm-svn: 207838	2014-05-02 14:54:15 +00:00
Benjamin Kramer	42d262f410	Allow SelectionDAG::FoldConstantArithmetic to work when it's called with a vector VT but scalar values. llvm-svn: 207835	2014-05-02 12:35:22 +00:00
Nick Lewycky	718ada97bc	Fold strlen(expr ? "str1" : "str2") to x ? len1 : len2. This fires about 330 times in a bootstrap of clang. llvm-svn: 207828	2014-05-02 04:11:45 +00:00
Juergen Ributzka	37fc0a8ae8	[Stackmaps] Pacify windows buildbot. llvm-svn: 207807	2014-05-01 22:39:26 +00:00
Juergen Ributzka	673a762b80	[Stackmaps] Add command line option to specify the stackmap version. llvm-svn: 207805	2014-05-01 22:21:30 +00:00

1 2 3 4 5 ...

69256 Commits