llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Kuperstein	047b1a0400	[DAGCombine] Slightly improve lowering of BUILD_VECTOR into a shuffle. This handles the case of a BUILD_VECTOR being constructed out of elements extracted from a vector twice the size of the result vector. Previously this was always scalarized. Now, we try to construct a shuffle node that feeds on extract_subvectors. This fixes PR15872 and provides a partial fix for PR21711. Differential Revision: http://reviews.llvm.org/D6678 llvm-svn: 224429	2014-12-17 12:32:17 +00:00
Toma Tabacu	9941195a9f	[mips] Always clobber $1 for MIPS inline asm. Summary: Because GCC doesn't use $1 for code generation, inline assembly code can use $1 without having to add it to the clobbers list. LLVM, on the other hand, does not shy away from using $1, and this can cause conflicts with inline assembly which assumes GCC-like code generation. A solution to this problem is to make Clang automatically clobber $1 for all MIPS inline assembly. This is not the optimal solution, but it seems like a necessary compromise, for now. Reviewers: dsanders Reviewed By: dsanders Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D6638 llvm-svn: 224428	2014-12-17 12:02:58 +00:00
Vladimir Medic	636fefe252	MipsABIInfo class is used in different libraries. Moving the files to MCTargetDesc folder(LLVMMipsDesc library) prevents linkage errors. There are no functional changes. llvm-svn: 224427	2014-12-17 11:49:56 +00:00
Yaron Keren	af92a37c23	Teach compile_commands.json test that windows-gnu is the new name for mingw32. llvm-svn: 224426	2014-12-17 11:04:07 +00:00
Toma Tabacu	a23f13c3b0	[mips] Set GCC-compatible MIPS asssembler options before inline asm blocks. Summary: When generating MIPS assembly, LLVM always overrides the default assembler options by emitting the '.set noreorder', '.set nomacro' and '.set noat' directives, while GCC uses the default options if an assembly-level function contains inline assembly code. This becomes a problem when the code generated by LLVM is interleaved with inline assembly which assumes GCC-like assembler options (from Linux, for example). This patch fixes these conflicts by setting the appropriate assembler options at the beginning of an inline asm block and popping them at the end. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6637 llvm-svn: 224425	2014-12-17 10:56:16 +00:00
Suyog Sarda	43fae93da8	Revert 224119 "This patch recognizes (+ (+ v0, v1) (+ v2, v3)), reorders them for bundling into vector of loads, and vectorizes it." This was re-ordering floating point data types resulting in mismatch in output. llvm-svn: 224424	2014-12-17 10:34:27 +00:00
Evgeniy Stepanov	372deb091e	[msan] Stop calling pthread_getspecific in signal handlers. pthread_getspecific is not async-signal-safe. MsanThread pointer is now stored in a TLS variable, and the TSD slot is used only for its destructor, and never from a signal handler. This should fix intermittent CHECK failures in MsanTSDSet. llvm-svn: 224423	2014-12-17 10:30:06 +00:00
Dmitry Vyukov	508dd9b94c	tsan: add disabled test case for issue 87 llvm-svn: 224422	2014-12-17 10:19:20 +00:00
Yaron Keren	f630971635	Teach lit.cfg to recognize -windows-gnu in addition to -mingw32. llvm-svn: 224421	2014-12-17 09:55:15 +00:00
Peter Collingbourne	1f89ffdf4d	irgen: fix canAvoid* Patch by Andrew Wilkins! canAvoidElementLoad and canAvoidLoad were incorrectly eliding loads when an index expression is used as an another array index expression. This led to a panic. See comments on https://github.com/go-llvm/llgo/issues/175 Test Plan: lit test added Differential Revision: http://reviews.llvm.org/D6676 llvm-svn: 224420	2014-12-17 09:45:05 +00:00
Daniel Jasper	0580ff0ec6	clang-format: Fix incorrect calculation of token lenghts. This led, e.g. to break JavaScript regex literals too early. llvm-svn: 224419	2014-12-17 09:11:08 +00:00
Elena Demikhovsky	028e966a54	Added 5 more tests related to sink store revision 224247 - by Ella Bolshinsky http://reviews.llvm.org/D6420 llvm-svn: 224418	2014-12-17 08:12:59 +00:00
Erik Eckstein	a451b9b0b5	Strength reduce intrinsics with overflow into regular arithmetic operations if possible. Some intrinsics, like s/uadd.with.overflow and umul.with.overflow, are already strength reduced. This change adds other arithmetic intrinsics: s/usub.with.overflow, smul.with.overflow. It completes the work on PR20194. llvm-svn: 224417	2014-12-17 07:29:19 +00:00
Duncan P. N. Exon Smith	92731d26bc	Revert "Linker: Drop superseded subprograms" This reverts commit r224389. Based on feedback from the bots, the assertion seems to be going off more often, not less (previously I was just seeing it in an internal bootstrap, now it's happening in public builds too). http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/936/ http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-bootstrap/builds/5325 Reverting in order to investigate. llvm-svn: 224416	2014-12-17 07:27:31 +00:00
Justin Hibbits	0c0d5deff1	Add parsing of 'foo@local". Summary: Currently, it supports generating, but not parsing, this expression. Test added as well. Test Plan: New test added, no regressions due to this. Reviewers: hfinkel Reviewed By: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6672 llvm-svn: 224415	2014-12-17 06:23:35 +00:00
Rafael Espindola	5f06030989	Remove a debugging assert. Sorry for the noise, I have no idea how it survived to the final version. llvm-svn: 224414	2014-12-17 03:38:04 +00:00
Rafael Espindola	839353bca0	Remove unused includes and out of date comment. NFC. llvm-svn: 224413	2014-12-17 03:07:20 +00:00
Rafael Espindola	81adfb5c2e	Fix the windows build. llvm-svn: 224412	2014-12-17 02:42:20 +00:00
David Majnemer	4d2de1b03f	Sema: Don't dyn_cast a null pointer in CheckUsingDeclQualifier This code was written with the intent that a pointer could be null but we dyn_cast'd it anyway. Change the dyn_cast to a dyn_cast_or_null. This fixes PR21933. llvm-svn: 224411	2014-12-17 02:41:36 +00:00
Rafael Espindola	97935a9123	Refactor and simplify the code reading /proc/cpuinfo. NFC. llvm-svn: 224410	2014-12-17 02:32:44 +00:00
Matthias Braun	f4a72cd06e	RegisterCoalescer: Sprinkle some const modifiers. llvm-svn: 224409	2014-12-17 02:18:13 +00:00
Duncan P. N. Exon Smith	f9abf4fb0c	llvm-lto: Add testing coverage for local contexts Add coverage in `llvm-lto` for the API exposed by libLTO to create modules in local contexts. The goal here isn't to test the symbol-related API extensively, just to confirm that these modules work at all. (I'll be shifting code around soon that should be NFC and I realized there was no test coverage.) llvm-svn: 224408	2014-12-17 02:00:38 +00:00
Nick Lewycky	52ee5e446b	Delete debugging cruft that crept in with r223802. llvm-svn: 224407	2014-12-17 01:56:51 +00:00
Alexey Samsonov	b2dcac0bb7	[ASan] Re-structure the allocator code. NFC. Introduce "Allocator" object, which contains all the bits and pieces ASan allocation machinery actually use: allocator from sanitizer_common, quarantine, fallback allocator and quarantine caches, fallback mutex. This step is a preparation to adding more state to this object. We want to reduce dependency of Allocator on commandline flags and be able to "safely" modify its behavior (such as the size of the redzone) at runtime. llvm-svn: 224406	2014-12-17 01:55:03 +00:00
David Majnemer	65c52ae8ca	InstSimplify: shl nsw/nuw undef, %V -> undef We can always choose an value for undef which might cause %V to shift out an important bit except for one case, when %V is zero. However, shl behaves like an identity function when the right hand side is zero. llvm-svn: 224405	2014-12-17 01:54:33 +00:00
Nick Lewycky	ee0a3a7a2f	Make ValueEnumerator::print use OS for metadata too. Noticed by inspection. llvm-svn: 224404	2014-12-17 01:52:08 +00:00
David Majnemer	6ca445e0dd	Parse: Consume tokens more carefully in CheckForLParenAfterColonColon We would consume the lparen even if it wasn't followed by an identifier or a star-identifier pair. This fixes PR21815. llvm-svn: 224403	2014-12-17 01:39:22 +00:00
Quentin Colombet	fc2201e922	[CodeGenPrepare] Reapply r224351 with a fix for the assertion failure: The type promotion helper does not support vector type, so when make such it does not kick in in such cases. Original commit message: [CodeGenPrepare] Move sign/zero extensions near loads using type promotion. This patch extends the optimization in CodeGenPrepare that moves a sign/zero extension near a load when the target can combine them. The optimization may promote any operations between the extension and the load to make that possible. Although this optimization may be beneficial for all targets, in particular AArch64, this is enabled for X86 only as I have not benchmarked it for other targets yet. Context Most targets feature extended loads, i.e., loads that perform a zero or sign extension for free. In that context it is interesting to expose such pattern in CodeGenPrepare so that the instruction selection pass can form such loads. Sometimes, this pattern is blocked because of instructions between the load and the extension. When those instructions are promotable to the extended type, we can expose this pattern. Motivating Example Let us consider an example: define void @foo(i8* %addr1, i32* %addr2, i8 %a, i32 %b) { %ld = load i8* %addr1 %zextld = zext i8 %ld to i32 %ld2 = load i32* %addr2 %add = add nsw i32 %ld2, %zextld %sextadd = sext i32 %add to i64 %zexta = zext i8 %a to i32 %addza = add nsw i32 %zexta, %zextld %sextaddza = sext i32 %addza to i64 %addb = add nsw i32 %b, %zextld %sextaddb = sext i32 %addb to i64 call void @dummy(i64 %sextadd, i64 %sextaddza, i64 %sextaddb) ret void } As it is, this IR generates the following assembly on x86_64: [...] movzbl (%rdi), %eax # zero-extended load movl (%rsi), %es # plain load addl %eax, %esi # 32-bit add movslq %esi, %rdi # sign extend the result of add movzbl %dl, %edx # zero extend the first argument addl %eax, %edx # 32-bit add movslq %edx, %rsi # sign extend the result of add addl %eax, %ecx # 32-bit add movslq %ecx, %rdx # sign extend the result of add [...] The throughput of this sequence is 7.45 cycles on Ivy Bridge according to IACA. Now, by promoting the additions to form more extended loads we would generate: [...] movzbl (%rdi), %eax # zero-extended load movslq (%rsi), %rdi # sign-extended load addq %rax, %rdi # 64-bit add movzbl %dl, %esi # zero extend the first argument addq %rax, %rsi # 64-bit add movslq %ecx, %rdx # sign extend the second argument addq %rax, %rdx # 64-bit add [...] The throughput of this sequence is 6.15 cycles on Ivy Bridge according to IACA. This kind of sequences happen a lot on code using 32-bit indexes on 64-bit architectures. Note: The throughput numbers are similar on Sandy Bridge and Haswell. Proposed Solution To avoid the penalty of all these sign/zero extensions, we merge them in the loads at the beginning of the chain of computation by promoting all the chain of computation on the extended type. The promotion is done if and only if we do not introduce new extensions, i.e., if we do not degrade the code quality. To achieve this, we extend the existing “move ext to load” optimization with the promotion mechanism introduced to match larger patterns for addressing mode (r200947). The idea of this extension is to perform the following transformation: ext(promotableInst1(...(promotableInstN(load)))) => promotedInst1(...(promotedInstN(ext(load)))) The promotion mechanism in that optimization is enabled by a new TargetLowering switch, which is off by default. In other words, by default, the optimization performs the “move ext to load” optimization as it was before this patch. Performance Configuration: x86_64: Ivy Bridge fixed at 2900MHz running OS X 10.10. Tested Optimization Levels: O3/Os Tests: llvm-testsuite + externals. Results: - No regression beside noise. - Improvements: CINT2006/473.astar: ~2% Benchmarks/PAQ8p: ~2% Misc/perlin: ~3% The results are consistent for both O3 and Os. <rdar://problem/18310086> llvm-svn: 224402	2014-12-17 01:36:17 +00:00
Richard Smith	b9be608f2d	Add missing testcase from r224388. llvm-svn: 224401	2014-12-17 01:08:39 +00:00
Kevin Enderby	57538299e8	Add printing the LC_ENCRYPTION_INFO_64 load command with llvm-objdump’s -private-headers and add tests for the two AArch64 binaries. llvm-svn: 224400	2014-12-17 01:01:30 +00:00
David Blaikie	8b979f01c6	PR21875: codegen for non-type template parameters of nullptr_t type llvm-svn: 224399	2014-12-17 00:43:22 +00:00
Anna Zaks	87d404d458	[CallGraph] Make sure the edges are not missed due to re-declarations A patch by Daniel DeFreez! We were previously dropping edges on re-declarations. Store the canonical declarations in the graph to ensure that different references to the same function end up reflected with the same call graph node. (Note, this might lead to performance fluctuation because call graph is used to determine the function analysis order.) llvm-svn: 224398	2014-12-17 00:34:07 +00:00
Reid Kleckner	04b69f89aa	Revert "[CodeGenPrepare] Move sign/zero extensions near loads using type promotion." This reverts commit r224351. It causes assertion failures when building ICU. llvm-svn: 224397	2014-12-17 00:29:23 +00:00
Alexey Samsonov	2c31cc3cf1	Rename asan_allocator2.cc to asan_allocator.cc llvm-svn: 224396	2014-12-17 00:26:50 +00:00
Alexey Samsonov	91bb25f515	[ASan] Introduce SetCanPoisonMemory() function. SetCanPoisonMemory()/CanPoisonMemory() functions are now used instead of "poison_heap" flag to determine if ASan is allowed to poison the shadow memory. This allows to hot-patch this value in runtime (e.g. during ASan activation) without introducing a data race. llvm-svn: 224395	2014-12-17 00:01:02 +00:00
David Blaikie	0317bc9e55	PR21909: Don't try (and crash) to generate debug info for explicit instantiations of explicit specializations. llvm-svn: 224394	2014-12-16 23:49:18 +00:00
Hans Wennborg	224cb82a39	SelectionDAG switch lowering: use 'unsigned' to count destination popularity SwitchInst::getNumCases() returns unsinged, so using uint64_t to count cases seems unnecessary. Also fix a missing CHECK in the test case. llvm-svn: 224393	2014-12-16 23:41:59 +00:00
Jim Ingham	5e09c8c32c	Add the ability to tag one or more breakpoints with a name. These names can then be used in place of breakpoint id's or breakpoint id ranges in all the commands that operate on breakpoints. <rdar://problem/10103959> llvm-svn: 224392	2014-12-16 23:40:14 +00:00
Colin LeMahieu	aa1bade7b4	[Hexagon] Updating doubleword shift usages to new versions. llvm-svn: 224391	2014-12-16 23:36:15 +00:00
Kevin Enderby	0804f467f2	Add printing the LC_ENCRYPTION_INFO load command with llvm-objdump’s -private-headers. llvm-svn: 224390	2014-12-16 23:25:52 +00:00
Duncan P. N. Exon Smith	8759026893	Linker: Drop superseded subprograms When a function gets replaced by `ModuleLinker`, drop superseded subprograms. This ensures that the "first" subprogram pointing at a function is the same one that `!dbg` references point at. This is a stop-gap fix for PR21910. Notably, this fixes Release+Asserts bootstraps that are currently asserting out in `LexicalScopes::initialize()` due to the explicit instantiations in `lib/IR/Dominators.cpp` eventually getting replaced by -argpromotion. llvm-svn: 224389	2014-12-16 23:23:41 +00:00
Richard Smith	d52186ff5a	DR1684: a constexpr member function need not be a member of a literal class type. llvm-svn: 224388	2014-12-16 23:12:52 +00:00
David Blaikie	5413abf88f	Fix test cases given Clang's improved location information. llvm-svn: 224387	2014-12-16 23:07:55 +00:00
Kaelyn Takata	938204aa02	Try typo correction on all initialization arguments and be less pessimistic about when to do so. This also fixes PR21905 as the initialization argument was no longer viewed as being type dependent due to the TypoExpr being type-cast. llvm-svn: 224386	2014-12-16 23:07:00 +00:00
David Blaikie	bf22a4eaee	DebugInfo: Generalize debug info location handling This is a more scalable (fixed in mostly one place, rather than many places that will need constant improvement/maintenance) solution to several commits I've made recently to increase source fidelity for subexpressions. This resetting had to be done at the DebugLoc level (not the SourceLocation level) to preserve scoping information (if the resetting was done with CGDebugInfo::EmitLocation, it would've caused the tail end of an expression's codegen to end up in a potentially different scope than the start, even though it was at the same source location). The drawback to this is that it might leave CGDebugInfo out of sync. Ideally CGDebugInfo shouldn't have a duplicate sense of the current SourceLocation, but for now it seems it does... - I don't think I'm going to tackle removing that just now. I expect this'll probably cause some more buildbot fallout & I'll investigate that as it comes up. Also these sort of improvements might be starting to show a weakness/bug in LLVM's line table handling: we don't correctly emit is_stmt for statements, we just put it on every line table entry. This means one statement split over multiple lines appears as multiple 'statements' and two statements on one line (without column info) are treated as one statement. I don't think we have any IR representation of statements that would help us distinguish these cases and identify the beginning of each statement - so that might be something we need to add (possibly to the lexical scope chain - a scope for each statement). This does cause some problems for GDB and possibly other DWARF consumers. llvm-svn: 224385	2014-12-16 22:49:17 +00:00
Sanjay Patel	494a625fee	fix typo, add spaces; NFC llvm-svn: 224384	2014-12-16 22:48:42 +00:00
Simon Pilgrim	bf1e079005	[X86][SSE] Vector double -> float conversion memory folding (cvtpd2ps) Added a missing memory folding relationship for the (V)CVTPD2PS instruction - we can safely fold these for stack reloads. Differential Revision: http://reviews.llvm.org/D6663 llvm-svn: 224383	2014-12-16 22:30:10 +00:00
Rafael Espindola	9573a9cf9d	Make the assert a bit stronger. We should get no declarations in here. llvm-svn: 224382	2014-12-16 22:29:43 +00:00
Colin LeMahieu	7fc90fc7e9	[Hexagon] Removing old XTYPE/BIT instructions and replacing usages. llvm-svn: 224381	2014-12-16 22:17:09 +00:00
Nick Lewycky	4d59b77883	Look at whether TransformTypos returned a different Expr instead of looking at the number of uncorrected typos before and after. Correcting one typo may produce an expression with another TypoExpr in it, leading to matching counts even though a typo was corrected. Fixes PR21925! llvm-svn: 224380	2014-12-16 22:02:06 +00:00

1 2 3 4 5 ...

188866 Commits All Branches Search

188866 Commits

All Branches