llvm-project

Commit Graph

Author	SHA1	Message	Date
Quentin Colombet	72f6d59841	[RAFast] Don't dereference MBB::end When RAFast sees liveins in on a basic block, it uses that information to initialize the availability of the registers. The called method uses an instruction as one of its argument and in the liveins case, RAFast was dereferencing MBB::begin which can be MBB::end for empty basic block. Change the API of definePhysReg to use MachineBasicBlock::iterator instead of MachineInstr so that we don't dereference an invalid iterator while making the call. rdar://problem/36952401 llvm-svn: 323710	2018-01-29 23:42:37 +00:00
Tom Stellard	3ae38d271e	AMDGPU: Move ADDRIndirect complex pattern into R600Instructions.td Summary: This is only used by R600. Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, mgorny, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D37114 llvm-svn: 323709	2018-01-29 23:29:26 +00:00
Craig Topper	571231a7fe	[X86] Use VMOVDQA64 for aligned vXi32 stores. I meant to do this with the unaligned stores in r322820, but looks like I missed it. llvm-svn: 323708	2018-01-29 23:27:23 +00:00
Davide Italiano	b0edeb90ab	[test-suite] UNXfail several tests that now pass locally. Another step towards enabling unexpected successes as failures by default. llvm-svn: 323707	2018-01-29 23:24:50 +00:00
Marek Olsak	48057b554c	AMDGPU: Allow a SGPR for the conditional KILL operand Patch by: Bas Nieuwenhuizen Just use the _e64 variant if needed. This should be possible as per def : Pat < (int_amdgcn_kill (i1 (setcc f32:$src, InlineFPImm<f32>:$imm, cond:$cond))), (SI_KILL_F32_COND_IMM_PSEUDO $src, (bitcast_fpimm_to_i32 $imm), (cond_as_i32imm $cond)) > ; I don't think we can get an immediate for the other operand for which we need the second 32-bit word. https://reviews.llvm.org/D42302 llvm-svn: 323706	2018-01-29 23:19:10 +00:00
Rafael Espindola	93955d0d34	Fix some regular expressions in llvm-mode.el. In some cases it was using "\" unnecessarily. In another case it needed an additional "\" to properly indicate a numbered sub-match. Make comment-start buffer-local in llvm-mode.el llvm-mode was setting comment-start globally. However, it is better to only set it locally in the current buffer. Don't use purecopy in llvm-mode.el There's no reason to use purecopy in llvm-mode.el. purecopy is only needed for files that are dumped in emacs. Add a version header to llvm-mode.el Adding a version header to llvm-mode.el allows it to be installed by the Emacs package manager. There are not many requirements on the version number; however it is useful to users to bump it when something significant changes. Here I've chosen just to start at 1.0. Patch by Tom Tromey! llvm-svn: 323705	2018-01-29 22:56:41 +00:00
Sanjay Patel	d023a9b777	[DSE] add test for PR36129; NFC We can miscompile because we're not checking is the memory might me modified between the seemingly redundant store ops. llvm-svn: 323704	2018-01-29 22:50:08 +00:00
Eric Liu	b57da098e9	[clangd] Fix r323658 test failure on windows. llvm-svn: 323703	2018-01-29 22:28:08 +00:00
Matthias Braun	bd4bc3f186	LiveInterval: Print weight in print() function. llvm-svn: 323702	2018-01-29 22:03:00 +00:00
Paul Robinson	d0c89f851b	Stop tracking .debug_line_str in DWARFUnit. NFC. llvm-svn: 323701	2018-01-29 22:02:56 +00:00
Craig Topper	a8f87a36f1	[X86] Add FeaturePOPCNTFalseDeps to skylake server CPU to match skylake client. llvm-svn: 323700	2018-01-29 21:56:48 +00:00
Rui Ueyama	b6d3a93594	Warn on nonexistent comdat sections in an /order file. I didn't implement the feature in the original patch because I didn't come up with an idea to do that easily and efficiently. Turned out that that is actually easy to implement. In this patch, we collect comdat sections before gc is run and warn on nonexistent symbols in an order file. Differential Revision: https://reviews.llvm.org/D42658 llvm-svn: 323699	2018-01-29 21:50:53 +00:00
Rui Ueyama	0c55bda4fb	Update a help message. llvm-svn: 323698	2018-01-29 21:48:40 +00:00
George Karpenkov	9ed25edc66	[analyzer] Use stable filenames in analyzer testing infrastructure Makes finding the right file in test results easier. Differential Revision: https://reviews.llvm.org/D42445 llvm-svn: 323697	2018-01-29 21:45:07 +00:00
George Karpenkov	21428afae8	[analyzer] [NFC] Remove unused method visitItemsInWorkList Differential Revision: https://reviews.llvm.org/D42562 llvm-svn: 323696	2018-01-29 21:44:49 +00:00
Brian M. Rzycki	994e889022	[JumpThreading][NFC] Rename LoadInst variables Summary: The JumpThreading pass has several locations where to the variable name LI refers to a LoadInst type. This is confusing and inhibits the ability to use LI for LoopInfo as a member of the JumpThreading class. Minor formatting and comments were also altered to reflect this change. Reviewers: dberlin, kuba, spop, sebpop Reviewed by: sebpop Subscribers: sebpop, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D42601 llvm-svn: 323695	2018-01-29 21:29:44 +00:00
Marshall Clow	6f73cf45df	Minor updated to the main libcxx page; add a link to the deprecation page llvm-svn: 323694	2018-01-29 21:28:46 +00:00
Simon Pilgrim	02bdac53e7	[X86] Emit 11-byte or 15-byte NOPs on recent AMD targets, else default to 10-byte NOPs (PR22965) We currently emit up to 15-byte NOPs on all targets (apart from Silvermont), which stalls performance on some targets with decoders that struggle with 2 or 3 more '66' prefixes. This patch flags recent AMD targets (btver1/znver1) to still emit 15-byte NOPs and bdver* targets to emit 11-byte NOPs. All other targets now emit 10-byte NOPs apart from SilverMont CPUs which still emit 7-byte NOPS. Differential Revision: https://reviews.llvm.org/D42616 llvm-svn: 323693	2018-01-29 21:24:31 +00:00
Daniel Sanders	08464524c3	[ARM][GISel] PR35965 Constrain RegClasses of nested instructions built from Dst Pattern Summary: Apparently, we missed on constraining register classes of VReg-operands of all the instructions built from a destination pattern but the root (top-level) one. The issue exposed itself while selecting G_FPTOSI for armv7: the corresponding pattern generates VTOSIZS wrapped into COPY_TO_REGCLASS, so top-level COPY_TO_REGCLASS gets properly constrained, while nested VTOSIZS (or rather its destination virtual register to be exact) does not. Fixing this by issuing GIR_ConstrainSelectedInstOperands for every nested GIR_BuildMI. https://bugs.llvm.org/show_bug.cgi?id=35965 rdar://problem/36886530 Patch by Roman Tereshin Reviewers: dsanders, qcolombet, rovka, bogner, aditya_nandakumar, volkan Reviewed By: dsanders, qcolombet, rovka Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42565 llvm-svn: 323692	2018-01-29 21:09:12 +00:00
Paul Robinson	bf750c80e9	[DWARFv5] Re-enable dumping a line table with no CU. r323476 added support for DW_FORM_line_strp, and incorrectly made that depend on having a DWARFUnit available. We shouldn't be tracking .debug_line_str in DWARFUnit after all. After this patch, I can do an NFC follow up and undo a bunch of the "plumbing" part of r323476. Differential Revision: https://reviews.llvm.org/D42609 llvm-svn: 323691	2018-01-29 20:57:43 +00:00
Amaury Sechet	015184b79e	[X86] Avoid using high register trick for test instruction Summary: It seems it's main effect is to create addition copies when values are inr register that do not support this trick, which increase register pressure and makes the code bigger. The main noteworthy regression I was able to observe was pattern of the type (setcc (trunc (and X, C)), 0) where C is such as it would benefit from the hi register trick. To prevent this, a new pattern is added to materialize such pattern using a 32 bits test. This has the added benefit of working with any constant that is materializable as a 32bits immediate, not just the ones that can leverage the high register trick, as demonstrated by the test case in test-shrink.ll using the constant 2049 . Reviewers: craig.topper, niravd, spatel, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D42646 llvm-svn: 323690	2018-01-29 20:54:33 +00:00
Daniel Sanders	1cc575666f	[globalisel][legalizer] Change identity() to changeTo() to clarify that it changes things. NFC Prior to committing r323681, we decided to change pick() to identity() since it wasn't clear from the name what pick() did. However, identity() isn't a very good name either since it implies that no changes are made. For some reason, naming it changeTo() didn't occur to me until just after the commit. This should resolve the lack of clarity that pick() had while still implying that it changes the MIR. llvm-svn: 323689	2018-01-29 20:46:16 +00:00
Shoaib Meenai	63b6a0e5bc	[CodeGen] Simplify conditional. NFC Rafael pointed out that `hasInternalLinkage() \|\| hasPrivateLinkage()` is equivalent to `hasLocalLinkage()` in post-commit review. I'm intentionally not updating the comment, partly because I like it being explicit, and partly because "global symbols with local linkage" sounds like an oxymoron. llvm-svn: 323688	2018-01-29 20:28:04 +00:00
Amaury Sechet	4cbca08d71	[X86] Add test case to ensure testw is generated when optimizing for size. NFC llvm-svn: 323687	2018-01-29 20:22:46 +00:00
Evandro Menezes	1589d6e6a3	[AArch64] Change the filename of the Exynos M1 scheduling defs After request by Matthias Braun in https://reviews.llvm.org/D42387. llvm-svn: 323686	2018-01-29 20:22:24 +00:00
Kostya Kortchinsky	c15da10101	[scudo] Add support for Fuchsia OS. Summary: Built & tested in Fuchsia's build system. Reviewers: alekseyshl, cryptoad, aarongreen Reviewed By: cryptoad Subscribers: srhines, mgorny Differential Revision: https://reviews.llvm.org/D42610 llvm-svn: 323685	2018-01-29 20:17:29 +00:00
Ben Hamilton	5dd4018bfc	[clang-format] Add more tests for ObjC protocol list formatting behavior Summary: The existing unit tests in FormatTestObjC.cpp didn't fully cover all the cases for protocol confirmance list formatting. This extends the unit tests to more cases of protocol conformance list formatting, especially how the behavior changes when `BinPackParameters` changes from `true` (the default) to `false`. Test Plan: make -j12 FormatTests && \ ./tools/clang/unittests/Format/FormatTests --gtest_filter=FormatTestObjC.\* Reviewers: krasimir, jolesiak, stephanemoore Reviewed By: krasimir Subscribers: benhamilton, klimek, cfe-commits, hokein, Wizard Differential Revision: https://reviews.llvm.org/D42649 llvm-svn: 323684	2018-01-29 20:01:49 +00:00
Jun Bum Lim	fc7d56d949	Revert "AArch64: Omit callframe setup/destroy when not necessary" This reverts commit r322917 due to multiple performance regressions in spec2006 and spec2017. XFAILed llvm/test/CodeGen/AArch64/big-callframe.ll which initially motivated this change. llvm-svn: 323683	2018-01-29 19:56:42 +00:00
Rui Ueyama	d73479ba12	Remove trailing space. llvm-svn: 323682	2018-01-29 19:55:55 +00:00
Daniel Sanders	79cb839fcd	[globalisel][legalizer] Adapt LegalizerInfo to support inter-type dependencies and other things. Summary: As discussed in D42244, we have difficulty describing the legality of some operations. We're not able to specify relationships between types. For example, declaring the following setAction({..., 0, s32}, Legal) setAction({..., 0, s64}, Legal) setAction({..., 1, s32}, Legal) setAction({..., 1, s64}, Legal) currently declares these type combinations as legal: {s32, s32} {s64, s32} {s32, s64} {s64, s64} but we currently have no means to say that, for example, {s64, s32} is not legal. Some operations such as G_INSERT/G_EXTRACT/G_MERGE_VALUES/ G_UNMERGE_VALUES have relationships between the types that are currently described incorrectly. Additionally, G_LOAD/G_STORE currently have no means to legalize non-atomics differently to atomics. The necessary information is in the MMO but we have no way to use this in the legalizer. Similarly, there is currently no way for the register type and the memory type to differ so there is no way to cleanly represent extending-load/truncating-store in a way that can't be broken by optimizers (resulting in illegal MIR). It's also difficult to control the legalization strategy. We've added support for legalizing non-power of 2 types but there's still some hardcoded assumptions about the strategy. The main one I've noticed is that type0 is always legalized before type1 which is not a good strategy for `type0 = G_EXTRACT type1, ...` if you need to widen the container. It will converge on the same result eventually but it will take a much longer route when legalizing type0 than if you legalize type1 first. Lastly, the definition of legality and the legalization strategy is kept separate which is not ideal. It's helpful to be able to look at a one piece of code and see both what is legal and the method the legalizer will use to make illegal MIR more legal. This patch adds a layer onto the LegalizerInfo (to be removed when all targets have been migrated) which resolves all these issues. Here are the rules for shift and division: for (unsigned BinOp : {G_LSHR, G_ASHR, G_SDIV, G_UDIV}) getActionDefinitions(BinOp) .legalFor({s32, s64}) // If type0 is s32/s64 then it's Legal .clampScalar(0, s32, s64) // If type0 is <s32 then WidenScalar to s32 // If type0 is >s64 then NarrowScalar to s64 .widenScalarToPow2(0) // Round type0 scalars up to powers of 2 .unsupported(); // Otherwise, it's unsupported This describes everything needed to both define legality and describe how to make illegal things legal. Here's an example of a complex rule: getActionDefinitions(G_INSERT) .unsupportedIf([=](const LegalityQuery &Query) { // If type0 is smaller than type1 then it's unsupported return Query.Types[0].getSizeInBits() <= Query.Types[1].getSizeInBits(); }) .legalIf([=](const LegalityQuery &Query) { // If type0 is s32/s64/p0 and type1 is a power of 2 other than 2 or 4 then it's legal // We don't need to worry about large type1's because unsupportedIf caught that. const LLT &Ty0 = Query.Types[0]; const LLT &Ty1 = Query.Types[1]; if (Ty0 != s32 && Ty0 != s64 && Ty0 != p0) return false; return isPowerOf2_32(Ty1.getSizeInBits()) && (Ty1.getSizeInBits() == 1 \|\| Ty1.getSizeInBits() >= 8); }) .clampScalar(0, s32, s64) .widenScalarToPow2(0) .maxScalarIf(typeInSet(0, {s32}), 1, s16) // If type0 is s32 and type1 is bigger than s16 then NarrowScalar type1 to s16 .maxScalarIf(typeInSet(0, {s64}), 1, s32) // If type0 is s64 and type1 is bigger than s32 then NarrowScalar type1 to s32 .widenScalarToPow2(1) // Round type1 scalars up to powers of 2 .unsupported(); This uses a lambda to say that G_INSERT is unsupported when type0 is bigger than type1 (in practice, this would be a default rule for G_INSERT). It also uses one to describe the legal cases. This particular predicate is equivalent to: .legalFor({{s32, s1}, {s32, s8}, {s32, s16}, {s64, s1}, {s64, s8}, {s64, s16}, {s64, s32}}) In terms of performance, I saw a slight (~6%) performance improvement when AArch64 was around 30% ported but it's pretty much break even right now. I'm going to take a look at constexpr as a means to reduce the initialization cost. Future work: * Make it possible for opcodes to share rulesets. There's no need for G_LSHR/G_ASHR/G_SDIV/G_UDIV to have separate rule and ruleset objects. There's no technical barrier to this, it just hasn't been done yet. * Replace the type-index numbers with an enum to get .clampScalar(Type0, s32, s64) * Better names for things like .maxScalarIf() (clampMaxScalar?) and the vector rules. * Improve initialization cost using constexpr Possible future work: * It's possible to make these rulesets change the MIR directly instead of returning a description of how to change the MIR. This should remove a little overhead caused by parsing the description and routing to the right code, but the real motivation is that it removes the need for LegalizeAction::Custom. With Custom removed, there's no longer a requirement that Custom legalization change the opcode to something that's considered legal. Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar, volkan, reames, bogner Reviewed By: bogner Subscribers: hintonda, bogner, aemerson, mgorny, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D42251 llvm-svn: 323681	2018-01-29 19:54:49 +00:00
Rafael Espindola	e899a0b824	Improve testcase. We now test that pic and static produce different results for bar. The function names were demangled. The attributes are written inline. llvm-svn: 323680	2018-01-29 19:37:27 +00:00
Erich Keane	c18cce405e	[NFC] Fixup comment with function name, actually incorrect name! llvm-svn: 323679	2018-01-29 19:33:20 +00:00
Krasimir Georgiev	a83d3c576a	[clang-format] Disable some text proto delimiters and functions for google style Summary: This disables some of the most commonly used text proto delimiters and functions for google style until we resolve several style options for that style. In particular, wheter there should be a space surrounding braces ``msg { sub { key : value } }`` and the extent of packing of submessages on a same line. Reviewers: sammccall Reviewed By: sammccall Subscribers: klimek, cfe-commits Differential Revision: https://reviews.llvm.org/D42651 llvm-svn: 323678	2018-01-29 19:28:05 +00:00
Jan Vesely	0ecb5e511e	math.h: Set HAVE_HW_FMA32 based on compiler provided macro Fixes sin/cos piglits on non-FMA capable asics. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=35983 Reviewer: Tom Stellard Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 323677	2018-01-29 19:05:08 +00:00
Geoff Berry	d1be91127c	[MachineVerifier] Add check that renamable operands aren't reserved registers. Summary: Reviewers: qcolombet, MatzeB Subscribers: arsenm, sdardis, nhaehnle, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D42449 llvm-svn: 323676	2018-01-29 18:57:07 +00:00
Geoff Berry	d37dc77b6e	[AMDGPU][X86][Mips] Make sure renamable bit not set for reserved regs Summary: Fix a few places that were modifying code after register allocation to set the renamable bit correctly to avoid failing the validation added in D42449. llvm-svn: 323675	2018-01-29 18:47:48 +00:00
Rafael Espindola	d5f76ad37f	Move getPlatformFlags to ELFObjectFileBase and simplify. This removes a few std::error_code results that were ignored on every call. llvm-svn: 323674	2018-01-29 18:27:30 +00:00
Michal Gorny	c061175fcd	[Host] Respect LLVM_LIBDIR_SUFFIX when looking for LLDB plugins on Linux Fix the Linux plugin lookup path to include appropriate libdir suffix for the system. To accomplish this, store the value of LLVM_LIBDIR_SUFFIX in lldb/Host/Config.h as LLDB_LIBDIR_SUFFIX, and use this variable when defining the plugin path. Differential Revision: https://reviews.llvm.org/D42317 llvm-svn: 323673	2018-01-29 18:25:06 +00:00
Craig Topper	eb13ebdb99	[X86] Don't create SHRUNKBLEND when the condition is used by the true or false operand of the vselect. Fixes PR34592. Differential Revision: https://reviews.llvm.org/D42628 llvm-svn: 323672	2018-01-29 17:56:57 +00:00
Craig Topper	63db1c117a	[X86] Add test case for pr34592 llvm-svn: 323671	2018-01-29 17:56:55 +00:00
Wolfgang Pieb	9f23426cb0	[DWARF] Recommitting a test reverted in r323560. Moved to x86 directory with explicit triple. ELF support is required for type units. llvm-svn: 323670	2018-01-29 17:49:10 +00:00
Daniel Sanders	9ade5592d9	[globalisel] Make LegalizerInfo::LegalizeAction available outside of LegalizerInfo. NFC Summary: The improvements to the LegalizerInfo discussed in D42244 require that LegalizerInfo::LegalizeAction be available for use in other classes. As such, it needs to be moved out of LegalizerInfo. This has been done separately to the next patch to minimize the noise in that patch. llvm-svn: 323669	2018-01-29 17:37:29 +00:00
Ben Hamilton	6845dec917	[clang-format] Fix bug where -dump-config failed on ObjC header Summary: `clang-format -dump-config path/to/file.h` never passed anything for the Code parameter to clang::format::getStyle(). This meant the logic to guess Objective-C from the contents of a .h file never worked, because LibFormat didn't have the code to work with. With this fix, we now correctly read in the contents of the file if possible with -dump-config. I had to update the lit config for test/Format/ because the default config ignores .h files. Test Plan: make -j12 check-clang Reviewers: jolesiak, krasimir Reviewed By: jolesiak, krasimir Subscribers: Wizard, klimek, cfe-commits, djasper Differential Revision: https://reviews.llvm.org/D42395 llvm-svn: 323668	2018-01-29 17:36:43 +00:00
Jonas Devlieghere	073971b243	[AccelTable] Workaround for MSVC bug Microsoft Visual Studio rejects the static constexpr static list of atoms even though it's valid C++. This provides a workaround to unbreak the bots. llvm-svn: 323667	2018-01-29 17:28:51 +00:00
Tony Jiang	7e86d05950	Add myself to CREDITS.txt llvm-svn: 323666	2018-01-29 17:02:34 +00:00
Jonathan Roelofs	4a205fd038	[scan-build] Add an option to skip overriding CC and CXX make vars Autoconf and some other systems tend to add essential compilation options to CC (e.g. -std=gnu99). When running such an auto-generated makefile, scan-build does not need to change CC and CXX as they are already set to use ccc-analyzer by a configure script. Implement a new option --keep-cc as was proposed in this discussion: http://lists.llvm.org/pipermail/cfe-dev/2013-September/031832.html Patch by Paul Fertser! llvm-svn: 323665	2018-01-29 16:49:34 +00:00
Jonathan Roelofs	9cf6302cce	[analyzer] Fix -x language argument for C preprocessed sources clang's -x option doesn't accept c-cpp-output as a language (even though 463eb6ab was merged, the driver still doesn't handle that). This bug prevents testing C language projects when ccache is used. Fixes #25851. Investigation and patch by Dave Rigby. llvm-svn: 323664	2018-01-29 16:37:53 +00:00
Amaury Sechet	9827d8ed15	Add test case for truncated and promotion to test. NFC llvm-svn: 323663	2018-01-29 16:13:01 +00:00
Alexey Bataev	9c5c103283	[SLP] Fix for PR32086: Count InsertElementInstr of the same elements as shuffle. Summary: If the same value is going to be vectorized several times in the same tree entry, this entry is considered to be a gather entry and cost of this gather is counter as cost of InsertElementInstrs for each gathered value. But we can consider these elements as ShuffleInstr with SK_PermuteSingle shuffle kind. Reviewers: spatel, RKSimon, mkuper, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38697 llvm-svn: 323662	2018-01-29 16:08:52 +00:00
Alexey Bataev	10f5c9e765	[SLP] Add a test with extract for PR32086, NFC. llvm-svn: 323661	2018-01-29 15:56:52 +00:00

... 2 3 4 5 6 ...

281596 Commits All Branches Search

281596 Commits

All Branches