llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Bieneman	e49730d4ba	Remove autoconf support Summary: This patch is provided in preparation for removing autoconf on 1/26. The proposal to remove autoconf on 1/26 was discussed on the llvm-dev thread here: http://lists.llvm.org/pipermail/llvm-dev/2016-January/093875.html "I felt a great disturbance in the [build system], as if millions of [makefiles] suddenly cried out in terror and were suddenly silenced. I fear something [amazing] has happened." - Obi Wan Kenobi Reviewers: chandlerc, grosbach, bob.wilson, tstellarAMD, echristo, whitequark Subscribers: chfast, simoncook, emaste, jholewinski, tberghammer, jfb, danalbert, srhines, arsenm, dschuff, jyknight, dsanders, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D16471 llvm-svn: 258861	2016-01-26 21:29:08 +00:00
Derek Schuff	e7305cc4b3	[WebAssembly] Remove check for FrameIndex operands in WebAssemblyPeephole This pass runs after FrameIndex elimination, so it should never see FI operands. NFC llvm-svn: 258860	2016-01-26 21:08:27 +00:00
Sanjay Patel	3e1701da29	[x86] add materializeVectorConstant() helper function; NFC LowerBUILD_VECTOR is still over 300 lines long, but it's a start... llvm-svn: 258858	2016-01-26 21:05:00 +00:00
Hemant Kulkarni	444766848e	Fix comparison warning (r258845) llvm-svn: 258856	2016-01-26 20:38:15 +00:00
Hemant Kulkarni	ae1acb0969	Fixes build break introduced by r258845 llvm-svn: 258854	2016-01-26 20:28:15 +00:00
JF Bastien	43436716aa	WebAssembly NFC: update error message I forgot to update this one in my previous patch. llvm-svn: 258853	2016-01-26 20:24:51 +00:00
JF Bastien	1a6c7608b1	WebAssembly: don't optimize memcpy/memmove/memcpy to frame index r258781 optimized memcpy/memmove/memcpy so the intrinsic call can return its first argument, but missed the frame index case. Teach it to ignore that case so C code doesn't assert out in these cases. llvm-svn: 258851	2016-01-26 20:22:42 +00:00
Cong Hou	26d04ef9d9	Add a missing test case for r258847. llvm-svn: 258848	2016-01-26 20:09:38 +00:00
Cong Hou	551a57f797	Allow X86::COND_NE_OR_P and X86::COND_NP_OR_E to be reversed. Currently, AnalyzeBranch() fails non-equality comparison between floating points on X86 (see https://llvm.org/bugs/show_bug.cgi?id=23875). This is because this function can modify the branch by reversing the conditional jump and removing unconditional jump if there is a proper fall-through. However, in the case of non-equality comparison between floating points, this can turn the branch "unanalyzable". Consider the following case: jne.BB1 jp.BB1 jmp.BB2 .BB1: ... .BB2: ... AnalyzeBranch() will reverse "jp .BB1" to "jnp .BB2" and then "jmp .BB2" will be removed: jne.BB1 jnp.BB2 .BB1: ... .BB2: ... However, AnalyzeBranch() cannot analyze this branch anymore as there are two conditional jumps with different targets. This may disable some optimizations like block-placement: in this case the fall-through behavior is enforced even if the fall-through block is very cold, which is suboptimal. Actually this optimization is also done in block-placement pass, which means we can remove this optimization from AnalyzeBranch(). However, currently X86::COND_NE_OR_P and X86::COND_NP_OR_E are not reversible: there is no defined negation conditions for them. In order to reverse them, this patch defines two new CondCode X86::COND_E_AND_NP and X86::COND_P_AND_NE. It also defines how to synthesize instructions for them. Here only the second conditional jump is reversed. This is valid as we only need them to do this "unconditional jump removal" optimization. Differential Revision: http://reviews.llvm.org/D11393 llvm-svn: 258847	2016-01-26 20:08:01 +00:00
Davide Italiano	74237155e5	[llvm-nm] Roll several conditions into a single if. NFCI. llvm-svn: 258846	2016-01-26 19:57:42 +00:00
Hemant Kulkarni	ab4a46fa1c	[llvm-readobj] Add -elf-section-groups option Adds a way to inspect SHT_GROUP sections in ELF objects. Displays signature, member sections of these sections. Differential revision: http://reviews.llvm.org/D16555 llvm-svn: 258845	2016-01-26 19:46:39 +00:00
Chad Rosier	b46d0f9a71	[ScheduleDAGInstrs] Simplify logic to improve readability. NFC. The call to isInvariantLoad() already returns false for non-load instructions. llvm-svn: 258841	2016-01-26 19:33:57 +00:00
Sanjay Patel	59066a0803	tidy up; NFC llvm-svn: 258838	2016-01-26 19:30:14 +00:00
Davide Italiano	e4db187961	[llvm-nm] Simplify. No functional changes intended. llvm-svn: 258837	2016-01-26 19:28:51 +00:00
Sanjay Patel	70fa79fdf2	[x86] simplify getOnesVector() ; NFCI Let DAG.getConstant() handle the splatting; there's no need to repeat that logic here. llvm-svn: 258833	2016-01-26 18:49:36 +00:00
Eugene Zelenko	6ac3f739ca	Fix Clang-tidy modernize-use-nullptr and modernize-use-override warnings; other minor fixes. Differential revision: reviews.llvm.org/D16568 llvm-svn: 258831	2016-01-26 18:48:36 +00:00
Aditya Nandakumar	3d0c46d489	Reassociate: Reprocess RedoInsts after each inst Previously the RedoInsts was processed at the end of the block. However it was possible that it left behind some instructions that were not canonicalized. This should guarantee that any previous instruction in the basic block is canonicalized before we process a new instruction. llvm-svn: 258830	2016-01-26 18:42:36 +00:00
Sanjay Patel	10e82d1ddb	[x86, AVX] tighten checks llvm-svn: 258828	2016-01-26 18:22:50 +00:00
Benjamin Kramer	c50b89070c	Update wasm target for r258819. llvm-svn: 258827	2016-01-26 18:21:38 +00:00
Kevin Enderby	40fdbf87d2	Update the comments for the macho-invalid-zero-ncmds test and fix llvm-objdump when printing the Mach Header to print the unknown cputype and cpusubtype fields as decimal instead of not printing them at all. And change the test to check for that. llvm-svn: 258826	2016-01-26 18:20:49 +00:00
Sanjay Patel	9eed9a956f	fix formatting; NFC llvm-svn: 258825	2016-01-26 18:14:37 +00:00
Sanjay Patel	f1ac8ba45b	don't repeat names in documentation comments; NFC llvm-svn: 258820	2016-01-26 17:06:13 +00:00
Benjamin Kramer	f57c1977c1	Reflect the MC/MCDisassembler split on the include/ level. No functional change, just moving code around. llvm-svn: 258818	2016-01-26 16:44:37 +00:00
Sanjay Patel	980b280f50	[LibCallSimplifier] fold memset(malloc(x), 0, x) --> calloc(1, x) This is a step towards solving PR25892: https://llvm.org/bugs/show_bug.cgi?id=25892 It won't handle the reported case. As noted by the 'TODO' comments in the patch, we need to relax the hasOneUse() constraint and also match patterns that include memset_chk() and the llvm.memset() intrinsic in addition to memset(). Differential Revision: http://reviews.llvm.org/D16337 llvm-svn: 258816	2016-01-26 16:17:24 +00:00
Matthew Simpson	61d5a18469	Revert "Reapply commit r258404 with fix" This commit exposes a crash in computeKnownBits on the Chromium buildbots. Reverting to investigate. Reference: https://llvm.org/bugs/show_bug.cgi?id=26307 llvm-svn: 258812	2016-01-26 15:45:49 +00:00
Igor Laevsky	03a670c0ec	Re-submit r256008 "Improve DWARFDebugFrame::parse to also handle __eh_frame." Originally this change was causing failures on windows buildbots. But those problems were fixed in r258806. llvm-svn: 258811	2016-01-26 15:09:42 +00:00
Dan Gohman	fb619e9686	[WebAssembly] Fix a typo in a comment. llvm-svn: 258810	2016-01-26 14:55:17 +00:00
Igor Laevsky	0e1605a3b4	[DebugInfo] Fix DWARFDebugFrame instruction operand ordering We can't rely on the evalution order of function arguments. Differential Revision: http://reviews.llvm.org/D16509 llvm-svn: 258806	2016-01-26 13:31:11 +00:00
Simon Pilgrim	46696ef93c	[X86][SSE] Add zero element and general 64-bit VZEXT_LOAD support to EltsFromConsecutiveLoads This patch adds support for trailing zero elements to VZEXT_LOAD loads (and checks that no zero elts occur within the consecutive load). It also generalizes the 64-bit VZEXT_LOAD load matching to work for loads other than 2x32-bit loads. After this patch it will also be easier to add support for other basic load patterns like 32-bit VZEXT_LOAD loads, PMOVZX and subvector load insertion. Differential Revision: http://reviews.llvm.org/D16217 llvm-svn: 258798	2016-01-26 09:30:08 +00:00
Craig Topper	b9c932f26e	[X86] Mark LDS/LES as not being allowed in 64-bit mode. Their opcodes are used as part of the VEX prefix in 64-bit mode. Clearly the disassembler implicitly decoded them as AVX instructions in 64-bit mode, but I think the AsmParser would have encoded them. llvm-svn: 258793	2016-01-26 06:10:15 +00:00
Matt Arsenault	bee7575e1a	AMDGPU: Move AMDGPU intrinsics only used by R600 llvm-svn: 258790	2016-01-26 04:49:24 +00:00
Matt Arsenault	382d945d16	AMDGPU: Tidy minor td file issues Make comments and indentation more consistent. Rearrange a few things to be in a more consistent order, such as organizing subtarget features from those describing an actual device property, and those used as options. llvm-svn: 258789	2016-01-26 04:49:22 +00:00
Matt Arsenault	c5f6152911	AMDGPU: Make v32i8/v64i8 illegal types Old intrinsics were forcing these, but they have now all been removed. This fixes large i8 vector operations generally being broken. llvm-svn: 258788	2016-01-26 04:43:48 +00:00
Matt Arsenault	018179fc46	AMDGPU: Remove old sample intrinsics I did my best to try to update all the uses in tests that just happened to use the old ones to the newer intrinsics. I'm not sure I got all of the immediate operand conversions correct, since the value seems to have been ignored by the old pattern but I don't think it really matters. llvm-svn: 258787	2016-01-26 04:38:08 +00:00
Matt Arsenault	051d6f9fde	AMDGPU: Add new amdgcn intrinsics for cube instructions More cleanup to try to get all intrinsics using the correct amdgcn prefix that are as close to the instruction as possible. llvm-svn: 258786	2016-01-26 04:29:56 +00:00
Matt Arsenault	9a10cea7fb	AMDGPU: Implement read_register and write_register intrinsics Some of the special intrinsics now that now correspond to a instruction also have special setting of some registers, e.g. llvm.SI.sendmsg sets m0 as well as use s_sendmsg. Using these explicit register intrinsics may be a better option. Reading the exec mask and others may be useful for debugging. For this I'm not sure this is entirely correct because we would want this to be convergent, although it's possible this is already treated sufficently conservatively. llvm-svn: 258785	2016-01-26 04:29:24 +00:00
Matt Arsenault	cee02ccc1d	AMDGPU: Note mesa version in release notes llvm-svn: 258784	2016-01-26 04:29:15 +00:00
Matt Arsenault	0c3e2338fe	AMDGPU: Restore AMDGPU prefixed rsq intrinsic for now Also move into backend intrinsics to discourage use of the old name. llvm-svn: 258783	2016-01-26 04:14:16 +00:00
Dan Gohman	bdf08d5da6	[WebAssembly] Optimize memcpy/memmove/memcpy calls. These calls return their first argument, but because LLVM uses an intrinsic with a void return type, they can't use the returned attribute. Generalize the store results pass to optimize these calls too. llvm-svn: 258781	2016-01-26 04:01:11 +00:00
Dan Gohman	be6f196bff	[WebAssembly] Remove a completed entry from the README.txt. llvm-svn: 258780	2016-01-26 03:43:48 +00:00
Dan Gohman	bb3722430f	[WebAssembly] Implement unaligned loads and stores. Differential Revision: http://reviews.llvm.org/D16534 llvm-svn: 258779	2016-01-26 03:39:31 +00:00
Haicheng Wu	f1c00a22be	[LIR] Add support for structs and hand unrolled loops This is a recommit of r258620 which causes PR26293. The original message: Now LIR can turn following codes into memset: typedef struct foo { int a; int b; } foo_t; void bar(foo_t f, unsigned n) { for (unsigned i = 0; i < n; ++i) { f[i].a = 0; f[i].b = 0; } } void test(foo_t f, unsigned n) { for (unsigned i = 0; i < n; i += 2) { f[i] = 0; f[i+1] = 0; } } llvm-svn: 258777	2016-01-26 02:27:47 +00:00
Reid Kleckner	0b5220d0aa	Use binary search for intrinsic ID lookups This improves compile time of Function.cpp from 57s to 37s for me locally. Intrinsic IDs are cached on the Function object, so this shouldn't regress performance. llvm-svn: 258774	2016-01-26 02:06:41 +00:00
Matthias Braun	db320773e1	LiveIntervalAnalysis: Improve some comments As recommended by Justin. llvm-svn: 258771	2016-01-26 01:40:48 +00:00
Reid Kleckner	86ff2689a5	Sort intrinsics by LLVM intrinsic name, rather than tablegen def name Step one towards using a simple binary search to lookup intrinsic IDs instead of our crazy table generated switch+memcmp+startswith code that makes Function.cpp take about a minute to compile. See PR24785 and PR11951 for why we should do this. The X86 backend contains tables that need to be sorted on intrinsic ID, so reorder those. llvm-svn: 258757	2016-01-26 00:55:00 +00:00
Matthias Braun	242b8bb58d	LiveIntervalAnalysis: Cleanup handleMove{Down\|Up}() functions, NFC These two functions are hard to reason about. This commit makes the code more comprehensible: - Use four distinct variables (OldIdxIn, OldIdxOut, NewIdxIn, NewIdxOut) with a fixed value instead of a changing iterator I that points to different things during the function. - Remove the early explanation before the function in favor of more detailed comments inside the function. Should have more/clearer comments now stating which conditions are tested and which invariants hold at different points in the functions. The behaviour of the code was not changed. I hope that this will make it easier to review the changes in http://reviews.llvm.org/D9067 which I will adapt next. Differential Revision: http://reviews.llvm.org/D16379 llvm-svn: 258756	2016-01-26 00:43:50 +00:00
Dan Gohman	75452734e4	Followup to 258750; update more tests to use .p2align . llvm-svn: 258755	2016-01-26 00:35:07 +00:00
Dan Gohman	14d8436c66	Followup to 258750; update all MC tests to use .p2align . llvm-svn: 258754	2016-01-26 00:27:59 +00:00
Dan Gohman	968b090552	Followup to 258750; update this test to use .p2align . llvm-svn: 258752	2016-01-26 00:17:24 +00:00
Dan Gohman	61d15ae4f5	[MC] Use .p2align instead of .align For historic reasons, the behavior of .align differs between targets. Fortunately, there are alternatives, .p2align and .balign, which make the interpretation of the parameter explicit, and which behave consistently across targets. This patch teaches MC to use .p2align instead of .align, so that people reading code for multiple architectures don't have to remember which way each platform does its .align directive. Differential Revision: http://reviews.llvm.org/D16549 llvm-svn: 258750	2016-01-26 00:03:25 +00:00

1 2 3 4 5 ...

126644 Commits