llvm-project

Commit Graph

Author	SHA1	Message	Date
Gerolf Hoflehner	5e1207e54c	MachineCombiner Pass for selecting faster instruction sequence - target independent framework When the DAGcombiner selects instruction sequences it could increase the critical path or resource len. For example, on arm64 there are multiply-accumulate instructions (madd, msub). If e.g. the equivalent multiply-add sequence is not on the crictial path it makes sense to select it instead of the combined, single accumulate instruction (madd/msub). The reason is that the conversion from add+mul to the madd could lengthen the critical path by the latency of the multiply. But the DAGCombiner would always combine and select the madd/msub instruction. This patch uses machine trace metrics to estimate critical path length and resource length of an original instruction sequence vs a combined instruction sequence and picks the faster code based on its estimates. This patch only commits the target independent framework that evaluates and selects code sequences. The machine instruction combiner is turned off for all targets and expected to evolve over time by gradually handling DAGCombiner pattern in the target specific code. This framework lays the groundwork for fixing rdar://16319955 llvm-svn: 214666	2014-08-03 21:35:39 +00:00
Tobias Grosser	f57d63f906	Do allow negative offsets in the outermost array dimension There is no needed for neither 1-dimensional nor higher dimensional arrays to require positive offsets in the outermost array dimension. We originally introduced this assumption with the support for delinearizing multi-dimensional arrays. llvm-svn: 214665	2014-08-03 21:07:30 +00:00
Saleem Abdulrasool	4544c16eab	MC: virtualise EmitWindowsUnwindTables This makes EmitWindowsUnwindTables a virtual function and lowers the implementation of the function to the X86WinCOFFStreamer. This method is a target specific operation. This enables making the behaviour target dependent by isolating it entirely to the target specific streamer. llvm-svn: 214664	2014-08-03 18:51:26 +00:00
Saleem Abdulrasool	b3be7371d5	MC: rename Win64EHFrameInfo to WinEH::FrameInfo The frame information stored in this structure is driven by the requirements for Windows NT unwinding rather than Windows 64 specifically. As a result, this type can be shared across multiple architectures (ARM, AXP, MIPS, PPC, SH). Rename this class in preparation for adding support for supporting unwinding information for Windows on ARM. Take the opportunity to constify the members as everything except the ChainedParent is read-only. This required some adjustment to the label handling. llvm-svn: 214663	2014-08-03 18:51:17 +00:00
Simon Atanasyan	3ab94b910b	[Mips] Add the `mips64-linux-gnu` target to the test case to check `in128` type handling. llvm-svn: 214662	2014-08-03 16:11:05 +00:00
Matt Arsenault	9215b17eb7	R600/SI: Fix extra whitespace in asm str This slipped in in r214467, so something like V_MOV_B32_e32 v0, ... is now printed with 2 spaces between the instruction name and first operand. llvm-svn: 214660	2014-08-03 05:27:14 +00:00
Johannes Doerfert	a63b2579c6	Fix the modifiable access creation + Remove the class IslGenerator which duplicates the functionality of IslExprBuilder. + Use the IslExprBuilder to create code for memory access relations. + Also handle array types during access creation. + Enable scev codegen for one of the transformed memory access tests, thus access creation without canonical induction variables available. + Update one test case to the new output. llvm-svn: 214659	2014-08-03 01:51:59 +00:00
Johannes Doerfert	ed87831113	Allow the IslExprBuilder to generate access operations llvm-svn: 214658	2014-08-03 01:50:50 +00:00
Johannes Doerfert	b5d1c322f2	Update the jscop tests and port them to isl codegen. The updated tests use a different context than the old ones did. Other than that only their path and the code generation we use changed. llvm-svn: 214657	2014-08-03 01:48:49 +00:00
NAKAMURA Takumi	0f9447d1ce	Tools.cpp: Avoid std::to_string() on -fbuild-session-timestamp to appease mingw32 builder. llvm-svn: 214656	2014-08-03 01:11:44 +00:00
Manman Ren	062f58d550	[SimplifyCFG] fix accessing deleted PHINodes in switch-to-table conversion. When we have a covered lookup table, make sure we don't delete PHINodes that are cached in PHIs. rdar://17887153 llvm-svn: 214642	2014-08-02 23:41:54 +00:00
Simon Atanasyan	0670abdd2e	[Mips] Replace assembler code by YAML to make the 'gotsym.test' test target independent. llvm-svn: 214641	2014-08-02 20:18:31 +00:00
Joerg Sonnenberger	c03105ba8e	tlbia support llvm-svn: 214640	2014-08-02 20:16:29 +00:00
Joerg Sonnenberger	e8a167ce8f	mfdcr / mtdcr support llvm-svn: 214639	2014-08-02 20:00:26 +00:00
Erik Eckstein	26a1bf7d84	fix bug 20513 - Crash in SLP Vectorizer llvm-svn: 214638	2014-08-02 19:39:42 +00:00
James Molloy	6b999ae682	Update test to use a more modern AArch64 triple, as requested by Renato. llvm-svn: 214637	2014-08-02 17:15:11 +00:00
Joerg Sonnenberger	99ab590ac9	Don't use additional arguments for dss and friends to satisfy DSS_Form, when let can do the same thing. Keep the 64bit variants as codegen-only. While they have a different register class, the encoding is the same for 32bit and 64bit mode. Having both present would otherwise confuse the disassembler. llvm-svn: 214636	2014-08-02 15:09:41 +00:00
Joerg Sonnenberger	466a31eb65	vcfsx and dss instructions require immediates, variables are not valid. llvm-svn: 214635	2014-08-02 15:07:21 +00:00
James Molloy	ce45be0465	[AArch64] Teach DAGCombiner that converting two consecutive loads into a vector load is not a good transform when paired loads are available. The combiner was creating Q-register loads and stores, which then had to be spilled because there are no callee-save Q registers! llvm-svn: 214634	2014-08-02 14:51:24 +00:00
Tobias Grosser	8c112d838c	Mark a GPGPU test case as XFAIL This area of code is currently not very much tested. It will hopefully be superseeded by Yabin's GSoC project. llvm-svn: 214633	2014-08-02 13:37:32 +00:00
Tobias Grosser	5b5fd4e27c	No need to run -mem2reg twice llvm-svn: 214632	2014-08-02 13:37:25 +00:00
Chandler Carruth	16c13cad35	[x86] Remove the FIXME that was implemented in r214628. Managed to forget to update the comment here... =/ llvm-svn: 214630	2014-08-02 11:34:23 +00:00
Chandler Carruth	bec57b406d	[x86] Give this test a bare metal triple so it doesn't use the weird Darwin x86 asm comment prefix designed to work around GAS on that platform. That makes the comment-matching of the test much more stable. llvm-svn: 214629	2014-08-02 11:17:41 +00:00
Chandler Carruth	4c57955fe3	[x86] Largely complete the use of PSHUFB in the new vector shuffle lowering with a small addition to it and adding PSHUFB combining. There is one obvious place in the new vector shuffle lowering where we should form PSHUFBs directly: when without them we will unpack a vector of i8s across two different registers and do a potentially 4-way blend as i16s only to re-pack them into i8s afterward. This is the crazy expensive fallback path for i8 shuffles and we can just directly use pshufb here as it will always be cheaper (the unpack and pack are two instructions so even a single shuffle between them hits our three instruction limit for forming PSHUFB). However, this doesn't generate very good code in many cases, and it leaves a bunch of common patterns not using PSHUFB. So this patch also adds support for extracting a shuffle mask from PSHUFB in the X86 lowering code, and uses it to handle PSHUFBs in the recursive shuffle combining. This allows us to combine through them, combine multiple ones together, and generally produce sufficiently high quality code. Extracting the PSHUFB mask is annoyingly complex because it could be either pre-legalization or post-legalization. At least this doesn't have to deal with re-materialized constants. =] I've added decode routines to handle the different patterns that show up at this level and we dispatch through them as appropriate. The two primary test cases are updated. For the v16 test case there is still a lot of room for improvement. Since I was going through it systematically I left behind a bunch of FIXME lines that I'm hoping to turn into ALL lines by the end of this. llvm-svn: 214628	2014-08-02 10:39:15 +00:00
Chandler Carruth	d10b29240c	[x86] Switch to using the variable we extracted this operand into. Spotted this missed refactoring by inspection when reading code, and it doesn't changethe functionality at all. llvm-svn: 214627	2014-08-02 10:29:36 +00:00
Chandler Carruth	5219d4eff6	[x86] Fix a few typos in my comments spotted in passing. llvm-svn: 214626	2014-08-02 10:29:34 +00:00
Chandler Carruth	34f9a987e9	[x86] Teach the target shuffle mask extraction to recognize unary forms of normally binary shuffle instructions like PUNPCKL and MOVLHPS. This detects cases where a single register is used for both operands making the shuffle behave in a unary way. We detect this and adjust the mask to use the unary form which allows the existing DAG combine for shuffle instructions to actually work at all. As a consequence, this uncovered a number of obvious bugs in the existing DAG combine which are fixed. It also now canonicalizes several shuffles even with the existing lowering. These typically are trying to match the shuffle to the domain of the input where before we only really modeled them with the floating point variants. All of the cases which change to an integer shuffle here have something in the integer domain, so there are no more or fewer domain crosses here AFAICT. Technically, it might be better to go from a GPR directly to the floating point domain, but detecting floating point outputs despite integer inputs is a lot more code and seems unlikely to be worthwhile in practice. If folks are seeing domain-crossing regressions here though, let me know and I can hack something up to fix it. Also as a consequence, a bunch of missed opportunities to form pshufb now can be formed. Notably, splats of i8s now form pshufb. Interestingly, this improves the existing splat lowering too. We go from 3 instructions to 1. Yes, we may tie up a register, but it seems very likely to be worth it, especially if splatting the 0th byte (the common case) as then we can use a zeroed register as the mask. llvm-svn: 214625	2014-08-02 10:27:38 +00:00
Chandler Carruth	2ad69eea8d	[x86] Teach my pshufb comment printer to handle VPSHUFB forms as well as PSHUFB forms. This will be important to update some AVX tests when I add PSHUFB combining. llvm-svn: 214624	2014-08-02 10:08:17 +00:00
Chandler Carruth	18066974d4	[SDAG] Refactor the code which deletes nodes in the DAG combiner to do so using a single helper which adds operands back onto the worklist. Several places didn't rigorously do this but a couple already did. Factoring them together and doing it rigorously is important to delete things recursively early on in the combiner and get a chance to see accurate hasOneUse values. While no existing test cases change, an upcoming patch to add DAG combining logic for PSHUFB requires this to work correctly. llvm-svn: 214623	2014-08-02 10:02:07 +00:00
Owen Anderson	9d5a8c2813	Fix issues with ISD::FNEG and ISD::FMA SDNodes where they would not be constant-folded during DAGCombine in certain circumstances. Unfortunately, the circumstances required to trigger the issue seem to require a pretty specific interaction of DAGCombines, and I haven't been able to find a testcase that reproduces on X86, ARM, or AArch64. The functionality added here is replicated in essentially every other DAG combine, so it seems pretty obviously correct. llvm-svn: 214622	2014-08-02 08:45:33 +00:00
Alexander Kornienko	da2734d4d0	Changed tool-template to use CommonOptionsParser. Reviewers: pcc, klimek Reviewed By: klimek Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4765 llvm-svn: 214621	2014-08-02 08:24:10 +00:00
NAKAMURA Takumi	70cac04c7f	libclang/Makefile: Update LINK_COMPONENTS take #3 . Sorry for the noise. llvm-svn: 214620	2014-08-02 07:24:04 +00:00
NAKAMURA Takumi	3654ce35ae	libclang/Makefile: Update LINK_COMPONENTS take #2 . llvm-svn: 214619	2014-08-02 07:16:14 +00:00
Zachary Turner	fb903ab7d4	Make the swig generation script use the correct python executable. It was hardcoding the value "python", which will end up at best getting a different python executable (if the user has overridden the value of PYTHON_EXECUTABLE), and at worst encountering an error (if there is no copy of python on the system path). This patch changes the script to use sys.executable so that it runs the sub-script with the same executable that it was run with. llvm-svn: 214618	2014-08-02 07:11:22 +00:00
NAKAMURA Takumi	090b78c7d2	libclang/Makefile: Restore some components in LINK_COMPONENTS. Clang's Makefile(s) are not transitive on clang libs. llvm-svn: 214617	2014-08-02 07:05:38 +00:00
NAKAMURA Takumi	bef5d81a50	libclang: Update LINK_COMPONENTS. llvm-svn: 214616	2014-08-02 06:58:39 +00:00
Justin Bogner	0950d79f60	CodeGen: Remove commented out code These two lines have been commented out for over 4 years. They aren't helping anyone. llvm-svn: 214615	2014-08-02 06:47:07 +00:00
Akira Hatanaka	dc08c30df9	[ARM] In dynamic-no-pic mode, ARM's post-RA pseudo expansion was incorrectly expanding pseudo LOAD_STATCK_GUARD using instructions that are normally used in pic mode. This patch fixes the bug. <rdar://problem/17886592> llvm-svn: 214614	2014-08-02 05:40:40 +00:00
Lang Hames	70735351ca	[MCJIT] Fix an overly-aggressive check in RuntimeDyldMachOARM. This should fix the MachO_ARM_PIC_relocations.s test failures on some 32-bit testers. llvm-svn: 214613	2014-08-02 03:00:49 +00:00
Matt Arsenault	4de324442b	R600: Cleanup fneg tests llvm-svn: 214612	2014-08-02 02:26:51 +00:00
Michael Gottesman	55fcf34705	Add a small utility called bisect that enables commandline bisecting on a counter. This is something that I have found to be very useful in my work and I wanted to contribute it back to the community since several people in the past have asked me for something along these lines. (Jakob, I know this has been a while coming ; )] The way you use this is you create a script that takes in as its first argument a count. The script passes into LLVM the count via a command line flag that disables a pass after LLVM has run after the pass has run for count number of times. Then the script invokes a test of some sort and indicates whether LLVM successfully compiled the test via the scripts exit status. Then you invoke bisect as follows: bisect --start=<start_num> --end=<end_num> ./script.sh "%(count)s" And bisect will continually call ./script.sh with various counts using the exit status to determine success and failure. llvm-svn: 214610	2014-08-02 01:39:08 +00:00
Eric Fiselier	c85f00a062	[lit] Add --show-xfail flag to LIT. Summary: This patch add a --show-xfail flag. If this flag is specified then each xfail test will be printed to output. When it is not given xfail tests are ignored. Ignoring xfail tests is the current behavior. This flag is meant to mirror the --show-unsupported flag that was recently added. Reviewers: ddunbar, EricWF Reviewed By: EricWF Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4750 llvm-svn: 214609	2014-08-02 01:29:52 +00:00
Matt Arsenault	a80c8770f9	R600/SI: Fix formatting. Avoid weird line wrapping of BuildMI dest register. llvm-svn: 214608	2014-08-02 01:10:28 +00:00
Alexander Kornienko	228dda5ac5	Use CommonOptionsParser in clang-query. This fixes its support of the fixed compilation database and makes it behave consistently with other clang tools. Reviewers: klimek, pcc Reviewed By: pcc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4763 llvm-svn: 214607	2014-08-02 01:02:33 +00:00
Richard Smith	6d0e97afcf	In the case of mangling collisions, make an attempt to note both definitions involved. llvm-svn: 214606	2014-08-02 00:50:16 +00:00
Chandler Carruth	063f425ea7	[x86] Make some questionable tests not spew assembly to stdout, which makes a mess of the lit output when they ultimately fail. The 2012-10-02-DAGCycle test is really frustrating because the only explanation for what it is testing is a rdar link. I would really rather that rdar links (which are not public or part of the open source project) were not committed to the source code. Regardless, the actual problem must be described as the rdar link is completely opaque. The fact that this test didn't check for any particular output further exacerbates the inability of any other developer to debug failures. The mem-promote-integers test has nice comments and seems to be a great test for our lowering... except that we don't actually check that any of the generated code is correct or matches some pattern. We just avoid crashing. It would be great to go back and populate this test with the actual expectations. llvm-svn: 214605	2014-08-02 00:50:10 +00:00
Alexey Samsonov	d9ad5cec0c	[ASan] Use metadata to pass source-level information from Clang to ASan. Instead of creating global variables for source locations and global names, just create metadata nodes and strings. They will be transformed into actual globals in the instrumentation pass (if necessary). This approach is more flexible: 1) we don't have to ensure that our custom globals survive all the optimizations 2) if globals are discarded for some reason, we will simply ignore metadata for them and won't have to erase corresponding globals 3) metadata for source locations can be reused for other purposes: e.g. we may attach source location metadata to alloca instructions and provide better descriptions for stack variables in ASan error reports. No functionality change. llvm-svn: 214604	2014-08-02 00:35:50 +00:00
Jim Ingham	bb006ce291	After you attach, give the process plugin a chance to report back (through DidAttach) the architecture of the binary you attached to. <rdar://problem/17891396> llvm-svn: 214603	2014-08-02 00:33:35 +00:00
Chandler Carruth	ee1a1fc900	[SDAG] Allow the legalizer to delete an illegally typed intermediate introduced during legalization. This pattern is based on other patterns in the legalizer that I changed in the same way. Now, the legalizer eagerly collects its garbage when necessary so that we can survive leaving such nodes around for it. Instead, we add an assert to make sure the node will be correctly handled by that layer. llvm-svn: 214602	2014-08-02 00:24:54 +00:00
Chandler Carruth	3707dda904	[SDAG] Let the DAG combiner take care of dead nodes rather than manually deleting them. This already seems to work, as no tests fail without this. llvm-svn: 214601	2014-08-02 00:19:10 +00:00

1 2 3 4 5 ...

180017 Commits All Branches Search

180017 Commits

All Branches