llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	5b5fd4e27c	No need to run -mem2reg twice llvm-svn: 214632	2014-08-02 13:37:25 +00:00
Chandler Carruth	16c13cad35	[x86] Remove the FIXME that was implemented in r214628. Managed to forget to update the comment here... =/ llvm-svn: 214630	2014-08-02 11:34:23 +00:00
Chandler Carruth	bec57b406d	[x86] Give this test a bare metal triple so it doesn't use the weird Darwin x86 asm comment prefix designed to work around GAS on that platform. That makes the comment-matching of the test much more stable. llvm-svn: 214629	2014-08-02 11:17:41 +00:00
Chandler Carruth	4c57955fe3	[x86] Largely complete the use of PSHUFB in the new vector shuffle lowering with a small addition to it and adding PSHUFB combining. There is one obvious place in the new vector shuffle lowering where we should form PSHUFBs directly: when without them we will unpack a vector of i8s across two different registers and do a potentially 4-way blend as i16s only to re-pack them into i8s afterward. This is the crazy expensive fallback path for i8 shuffles and we can just directly use pshufb here as it will always be cheaper (the unpack and pack are two instructions so even a single shuffle between them hits our three instruction limit for forming PSHUFB). However, this doesn't generate very good code in many cases, and it leaves a bunch of common patterns not using PSHUFB. So this patch also adds support for extracting a shuffle mask from PSHUFB in the X86 lowering code, and uses it to handle PSHUFBs in the recursive shuffle combining. This allows us to combine through them, combine multiple ones together, and generally produce sufficiently high quality code. Extracting the PSHUFB mask is annoyingly complex because it could be either pre-legalization or post-legalization. At least this doesn't have to deal with re-materialized constants. =] I've added decode routines to handle the different patterns that show up at this level and we dispatch through them as appropriate. The two primary test cases are updated. For the v16 test case there is still a lot of room for improvement. Since I was going through it systematically I left behind a bunch of FIXME lines that I'm hoping to turn into ALL lines by the end of this. llvm-svn: 214628	2014-08-02 10:39:15 +00:00
Chandler Carruth	d10b29240c	[x86] Switch to using the variable we extracted this operand into. Spotted this missed refactoring by inspection when reading code, and it doesn't changethe functionality at all. llvm-svn: 214627	2014-08-02 10:29:36 +00:00
Chandler Carruth	5219d4eff6	[x86] Fix a few typos in my comments spotted in passing. llvm-svn: 214626	2014-08-02 10:29:34 +00:00
Chandler Carruth	34f9a987e9	[x86] Teach the target shuffle mask extraction to recognize unary forms of normally binary shuffle instructions like PUNPCKL and MOVLHPS. This detects cases where a single register is used for both operands making the shuffle behave in a unary way. We detect this and adjust the mask to use the unary form which allows the existing DAG combine for shuffle instructions to actually work at all. As a consequence, this uncovered a number of obvious bugs in the existing DAG combine which are fixed. It also now canonicalizes several shuffles even with the existing lowering. These typically are trying to match the shuffle to the domain of the input where before we only really modeled them with the floating point variants. All of the cases which change to an integer shuffle here have something in the integer domain, so there are no more or fewer domain crosses here AFAICT. Technically, it might be better to go from a GPR directly to the floating point domain, but detecting floating point outputs despite integer inputs is a lot more code and seems unlikely to be worthwhile in practice. If folks are seeing domain-crossing regressions here though, let me know and I can hack something up to fix it. Also as a consequence, a bunch of missed opportunities to form pshufb now can be formed. Notably, splats of i8s now form pshufb. Interestingly, this improves the existing splat lowering too. We go from 3 instructions to 1. Yes, we may tie up a register, but it seems very likely to be worth it, especially if splatting the 0th byte (the common case) as then we can use a zeroed register as the mask. llvm-svn: 214625	2014-08-02 10:27:38 +00:00
Chandler Carruth	2ad69eea8d	[x86] Teach my pshufb comment printer to handle VPSHUFB forms as well as PSHUFB forms. This will be important to update some AVX tests when I add PSHUFB combining. llvm-svn: 214624	2014-08-02 10:08:17 +00:00
Chandler Carruth	18066974d4	[SDAG] Refactor the code which deletes nodes in the DAG combiner to do so using a single helper which adds operands back onto the worklist. Several places didn't rigorously do this but a couple already did. Factoring them together and doing it rigorously is important to delete things recursively early on in the combiner and get a chance to see accurate hasOneUse values. While no existing test cases change, an upcoming patch to add DAG combining logic for PSHUFB requires this to work correctly. llvm-svn: 214623	2014-08-02 10:02:07 +00:00
Owen Anderson	9d5a8c2813	Fix issues with ISD::FNEG and ISD::FMA SDNodes where they would not be constant-folded during DAGCombine in certain circumstances. Unfortunately, the circumstances required to trigger the issue seem to require a pretty specific interaction of DAGCombines, and I haven't been able to find a testcase that reproduces on X86, ARM, or AArch64. The functionality added here is replicated in essentially every other DAG combine, so it seems pretty obviously correct. llvm-svn: 214622	2014-08-02 08:45:33 +00:00
Alexander Kornienko	da2734d4d0	Changed tool-template to use CommonOptionsParser. Reviewers: pcc, klimek Reviewed By: klimek Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4765 llvm-svn: 214621	2014-08-02 08:24:10 +00:00
NAKAMURA Takumi	70cac04c7f	libclang/Makefile: Update LINK_COMPONENTS take #3 . Sorry for the noise. llvm-svn: 214620	2014-08-02 07:24:04 +00:00
NAKAMURA Takumi	3654ce35ae	libclang/Makefile: Update LINK_COMPONENTS take #2 . llvm-svn: 214619	2014-08-02 07:16:14 +00:00
Zachary Turner	fb903ab7d4	Make the swig generation script use the correct python executable. It was hardcoding the value "python", which will end up at best getting a different python executable (if the user has overridden the value of PYTHON_EXECUTABLE), and at worst encountering an error (if there is no copy of python on the system path). This patch changes the script to use sys.executable so that it runs the sub-script with the same executable that it was run with. llvm-svn: 214618	2014-08-02 07:11:22 +00:00
NAKAMURA Takumi	090b78c7d2	libclang/Makefile: Restore some components in LINK_COMPONENTS. Clang's Makefile(s) are not transitive on clang libs. llvm-svn: 214617	2014-08-02 07:05:38 +00:00
NAKAMURA Takumi	bef5d81a50	libclang: Update LINK_COMPONENTS. llvm-svn: 214616	2014-08-02 06:58:39 +00:00
Justin Bogner	0950d79f60	CodeGen: Remove commented out code These two lines have been commented out for over 4 years. They aren't helping anyone. llvm-svn: 214615	2014-08-02 06:47:07 +00:00
Akira Hatanaka	dc08c30df9	[ARM] In dynamic-no-pic mode, ARM's post-RA pseudo expansion was incorrectly expanding pseudo LOAD_STATCK_GUARD using instructions that are normally used in pic mode. This patch fixes the bug. <rdar://problem/17886592> llvm-svn: 214614	2014-08-02 05:40:40 +00:00
Lang Hames	70735351ca	[MCJIT] Fix an overly-aggressive check in RuntimeDyldMachOARM. This should fix the MachO_ARM_PIC_relocations.s test failures on some 32-bit testers. llvm-svn: 214613	2014-08-02 03:00:49 +00:00
Matt Arsenault	4de324442b	R600: Cleanup fneg tests llvm-svn: 214612	2014-08-02 02:26:51 +00:00
Michael Gottesman	55fcf34705	Add a small utility called bisect that enables commandline bisecting on a counter. This is something that I have found to be very useful in my work and I wanted to contribute it back to the community since several people in the past have asked me for something along these lines. (Jakob, I know this has been a while coming ; )] The way you use this is you create a script that takes in as its first argument a count. The script passes into LLVM the count via a command line flag that disables a pass after LLVM has run after the pass has run for count number of times. Then the script invokes a test of some sort and indicates whether LLVM successfully compiled the test via the scripts exit status. Then you invoke bisect as follows: bisect --start=<start_num> --end=<end_num> ./script.sh "%(count)s" And bisect will continually call ./script.sh with various counts using the exit status to determine success and failure. llvm-svn: 214610	2014-08-02 01:39:08 +00:00
Eric Fiselier	c85f00a062	[lit] Add --show-xfail flag to LIT. Summary: This patch add a --show-xfail flag. If this flag is specified then each xfail test will be printed to output. When it is not given xfail tests are ignored. Ignoring xfail tests is the current behavior. This flag is meant to mirror the --show-unsupported flag that was recently added. Reviewers: ddunbar, EricWF Reviewed By: EricWF Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4750 llvm-svn: 214609	2014-08-02 01:29:52 +00:00
Matt Arsenault	a80c8770f9	R600/SI: Fix formatting. Avoid weird line wrapping of BuildMI dest register. llvm-svn: 214608	2014-08-02 01:10:28 +00:00
Alexander Kornienko	228dda5ac5	Use CommonOptionsParser in clang-query. This fixes its support of the fixed compilation database and makes it behave consistently with other clang tools. Reviewers: klimek, pcc Reviewed By: pcc Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4763 llvm-svn: 214607	2014-08-02 01:02:33 +00:00
Richard Smith	6d0e97afcf	In the case of mangling collisions, make an attempt to note both definitions involved. llvm-svn: 214606	2014-08-02 00:50:16 +00:00
Chandler Carruth	063f425ea7	[x86] Make some questionable tests not spew assembly to stdout, which makes a mess of the lit output when they ultimately fail. The 2012-10-02-DAGCycle test is really frustrating because the only explanation for what it is testing is a rdar link. I would really rather that rdar links (which are not public or part of the open source project) were not committed to the source code. Regardless, the actual problem must be described as the rdar link is completely opaque. The fact that this test didn't check for any particular output further exacerbates the inability of any other developer to debug failures. The mem-promote-integers test has nice comments and seems to be a great test for our lowering... except that we don't actually check that any of the generated code is correct or matches some pattern. We just avoid crashing. It would be great to go back and populate this test with the actual expectations. llvm-svn: 214605	2014-08-02 00:50:10 +00:00
Alexey Samsonov	d9ad5cec0c	[ASan] Use metadata to pass source-level information from Clang to ASan. Instead of creating global variables for source locations and global names, just create metadata nodes and strings. They will be transformed into actual globals in the instrumentation pass (if necessary). This approach is more flexible: 1) we don't have to ensure that our custom globals survive all the optimizations 2) if globals are discarded for some reason, we will simply ignore metadata for them and won't have to erase corresponding globals 3) metadata for source locations can be reused for other purposes: e.g. we may attach source location metadata to alloca instructions and provide better descriptions for stack variables in ASan error reports. No functionality change. llvm-svn: 214604	2014-08-02 00:35:50 +00:00
Jim Ingham	bb006ce291	After you attach, give the process plugin a chance to report back (through DidAttach) the architecture of the binary you attached to. <rdar://problem/17891396> llvm-svn: 214603	2014-08-02 00:33:35 +00:00
Chandler Carruth	ee1a1fc900	[SDAG] Allow the legalizer to delete an illegally typed intermediate introduced during legalization. This pattern is based on other patterns in the legalizer that I changed in the same way. Now, the legalizer eagerly collects its garbage when necessary so that we can survive leaving such nodes around for it. Instead, we add an assert to make sure the node will be correctly handled by that layer. llvm-svn: 214602	2014-08-02 00:24:54 +00:00
Chandler Carruth	3707dda904	[SDAG] Let the DAG combiner take care of dead nodes rather than manually deleting them. This already seems to work, as no tests fail without this. llvm-svn: 214601	2014-08-02 00:19:10 +00:00
Greg Clayton	7fde1b3b83	Now that setting an architecture from a mach-o CPU type and subtype doesn't set the OS type, make sure to set it. llvm-svn: 214600	2014-08-02 00:15:37 +00:00
Tyler Nowicki	064896bbc5	Add diagnostics to the vectorizer cost model. When the cost model determines vectorization is not possible/profitable these remarks print an analysis of that decision. Note that in selectVectorizationFactor() we can assume that OptForSize and ForceVectorization are mutually exclusive. Reviewed by Arnold Schwaighofer llvm-svn: 214599	2014-08-02 00:14:03 +00:00
NAKAMURA Takumi	78c32d75e0	BitcodeTests: Fix LINK_COMPONENTS. llvm-svn: 214598	2014-08-02 00:12:54 +00:00
Duncan P. N. Exon Smith	a5a8d3f6f2	verify-uselistorder: Reverse use-lists at every verification Updated `verify-uselistorder` to more than double the number of use-list orders it checks. - Every time it verifies an order, it then reverses the order and verifies again. - It now verifies the initial order, before running any shuffles. Changed the default to `-num-shuffles=1`, since this is already four checks, and after r214584 shuffling is guaranteed to make a new order. This is part of PR5680. llvm-svn: 214596	2014-08-01 23:49:41 +00:00
Duncan P. N. Exon Smith	9a2017bfc3	verify-uselistorder: Add missing `static` llvm-svn: 214595	2014-08-01 23:31:13 +00:00
Duncan P. N. Exon Smith	3441ffe98d	IR: Add Value::reverseUseList() I'm going to use this to improve `verify-uselistorder`. Part of PR5680. llvm-svn: 214594	2014-08-01 23:28:49 +00:00
Peter Collingbourne	e52646cd80	PartiallyInlineLibCalls: Check sqrt result type before transforming it. Some configure scripts declare this with the wrong prototype, which can lead to an assertion failure. llvm-svn: 214593	2014-08-01 23:21:21 +00:00
Duncan P. N. Exon Smith	3da117d272	verify-uselistorder: Move shuffleUseLists() out of lib/IR `shuffleUseLists()` is only used in `verify-uselistorder`, so move it there to avoid bloating other executables. As a drive-by, update some of the header docs. This is part of PR5680. llvm-svn: 214592	2014-08-01 23:03:36 +00:00
Adrian Prantl	d13dba42f6	Cleanup this test some more. llvm-svn: 214591	2014-08-01 23:01:32 +00:00
Adrian Prantl	a717a6da3d	Add the missing target triple to this testcase. llvm-svn: 214590	2014-08-01 23:01:30 +00:00
Ben Langmuir	4ad99c3b2e	Fix test from r214577 for other timezones Unsurprisingly, changing a file modification time to a specific date/time doesn't give the same epoch time everywhere. Just make the file move into the past and look at only the first few digits of the epoch time. llvm-svn: 214589	2014-08-01 22:58:19 +00:00
Adrian Prantl	a6cf448226	Attempt to increase the overall happiness of the MSCV-based buildbots. llvm-svn: 214588	2014-08-01 22:56:10 +00:00
Duncan P. N. Exon Smith	36d57a2303	verify-uselistorder: Make the verification logic easier to reuse llvm-svn: 214587	2014-08-01 22:52:06 +00:00
Justin Bogner	9c6818ef00	InstrProf: Update for LLVM API change We've added support for a multiple functions with the same name in LLVM's profile data, so the lookup returning the function hash it found doesn't make sense anymore. Update to pass in the hash we expect. This also adds a test that the version 1 format is still readable, since the new API is expected to handle that. llvm-svn: 214586	2014-08-01 22:50:16 +00:00
Justin Bogner	821d7471f9	InstrProf: Allow multiple functions with the same name This updates the instrumentation based profiling format so that when we have multiple functions with the same name (but different function hashes) we keep all of them instead of rejecting the later ones. There are a number of scenarios where this can come up where it's more useful to keep multiple function profiles: * Name collisions in unrelated libraries that are profiled together. * Multiple "main" functions from multiple tools built against a common library. * Combining profiles from different build configurations (ie, asserts and no-asserts) The profile format now stores the number of counters between the hash and the counts themselves, so that multiple sets of counts can be stored. Since this is backwards incompatible, I've bumped the format version and added some trivial logic to skip this when reading the old format. llvm-svn: 214585	2014-08-01 22:50:07 +00:00
Duncan P. N. Exon Smith	6d3adac217	UseListOrder: Guarantee that shuffles change use-list order Change shuffleUseLists() always to change use-list order by rejecting orders that have no changes. This is part of PR5680. llvm-svn: 214584	2014-08-01 22:50:04 +00:00
Sean Callanan	608fb390a8	Fixed a problem in the Clang AST importer where we overrode debug information as the authoritative source for type information, substituting types from the Objective-C runtime. The runtime should never be the primary source. <rdar://problem/16065049> llvm-svn: 214583	2014-08-01 22:42:38 +00:00
Richard Smith	7f5755cfac	Notional simplification: defer emitting deferred inline methods until we finish emitting everything, rather than potentially doing this reentrantly. llvm-svn: 214582	2014-08-01 22:42:16 +00:00
Duncan P. N. Exon Smith	6e1009b65e	UseListOrder: Fix blockaddress use-list order `parseBitcodeFile()` uses the generic `getLazyBitcodeFile()` function as a helper. Since `parseBitcodeFile()` isn't actually lazy -- it calls `MaterializeAllPermanently()` -- bypass the unnecessary call to `materializeForwardReferencedFunctions()` by extracting out a common helper function. This removes the last of the use-list churn caused by blockaddresses. This highlights that we can't reproduce use-list order of globals and constants when parsing lazily -- but that's necessarily out of scope. When we're parsing lazily, we never have all the functions in memory, so the use-lists of globals (and constants that reference globals) are always incomplete. This is part of PR5680. llvm-svn: 214581	2014-08-01 22:27:19 +00:00
Akira Hatanaka	3516669a50	[X86] Simplify X87 stackifier pass. Stop using ST registers for function returns and inline-asm instructions and use FP registers instead. This allows removing a large amount of code in the stackifier pass that was needed to track register liveness and handle copies between ST and FP registers and function calls returning floating point values. It also fixes a bug which manifests when an ST register defined by an inline-asm instruction was live across another inline-asm instruction, as shown in the following sequence of machine instructions: 1. INLINEASM <es:frndint> $0:[regdef], %ST0<imp-def,tied5> 2. INLINEASM <es:fldcw $0> 3. %FP0<def> = COPY %ST0 <rdar://problem/16952634> llvm-svn: 214580	2014-08-01 22:19:41 +00:00

1 2 3 4 5 ...

179997 Commits All Branches Search

179997 Commits

All Branches