llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrew Trick	4d4b5469ab	Fix a naughty header include that breaks "installed" builds. llvm-svn: 155486	2012-04-24 20:36:19 +00:00
Nadav Rotem	450d69a5ee	ConstantFoldSelectInstruction swapped the operands of the select. Fix 12592. Patch by Matt Pharr. llvm-svn: 155480	2012-04-24 20:18:49 +00:00
Nadav Rotem	d50c3b2c57	Fix the testcase. We do expect two vblendw on XMMs. llvm-svn: 155477	2012-04-24 19:57:38 +00:00
Nadav Rotem	edef71790b	Add a testcase for 155440 llvm-svn: 155475	2012-04-24 19:45:28 +00:00
Evan Cheng	2d14d8aca1	MachineBasicBlock::SplitCriticalEdge() should follow LLVM IR variant and refuse to break edge to EH landing pad. rdar://11300144 llvm-svn: 155470	2012-04-24 19:06:55 +00:00
Lang Hames	84531c2b5f	Add support for llvm.arm.neon.vmull* intrinsics to InstCombine. This fixes <rdar://problem/11291436>. llvm-svn: 155468	2012-04-24 18:58:36 +00:00
Chandler Carruth	aacb8a5809	Fix a crash on valid (if UB) bitcode that is produced for some global constants in C++11 mode. I have no idea why it required such particular circumstances to get here, the code seems clearly to rely upon unchecked assumptions. Specifically, when we decide to form an index into a struct type, we may have gone through (at least one) zero-length array indexing round, which would have left the offset un-adjusted, and thus not necessarily valid for use when indexing the struct type. This is just an canonicalization step, so the correct thing is to refuse to canonicalize nonsensical GEPs of this form. Implemented, and test case added. Fixes PR12642. Pair debugged and coded with Richard Smith. =] I credit him with most of the debugging, and preventing me from writing the wrong code. llvm-svn: 155466	2012-04-24 18:42:47 +00:00
Jim Grosbach	1e75fc1fe1	ARM: Nuke remnant bogus code. r154362 was supposed to delete this bit, but obviously didn't. rdar://11305594 llvm-svn: 155465	2012-04-24 18:39:47 +00:00
Stepan Dyatkovskiy	040978403a	Related to PR1255. Let's begin. I'll commit classes that corresponds to our latest PR1255 discussion posts in llvm-commits. Strategy. 0. Implement new classes. Classes doesn't affect anything. They still work with ConstantInt base values at this stage. 1. Fictitious replacement of current ConstantInt case values with ConstantRangesSet. Case ranges set will still hold single value, and ConstantInt getCaseValue() will return it. But additionally implement new method in SwitchInst that allows to work with case ranges. Currenly I think it should be some wrapper that returns either single value or ConstantRangesSet object. 2. Step-by-step replacement of old "ConstantInt getCaseValue()" with new alternative. Modify algorithms for all passes that works with SwitchInst. But don't modify LLParser and BitcodeReader/Writer. Still hold single value in each ConstantRangesSet object. On this stage some parts of LLVM will use old-style methods, and some ones new-style. 3. After all getCaseValue() usages will removed and whole LLVM and its clients will work in new style - modify LLParser, Reader and Writer. Remove getCaseValue(). 4. Replace ConstantInt-based case ranges set items with APInt ones. Currently we are on Zero Stage: New classes. ConstantRangesSet. I selected ConstantArrays as case ranges set "holder" object (it is a temporary decision, I'll explain why below). The array items are may be ConstantVectors with single item, and ConstantVectors with two items (that means single number and range respectively). The ConstantInt will used as basic value representation. It will replaced with APInt then. Of course ConstantArray and ConstantVector will go away after ConstantInt => APInt replacement. New class mandatory features: - bool isSatisfies(ConstantInt V) method (need better name?). Returns true if the given value satisfies this case. - Case's ranges and values enumeration. In some passes we need to analize each case (SwitchLowering for example). Factory + unified clusterify. I also propose to implement the factory that allows to build case object with user friendly way. I called it CRSBuilder by now. Currenly I implemented the factory that allows add,remove pairs of range+successor. It also allows add existing ConstantRangesSet decompiling it to separated ranges. Factory can emit either clusters set (single case range + successor) or the set of "ConstantRangesSet + Successor" pairs. So you can use it either as builder for new cases set for SwitchInst, or for clusterification of existing cases set. Just call Factory.optimize() and it emits optimized and sorted clusters collection for you! I tested clusterification on SelectionDAGBuilder - it works fine. Don't worry it was not included in this patch. Just new classes. Factory is a template. There are two params: SuccessorClass and IsReadonly. So you can specify what successor you need (BB or MBB). And you can also restrict your factory to use values in read-only mode (SelectionDAGBuilder need IsReadonly=true). Read-only factory couldn't build the cases ranges. llvm-svn: 155464	2012-04-24 18:31:10 +00:00
Nadav Rotem	810734b7f4	AVX: Add additional vbroadcast replacement sequences for integers. Remove the v2f64 patterns because it does not match any vbroadcast instruction. llvm-svn: 155461	2012-04-24 18:09:59 +00:00
Andrew Trick	26bdff9b82	cmake: new file llvm-svn: 155460	2012-04-24 18:06:49 +00:00
Andrew Trick	9e9a9f1465	misched: DAG builder must special case earlyclobber llvm-svn: 155459	2012-04-24 18:04:41 +00:00
Andrew Trick	c3ea00565f	misched: try (not too hard) to place debug values where they belong llvm-svn: 155458	2012-04-24 18:04:37 +00:00
Andrew Trick	cc45a28320	misched: ignore debug values during scheduling llvm-svn: 155457	2012-04-24 18:04:34 +00:00
Andrew Trick	88639928bd	misched: DAG builder support for tracking register pressure within the current scheduling region. The DAG builder is a convenient place to do it. Hopefully this is more efficient than a separate traversal over the same region. llvm-svn: 155456	2012-04-24 17:56:43 +00:00
Andrew Trick	3cd53a1a52	RegisterPressure: A utility for computing register pressure within a MachineInstr sequence. This uses the new target interface for tracking register pressure using pressure sets to model overlapping register classes and subregisters. RegisterPressure results can be tracked incrementally or stored at region boundaries. Global register pressure can be deduced from local RegisterPressure results if desired. This is an early, somewhat untested implementation. I'm working on testing it within the context of a register pressure reducing MachineScheduler. llvm-svn: 155454	2012-04-24 17:53:35 +00:00
Kevin Enderby	70be447e5c	Add missing test cases for ARM VLD3 (single 3-element structure to all lanes) instructions. llvm-svn: 155453	2012-04-24 17:45:56 +00:00
Kevin Enderby	c8d223e41e	Add missing test cases for ARM VLD4 (single 4-element structure to all lanes) instructions. llvm-svn: 155444	2012-04-24 15:55:00 +00:00
Nadav Rotem	7b7b99c74a	AVX2: The BLENDPW instruction selects between vectors of v16i16 using an i8 immediate. We can't use it here because the shuffle code does not check that the lower part of the word is identical to the upper part. llvm-svn: 155440	2012-04-24 11:27:53 +00:00
Richard Barton	e9600009e9	Refactor Thumb ITState handling in ARM Disassembler to more efficiently use its vector llvm-svn: 155439	2012-04-24 11:13:20 +00:00
Nadav Rotem	aa3ff8da00	AVX: We lower VECTOR_SHUFFLE and BUILD_VECTOR nodes into vbroadcast instructions using the pattern (vbroadcast (i32load src)). In some cases, after we generate this pattern new users are added to the load node, which prevent the selection of the blend pattern. This commit provides fallback patterns which perform in-vector broadcast (using in-vector vbroadcast in AVX2 and pshufd on AVX1). llvm-svn: 155437	2012-04-24 11:07:03 +00:00
Bill Wendling	f1b14b719f	Look for the 'Is Simulated' module flag. This indicates that the program is compiled to run on a simulator. llvm-svn: 155435	2012-04-24 11:03:50 +00:00
Bill Wendling	1981c0e533	FileCheck-ize tests. llvm-svn: 155434	2012-04-24 10:45:44 +00:00
Bill Wendling	4cf911c0cd	FileCheck-ize these tests. llvm-svn: 155433	2012-04-24 10:36:42 +00:00
Bill Wendling	cd6df16cb4	FileCheck-ize these tests. Harden some of them. llvm-svn: 155432	2012-04-24 09:15:38 +00:00
Craig Topper	0b65c40821	Remove dangling spaces. Fix some other formatting. llvm-svn: 155429	2012-04-24 06:36:35 +00:00
Craig Topper	6f2a535de2	Simplify code a bit and make it compile better. Remove unused parameters. llvm-svn: 155428	2012-04-24 06:02:29 +00:00
Evan Cheng	7fd160700f	Add a missing cpu subtype. llvm-svn: 155402	2012-04-23 22:41:39 +00:00
Jim Grosbach	671ad2a572	Tidy up. 80 columns, whitespace, et. al. llvm-svn: 155399	2012-04-23 22:04:10 +00:00
Nadav Rotem	3f8acfc3c4	Optimize the vector UINT_TO_FP, SINT_TO_FP and FP_TO_SINT operations where the integer type is i8 (commonly used in graphics). llvm-svn: 155397	2012-04-23 21:53:37 +00:00
Preston Gurd	9a0914753a	This patch fixes a problem which arose when using the Post-RA scheduler on X86 Atom. Some of our tests failed because the tail merging part of the BranchFolding pass was creating new basic blocks which did not contain live-in information. When the anti-dependency code in the Post-RA scheduler ran, it would sometimes rename the register containing the function return value because the fact that the return value was live-in to the subsequent block had been lost. To fix this, it is necessary to run the RegisterScavenging code in the BranchFolding pass. This patch makes sure that the register scavenging code is invoked in the X86 subtarget only when post-RA scheduling is being done. Post RA scheduling in the X86 subtarget is only done for Atom. This patch adds a new function to the TargetRegisterClass to control whether or not live-ins should be preserved during branch folding. This is necessary in order for the anti-dependency optimizations done during the PostRASchedulerList pass to work properly when doing Post-RA scheduling for the X86 in general and for the Intel Atom in particular. The patch adds and invokes the new function trackLivenessAfterRegAlloc() instead of using the existing requiresRegisterScavenging(). It changes BranchFolding.cpp to call trackLivenessAfterRegAlloc() instead of requiresRegisterScavenging(). It changes the all the targets that implemented requiresRegisterScavenging() to also implement trackLivenessAfterRegAlloc(). It adds an assertion in the Post RA scheduler to make sure that post RA liveness information is available when it is needed. It changes the X86 break-anti-dependencies test to use –mcpu=atom, in order to avoid running into the added assertion. Finally, this patch restores the use of anti-dependency checking (which was turned off temporarily for the 3.1 release) for Intel Atom in the Post RA scheduler. Patch by Andy Zhang! Thanks to Jakob and Anton for their reviews. llvm-svn: 155395	2012-04-23 21:39:35 +00:00
Jim Grosbach	41e94d79be	ARM: VSLI two-operand assmebly aliases are tblgen'erated. llvm-svn: 155393	2012-04-23 21:22:04 +00:00
Jim Grosbach	3dada484c3	ARM: tblgen'erate VSRA/VRSRA/VSRI assembly two-operand aliases. llvm-svn: 155392	2012-04-23 21:00:49 +00:00
Jim Grosbach	f6371b5238	ARM: Add testcases for two-operand variants of VSRA/VRSRA/VSRI. llvm-svn: 155391	2012-04-23 21:00:47 +00:00
Jim Grosbach	76cdd136bf	Add ARM mode tests for the NEON vector shift-accumulate tests. llvm-svn: 155390	2012-04-23 21:00:44 +00:00
Jim Grosbach	5c7e9e5e1b	Tidy up. Reformat for ease of reading. llvm-svn: 155389	2012-04-23 21:00:42 +00:00
Jim Grosbach	e5012fbad3	ARM: vqdmulh two-operand aliases are tblgen'erated now. llvm-svn: 155387	2012-04-23 20:37:20 +00:00
Michael J. Spencer	04b795bc1d	[Support/Unix] Unconditionally include time.h. When building LLVM on Linux with libc++ with CMake TIME_WITH_SYS_TIME is undefined, and HAVE_SYS_TIME_H is defined. This ends up including sys/time.h but not time.h. Unix/TimeValue.inc requires time.h for asctime_r and localtime. libstdc++ seems to include time.h anyway, but libc++ does not. Fix this by always including time.h llvm-svn: 155382	2012-04-23 19:00:27 +00:00
Eric Christopher	27deb265f9	Allow forward declarations to take a context. This helps the debugger find forward declarations in the context that the actual definition will occur. rdar://11291658 llvm-svn: 155380	2012-04-23 19:00:11 +00:00
Chandler Carruth	af0f8bf595	Temporarily revert r155364 until the upstream review can complete, per the stated developer policy. llvm-svn: 155373	2012-04-23 18:28:57 +00:00
Chandler Carruth	3c3bb55a85	Revert r155365, r155366, and r155367. All three of these have regression test suite failures. The failures occur at each stage, and only get worse, so I'm reverting all of them. Please resubmit these patches, one at a time, after verifying that the regression test suite passes. Never submit a patch without running the regression test suite. llvm-svn: 155372	2012-04-23 18:25:57 +00:00
Sirish Pande	a3f8ba2439	Hexagon V5 (floating point) support. llvm-svn: 155367	2012-04-23 17:49:40 +00:00
Sirish Pande	2c7bf00fba	Support for Hexagon architectural feature, new value jump. llvm-svn: 155366	2012-04-23 17:49:28 +00:00
Sirish Pande	6cd2251598	Support for Hexagon VLIW Packetizer. llvm-svn: 155365	2012-04-23 17:49:20 +00:00
Sirish Pande	995c8dbfd2	Hexagon Packetizer's target independent fix. llvm-svn: 155364	2012-04-23 17:49:09 +00:00
Jakob Stoklund Olesen	43bcb970e5	Reapply r155136 after fixing PR12599. Original commit message: Defer some shl transforms to DAGCombine. The shl instruction is used to represent multiplication by a constant power of two as well as bitwise left shifts. Some InstCombine transformations would turn an shl instruction into a bit mask operation, making it difficult for later analysis passes to recognize the constsnt multiplication. Disable those shl transformations, deferring them to DAGCombine time. An 'shl X, C' instruction is now treated mostly the same was as 'mul X, C'. These transformations are deferred: (X >>? C) << C --> X & (-1 << C) (When X >> C has multiple uses) (X >>? C1) << C2 --> X << (C2-C1) & (-1 << C2) (When C2 > C1) (X >>? C1) << C2 --> X >>? (C1-C2) & (-1 << C2) (When C1 > C2) The corresponding exact transformations are preserved, just like div-exact + mul: (X >>?,exact C) << C --> X (X >>?,exact C1) << C2 --> X << (C2-C1) (X >>?,exact C1) << C2 --> X >>?,exact (C1-C2) The disabled transformations could also prevent the instruction selector from recognizing rotate patterns in hash functions and cryptographic primitives. I have a test case for that, but it is too fragile. llvm-svn: 155362	2012-04-23 17:39:52 +00:00
Sylvestre Ledru	3099f4bda8	Conflict with st_dev/st_ino identifiers under Debian GNU/Hurd The problem is that the struct file_status on UNIX systems has two members called st_dev and st_ino; those are also members of the struct stat, and they are reserved identifiers which can also be provided as #define (and this is the case for st_dev on Hurd). The solution (attached) is to rename them, for example adding a "fs_" prefix (= file status) to them. Patch by Pino Toscano llvm-svn: 155354	2012-04-23 16:37:23 +00:00
Alexander Potapenko	056e27ea49	Fix issue 67 by checking that the interface functions weren't redefined in the compiled source file. llvm-svn: 155346	2012-04-23 10:47:31 +00:00
Kostya Serebryany	5a4b7a232c	[tsan] use llvm/ADT/Statistic.h for tsan stats llvm-svn: 155341	2012-04-23 08:44:59 +00:00
Craig Topper	153bb34a3c	Use MVT instead of EVT through all of LowerVECTOR_SHUFFLEtoBlend and not just the switch. Saves a little bit of binary size. llvm-svn: 155339	2012-04-23 07:36:33 +00:00

1 2 3 4 5 ...

81880 Commits