llvm-project

Commit Graph

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	b474937929	AsmPrinter: Stop creating DebugLocs While looking at a heap profile of a clang LTO bootstrap with -g, I noticed that 2.2% of memory in an `llvm-lto` of clang is from calling `DebugLoc::get()` in `collectVariableInfo()` (accounting for ~40% of memory used for `MDLocation`s). I suspect this was introduced by r226736, whose goal was to prevent uniquing of `DebugLoc`s (goal achieved, if so). There's no reason we need a `DebugLoc` here at all -- it was just being used for (in)convenient API -- so the fix is to pass the scope and inlined-at directly to `LexicalScopes::findInlinedScope()`. llvm-svn: 229459	2015-02-17 00:02:27 +00:00
Hal Finkel	5cedafb8cd	[PowerPC] Support non-direct-sub/superclass VSX copies Our register allocation has become better recently, it seems, and is now starting to generate cross-block copies into inflated register classes. These copies are not transformed into subregister insertions/extractions by the PPCVSXCopy class, and so need to be handled directly by PPCInstrInfo::copyPhysReg. The code to do this was almost there, but not quite (it was unnecessarily restricting itself to only the direct sub/super-register-class case (not copying between, for example, something in VRRC and the lower-half of VSRC which are super-registers of F8RC). Triggering this behavior manually is difficult; I'm including two bugpoint-reduced test cases from the test suite. llvm-svn: 229457	2015-02-16 23:46:30 +00:00
Justin Bogner	fcb2de694a	Revert "InstrProf: Add unit tests for the profile reader and writer" Looks like the bots don't like my initializer lists. This reverts r229455 llvm-svn: 229456	2015-02-16 23:31:07 +00:00
Justin Bogner	f83e895fa7	InstrProf: Add unit tests for the profile reader and writer This required some minor API to be added to these types to avoid needing temp files. Also, I've used initializer lists in the tests, as MSVC 2013 claims to support them. I'll redo this without them if the bots complain. llvm-svn: 229455	2015-02-16 23:27:48 +00:00
Simon Atanasyan	79ba8407d2	[Mips] Add .MIPS.options section descriptor kinds enumeration No functional changes. llvm-svn: 229452	2015-02-16 22:59:29 +00:00
Lang Hames	05fa2b0a14	[Orc] Add an emitAndFinalize method to the ObjectLinkingLayer, IRCompileLayer and LazyEmittingLayer of Orc. This method allows you to immediately emit and finalize a module. It is required by an upcoming refactor of the indirection utils and the compile-on-demand layer. I've filed http://llvm.org/PR22608 to write unit tests for this and other Orc APIs. llvm-svn: 229451	2015-02-16 22:36:25 +00:00
Ahmed Bougacha	bf2b90e92d	[ARM] Remove unused declaration. NFC. GlobalMerge was moved to lib/CodeGen a while ago, and is no longer called "ARMGlobalMerge". llvm-svn: 229448	2015-02-16 22:30:08 +00:00
Cameron McInally	c5764cbe4e	[AVX512] Make 512b vector floating point rounds legal on AVX512. llvm-svn: 229445	2015-02-16 22:15:42 +00:00
Matthias Braun	15635c5f85	RegisterCoalescer: Don't rematerialize subregister definitions. We cannot simply rematerialize instructions which only defining a subregister, as the final value also depends on the previous instructions. This fixes test/CodeGen/R600/subreg-coalescer-bug.ll with subreg liveness enabled. llvm-svn: 229444	2015-02-16 22:05:17 +00:00
Matthias Braun	1b901a4435	RegisterCoalescer: Do not look for regclass of IMPLICIT_DEF. IMPLICIT_DEF is a generic instruction and has no (fixed) output register class defined. The rematerialization code of the register coalescer should not scan the instruction description for a register class. This fixes a problem showing up in test/CodeGen/R600/subreg-coalescer-crash.ll with subregister liveness enabled. llvm-svn: 229443	2015-02-16 22:05:12 +00:00
Simon Pilgrim	b2c00f3286	[X86][SSE] Add SSE MOVQ instructions to SSEPackedInt domain Patch to explicitly add the SSE MOVQ (rr,mr,rm) instructions to SSEPackedInt domain - prevents a number of costly domain switches. Differential Revision: http://reviews.llvm.org/D7600 llvm-svn: 229439	2015-02-16 21:50:56 +00:00
Mehdi Amini	3e0023b8f6	SelectionDAG: fold (fp_to_u/sint (s/uint_to_fp)) here too Update SPARC tests to match. From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229438	2015-02-16 21:47:58 +00:00
Mehdi Amini	b9a0fa4822	InstCombine: fold more cases of (fp_to_u/sint (u/sint_to_fp val)) Fixes radar 15486701. From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229437	2015-02-16 21:47:54 +00:00
Mehdi Amini	7aab8752ba	Tests: reformat sitofp.ll and use FileCheck From: Fiona Glaser <fglaser@apple.com> llvm-svn: 229436	2015-02-16 21:47:50 +00:00
Justin Bogner	ab89ed7dd5	InstrProf: Use ErrorOr for IndexedInstrProfReader::create (NFC) The other InstrProfReader::create factories were updated to return ErrorOr in r221120, and it's odd for these APIs not to match. llvm-svn: 229433	2015-02-16 21:28:58 +00:00
Craig Topper	49df44e2e2	[X86] Remove the multiply by 8 that goes into the shift constant for X86ISD::VSHLDQ and X86ISD::VSRLDQ. This simplifies the pattern matching in isel and allows these nodes to become the patterns embedded in the instruction. llvm-svn: 229431	2015-02-16 20:52:07 +00:00
Craig Topper	44026efa88	[X86] Remove x86.avx2.psll.dq.bs and x86.avx2.psrl.dq.bs intrinsics. llvm-svn: 229430	2015-02-16 20:51:59 +00:00
Matthias Braun	d6b108e445	ARM: Transfer kill flag when lowering VSTMQIA to VSTMDIA. llvm-svn: 229425	2015-02-16 19:34:30 +00:00
Matthias Braun	2eab0e30e1	RegisterCoalescer: Improve previous fix for wrong def after. The previous fix in r225503 was needlessly complicated. The problem goes away as well if the arguments to MergeValueNumberInto are supplied in the correct order. This was previously missed because the existing code already had the wrong order but an additional later Merge was hiding the bug for the main liverange VNI. llvm-svn: 229424	2015-02-16 19:34:27 +00:00
Aaron Ballman	97a59fb464	MSVC 2013 does not ICE on this code in the same fashion that MSVC 2012 did; NFC. llvm-svn: 229422	2015-02-16 19:33:36 +00:00
Duncan P. N. Exon Smith	060ee625b8	Bitcode: Fix major regression: large files w/ debug info The metadata/value split introduced a major regression reading large bitcode files that contain debug info (or other cyclic (non-self reference) metadata graphs). For the first time in a while, I dropped from libLTO.dylib down to `llvm-lto` with a non-trivial bitcode file (~350MB), and I hit this when reading the result of ld64's `-save-temps` in `llvm-lto`. Here's pseudo-code for what was going on: read-main-metadata-block: for each md: if has-fwd-ref: // Only true for cyclic graphs. any-fwd-refs <- true if any-fwd-refs: foreach md: resolve-cycles(md) // Handle cycles. foreach function: read-function-metadata-block: // Such as !alias, !loop if any-fwd-refs: foreach md: // (all metadata, not just this block) resolve-cycles(md) // A no-op, but the loop is expensive!! This commit resets the `AnyFwdRefs` flag to `false`. This on its own was enough to change my Release+Asserts `llvm-lto` time for reading this bitcode from over 20 minutes (I gave up on it) to 20 seconds. I've gone further by tracking the min/max metadata forward-references in a metadata block. This protects against a schema that has lots of functions that each reference their own metadata cycle. Unfortunately, this regression is in the 3.6 branch as well. llvm-svn: 229421	2015-02-16 19:18:01 +00:00
David Majnemer	8b77454dff	ConstantFold: Properly fold GEP indices wider than i64 llvm-svn: 229420	2015-02-16 19:10:02 +00:00
James Molloy	83570247f1	Run LICM as part of the cleanup phase from the scalar optimizer. Things like LoopUnrolling can produce loop invariant values - make sure we pick them up. llvm-svn: 229419	2015-02-16 18:59:54 +00:00
Aaron Ballman	da9501b25c	We require MSVC 1800 as our minimum, so these checks can safely go away; NFC. (It seems this code has been copy/pasted around, unfortunately.) llvm-svn: 229417	2015-02-16 18:34:57 +00:00
Aaron Ballman	b664e2a24b	We require MSVC 1800 as our minimum, so these checks can safely go away; NFC. llvm-svn: 229415	2015-02-16 18:23:00 +00:00
Aaron Ballman	f17583ee9c	MSVC 2013 supports std::forward_as_tuple, while MSVC 2012 did not; so we can move to using the improved API. llvm-svn: 229414	2015-02-16 18:21:19 +00:00
Andrew Trick	05938a5481	AArch64: Safely handle the incoming sret call argument. This adds a safe interface to the machine independent InputArg struct for accessing the index of the original (IR-level) argument. When a non-native return type is lowered, we generate the hidden machine-level sret argument on-the-fly. Before this fix, we were representing this argument as OrigArgIndex == 0, which is an outright lie. In particular this crashed in the AArch64 backend where we actually try to access the type of the original argument. Now we use a sentinel value for machine arguments that have no original argument index. AArch64, ARM, Mips, and PPC now check for this case before accessing the original argument. Fixes <rdar://19792160> Null pointer assertion in AArch64TargetLowering llvm-svn: 229413	2015-02-16 18:10:47 +00:00
Hal Finkel	c64150b8f3	[ADCE] Don't indent inside an anonymous namespace To be consistent with what clang-format does, don't add extra indentation inside an anonymous namespace. NFC. llvm-svn: 229412	2015-02-16 18:08:00 +00:00
James Molloy	e32d806b5f	[LoopReroll] Relax some assumptions a little. We won't find a root with index zero in any loop that we are able to reroll. However, we may find one in a non-rerollable loop, so bail gracefully instead of failing hard. llvm-svn: 229406	2015-02-16 17:02:00 +00:00
James Molloy	4c7deb2259	[LoopReroll] Don't crash on dead code If a PHI has no users, don't crash; bail gracefully. This shouldn't happen often, but we can make no guarantees that previous passes didn't leave dead code around. llvm-svn: 229405	2015-02-16 17:01:52 +00:00
Jonas Paulsson	f81c3ebeb7	[PBQP] Improve the assert for conservatively allocatables. Remember if the node ever was in this state instead of checking just the final state. Reviewed by Arnaud de Grandmaison. llvm-svn: 229400	2015-02-16 15:39:26 +00:00
Evgeniy Stepanov	292acab847	[asan] Reuse a common function. Do not reimplement RoundUpToAlignment. llvm-svn: 229397	2015-02-16 14:49:37 +00:00
Chandler Carruth	1e57e2deb8	[x86] Add a generic unpack-targeted lowering technique. This can be used to generically lower blends and is particularly nice because it is available frome SSE2 onward. This removes a lot of the remaining domain crossing blends in SSE2 code. I'm hoping to replace some of the "interleaved" lowering hacks with something closer to this which should be more principled. First, this needs to learn how to detect and use other interleavings besides that of the natural type provided. That will be a follow-up patch though. llvm-svn: 229378	2015-02-16 12:28:18 +00:00
Chandler Carruth	50dc783d75	[x86] Switch this test to use checks generated by my update script. NFC llvm-svn: 229377	2015-02-16 12:23:22 +00:00
Michael Kuperstein	fc3e626752	Fix quoting of #pragma comment for MS compat, LLVM part. For #pragma comment(linker, ...) MSVC expects the comment string to be quoted, but for #pragma comment(lib, ...) the compiler itself quotes the library name. Since this distinction disappears by the time the directive reaches the backend, move quoting for the "lib" version to the frontend. Differential Revision: http://reviews.llvm.org/D7652 llvm-svn: 229375	2015-02-16 11:57:17 +00:00
Chandler Carruth	c802085b3a	[x86] Add initial basic support for forming blends of v16i8 vectors. This blend instruction is ... really lame. The register usage is insane. As a consequence this is probably only barely better than 2 pshufbs followed by a por, and that mostly because it only has to read from a single memory location. However, this doesn't fix as much as I kind of expected, so more to go. Pretty sure that the ordering and delegation of v16i8 is just really, really bad. llvm-svn: 229373	2015-02-16 10:58:23 +00:00
Chandler Carruth	e8b558c336	[x86] Add some more test cases for i8 vector blends. llvm-svn: 229372	2015-02-16 10:51:49 +00:00
Benjamin Kramer	499473c201	Document that defaulted & deleted methods and explicit conversions are allowed now. llvm-svn: 229369	2015-02-16 10:28:41 +00:00
Chandler Carruth	e63bbd97a7	[x86] Switch my usage of VariadicFunction to a "normal" variadic template now that we can use them. This is, of course, horribly ugly because of the required recursive formulation. Suggestions for making it less ugly welcome. llvm-svn: 229367	2015-02-16 09:59:48 +00:00
David Majnemer	8b576a579a	IR: SrcTy == DstTy doesn't imply that a cast is valid Cast validity depends on the cast's kind, not just its types. llvm-svn: 229366	2015-02-16 09:37:35 +00:00
David Majnemer	7ccc34dbc1	AsmParser: extractvalue requires at least one index operand llvm-svn: 229365	2015-02-16 09:18:13 +00:00
David Majnemer	49b3d9bc84	AsmParser: Make sure GlobalVariables have sane types llvm-svn: 229364	2015-02-16 08:41:08 +00:00
David Majnemer	a3b0eb2f7f	AsmParser: Reject alloca with function type llvm-svn: 229363	2015-02-16 08:38:03 +00:00
Chandler Carruth	8510f73740	Switch our index sequence away from template aliases and just use classes. We can't use template aliases because on MSVC they don't appear to work correctly in the common usage such as Format.h. Many thanks to Zach for doing all the testing and debugging here. I just slotted the fix into the code. llvm-svn: 229362	2015-02-16 08:22:35 +00:00
David Majnemer	04b4ed329e	Verifier: Diagnose module flags which have null ID operands llvm-svn: 229361	2015-02-16 08:14:22 +00:00
Craig Topper	7e8dcef094	[X86] Add support for lowering shuffles to 256-bit PALIGNR instruction. llvm-svn: 229359	2015-02-16 06:29:06 +00:00
Craig Topper	b2b4f8a721	[X86] Remove some hard tab characters from tests. llvm-svn: 229358	2015-02-16 06:29:02 +00:00
David Majnemer	e7a9cdbd20	DebugInfo: Don't crash if 'Debug Info Version' has a strange value llvm-svn: 229356	2015-02-16 06:04:53 +00:00
David Majnemer	4b04292643	DataLayout: Validate that the pref alignment is at least the ABI align llvm-svn: 229355	2015-02-16 05:41:55 +00:00
David Majnemer	1b9fc3a186	DataLayout: Report when the datalayout type alignment/width is too large llvm-svn: 229354	2015-02-16 05:41:53 +00:00
David Majnemer	9b529a76e9	IR: Properly return nullptr when getAggregateElement is out-of-bounds We didn't properly handle the out-of-bounds case for ConstantAggregateZero and UndefValue. This would manifest as a crash when the constant folder was asked to fold a load of a constant global whose struct type has no operands. This fixes PR22595. llvm-svn: 229352	2015-02-16 04:02:09 +00:00
NAKAMURA Takumi	7540eafb5c	[CMake] Add RuntimeDyld to libdeps corresponding to r229343. llvm-svn: 229351	2015-02-16 02:13:30 +00:00
Chandler Carruth	87e580a659	[x86] Teach the 128-bit vector shuffle lowering routines to take advantage of the existence of a reasonable blend instruction. The 256-bit vector shuffle lowering has leveraged the general technique of decomposed shuffles and blends for quite some time, but this never made it back into the 128-bit code, and there are a large number of patterns where this is substantially better. For example, this removes almost all domain crossing in vector shuffles that involve some blend and some permutation with SSE4.1 and later. See the massive reduction in 'shufps' for integer test cases in this commit. This isn't perfect yet for a few reasons: 1) The v8i16 shuffle lowering continues to plague me. We don't always form an unpack-based blend when that would be better. But the wins pretty drastically outstrip the losses here. 2) The v16i8 shuffle lowering is just a disaster here. I never went and implemented blend support here for some terrible reason. I'll do that next probably. I've not updated it for now. More variations on this technique are coming as well -- we don't shuffle-into-unpack or shuffle-into-palignr, both of which would also be profitable. Note that some test cases grow significantly in the number of instructions, but I expect to actually be faster. We use pshufd+pshufd+blendw instead of a single shufps, but the pshufd's are very likely to pipeline well (two ports on most modern intel chips) and the blend is a very fast instruction. The domain switch penalty will essentially always be more than a blend instruction, which is the only increase in tree height. llvm-svn: 229350	2015-02-16 01:52:02 +00:00
Chandler Carruth	c06b7fbfc3	[x86] Clean up a few test cases with the update script. NFC llvm-svn: 229349	2015-02-16 01:39:50 +00:00
Craig Topper	df6797fe83	[X86] Remove gcc builtins for AVX2 psll_dq and psrl_dq intrinsics. Clang no longer needs them. llvm-svn: 229347	2015-02-16 00:42:36 +00:00
Filipe Cabecinhas	ecf8f7f49b	[Bitcode reader] Fix a few assertions when reading invalid files Summary: When creating {insert,extract}value instructions from a BitcodeReader, we weren't verifying the fields were valid. Bugs found with afl-fuzz Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7325 llvm-svn: 229345	2015-02-16 00:03:11 +00:00
Lang Hames	da9d387929	[ExecutionEngine] Fix dependence issue by moving RTDyldMemoryManager into RuntimeDyld. This should fix http://llvm.org/PR22593. llvm-svn: 229343	2015-02-15 23:22:43 +00:00
Benjamin Kramer	432092ebdb	MinGW's snprintf is not exposed through std::. llvm-svn: 229342	2015-02-15 23:17:20 +00:00
Aaron Ballman	f9a1897c72	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229340	2015-02-15 22:54:22 +00:00
Benjamin Kramer	af09f22c4b	Format: Modernize using variadic templates. Introduces a subset of C++14 integer sequences in STLExtras. This is just enough to support unpacking a std::tuple into the arguments of snprintf, we can add more of it when it's actually needed. Also removes an ancient macro hack that leaks a macro into the global namespace. Clean up users that made use of the convenient hack. llvm-svn: 229337	2015-02-15 22:15:41 +00:00
Aaron Ballman	b46962fe5d	Removing LLVM_EXPLICIT, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229335	2015-02-15 22:00:20 +00:00
Aaron Ballman	b1607ab354	Since MSVC 1800 is our lowest common denominator, we don't need an explicit check for it in these macros any longer; NFC. llvm-svn: 229333	2015-02-15 21:21:52 +00:00
Benjamin Kramer	1b6955a723	CommandLine: Use variadic templates to simplify opt constructors. llvm-svn: 229332	2015-02-15 21:11:25 +00:00
Zachary Turner	16e504f5d5	llvm-pdbdump: Fix warning caused by missing sentinel value. llvm-svn: 229331	2015-02-15 20:37:44 +00:00
Zachary Turner	c0acf6837b	llvm-pdbdump: Add flags controlling the type of values to dump. llvm-svn: 229330	2015-02-15 20:27:53 +00:00
Benjamin Kramer	a1e23d950b	FoldingSet: Replace faux variadics with real variadics. NFC. llvm-svn: 229328	2015-02-15 20:12:17 +00:00
Benjamin Kramer	bec02ccdea	Remove LLVM_HAS_VARIADIC_TEMPLATES and all the faux variadic workarounds guarded by it. We no longer support compilers without variadic template support. llvm-svn: 229324	2015-02-15 19:34:28 +00:00
Benjamin Kramer	de1a193d24	Update the docs to require at least MSVC 2013. llvm-svn: 229323	2015-02-15 19:34:17 +00:00
Philip Reames	090a8242c3	Revert 229175 This change is a logical suspect in 22587 and 22590. Given it's of minimal importanance and I can't get clang to build on my home machine, I'm reverting so that I can deal with this next week. llvm-svn: 229322	2015-02-15 19:07:31 +00:00
Simon Pilgrim	d4ed5df3a6	Added (still inefficient) shuffle test case for PR21138 llvm-svn: 229321	2015-02-15 18:21:39 +00:00
NAKAMURA Takumi	301e190c54	Reapply r229185(cbieneman) -- Raising minimum required Visual Studio version to 2013. This is based on the discussions on: [LLVMdev] [RFC] Raising LLVM minimum required MSVC version to 2013 for trunk llvm-svn: 229320	2015-02-15 17:53:10 +00:00
Hal Finkel	8626ed2eae	[ADCE] Convert another loop for a range-based for We can use a range-based for for the operands loop too; NFC. llvm-svn: 229319	2015-02-15 15:51:25 +00:00
Hal Finkel	92fb2d3803	[ADCE] Use inst_range and range-based fors Convert a few loops to range-based fors; NFC. llvm-svn: 229318	2015-02-15 15:51:23 +00:00
Hal Finkel	c6035cff55	[ADCE] Fix formatting of pointer types We prefer to put the * with the variable, not with the type; NFC. llvm-svn: 229317	2015-02-15 15:47:52 +00:00
Hal Finkel	234d8fea7b	[ADCE] Fix capitalization of another local variable Bring another local variable in compliance with our naming conventions, NFC. llvm-svn: 229316	2015-02-15 15:45:30 +00:00
Hal Finkel	75901293a1	[ADCE] Fix capitalization of some local variables Bring some local variables in compliance with our naming conventions, NFC. llvm-svn: 229315	2015-02-15 15:45:28 +00:00
Simon Pilgrim	5a6375c3ba	Added some test cases of missed opportunities to use unpckl/unpckh shuffles llvm-svn: 229313	2015-02-15 15:07:45 +00:00
Simon Pilgrim	2a7bedb73e	Coding style fixes to recent patches. NFC. llvm-svn: 229312	2015-02-15 14:19:29 +00:00
Simon Pilgrim	00bd79d794	[X86][AVX2] vpslldq/vpsrldq byte shifts for AVX2 This patch refactors the existing lowerVectorShuffleAsByteShift function to add support for 256-bit vectors on AVX2 targets. It also fixes a tablegen issue that prevented the lowering of vpslldq/vpsrldq vec256 instructions. Differential Revision: http://reviews.llvm.org/D7596 llvm-svn: 229311	2015-02-15 13:19:52 +00:00
Chandler Carruth	6a61efdce5	[x86] Add the test case from PR22412, we now get this right even with the new vector shuffle legality. llvm-svn: 229310	2015-02-15 12:45:05 +00:00
Chandler Carruth	bf0fb06e0d	[x86] Teach the decomposed shuffle/blend lowering to use an early blend when that will allow it to lower with a single permute instead of multiple permutes. It tries to detect when it will only have to do a single permute in either case to maximize folding of loads and such. This cuts a lot of the avx2 shuffle permute counts in half. =] llvm-svn: 229309	2015-02-15 12:42:15 +00:00
Chandler Carruth	1b5285dd57	[SDAG] Teach the SelectionDAG to canonicalize vector shuffles of splats directly into blends of the splats. These patterns show up even very late in the vector shuffle lowering where we don't have any chance for DAG combining to kick in, and blending is a tremendously simpler operation to model. By coercing the shuffle into a blend we can much more easily match and lower shuffles of splats. Immediately with this change there are significantly more blends being matched in the x86 vector shuffle lowering. llvm-svn: 229308	2015-02-15 12:18:12 +00:00
Chandler Carruth	75d9a97569	[x86] Teach the shuffle mask equivalence test to look through build vectors and detect equivalent inputs. This lets the code match unpck-style instructions when only one of the inputs are lined up but the other input is a splat and so which lanes we pull from doesn't matter. Today, this doesn't really happen, but just by accident. I have a patch that normalizes how we shuffle splats, and with that patch this will be necessary for a lot of the mask equivalence tests to work. I don't really know how to write a test case for this specific change until the other change lands though. llvm-svn: 229307	2015-02-15 12:07:55 +00:00
Chandler Carruth	4fe214b1f2	[x86] Tweak the ordering of unpack matching vs. element insertion, and don't try to do element insertion for non-zero-index floating point vectors. We don't have any useful patterns or lowering for element insertion into high elements of a floating point vector, and the generic shuffle lowering will end up being better -- namely it will fall back to unpck. But we should try to handle other forms of element insertion before matching unpck patterns. While this doesn't matter much right now, I'm working on a patch that makes unpck matching much more powerful, and that patch will break without this re-ordering. llvm-svn: 229306	2015-02-15 12:01:14 +00:00
Arnaud A. de Grandmaison	69f2b3b87a	[PBQP] Assert conservativelly allocatable nodes are spilled by choice. llvm-svn: 229302	2015-02-15 10:35:31 +00:00
Chandler Carruth	56e0ceda0d	[x86] Stop shuffling zero vectors. =] I was somewhat surprised this pattern really came up, but it does. It seems better to just directly handle it than try to special case every place where we end up forming a shuffle that devolves to a shuffle of a zero vector. llvm-svn: 229301	2015-02-15 10:34:52 +00:00
Chandler Carruth	3d272daaed	[x86] Use a more helpful parenthesizing of these comparisons. Silences a -Wparentheses complaint from GCC. llvm-svn: 229300	2015-02-15 10:15:20 +00:00
Chandler Carruth	62558c1d4d	[x86] When splitting 256-bit vectors into 128-bit vectors, don't extract subvectors from buildvectors. That doesn't really make any sense and it breaks all of the down-stream matching of buildvectors to cleverly lower shuffles. With this, we now get the shift-based lowering of 256-bit vector shuffles with AVX1 when we split them into 128-bit vectors. We also do much better on the zero-extension patterns, although there remains quite a bit of room for improvement here. llvm-svn: 229299	2015-02-15 10:12:02 +00:00
Chandler Carruth	a6f8a3661c	[x86] Make computing the zeroable elements slightly more powerful, at least in theory. I don't actually have a test case that benefits from this, but theoretically, it could come up, and I don't want to try to think about whether this is the culprit or something else is, so I'd rather just make this code powerful. =/ Makes me sad that I can't really test it though. llvm-svn: 229298	2015-02-15 09:33:36 +00:00
Michael Kuperstein	2ccbc08b8e	gold-plugin: fix test to allow default visibility on local symbols GNU ld sets default, not hidden, visibility on local symbols. Having default or hidden visibility on local symbols makes no difference in run-time behavior. Patch by: H.J. Lu <hjl.tools@gmail.com> llvm-svn: 229297	2015-02-15 09:32:30 +00:00
Chandler Carruth	1c60d18aee	[x86] Update some tests with the latest version of my script and llc. This mostly adds some shuffle decode comments and cleans up indentation. llvm-svn: 229296	2015-02-15 09:26:15 +00:00
Chandler Carruth	0ddfe0c7c5	[x86] Add a slight variation on some of the other generic shuffle lowerings -- one which decomposes into an initial blend followed by a permute. Particularly on newer chips, blends are handled independently of shuffles and so this is much less bottlenecked on the single port that floating point shuffles are executed with on Intel. I'll be adding this lowering to a bunch of other code paths in subsequent commits to handle still more places where we can effectively leverage blends when they're available in the ISA. llvm-svn: 229292	2015-02-15 08:26:30 +00:00
Elena Demikhovsky	6f5a859633	Enabled cost calculation for masked memory operations. We already have implementation for cost calculation for masked memory operations. I just call it from the loop vectorizer. llvm-svn: 229290	2015-02-15 08:08:48 +00:00
Craig Topper	78c424dfca	[X86] Add assembly parser support for mnemonic aliases for AVX-512 vpcmp instructions. llvm-svn: 229287	2015-02-15 07:13:48 +00:00
Chandler Carruth	97381fd0af	[x86] Add a test case for PR22390 which was a dup of PR22377 and fixed by r229285. This is a nice different test case though, so I'd like to have the extra testing of these kinds of patterns. llvm-svn: 229286	2015-02-15 07:05:50 +00:00
Chandler Carruth	499d7332c5	[x86] Fix PR22377, a regression with the new vector shuffle legality test. This was just a matter of the DAG combine for vector shuffles being too aggressive. This is a bit of a grey area, but I think generally if we can re-use intermediate shuffles, we should. Certainly, given the test cases I have available, this seems like the right call. llvm-svn: 229285	2015-02-15 07:01:10 +00:00
Chandler Carruth	fe69608839	[x86] Switch a collection of tests explicitly to the new vector shuffle legality test (essentially, everything is legal). I'm planning to make this the default shortly, but I'd like to fix a collection of the bugs it exposes first, and this will let me easily test them. It also showcases both the improvements and a few of the regressions triggered by the change. The biggest improvements by far are the significantly reduced shuffling and domain crossing in the combining test case. The biggest regressions are missing some clever blending patterns. llvm-svn: 229284	2015-02-15 06:37:21 +00:00
Chandler Carruth	89a60770e0	[x86] Remove the now-default-on flag for the new vector shuffle lowering strategy from a bunch of tests. llvm-svn: 229283	2015-02-15 06:20:51 +00:00
Craig Topper	f02ad93270	[X86] Add assembler predicates for the rest of the AVX512 feature flags. This makes the assembly matching consistent across all AVX512 instructions. Without this we were allowing some AVX512 instructions to be parsed always, but not the foundation instructions. llvm-svn: 229280	2015-02-15 04:54:55 +00:00
Craig Topper	a3776de242	[X86] Add the remaining 11 possible exact ModRM formats. This makes their encodings linear which can then be used to simplify some other code. llvm-svn: 229279	2015-02-15 04:16:44 +00:00
David Blaikie	0c1a24026e	FileCheck-ize a test to make it easier to migrate to typeless pointers llvm-svn: 229278	2015-02-15 04:14:00 +00:00
David Blaikie	15a8444d16	Update a test to make it easier to migrate to untyped pointers llvm-svn: 229277	2015-02-15 04:13:58 +00:00
David Blaikie	21caa5c85d	Update a test to use FileCheck so it's easier to migrate to future typeless pointer changes llvm-svn: 229276	2015-02-15 04:13:57 +00:00
David Blaikie	eba8c88a90	Reformat test case to be easier to migrate to typeless pointers. llvm-svn: 229275	2015-02-15 04:13:53 +00:00
Chandler Carruth	0613751dcb	[x86] Teach my test updating script about another quirk of the printed asm and port the mmx vector shuffle test to it. Not thrilled with how it handles the stack manipulation logic, but I'm much less bothered by that than I am by updating the test manually. =] If anyone wants to teach the test checks management script about stack adjustment patterns, that'd be cool too. llvm-svn: 229268	2015-02-15 00:08:01 +00:00
Simon Pilgrim	31457d54f7	[X86][XOP] Enable commutation for XOP instructions Patch to allow XOP instructions (integer comparison and integer multiply-add) to be commuted. The comparison instructions sometimes require the compare mode to be flipped but the remaining instructions can use default commutation modes. This patch also sets the SSE domains of all the XOP instructions. Differential Revision: http://reviews.llvm.org/D7646 llvm-svn: 229267	2015-02-14 22:40:46 +00:00
Craig Topper	43860838dc	[X86] Improve parsing support AVX/SSE floating point compare instruction mnemonic aliases. They'll now print with the alias the parser received instead of converting to the explicit immediate form. llvm-svn: 229266	2015-02-14 21:54:03 +00:00
Ramkumar Ramachandra	8fcb498a9a	InstCombine: propagate deref via new addDereferenceableAttr The "dereferenceable" attribute cannot be added via .addAttribute(), since it also expects a size in bytes. AttrBuilder#addAttribute or AttributeSet#addAttribute is wrapped by classes Function, InvokeInst, and CallInst. Add corresponding wrappers to AttrBuilder#addDereferenceableAttr. Having done this, propagate the dereferenceable attribute via gc.relocate, adding a test to exercise it. Note that -datalayout is required during execution over and above -instcombine, because InstCombine only optionally requires DataLayoutPass. Differential Revision: http://reviews.llvm.org/D7510 llvm-svn: 229265	2015-02-14 19:37:54 +00:00
Duncan P. N. Exon Smith	025c0ad74c	Target: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229261	2015-02-14 15:36:52 +00:00
Duncan P. N. Exon Smith	b5054333ec	NVPTX: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229260	2015-02-14 15:35:43 +00:00
Andrea Di Biagio	f54432388f	[optnone] Skip pass Constant Hoisting on optnone functions. Added test CodeGen/X86/constant-hoisting-optnone.ll to verify that pass Constant Hoisting is not run on optnone functions. llvm-svn: 229258	2015-02-14 15:11:48 +00:00
Simon Pilgrim	6eb925a3ed	[X86] Ensure integer domain on scalar load/store stack folding tests. NFC llvm-svn: 229257	2015-02-14 14:10:44 +00:00
Simon Pilgrim	b0bac23fcd	Line ending fix. NFC. llvm-svn: 229256	2015-02-14 13:27:53 +00:00
Chandler Carruth	edd92948d1	[gold] Consolidate the gold plugin options and actually search for a gold binary explicitly. Substitute this binary into the tests rather than just directly executing the 'ld' binary. This should allow folks to inject a cross compiling gold binary, or in my case to use a gold binary built and installed somewhere other than /usr/bin/ld. It should also allow the tests to find 'ld.gold' so that things work even if gold isn't the default on the system. I've only stubbed out support in the makefile to preserve the existing behavior with none of the fancy logic. If someone else wants to add logic here, they're welcome to do so. llvm-svn: 229251	2015-02-14 09:43:57 +00:00
Chandler Carruth	003ed332bf	Remove a variable only used in an assert and sink its initializer into the assert. Fixes -Wunused-variable on non-asserts builds. llvm-svn: 229250	2015-02-14 09:14:44 +00:00
Chandler Carruth	a217173f4b	Back out two accidental changes that snuck in with r229245. Sorry these snuck in, they weren't ready for prime time and had nothing to do with that commit. llvm-svn: 229248	2015-02-14 09:05:58 +00:00
Chandler Carruth	8756afc5a9	[lit] Make the gold plugin support testing work with a python3 interpreter. Seems that's a better path than pinning to python2.7. Thanks to Justin for prodding me toward a fix. =] llvm-svn: 229247	2015-02-14 09:05:56 +00:00
Chandler Carruth	f9dd7edd49	Revert r229224: Make the 'llvm-lit' utility defend against a system where Python3 Apparantly python2.7 also doesn't work. Awesome. llvm-svn: 229245	2015-02-14 07:11:25 +00:00
Chandler Carruth	edb9ece4d3	[lit] Make the 'llvm-lit' utility defend against a system where Python3 is the default. The lit.cfg files are not all valid Python3 and I've no idea if anyone is really prepared to update them. The easiest way I know of to ensure that this script uses Python 2 is to use 'python2.7' in the command. Mac and Linux are definitely fine with this and I think other platforms will be as well, but if anyone struggles with this set up and has better ideas, let me know. llvm-svn: 229244	2015-02-14 07:05:15 +00:00
Richard Smith	221b22d695	[modules] Try harder to stop DebugInfo/PDB/DIA being built if not available. llvm-svn: 229243	2015-02-14 05:54:56 +00:00
Matt Arsenault	0bbcd8ba2f	R600/SI: Implement correct f64 fdiv This version passes the OpenCL conformance test. llvm-svn: 229239	2015-02-14 04:30:08 +00:00
Matt Arsenault	044f1d19cf	R600/SI: Use complex operand folding for div_scale llvm-svn: 229238	2015-02-14 04:24:28 +00:00
Matt Arsenault	7eaee80675	R600/SI: Add tests for div_fmas with inline immediate operands llvm-svn: 229237	2015-02-14 04:22:02 +00:00
Matt Arsenault	1bc9d95047	R600/SI: Fix implicit vcc operand to v_div_fmas_* This should allow finally fixing the f64 fdiv implementation. Test is disabled for VI since there seems to be a problem with one of the buffer load instructions on it. llvm-svn: 229236	2015-02-14 04:22:00 +00:00
Matt Arsenault	6e26b8d854	R600/SI: Fix schedule model for v_div_scale_{f32\|f64} llvm-svn: 229235	2015-02-14 04:03:18 +00:00
Matt Arsenault	35733e2dec	R600/SI: Really fix size of VReg_1 llvm-svn: 229234	2015-02-14 03:54:32 +00:00
Matt Arsenault	1bcc8cba5a	R600/SI: Rename encoding field to match docs for VOP3b llvm-svn: 229233	2015-02-14 03:54:29 +00:00
Zachary Turner	26ebe3fbcd	llvm-pdbdump: Only dump whitelisted global symbols. Dumping the global scope contains a lot of very uninteresting things and is generally polluted with a lot of random junk. Furthermore, it dumps values unsorted, making it hard to read. This patch dumps known interesting types only, and as a side effect sorts the list by symbol type. llvm-svn: 229232	2015-02-14 03:54:28 +00:00
Zachary Turner	52c9f881de	llvm-pdbdump: Re-order header files according to LLVM style guide. llvm-svn: 229231	2015-02-14 03:53:56 +00:00
Matt Arsenault	31ec598a2a	R600/SI: Fix not encoding src2 for v_div_scale_{f32\|f64} This apparently got lost in the VI changes. llvm-svn: 229230	2015-02-14 03:40:35 +00:00
Matt Arsenault	692acf1438	R600/SI: Fix VOP3b encoding on VI llvm-svn: 229228	2015-02-14 03:02:23 +00:00
Matt Arsenault	95546b46ab	R600/SI: Fix phys reg copies in SIFoldOperands llvm-svn: 229227	2015-02-14 02:55:57 +00:00
Matt Arsenault	9998168982	R600/SI: Fix copies from SGPR to VCC This shows up without optimizations when vcc is required to be used. llvm-svn: 229226	2015-02-14 02:55:56 +00:00
Matt Arsenault	834b1aa806	R600/SI: Add hack to copy from a VGPR to VCC This hopefully should be fixed when VReg_1 is removed. llvm-svn: 229225	2015-02-14 02:55:54 +00:00
Duncan P. N. Exon Smith	5bedaf934f	PowerPC: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229224	2015-02-14 02:54:07 +00:00
Matt Arsenault	f417ff8f2a	R600/SI: Fix size of VReg_1 This is really a 32-bit register, if we try to check the size of it, we want 32-bits. llvm-svn: 229223	2015-02-14 02:51:44 +00:00
Duncan P. N. Exon Smith	8480c87ce6	R600: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229222	2015-02-14 02:45:45 +00:00
Duncan P. N. Exon Smith	2e75314352	Mips: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229221	2015-02-14 02:37:48 +00:00
Duncan P. N. Exon Smith	2cff9e19a2	ARM: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229220	2015-02-14 02:24:44 +00:00
Duncan P. N. Exon Smith	003bb7d96e	AArch64: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229218	2015-02-14 02:09:06 +00:00
Justin Bogner	0ef7a2a250	llvm-cov: Actually use the command line arguments when reporting This code didn't really make sense as is. If a filename is passed in, the user obviously wants the coverage for that file, not for everything. llvm-svn: 229217	2015-02-14 02:05:05 +00:00
Justin Bogner	f91bc6cdd8	llvm-cov: Simplify coverage reports, fixing PR22575 in the process PR22575 occurred because we were unsafely storing references into a std::vector. If the vector moved because it grew, we'd be left iterating through garbage memory. This avoids the issue by simplifying the logic to gather coverage information as we go, rather than storing it and iterating over it. I'm relying on the existing tests showing that this is semantically NFC, since it's difficult to hit the issue this fixes without relatively large covered programs. llvm-svn: 229215	2015-02-14 02:01:24 +00:00
Duncan P. N. Exon Smith	5975a703e6	X86: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229214	2015-02-14 01:59:52 +00:00
Peter Collingbourne	46f4b48e62	llvm-go: Set $GCCGO instead of putting a gccgo executable on $PATH. Now that llgo ships its own go command we can rely on it having support for $GCCGO. Differential Revision: http://reviews.llvm.org/D7628 llvm-svn: 229210	2015-02-14 01:45:57 +00:00
Peter Collingbourne	5570708ca5	llvm-go: Add flag for specifying path to go command. Differential Revision: http://reviews.llvm.org/D7627 llvm-svn: 229209	2015-02-14 01:45:56 +00:00
Duncan P. N. Exon Smith	70eb9c5ae5	CodeGen: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) Also, add `Function::getFnStackAlignment()`, and canonicalize: getAttributes().getStackAlignment(AttributeSet::FunctionIndex) => getFnStackAlignment() llvm-svn: 229208	2015-02-14 01:44:41 +00:00
Ahmed Bougacha	8f2b4f0be8	[X86] Factor out the CMOV pseudo definitions. NFCI. llvm-svn: 229206	2015-02-14 01:36:53 +00:00
Matthias Braun	33cc10724d	Revert "On ELF, put PIC jump tables in a non executable section." This reverts commit r228939. The commit broke something in the output of exception handling tables on darwin x86-64. llvm-svn: 229203	2015-02-14 01:16:54 +00:00
Duncan P. N. Exon Smith	2c79ad974c	Transforms: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229202	2015-02-14 01:11:29 +00:00
Richard Smith	845af6c46d	[modules] Split off a separate module for DebugInfo/PDB/DIA so that its headers don't get included on systems where the DIA SDK is unavailable. llvm-svn: 229200	2015-02-14 00:47:20 +00:00
NAKAMURA Takumi	3d1b0aa008	Revert r229185, "Raising minimum required Visual Studio version to 2013." All builders are not ready yet. llvm-svn: 229199	2015-02-14 00:45:32 +00:00
Reid Kleckner	2d5fb68ee0	Unify the two EH personality classification routines I wrote We only need one. llvm-svn: 229193	2015-02-14 00:21:02 +00:00
Duncan P. N. Exon Smith	b3fc83c403	Analysis: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229192	2015-02-14 00:12:15 +00:00
Eric Christopher	b2a5fa98e4	Use the template method to grab the target specific subtarget. llvm-svn: 229191	2015-02-14 00:09:46 +00:00
Philip Reames	9ae15209ad	[InstCombine] When canonicalizing gep indices, prefer zext when possible If we know that the sign bit of a value being sign extended is zero, we can use a zero extension instead. This is motivated by the fact that zero extensions are generally cheaper on x86 (and most other architectures?). We already apply a similar transform in DAGCombine, this just extends that to the IR level. This comes up when we eagerly canonicalize gep indices to the width of a machine register (i64 on x86_64). To do so, we insert sign extensions (sext) to promote smaller types. Differential Revision: http://reviews.llvm.org/D7255 llvm-svn: 229189	2015-02-14 00:05:36 +00:00
Chris Bieneman	eff99e8ac5	Raising minimum required Visual Studio version to 2013. This is based on the discussions on: [LLVMdev] [RFC] Raising LLVM minimum required MSVC version to 2013 for trunk llvm-svn: 229185	2015-02-13 23:24:14 +00:00
Frederic Riss	84c09a51e9	[dsymutil] Add DIE selection algorithm. With this commit, llvm-dsymutil learns how to choose which DIEs it will link in the final output and which ones it won't. This is based on the 'valid relocation' information that has been built in the previous commits. The test only tests that we choose the right 'root DIEs'. The selection algorithm (and especially the part that walk the dependencies of a root DIE) lacks a bit test coverage. This will be much easier to cover when we output actual Dwarf and thus can use llvm-dwarfdump to verify the structure of the emitted DIE trees. I'll add more tests then. llvm-svn: 229183	2015-02-13 23:18:34 +00:00
Frederic Riss	9aa725ba76	[dsymutil] Downcase a function name. llvm-svn: 229182	2015-02-13 23:18:31 +00:00
Frederic Riss	1b9da425c3	[dsymutil] Add a few generic helper methods. To be used in subsequent commits (separated to keep only core logic in the follow-ups). llvm-svn: 229181	2015-02-13 23:18:29 +00:00
Frederic Riss	c3349d4fb4	[dsymutil] constify trivial function. llvm-svn: 229180	2015-02-13 23:18:27 +00:00
Frederic Riss	898a67dfb7	DWARFUnit: Add a couple of helpers to access the DIE array. To be used in dsymutil (or any other client that wants to take advantage of the fact that DIEs are stored in a vector). llvm-svn: 229179	2015-02-13 23:18:24 +00:00
Frederic Riss	1036e64bcc	[dsymutil] Find relocations that correspond to debug map entries. These 'valid relocations' in the debug_info section will be how dsymutil identifies the DIEs it needs to keep in the linked debug information. llvm-svn: 229178	2015-02-13 23:18:22 +00:00
Frederic Riss	1595c5d37d	[dsymutil] Add DebugMapObject::lookupObjectAddress() It turns out the debug map will be interogated both by name and by object file address. Add the latter capability. llvm-svn: 229177	2015-02-13 23:18:16 +00:00
Chris Bieneman	f942c0c89d	Fixing broken bots. llvm-svn: 229176	2015-02-13 23:10:31 +00:00
Philip Reames	66facd6c14	Minor tweak to MDA Two minor tweaks I noticed when reading through the code: - No need to recompute begin() on every iteration. We're not modifying the instructions in this loop. - We can ignore PHINodes and Dbg intrinsics. The current code does this anyways, but it will spend slightly more time doing so and will count towards the limit of instructions in the block. It seems really silly to give up due the presence of PHIs... Differential Revision: http://reviews.llvm.org/D7624 llvm-svn: 229175	2015-02-13 23:08:37 +00:00
Chris Bieneman	67e426a022	NFC. Moving the RegisteredOptionCategories global into the CommandLineParser class. llvm-svn: 229172	2015-02-13 22:54:32 +00:00
Chris Bieneman	ceaf5f660d	NFC. clang-format wants to change this from two lines to one. llvm-svn: 229171	2015-02-13 22:54:29 +00:00
Chris Bieneman	542f56a512	NFC. More code cleanup making LookupOption a member of the CommandLineParser. llvm-svn: 229170	2015-02-13 22:54:27 +00:00
Eric Christopher	fcd3d87ad8	The base pointer save offset can be computed at initialization time, do so and fix up the calls. llvm-svn: 229169	2015-02-13 22:48:53 +00:00
Eric Christopher	a10d58dba8	Move the target machine variable so that it's initialized early enough we can use it to initialize frame lowering. llvm-svn: 229168	2015-02-13 22:48:51 +00:00
Eric Christopher	e8dbfe1cf8	Stash the TargetMachine on the subtarget so we can access it later. Clean up a subtarget function that has it passed in while we're at it. llvm-svn: 229164	2015-02-13 22:23:04 +00:00
Eric Christopher	a4ae213193	PPC LinkageSize can be computed at initialization time, do so. llvm-svn: 229163	2015-02-13 22:22:57 +00:00
Reid Kleckner	5fe405df36	Triple: Make setEnvironment not override the object format Discovered by Halide users who had C++ code like this: Triple.setArch(Triple::x86); Triple.setOS(Triple::Windows); Triple.setObjectFormat(Triple::ELF); Triple.setEnvironment(Triple::MSVC); This would produce the stringified triple of x86-windows-msvc, instead of the x86-windows-msvc-elf string needed to run MCJIT. With this change, they retain the -elf suffix. llvm-svn: 229160	2015-02-13 22:05:50 +00:00
Sanjay Patel	baa6bc378f	[SSE/AVX] Use multiclasses to reduce the mass of scalar math patterns; NFCI This takes the preposterous number of patterns in this section that were last added to in r219033 down to just plain obnoxious. With a little more work, we might get this down to just comical. I've added more test cases to the existing file that checks these patterns, but it seems that some of these patterns simply don't exist with today's shuffle lowering. llvm-svn: 229158	2015-02-13 21:52:42 +00:00
Reid Kleckner	65842b461a	Fix R600 test deadlock on Windows by giving FileCheck an argument llc would hang trying to write output to a full pipe that FileCheck wasn't reading. FileCheck wasn't reading from stdin because it needs a file as a positional argument. llvm-svn: 229157	2015-02-13 21:27:28 +00:00
Chandler Carruth	5700b3798f	[PM] Fix a compile error I introduced in r229094 and didn't notice because I didn't have binutils set up properly to build the gold plugin. Fixes PR22581 which was filed because this broke the build for folks relying on the plugin. Very sorry! =] I've gotten the plugin stuff building now as well so it shouldn't keep happening. llvm-svn: 229156	2015-02-13 21:10:58 +00:00
Sanjay Patel	34da52a894	fix typos; NFC llvm-svn: 229155	2015-02-13 21:07:22 +00:00
Richard Smith	254ee9f335	[modules] Mark include/llvm/Support/Dwarf.def as being a textually-included header. llvm-svn: 229154	2015-02-13 21:06:45 +00:00
Richard Smith	7b408025fd	Clean up some inappropriate choices of type in the bitcode reader. None of these are expected to fix any 64->32 bit real truncation issues. llvm-svn: 229153	2015-02-13 21:05:11 +00:00
Tom Stellard	e1e4a2d310	R600/SI: Refactor SOP1 classes llvm-svn: 229152	2015-02-13 21:02:37 +00:00
Tom Stellard	6c65e9a99a	R600/SI: Lowercase register names llvm-svn: 229151	2015-02-13 21:02:36 +00:00
Tom Stellard	d09fa9cec8	R600/SI: Remove some unused TableGen classes llvm-svn: 229150	2015-02-13 21:02:33 +00:00
Benjamin Kramer	bad5a46c05	Reapply r229142 with some enable_if magic to avoid memcpying between differing types. Original commit message: SmallVector: Resolve a long-standing fixme by using the existing unitialized_copy dispatch. This makes append() use memcpy for trivially copyable types. llvm-svn: 229149	2015-02-13 20:45:14 +00:00
Benjamin Kramer	36913deebe	Revert r229142. It breaks the world for unknown reasons. llvm-svn: 229144	2015-02-13 19:45:28 +00:00
Dimitry Andric	31421fba87	Increase the CPU time timeout for testing from 10 to 20 minutes, to compensate for slow machines, and the growing number of tests. Prodded by: Hans Wennborg llvm-svn: 229143	2015-02-13 19:45:19 +00:00
Benjamin Kramer	9d26f6462e	SmallVector: Resolve a long-standing fixme by using the existing unitialized_copy dispatch. This makes append() use memcpy for trivially copyable types. llvm-svn: 229142	2015-02-13 19:20:39 +00:00
Vasileios Kalintiris	99eeb8aae4	[mips] Refactor and simplify MipsSEDAGToDAGISel::selectIntAddrLSL2MM(). NFC. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7618 llvm-svn: 229140	2015-02-13 19:14:22 +00:00
Vasileios Kalintiris	46963f6e73	[mips] Use isa<> instead of dyn_cast<> with unused value. NFC. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7615 llvm-svn: 229138	2015-02-13 19:12:16 +00:00
Matt Arsenault	774e20b42a	R600/SI: Remove handling of fpimm llvm-svn: 229136	2015-02-13 19:05:07 +00:00
Matt Arsenault	11a4d6774b	R600/SI: Allow f64 inline immediates in i64 operands This requires considering the size of the operand when checking immediate legality. llvm-svn: 229135	2015-02-13 19:05:03 +00:00
Matt Arsenault	8a9e404c0e	R600/SI: Minor test scheduling fixes This prevents these from failing in a later commit. llvm-svn: 229134	2015-02-13 19:04:56 +00:00
Zachary Turner	6a582f9fc8	Fix -Wunused-variable warning. llvm-svn: 229130	2015-02-13 18:11:49 +00:00
Zachary Turner	04b966d9dc	llvm-pdbdump: Improve printing of functions and signatures. This correctly prints the function pointers, and also prints function signatures for symbols as opposed to just types. So actual functions in your program will now be printed with full name and signature, as opposed to just name as before. llvm-svn: 229129	2015-02-13 17:57:09 +00:00
Jozef Kolek	650a61a943	[mips][microMIPS] Delay slot filler: Replace the microMIPS JR with the JRC This patch adds functionality in MIPS delay slot filler such as if delay slot filler have to put NOP instruction into the delay slot of microMIPS JR instruction, then instead of emitting NOP this instruction is replaced by compact jump instruction JRC. Differential Revision: http://reviews.llvm.org/D7522 llvm-svn: 229128	2015-02-13 17:51:27 +00:00
Andrea Di Biagio	30d471f6aa	[InstCombine] Fix regression introduced at r227197. This patch fixes a problem I accidentally introduced in an instruction combine on select instructions added at r227197. That revision taught the instruction combiner how to fold a cttz/ctlz followed by a icmp plus select into a single cttz/ctlz with flag 'is_zero_undef' cleared. However, the new rule added at r227197 would have produced wrong results in the case where a cttz/ctlz with flag 'is_zero_undef' cleared was follwed by a zero-extend or truncate. In that case, the folded instruction would have been inserted in a wrong location thus leaving the CFG in an inconsistent state. This patch fixes the problem and add two reproducible test cases to existing test 'InstCombine/select-cmp-cttz-ctlz.ll'. llvm-svn: 229124	2015-02-13 16:33:34 +00:00
Tom Stellard	3236c72458	Help: Document how to build and install with CMake. Resolves PR21569. Patch by: Stephen Kelly llvm-svn: 229122	2015-02-13 16:15:32 +00:00
Tom Stellard	94d1521fcd	Help: Document the minimum CMake version required. Patch by: Stephen Kelly llvm-svn: 229121	2015-02-13 16:15:29 +00:00
Akira Hatanaka	59f3073d23	Add run line that was missing in r228999. Also, change the run lines to use -allow-empty. llvm-svn: 229118	2015-02-13 16:00:03 +00:00
Andrea Di Biagio	b14ae8692d	[CodeGenPrepare] Removed duplicate logic. SimplifyCFG already knows how to speculate calls to cttz/ctlz. SimplifyCFG now knows how to speculate calls to intrinsic cttz/ctlz that are 'cheap' for the target. Therefore, some of the logic in CodeGenPrepare that was originally added at revision 224899 can now be removed. This patch is basically a no functional change. It removes the duplicated logic in CodeGenPrepare and converts all the existing target specific tests for cttz/ctlz into SimplifyCFG tests. Differential Revision: http://reviews.llvm.org/D7608 llvm-svn: 229105	2015-02-13 14:15:48 +00:00
Arnaud A. de Grandmaison	a7c90d8487	[PBQP] Conservativelly allocatable nodes can be spilled and give a better solution Although such nodes are allocatable, the cost of spilling may be less than allocating to register, so spilling the node may provide a better solution. The assert does not account for this case, so remove it for now. llvm-svn: 229103	2015-02-13 12:04:42 +00:00
James Molloy	5eb75aced4	[SimplifyCFG] Add test for r229099 Add extra test that was accidentally not staged. llvm-svn: 229101	2015-02-13 11:08:40 +00:00
James Molloy	1b6207e6eb	[SimplifyCFG] Be more aggressive Up the phi node folding threshold from a cheap "1" to a meagre "2". Update tests for extra added selects and slight code churn. llvm-svn: 229099	2015-02-13 10:48:30 +00:00
Toma Tabacu	16a74499af	[mips] Improve support for the .set at/noat assembler directives. Summary: Made the following changes: Added calls to emitDirectiveSetNoAt() and emitDirectiveSetAt(). Added special emit function for .set at=$reg, emitDirectiveSetAtWithArg(unsigned RegNo). Improved parsing error checks for .set at. Refactored parser code for .set at. Improved testing of both directives. Improved code readability and comments. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7176 llvm-svn: 229097	2015-02-13 10:30:57 +00:00
Chandler Carruth	7ecd99163c	[PM] Update the examples to reflect the removal of the llvm/PassManager.h wrapper header and its using declarations. These now directly use the legacy namespace. I had updated the #include lines in my large commit but forgot that the examples weren't being built and didn't update the code to use the correct namespace. Sorry for the noise here. llvm-svn: 229095	2015-02-13 10:21:05 +00:00
Chandler Carruth	30d69c2e36	[PM] Remove the old 'PassManager.h' header file at the top level of LLVM's include tree and the use of using declarations to hide the 'legacy' namespace for the old pass manager. This undoes the primary modules-hostile change I made to keep out-of-tree targets building. I sent an email inquiring about whether this would be reasonable to do at this phase and people seemed fine with it, so making it a reality. This should allow us to start bootstrapping with modules to a certain extent along with making it easier to mix and match headers in general. The updates to any code for users of LLVM are very mechanical. Switch from including "llvm/PassManager.h" to "llvm/IR/LegacyPassManager.h". Qualify the types which now produce compile errors with "legacy::". The most common ones are "PassManager", "PassManagerBase", and "FunctionPassManager". llvm-svn: 229094	2015-02-13 10:01:29 +00:00
Chandler Carruth	1f832f7c27	Re-sort the #include lines in bindings and examples which I managed to miss previously. llvm-svn: 229089	2015-02-13 09:14:30 +00:00
Chandler Carruth	71f308adb7	Re-sort #include lines using my handy dandy ./utils/sort_includes.py script. This is in preparation for changes to lots of include lines. llvm-svn: 229088	2015-02-13 09:09:03 +00:00
Zachary Turner	2a903b900d	Fix the windows build again. Grrr, MSVC. llvm-svn: 229081	2015-02-13 07:55:29 +00:00
Chandler Carruth	d99f427e31	Revert a series of commits starting at r228886 which is triggering some regressions for LLDB on Linux. Rafael indicated on lldb-dev that we should just go ahead and revert these but that he wasn't at a computer. The patches backed out are as follows: r228980: Add support for having multiple sections with the name and ... r228889: Invert the section relocation map. r228888: Use the existing SymbolTableIndex intsead of doing a lookup. r228886: Create the Section -> Rel Section map when it is first needed. These patches look pretty nice to me, so hoping its not too hard to get them re-instated. =D llvm-svn: 229080	2015-02-13 07:52:39 +00:00
Zachary Turner	191b57478e	Fix non-windows builds unhappy about a missing header. llvm-svn: 229079	2015-02-13 07:45:49 +00:00
Craig Topper	916708f152	[X86] Add support for parsing and printing the mnemonic aliases for the XOP VPCOM instructions. llvm-svn: 229078	2015-02-13 07:42:25 +00:00
Craig Topper	e32546dd29	[X86] Fix XOP vpcom intrinsic autoupgrade to map 'true' and 'false' to the correct immediates. Seems they were swapped. llvm-svn: 229077	2015-02-13 07:42:15 +00:00
Zachary Turner	a952c49c20	llvm-pdbdump: Add more comprehensive dumping of symbol types. In particular this patch adds the ability to dump complete function signature information including argument types as correctly formatted strings. A side effect of this is that almost all symbol and meta types are now formatted. llvm-svn: 229076	2015-02-13 07:40:03 +00:00
Mehdi Amini	383d7ae0bd	InstCombine: cleanup redundant dyn_cast<> (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 229075	2015-02-13 07:38:04 +00:00
Craig Topper	007a713ebf	Fix a typo in a comment. NFC llvm-svn: 229071	2015-02-13 06:07:29 +00:00
Craig Topper	d1e1bf5d78	Fix probable typo in test. llvm-svn: 229070	2015-02-13 06:07:27 +00:00
Craig Topper	4e0700f365	[X86] Remove int_x86_sse2_psll_dq_bs and int_x86_sse2_psrl_dq_bs intrinsics. The builtins aren't used by clang. llvm-svn: 229069	2015-02-13 06:07:24 +00:00
Craig Topper	65e2cb5676	[X86] Remove references to builtin names that have been removed from clang. Hope to remove the intrinsics themselves soon. llvm-svn: 229068	2015-02-13 06:07:14 +00:00
Chandler Carruth	1fbc316534	[unroll] Concede defeat and disable the unroll analyzer for now. The issues with the new unroll analyzer are more fundamental than code cleanup, algorithm, or data structure changes. I've sent an email to the original commit thread with details and a proposal for how to redesign things. I'm disabling this for now so that we don't spend time debugging issues with it in its current state. llvm-svn: 229064	2015-02-13 05:31:46 +00:00
Michael Liao	d266b928ae	[InstCombine] Fix a bug when combining `icmp` from `ptrtoint` - First, there's a crash when we try to combine that pointers into `icmp` directly by creating a `bitcast`, which is invalid if that two pointers are from different address spaces. - It's not always appropriate to cast one pointer to another if they are from different address spaces as that is not no-op cast. Instead, we only combine `icmp` from `ptrtoint` if that two pointers are of the same address space. llvm-svn: 229063	2015-02-13 04:51:26 +00:00
Chandler Carruth	6c03dff7cc	[unroll] Merge the simplification and DCE estimation methods on the UnrollAnalyzer. Now they share a single worklist and have less implicit state between them. There was no real benefit to separating these two things out. I'm going to subsequently refactor things to share even more code. llvm-svn: 229062	2015-02-13 04:39:05 +00:00
Chandler Carruth	d9591d8922	[unroll] Remove pointless dyn_cast<>s to Instruction - the users of an instruction must by definition be instructions. llvm-svn: 229061	2015-02-13 04:33:21 +00:00
Chandler Carruth	5457e20d27	[unroll] Don't check the loop set for whether an instruction is contained in it each time we try to add it to the worklist, just check this when pulling it off the worklist. That way we do it at most once per instruction with the cost of the worklist set we would need to pay anyways. llvm-svn: 229060	2015-02-13 04:30:44 +00:00
Chandler Carruth	e5c30e4e10	[unroll] Change the other worklist in the unroll analyzer to be a set vector. In addition to dramatically reducing the work required for contrived example loops, this also has to correct some serious latent bugs in the cost computation. Previously, we might add an instruction onto the worklist once for every load which it used and was simplified. Then we would visit it many times and accumulate "savings" each time. I mean, fortunately this couldn't matter for things like calls with 100s of operands, but even for binary operators this code seems like it must be double counting the savings. I just noticed this by inspection and due to the runtime problems it can introduce, I don't have any test cases for cases where the cost produced by this routine is unacceptable. llvm-svn: 229059	2015-02-13 04:27:50 +00:00
Chandler Carruth	7824bc9241	[unroll] Replace a boolean, for loop, condition, and break with std::all_of and a lambda. Much cleaner, no functionality changed. llvm-svn: 229058	2015-02-13 04:18:14 +00:00
Chandler Carruth	06d537cdd6	[unroll] Directly query for dead instructions. In the unroll analyzer, it is checking each user to see if that user will become dead. However, it first checked if that user was missing from the simplified values map, and then if was also missing from the dead instructions set. We add everything from the simplified values map to the dead instructions set, so the first step is completely subsumed by the second. Moreover, the first step requires inserting something into the simplified value map which isn't what we want at all. This also replaces a dyn_cast with a cast as an instruction cannot be used by a non-instruction. llvm-svn: 229057	2015-02-13 04:14:05 +00:00
Chandler Carruth	82cb30f10c	[unroll] Replace a linear time check for no uses with a constant time check. Also hoist this into the enqueue process as it is faster even than testing the worklist set, we should just directly filter these out much like we filter out constants and such. llvm-svn: 229056	2015-02-13 04:06:08 +00:00
Chandler Carruth	3b057b3216	[unroll] Rather than an operand set, use a setvector for the worklist. We don't just want to handle duplicate operands within an instruction, but also duplicates across operands of different instructions. I should have gone straight to this, but I had convinced myself that it wasn't going to be necessary briefly. I've come to my senses after chatting more with Nick, and am now happier here. llvm-svn: 229054	2015-02-13 03:57:40 +00:00
Chandler Carruth	17a0496b5a	[unroll] Extract the code to enqueue operansd for the worklist in the unroll analysis into a lambda and call it. That's much simpler than duplicating all the code. llvm-svn: 229053	2015-02-13 03:49:41 +00:00
Chandler Carruth	8c86375a10	[unroll] Use a small set to de-duplicate operands prior to putting them into the worklist. This avoids allocating lots of worklist memory for them when there are large numbers of repeated operands. llvm-svn: 229052	2015-02-13 03:48:38 +00:00
Chandler Carruth	93063e6191	[unroll] Make the unroll cost analysis terminate deterministically and reasonably quickly. I don't have a reduced test case, but for a version of FFMPEG, this makes the loop unroller start finishing at all (after over 15 minutes of running, it hadn't terminated for me, no idea if it was a true infloop or just exponential work). The key thing here is to check the DeadInstructions set when pulling things off the worklist. Without this, we would re-walk the user list of already dead instructions again and again and again. Consider phi nodes with many, many operands and other patterns. The other important aspect of this is that because we would keep re-visiting instructions that were already known dead, we kept adding their cost savings to this! This would cause our cost savings to be insanely inflated from this. While I was here, I also rotated the operand walk out of the worklist loop to make the code easier to read. There is still work to be done to minimize worklist traffic because we don't de-duplicate operands. This means we may add the same instruction onto the worklist 1000s of times if it shows up in 1000s of operansd to a PHI node for example. Still, with this patch, the ffmpeg testcase I have finishes quickly and I can't measure the runtime impact of the unroll analysis any more. I'll probably try to do a few more cleanups to this code, but not sure how much cleanup I can justify right now. llvm-svn: 229038	2015-02-13 03:40:58 +00:00
Duncan P. N. Exon Smith	b4aa16f2bc	IR: Drop never-used defaults for DIBuilder::createTemplate*(), NFC No caller specifies anything different; these parameters are dead code and probably always have been. The new hierarchy doesn't bother with the fields at all (see r228607 and r228652). llvm-svn: 229037	2015-02-13 03:35:29 +00:00
Matt Arsenault	63bef0d177	R600/SI: Remove unnecessary check for fpimm llvm-svn: 229034	2015-02-13 02:47:22 +00:00
Chandler Carruth	dd6029fc6e	[unroll] Make range based for loops a bit more explicit and more readable. The biggest thing that was causing me problems is recognizing the references vs. poniters here. I also found that for maps naming the loop variable as KeyValue helps make it obvious why you don't actually use it directly. Finally, using 'auto' instead of 'User *' doesn't seem like a good tradeoff. Much like with the other cases, I like to know its a pointer, and 'User' is just as long and tells the reader a lot more. llvm-svn: 229033	2015-02-13 02:45:17 +00:00
Duncan P. N. Exon Smith	5012868431	Bitcode: Remove confusing '?' from r229004, NFC The name is always part of the record, it just might be empty. Remove the `?` for clarity. llvm-svn: 229032	2015-02-13 02:43:38 +00:00
Duncan P. N. Exon Smith	59ad8d374b	Bitcode: Add trailing comma to MetadataCodes, NFC Suggested in the review of r229004, this should simplify diffs in the future. llvm-svn: 229031	2015-02-13 02:41:36 +00:00
Chandler Carruth	87fdafc7b2	[IC] Fix a bug with the instcombine canonicalizing of loads and propagating of metadata. We were propagating !nonnull metadata even when the newly formed load is no longer of a pointer type. This is clearly broken and results in LLVM failing the verifier and aborting. This patch just restricts the propagation of !nonnull metadata to when we actually have a pointer type. This bug report and the initial version of this patch was provided by Charles Davis! Many thanks for finding this! We still need to add logic to round-trip the metadata correctly if we combine from pointer types to integer types and then back by using range metadata for the integer type loads. But this is the minimal and safe version of the patch, which is important so we can backport it into 3.6. llvm-svn: 229029	2015-02-13 02:30:01 +00:00
Chandler Carruth	415f41258f	[unroll] Avoid the "Insn" abbreviation of Instruction. This is quite hard to type and read for me, and is inconsistent with the other abbreviation in the base class "Inst". For most of these (where they are used widely) I prefer just spelling it out as Instruction. I've changed two of the short-lived variables to use "Inst" to match the base class. llvm-svn: 229028	2015-02-13 02:17:39 +00:00
Olivier Sallenave	83aec218e7	Check interleaving without relying on debug output. llvm-svn: 229027	2015-02-13 02:13:57 +00:00
Chandler Carruth	302a133b1e	[unroll] Tidy up the integer we use to accumululate the number of instructions optimized. NFC, just separating this out from the functionality changing commit. llvm-svn: 229026	2015-02-13 02:10:56 +00:00
Duncan P. N. Exon Smith	1c93116489	AsmWriter/Bitcode: MDImportedEntity llvm-svn: 229025	2015-02-13 01:46:02 +00:00
Duncan P. N. Exon Smith	d45ce96c38	AsmWriter/Bitcode: MDObjCProperty llvm-svn: 229024	2015-02-13 01:43:22 +00:00
Duncan P. N. Exon Smith	0c5c0124ac	AsmWriter/Bitcode: MDExpression llvm-svn: 229023	2015-02-13 01:42:09 +00:00
Duncan P. N. Exon Smith	72fe2d0b79	AsmWriter/Bitcode: MDLocalVariable llvm-svn: 229022	2015-02-13 01:39:44 +00:00
Zachary Turner	00dbc753db	Fix the build, I forgot to check that UnitTests still built. llvm-svn: 229021	2015-02-13 01:39:22 +00:00
Duncan P. N. Exon Smith	c8f810a017	AsmWriter/Bitcode: MDGlobalVariable llvm-svn: 229020	2015-02-13 01:35:40 +00:00
Duncan P. N. Exon Smith	2847f3805e	AsmWriter/Bitcode: MDTemplate{Type,Value}Parameter llvm-svn: 229019	2015-02-13 01:34:32 +00:00
Duncan P. N. Exon Smith	e146000565	AsmWriter/Bitcode: MDNamespace llvm-svn: 229018	2015-02-13 01:32:09 +00:00
Duncan P. N. Exon Smith	06a0702e40	AsmWriter/Bitcode: MDLexicalBlockFile llvm-svn: 229017	2015-02-13 01:30:42 +00:00
Duncan P. N. Exon Smith	a96d409997	AsmWriter/Bitcode: MDLexicalBlock llvm-svn: 229016	2015-02-13 01:29:28 +00:00

... 3 4 5 6 7 ...

113636 Commits