llvm-project

Commit Graph

Author	SHA1	Message	Date
Elena Demikhovsky	496656900e	AVX-512: Implemented CMOV for 512-bit vectors llvm-svn: 193747	2013-10-31 13:15:32 +00:00
Richard Sandiford	f834ea19db	[SystemZ] Automatically detect zEC12 and z196 hosts As on other hosts, the CPU identification instruction is priveleged, so we need to look through /proc/cpuinfo. I copied the PowerPC way of handling "generic". Several tests were implicitly assuming z10 and so failed on z196. llvm-svn: 193742	2013-10-31 12:14:17 +00:00
Amara Emerson	f80f95fcc7	[AArch64] Make the use of FP instructions optional, but enabled by default. This adds a new subtarget feature called FPARMv8 (implied by NEON), and predicates the support of the FP instructions and registers on this feature. llvm-svn: 193739	2013-10-31 09:32:11 +00:00
NAKAMURA Takumi	160cef8ddc	llvm/test/Bitcode/invalid.ll: Tweak expresion to mach "llvm-dis.EXE:" llvm-svn: 193738	2013-10-31 06:21:00 +00:00
Rafael Espindola	26b43cac18	Fix a use after free on invalid input. llvm-svn: 193737	2013-10-31 04:20:23 +00:00
Rafael Espindola	8fb73c8778	Fix most memory leaks in tablegen. Found by the valgrind bot. llvm-svn: 193736	2013-10-31 04:07:41 +00:00
Rafael Espindola	6554e5a94d	Merge CallGraph and BasicCallGraph. llvm-svn: 193734	2013-10-31 03:03:55 +00:00
Yuchen Wu	9194d7b063	Updated llvm-cov's OVERVIEW description llvm-svn: 193732	2013-10-31 02:01:24 +00:00
Jim Grosbach	7236678687	Legalize: Improve legalization of long vector extends. When an extend more than doubles the size of the elements (e.g., a zext from v16i8 to v16i32), the normal legalization method of splitting the vectors will run into problems as by the time the destination vector is legal, the source vector is illegal. The end result is the operation often becoming scalarized, with the typical horrible performance. For example, on x86_64, the simple input of: define void @bar(<16 x i8> %a, <16 x i32>* %p) nounwind { %tmp = zext <16 x i8> %a to <16 x i32> store <16 x i32> %tmp, <16 x i32>*%p ret void } Generates: .section __TEXT,__text,regular,pure_instructions .section __TEXT,__const .align 5 LCPI0_0: .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .section __TEXT,__text,regular,pure_instructions .globl _bar .align 4, 0x90 _bar: vpunpckhbw %xmm0, %xmm0, %xmm1 vpunpckhwd %xmm0, %xmm1, %xmm2 vpmovzxwd %xmm1, %xmm1 vinsertf128 $1, %xmm2, %ymm1, %ymm1 vmovaps LCPI0_0(%rip), %ymm2 vandps %ymm2, %ymm1, %ymm1 vpmovzxbw %xmm0, %xmm3 vpunpckhwd %xmm0, %xmm3, %xmm3 vpmovzxbd %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vandps %ymm2, %ymm0, %ymm0 vmovaps %ymm0, (%rdi) vmovaps %ymm1, 32(%rdi) vzeroupper ret So instead we can check if there are legal types that enable us to split more cleverly when the input vector is already legal such that we don't turn it into an illegal type. If the extend is such that it's more than doubling the size of the input we check if - the number of vector elements is even, - the source type is legal, - the type of a split source is illegal, - the type of an extended (by doubling element size) source is legal, and - the type of that extended source when split is legal. If the conditions are met, instead of just splitting both the destination and the source types, we create an extend that only goes up one "step" (doubling the element width), and the continue legalizing the rest of the operation normally. The result is that this operates as a new, more effecient, termination condition for the loop of "split the operation until the destination type is legal." With this change, the above example now compiles to: _bar: vpxor %xmm1, %xmm1, %xmm1 vpunpcklbw %xmm1, %xmm0, %xmm2 vpunpckhwd %xmm1, %xmm2, %xmm3 vpunpcklwd %xmm1, %xmm2, %xmm2 vinsertf128 $1, %xmm3, %ymm2, %ymm2 vpunpckhbw %xmm1, %xmm0, %xmm0 vpunpckhwd %xmm1, %xmm0, %xmm3 vpunpcklwd %xmm1, %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vmovaps %ymm0, 32(%rdi) vmovaps %ymm2, (%rdi) vzeroupper ret This generalizes a custom lowering that was added a while back to the ARM backend. That lowering is no longer necessary, and is removed. The testcases for it, however, provide excellent ARM tests for this change and so remain. rdar://14735100 llvm-svn: 193727	2013-10-31 00:20:48 +00:00
Matt Arsenault	909d0c063f	Fix a few typos llvm-svn: 193723	2013-10-30 23:43:29 +00:00
Matt Arsenault	2ba54c3d90	Fix CodeGen for unaligned loads with address spaces llvm-svn: 193721	2013-10-30 23:30:05 +00:00
Matt Arsenault	38b8ecf378	Teach scalarrepl about address spaces llvm-svn: 193720	2013-10-30 22:54:58 +00:00
Rafael Espindola	55fdcff446	Add calls to doInitialization() and doFinalization() in verifyFunction() The function verifyFunction() in lib/IR/Verifier.cpp misses some calls. It creates a temporary FunctionPassManager that will run a single Verifier pass. Unfortunately, FunctionPassManager is no PassManager and does not call doInitialization() and doFinalization() by itself. Verifier does important tasks in doInitialization() such as collecting type information used to check DebugInfo metadata and doFinalization() does some additional checks. Therefore these checks were missed and debug info couldn't be verified at all, it just crashed if the function had some. verifyFunction() is currently not used in llvm unless -debug option is enabled, and in unittests/IR/VerifierTest.cpp VerifierTest had to be changed to create the function in a module from which the type debug info can be collected. Patch by Michael Kruse. llvm-svn: 193719	2013-10-30 22:37:51 +00:00
Rafael Espindola	6f1b2852fc	Produce .weak_def_can_be_hidden for some linkonce_odr values With this patch llvm produces a weak_def_can_be_hidden for linkonce_odr if they are also unnamed_addr or don't have their address taken. There is not a lot of documentation about .weak_def_can_be_hidden, but from the old discussion about linkonce_odr_auto_hide and the name of the directive this looks correct: these symbols can be hidden. Testing this with the ld64 in Xcode 5 linking clang reduces the number of exported symbols from 21053 to 19049. llvm-svn: 193718	2013-10-30 22:08:11 +00:00
David Blaikie	6b288cfa7a	DebugInfo: Push header handling down into CompileUnit This is a preliminary step to handling type units by abstracting over all (type or compile) units. llvm-svn: 193714	2013-10-30 20:42:41 +00:00
Simon Atanasyan	6a2aaecd66	[Mips] Add more SHF_MIPS_xxx ELF section flags. llvm-svn: 193713	2013-10-30 20:41:45 +00:00
Will Dietz	b67a714d37	Add DebugInfo testcase for high_pc encoded as constant, fixed in r193555. llvm-svn: 193711	2013-10-30 20:27:17 +00:00
Matt Arsenault	614ea99da7	Fix GVN creating bitcast between address spaces llvm-svn: 193710	2013-10-30 19:05:41 +00:00
Tom Roeder	04d88fba3e	This commit adds some (but not all) of the x86-64 relocations that are not currently supported in the ELF object writer, along with a simple test case. llvm-svn: 193709	2013-10-30 18:47:25 +00:00
Rui Ueyama	00e24e48b6	Add {start,end}with_lower methods to StringRef. startswith_lower is ocassionally useful and I think worth adding. endwith_lower is added for completeness. Differential Revision: http://llvm-reviews.chandlerc.com/D2041 llvm-svn: 193706	2013-10-30 18:32:26 +00:00
Artyom Skrobov	c1be9c16bc	[ARM] NEON instructions were erroneously decoded from certain invalid encodings llvm-svn: 193705	2013-10-30 18:10:09 +00:00
Tom Stellard	c947d8ca64	R600: Custom lower f32 = uint_to_fp i64 llvm-svn: 193701	2013-10-30 17:22:05 +00:00
David Blaikie	2d4e11228b	DwarfDebug: Change Abbreviations member from pointer to reference llvm-svn: 193699	2013-10-30 17:14:24 +00:00
Benjamin Kramer	0463e83b1b	fix RST reference in Writing an LLVM Pass Currently, instead of showing up as link, it is rendered as ...of FunctionPass <writing-an-llvm-pass-FunctionPass>. The... PR17733. Patch by Tay Ray Chuan! llvm-svn: 193698	2013-10-30 17:09:32 +00:00
Hans Wennborg	3e9b1c1010	Add #include of raw_ostream.h to MipsSEISelLowering.cpp Fixing this Windows build error: ..\lib\Target\Mips\MipsSEISelLowering.cpp(997) : error C2027: use of undefined type 'llvm::raw_ostream' llvm-svn: 193696	2013-10-30 16:10:10 +00:00
Daniel Sanders	d5f554f0bb	[mips][msa] Correct definition of bins[lr] and CHECK-DAG-ize related tests llvm-svn: 193695	2013-10-30 15:45:42 +00:00
Nuno Lopes	1112eca0af	make ConstantRange::signExtend() optimal the case [x, INT_MIN) was not handled optimally llvm-svn: 193694	2013-10-30 15:36:50 +00:00
Daniel Sanders	ab94b537d7	[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693	2013-10-30 15:20:38 +00:00
Chad Rosier	be020d0309	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193691	2013-10-30 15:19:37 +00:00
Cameron McInally	d184466d1b	Refactor the AVX512 intrinsics. Cluster the intrinsics into the appropriate vector extension class within the .td file. llvm-svn: 193690	2013-10-30 15:19:10 +00:00
Howard Hinnant	811c96fa0e	Rehash but don't grow when full of tombstones. This problem was found and fixed by José Fonseca in March 2011 for SmallPtrSet, committed r128566. But as far as I can tell, all other llvm hash tables retain the same problem: the bucket count can grow without bound while size() remains near constant by repeated insert/erase cycles that tend to fill the container with tombstones. Here is a demo that has been reduced to a trivial case: int main() { llvm::DenseSet<unsigned> d; for (unsigned i = 0; i < 0xFFFFFFF; ++i) { d.insert(i); d.erase(i); } } While the container size() never grows above 1, the bucket count grows like this: nb = 64 nb = 128 nb = 256 nb = 512 nb = 1024 nb = 2048 nb = 4096 nb = 8192 nb = 16384 nb = 32768 nb = 65536 nb = 131072 nb = 262144 nb = 524288 nb = 1048576 nb = 2097152 nb = 4194304 nb = 8388608 nb = 16777216 nb = 33554432 nb = 67108864 nb = 134217728 nb = 268435456 The above program currently consumes a few GB ram. This patch brings the memory consumption down by several orders of magnitude, and keeps the bucket count at 64 for the above test. llvm-svn: 193689	2013-10-30 15:10:54 +00:00
Daniel Sanders	d74b130cc9	[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687	2013-10-30 14:45:14 +00:00
Daniel Sanders	53fe6c4d56	[mips][msa] Combine binsri-like DAG of AND and OR into equivalent VSELECT (or (and $a, $mask), (and $b, $inverse_mask)) => (vselect $mask, $a, $b). where $mask is a constant splat. This allows bitwise operations to make use of bsel. It's also a stepping stone towards matching bins[lr], and bins[lr]i from normal IR. Two sets of similar tests have been added in this commit. The bsel_* functions test the case where binsri cannot be used. The binsr_*_i functions will start to use the binsri instruction in the next commit. llvm-svn: 193682	2013-10-30 13:51:01 +00:00
Daniel Sanders	62aeab83e7	[mips] MipsSETargetLowering now reports DAGCombiner changes when using -debug-only=mips-isel No test since -debug output is intended for developers and not end-users. llvm-svn: 193681	2013-10-30 13:31:27 +00:00
Daniel Sanders	e7ef0c817b	[mips][msa] Added support for matching splat.[bhw] from normal IR (i.e. not intrinsics) splat.d is implemented but this subtest is currently disabled. This is because it is difficult to match the appropriate IR on MIPS32. There is a patch under review that should help with this so I hope to enable the subtest soon. llvm-svn: 193680	2013-10-30 13:07:44 +00:00
Juergen Ributzka	3bd686d493	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." Now Hexagon and SystemZ are not happy with it :-( llvm-svn: 193677	2013-10-30 06:36:19 +00:00
Juergen Ributzka	6ad05d6b95	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. This mask has usually the same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 193676	2013-10-30 05:48:18 +00:00
Bill Wendling	d3b4344af9	Reformat Makefile. No other changes. llvm-svn: 193675	2013-10-30 04:03:03 +00:00
Akira Hatanaka	3048b0248a	[mips] Compute stack alignment on the fly. llvm-svn: 193673	2013-10-30 02:29:43 +00:00
Josh Magee	7245f1d85d	Reformat code with clang-format. Differential Revision: http://llvm-reviews.chandlerc.com/D2057 llvm-svn: 193672	2013-10-30 02:25:14 +00:00
NAKAMURA Takumi	c6823c760c	StackProtector.h: Fix trailing comments for doxygen. [-Wdocumentation] s!//<!///<! llvm-svn: 193669	2013-10-30 00:49:39 +00:00
NAKAMURA Takumi	8970f5386c	Trailing whitespace in a comment line. llvm-svn: 193668	2013-10-30 00:49:33 +00:00
Manman Ren	251a1bd215	Debug Info: code clean up. Use EmitLabelOffsetDifference for handling on darwin platform when non-darwin platforms use EmitLabelPlusOffset. Also fix a bug in EmitLabelOffsetDifference where the size is hard-coded to 4 even though Size is passed in as an argument. llvm-svn: 193660	2013-10-29 23:14:15 +00:00
Manman Ren	ce20d460e2	Debug Info: support for DW_FORM_ref_addr. To support ref_addr, we calculate the section offset of a DIE (i.e. offset of a DIE from beginning of the debug info section). The Offset field in DIE is currently CU-relative. To calculate the section offset, we add a DebugInfoOffset field in CompileUnit to store the offset of a CU from beginning of the debug info section. We set the value in DwarfUnits::computeSizeAndOffset for each CompileUnit. A helper function DIE::getCompileUnit is added to return the CU DIE that the input DIE belongs to. We also add a map CUDieMap in DwarfDebug to help finding the CU for a given CU DIE. For a cross-referenced DIE, we first find the CU DIE it belongs to with getCompileUnit, then we use CUDieMap to get the corresponding CU for the CU DIE. Adding the section offset of the CU with the CU-relative offset of a DIE gives us the seciton offset of the DIE. We correctly emit ref_addr with relocation using EmitLabelPlusOffset when doesDwarfUseRelocationsAcrossSections is true. This commit handles the emission of DW_FORM_ref_addr when we have an attribute with FORM_ref_addr. A follow-on patch will start using ref_addr when adding a DIEEntry. This commit will be tested and verified in the follow-on patch. Reviewed off-list by Eric, Thanks. llvm-svn: 193658	2013-10-29 22:57:10 +00:00
Manman Ren	f4c339e04a	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. Ensure that we create the context before we create a type so that we can add the newly created type to the parent. Remove last use of addToContextOwner now that it's not needed. We use createAndAddDIE to wrap around "new DIE(". Now all shareable DIEs should be added to their parents right after the creation. Reviewed off-list by Eric, Thanks. llvm-svn: 193657	2013-10-29 22:49:29 +00:00
Manman Ren	b504f49448	Struct byval cleanup: add helper functions to reduce code duplication. Helper functions are added: emitPostLd: emit a post-increment load operation with given size. emitPostSt: emit a post-increment store operation with given size. No functionality change. llvm-svn: 193656	2013-10-29 22:27:32 +00:00
Josh Magee	3f1c0e35e6	[stackprotector] Update the StackProtector pass to perform datalayout analysis. This modifies the pass to classify every SSP-triggering AllocaInst according to an SSPLayoutKind (LargeArray, SmallArray, AddrOf). This analysis is collected by the pass and made available for use, but no other pass uses it yet. The next patch will make use of this analysis in PEI and StackSlot passes. The end goal is to support ssp-strong stack layout rules. WIP. Differential Revision: http://llvm-reviews.chandlerc.com/D1789 llvm-svn: 193653	2013-10-29 21:16:16 +00:00
Matt Arsenault	87596662cd	Update comment llvm-svn: 193651	2013-10-29 21:04:19 +00:00
Matt Arsenault	a1ca46d003	Workaround MSVC 32-bit miscompile of getCondCodeAction. Use 32-bit types for the array instead of 64. This should generally be better anyway. In optimized + assert builds, I saw a failure when a cond code / type combination that is never set was loading a non-zero value and hitting the != Promote assert. It turns out when loading the 64-bit value to do the shift, the assembly loads the 2 32-bit halves from non-consecutive addresses. The address the second half of the loaded uint64_t doesn't include the offset of the array in the struct. Instead of being offset + 4, it's just + 4. I'm not entirely sure why this wasn't observed before. setCondCodeAction isn't heavily used by the in-tree targets, and not with the higher valued vector SimpleValueTypes. Only PPC is using one of the > 32 valued types, and that is probably never used by anyone on a 32-bit MSVC compiled host. I ran into this when upgrading LLVM versions, so I guess the value loaded from the nonsense address happened to work out before. No test since I'm not really sure if / how it can be reproduced with the current in tree targets, and it's not supposed to change anything. llvm-svn: 193650	2013-10-29 20:59:29 +00:00
Aaron Ballman	9ab670fb54	Removing a switch statement that contains only a default label. This resolves an MSVC warning. No functional change intended. llvm-svn: 193649	2013-10-29 20:40:52 +00:00
Akira Hatanaka	6b2d841975	[mips] Align the stack to 16-bytes for mfp64. llvm-svn: 193641	2013-10-29 19:29:03 +00:00
Rafael Espindola	88034af278	Remove declared but not implemented function. llvm-svn: 193637	2013-10-29 18:31:14 +00:00
Benjamin Kramer	3b32b2ff10	Fix common typos in the docs. llvm-svn: 193632	2013-10-29 17:53:27 +00:00
Rafael Espindola	e133ed88b5	Move getSymbol to TargetLoweringObjectFile. This allows constructing a Mangler with just a TargetMachine. llvm-svn: 193630	2013-10-29 17:28:26 +00:00
Manman Ren	75cc7658e1	Debug Info: clean up testing case. Add a tag before the name attribute for readability. Use CHECK-NEXT instead of CHECK-NOT followed by a CHECK. Add new lines to separate checking of different DIEs. llvm-svn: 193629	2013-10-29 17:27:14 +00:00
Rafael Espindola	79858aa3df	Add a helper getSymbol to AsmPrinter. llvm-svn: 193627	2013-10-29 17:07:16 +00:00
Weiming Zhao	acf48d75e5	add test cases for frameaddr and returnaddr for aarch64 llvm-svn: 193626	2013-10-29 17:01:29 +00:00
Weiming Zhao	ffade617bd	[AArch64] Implement FrameAddr and ReturnAddr Fixes PR17690 llvm-svn: 193625	2013-10-29 17:00:25 +00:00
Amara Emerson	f9a67fce26	[ARM] Make sure HasCRC is initialized to false in Subtarget. llvm-svn: 193624	2013-10-29 16:54:52 +00:00
Zoran Jovanovic	507e084a18	Support for microMIPS jump instructions llvm-svn: 193623	2013-10-29 16:38:59 +00:00
Tom Stellard	6e1ee476ab	R600/SI: Add compute support for CI v2 v2: - Fix LDS size calculation Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 193621	2013-10-29 16:37:28 +00:00
Tom Stellard	e118b8becd	R600: Expand vector FSQRT ops llvm-svn: 193620	2013-10-29 16:37:20 +00:00
Alexey Samsonov	cbd806aef8	DWARF parser: propery handle DW_FORM_ref_sig8 and fix Windows build. Based on D2050 by Timur Iskhodzhanov. llvm-svn: 193619	2013-10-29 16:32:19 +00:00
Rafael Espindola	7d78b2ae3a	The asm printer has a mangler. Use it. llvm-svn: 193618	2013-10-29 16:24:21 +00:00
Rafael Espindola	69c1d631f2	The AsmPrinter has a Mangler. Use it. llvm-svn: 193617	2013-10-29 16:18:15 +00:00
Rafael Espindola	38c2e65e78	The asm printer has a mangler. Don't keep a second pointer to it. llvm-svn: 193616	2013-10-29 16:11:22 +00:00
Rafael Espindola	e804b1a44e	Support names like llvm-ar-3.4 and llvm-ranlib-3.4. They are used in some packages. For example: http://packages.ubuntu.com/saucy/i386/llvm-3.4/filelist This fixes pr17721. llvm-svn: 193612	2013-10-29 14:25:43 +00:00
Bernard Ogden	fce246f0c6	Test cleanup for v8 instructions Add some missing tests, factor out a test not specific to v8 into its own file. llvm-svn: 193611	2013-10-29 14:16:09 +00:00
Rafael Espindola	5d1b745689	Clarify that GlobalVariables definitions must have an initializer. llvm-svn: 193609	2013-10-29 13:44:11 +00:00
Timur Iskhodzhanov	cb4e7550eb	Quick-fix DebugInfo build on Windows MSVC can't comprehend template<typename T, size_t N> ArrayRef<T> makeArrayRef(const T (&Arr)[N]) { return ArrayRef<T>(Arr); } if Arr is static const uint8_t sizes[]; declared in a templated and defined a few lines later. I'll send a proper fix (i.e. get rid of unnecessary templates) for review soon. llvm-svn: 193604	2013-10-29 12:13:22 +00:00
Bernard Ogden	ee87e85505	ARM: Add subtarget feature for CRC Adds a subtarget feature for the CRC instructions (optional in v8-A) to the ARM (32-bit) backend. Differential Revision: http://llvm-reviews.chandlerc.com/D2036 llvm-svn: 193599	2013-10-29 09:47:35 +00:00
Anders Waldenborg	a36a7825fb	Fix misapplied patch in r193597 Sorry Peter Zotov, entirely my fault. llvm-svn: 193598	2013-10-29 09:37:28 +00:00
Anders Waldenborg	213a63fe53	llvm-c: Make LLVM{Get,Set}Alignment work on {Load,Store}Inst too Patch by Peter Zotov Differential Revision: http://llvm-reviews.chandlerc.com/D1910 llvm-svn: 193597	2013-10-29 09:02:02 +00:00
Tim Northover	d29ddf6713	AArch64: add 'a' inline asm operand modifier This is used in the Linux kernel, and effectively just means "print an address". llvm-svn: 193593	2013-10-29 08:22:33 +00:00
Manman Ren	f6b936bc06	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. This touches creation of namespaces and global variables. The purpose is to handle all DIE creations similarly: constructs the context first, then creates the DIE and immediately adds the DIE to its parent. We use createAndAddDIE to wrap around "new DIE(". llvm-svn: 193589	2013-10-29 05:49:41 +00:00
NAKAMURA Takumi	16c7184ba4	Add llvm/test/Transforms/SLPVectorizer/ARM/lit.local.cfg. Tests there require ARM in targets. llvm-svn: 193580	2013-10-29 02:46:00 +00:00
Alp Toker	6a03374526	Fix "existant" typos llvm-svn: 193579	2013-10-29 02:35:28 +00:00
Richard Smith	58d575926c	Clean up. llvm-svn: 193576	2013-10-29 01:44:23 +00:00
NAKAMURA Takumi	83a05039eb	DWARFFormValue.cpp: Appease gcc to give explicit constructors. error: conversion from `const uint8_t*' to non-scalar type `llvm::ArrayRef<unsigned char>' requested llvm-svn: 193575	2013-10-29 01:43:05 +00:00
Arnold Schwaighofer	89ae217422	ARM cost model: Unaligned vectorized double stores are expensive Updated a test case that assumed that <2 x double> would vectorize to use <4 x float>. radar://15338229 llvm-svn: 193574	2013-10-29 01:33:57 +00:00
Arnold Schwaighofer	77af0f6e82	ARM cost model: Account for zero cost scalar SROA instructions By vectorizing a series of srl, or, ... instructions we have obfuscated the intention so much that the backend does not know how to fold this code away. radar://15336950 llvm-svn: 193573	2013-10-29 01:33:53 +00:00
Arnold Schwaighofer	86252451c4	SLPVectorizer: Use vector type for vectorized memory operations No test case, because with the current cost model we don't see a difference. An upcoming ARM memory cost model change will expose and test this bug. radar://15332579 llvm-svn: 193572	2013-10-29 01:33:50 +00:00
Andrew Kaylor	8935258b4e	Cleaning up comments in lli llvm-svn: 193571	2013-10-29 01:33:14 +00:00
Andrew Kaylor	1ca510ea67	Adding a workaround for __main linking with remote lli and Cygwin/MinGW llvm-svn: 193570	2013-10-29 01:29:56 +00:00
Joerg Sonnenberger	fc18473400	Move the STT_FILE symbols out of the normal symbol table processing for ELF. They can overlap with the other symbols, e.g. if a source file "foo.c" contains a function "foo" with a static variable "c". llvm-svn: 193569	2013-10-29 01:06:17 +00:00
Manman Ren	4a841a86bd	Debug Info: use createAndAddDIE to wrap around "new DIE" in DwarfDebug. This commit ensures DIEs are constructed within a compile unit and immediately added to their parents. Reviewed off-list by Eric. llvm-svn: 193568	2013-10-29 01:03:01 +00:00
Manman Ren	73d697c641	Debug Info: use createAndAddDIE for newly-created Subprogram DIEs. More patches will be submitted to convert "new DIE(" to use createAddAndDIE in DwarfCompileUnit.cpp. This will simplify implementation of addDIEEntry where we have to decide between ref4 and ref_addr, because DIEs that can be shared across CU will be added to a CU already. Reviewed off-list by Eric. llvm-svn: 193567	2013-10-29 00:58:04 +00:00
Manman Ren	b987e517f2	Debug Info: add a helper function createAndAddDIE. It wraps around "new DIE(" and handles the bookkeeping part of the newly-created DIE. It adds the DIE to its parent, and calls insertDIE if necessary. It makes sure that bookkeeping is done at the earliest time and we should not see parentless DIEs if all constructions of DIEs go through this helper function. Later on, we can use an allocator for DIE allocation, and will only need to change createAndAddDIE instead of modifying all the "new DIE(". Reviewed off-list by Eric. llvm-svn: 193566	2013-10-29 00:53:03 +00:00
Alexey Samsonov	330b8939bb	Merge DWARFDIE::extractFast and DWARFDIE::extract into one function. Complicated CU-DIE-specific logic in the latter was never used, and it makes sense to have safety checks for broken dwarf in the former. llvm-svn: 193563	2013-10-28 23:58:58 +00:00
Andrew Kaylor	2873b38e69	Renaming MCJIT .ir files to .ll and moving them to Inputs llvm-svn: 193562	2013-10-28 23:51:03 +00:00
Alexey Samsonov	a56bbf0c8c	DWARF parser: Use ArrayRef to represent form sizes and simplify DWARFDIE::extractFast() interface. No functionality change. llvm-svn: 193560	2013-10-28 23:41:49 +00:00
Alp Toker	5e9ed7cf1d	lit: add missing substitutions for recently added tools llvm-mcmarkup, obj2yaml and yaml2obj were missing from the substitutions list, causing the test suite to fail in a sandboxed environment. llvm-svn: 193559	2013-10-28 23:37:49 +00:00
Alp Toker	0d44e49e92	Quote potential shell expansions found in tests llvm-svn: 193558	2013-10-28 23:37:45 +00:00
Alexey Samsonov	7614212fd1	DWARF parser: since DWARF4, DW_AT_high_pc may be a constant representing function size llvm-svn: 193555	2013-10-28 23:15:15 +00:00
Alexey Samsonov	48cbda5850	DebugInfo: Introduce the notion of "form classes" Summary: Use DWARF4 table of form classes to fetch attributes from DIE in a more consistent way. This shouldn't change the functionality and serves as a refactoring for upcoming change: DW_AT_high_pc has different semantics depending on its form class. Reviewers: dblaikie, echristo Reviewed By: echristo CC: echristo, llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1961 llvm-svn: 193553	2013-10-28 23:01:48 +00:00
Alp Toker	0a09ebf445	Fix the lli --extra-module value_desc llvm-svn: 193552	2013-10-28 22:51:25 +00:00
Rui Ueyama	b6decb0a80	Add a few tests for StringRef::{start,end}with. llvm-svn: 193550	2013-10-28 22:42:54 +00:00
Rafael Espindola	d1cac0af6b	Convert another llc -filetype=obj test. llvm-svn: 193548	2013-10-28 22:17:19 +00:00
Rafael Espindola	3a8c0734f9	Convert another llc -filetype=obj test. llvm-svn: 193547	2013-10-28 22:11:47 +00:00
Rafael Espindola	060e6444ea	Convert another llc -filetype=obj test. llvm-svn: 193546	2013-10-28 22:05:05 +00:00
Andrew Kaylor	4404eb4857	Standardizing lli's extra module command line option llvm-svn: 193544	2013-10-28 21:58:15 +00:00
Bill Wendling	4965e900d9	Remove stray '_'. llvm-svn: 193543	2013-10-28 21:43:54 +00:00
Bill Wendling	c14b8043bb	Use the correct reference. Spotted by Sean Silva. llvm-svn: 193542	2013-10-28 21:43:11 +00:00
Bill Wendling	8edd8f9298	Remove 2.4 from the list of supported Python versions. llvm-svn: 193541	2013-10-28 21:22:23 +00:00
Akira Hatanaka	7d82252d4b	[mips] Simplify LowerFormalArguments using getRegClassFor. No functionality change. llvm-svn: 193540	2013-10-28 21:21:36 +00:00
Rafael Espindola	940ca0bada	Convert another llc -filetype=obj test. llvm-svn: 193539	2013-10-28 21:12:15 +00:00
Rafael Espindola	57ec995c37	Convert another llc -filetype=obj test. llvm-svn: 193538	2013-10-28 21:06:12 +00:00
Rafael Espindola	3a5eecb57c	Convert another llc -filetype=obj test. llvm-svn: 193537	2013-10-28 20:59:41 +00:00
Rafael Espindola	3f018baac0	Convert another llc -filetype=obj test. llvm-svn: 193536	2013-10-28 20:54:33 +00:00
Lang Hames	b52816615b	Return early from getUnconditionalBranchTargetOpValue if the branch target is an MCExpr, in order to avoid writing an encoded zero value in the immediate field. When getUnconditionalBranchTargetOpValue is called with an MCExpr target, we don't know what the final immediate field value should be. We shouldn't explicitly set the immediate field to an encoded zero value as zero is encoded with a non-zero bit pattern. This leads to bits being set that pollute the final immediate value. The nature of the encoding is such that the polluted bits only affect very large immediate values, explaining why this hasn't caused problems earlier. Fixes <rdar://problem/15155975>. llvm-svn: 193535	2013-10-28 20:51:11 +00:00
Rafael Espindola	889a180e5a	Convert a llc -filetype=obj test into a llvm-mc test. llvm-svn: 193534	2013-10-28 20:40:20 +00:00
Ahmed Bougacha	a70ecdc3ac	TableGen: remove unused variable. llvm-svn: 193527	2013-10-28 18:19:04 +00:00
Ahmed Bougacha	141075110c	TableGen: Refactor DAG patterns to enable parsing one pattern at a time. llvm-svn: 193526	2013-10-28 18:07:21 +00:00
Ahmed Bougacha	bd2140018b	TableGen: Refactor AsmWriterEmitter to keep AsmWriterInsts. These used to be referenced by the CGI->AWI map (in AsmWriterEmitter), but stored in a vector local to EmitPrintInstruction. Move the vector to AsmWriterEmitter too. llvm-svn: 193525	2013-10-28 18:07:17 +00:00
Logan Chien	8cbb80d159	[arm] Implement eabi_attribute, cpu, and fpu directives. This commit allows the ARM integrated assembler to parse and assemble the code with .eabi_attribute, .cpu, and .fpu directives. To implement the feature, this commit moves the code from AttrEmitter to ARMTargetStreamers, and several new test cases related to cortex-m4, cortex-r5, and cortex-a15 are added. Besides, this commit also change the Subtarget->isFPOnlySP() to Subtarget->hasD16() to match the usage of .fpu directive. This commit changes the test cases: * Several .eabi_attribute directives in 2010-09-29-mc-asm-header-test.ll are removed because the .fpu directive already cover the functionality. * In the Cortex-A15 test case, the value for Tag_Advanced_SIMD_arch has be changed from 1 to 2, which is more precise. llvm-svn: 193524	2013-10-28 17:51:12 +00:00
Nuno Lopes	8a24152048	simplify ConstantRange::getSetSize() llvm-svn: 193523	2013-10-28 16:52:38 +00:00
Richard Sandiford	094e609716	[SystemZ] Set usaAA to true useAA significantly improves the handling of vector code that has TBAA information attached. It also helps other cases, as shown by the testsuite changes here. The only real downside I've seen is that it interferes with MergeConsecutiveStores. The problem is that that optimization works top down, starting at the first store in the chain, and looks for cases where the chain result is only used by a single related store. These related stores don't alias, so useAA will have rewritten all the later stores to use a different chain input (typically the same one as the first store). I think the advantages outweigh the disadvantages though, so for now I've just disabled alias analysis for the unaligned-01.ll test. llvm-svn: 193521	2013-10-28 13:53:37 +00:00
Richard Sandiford	981fdeb477	[DAGCombiner] Respect volatility when checking for aliases Making useAA() default to true for SystemZ showed that the combiner alias analysis wasn't handling volatile accesses. This hit many of the SystemZ tests, but I arbitrarily picked one for the purpose of this patch. llvm-svn: 193518	2013-10-28 12:00:00 +00:00
Richard Sandiford	39c1ce4dc1	Keep TBAA info when rewriting SelectionDAG loads and stores Most SelectionDAG code drops the TBAA info when creating a new form of a load and store (e.g. during legalization, or when converting a plain load to an extending one). This patch tries to catch all cases where the TBAA information can legitimately be carried over. The patch adds alternative forms of getLoad() and getExtLoad() that take a MachineMemOperand instead of individual fields. (The corresponding getTruncStore() already exists.) The idea is to use the MachineMemOperand forms when all fields are carried over (size, pointer info, isVolatile, isNonTemporal, alignment and TBAA info). If some adjustment is being made, e.g. to narrow the load, then we still pass the individual fields but also pass the TBAA info. llvm-svn: 193517	2013-10-28 11:17:59 +00:00
Alp Toker	d0cdc67caa	lit: multiprocessing platform fix attempt The error raised by Python varies by platform(!), so let's just catch any exception and fall back. Thanks to Sylvestre Ledru for noticing this on a Debian / Python 2.7 system running code coverage. llvm-svn: 193516	2013-10-28 10:26:13 +00:00
Benjamin Kramer	6094f30da2	SCEV: Make the final add of an inbounds GEP nuw if we know that the index is positive. We can't do this for the general case as saying a GEP with a negative index doesn't have unsigned wrap isn't valid for negative indices. %gep = getelementptr inbounds i32* %p, i64 -1 But an inbounds GEP cannot run past the end of address space. So we check for the very common case of a positive index and make GEPs derived from that NUW. Together with Andy's recent non-unit stride work this lets us analyze loops like void foo3(int a, int b) { for (; a < b; a++) {} } PR12375, PR12376. Differential Revision: http://llvm-reviews.chandlerc.com/D2033 llvm-svn: 193514	2013-10-28 07:30:06 +00:00
NAKAMURA Takumi	8a0464393f	Prune utf8 chars in comments. llvm-svn: 193512	2013-10-28 04:07:38 +00:00
NAKAMURA Takumi	0b865d445e	Prune trailing linefeeds. llvm-svn: 193511	2013-10-28 04:07:31 +00:00
NAKAMURA Takumi	4bb85f90fd	Target/R600: Un-tab-ify. llvm-svn: 193510	2013-10-28 04:07:23 +00:00
Reed Kotler	91ae9829a9	Make first substantial checkin of my port of ARM constant islands code to Mips. Before I just ported the shell of the pass. I've tried to keep everything nearly identical to the ARM version. I think it will be very easy to eventually merge these two and create a new more general pass that other targets can use. I have some improvements I would like to make to allow pools to be shared across functions and some other things. When I'm all done we can think about making a more general pass. More to be ported but the basic mechanism works now almost as good as gcc mips16. llvm-svn: 193509	2013-10-27 21:57:36 +00:00
Alp Toker	31bd72fb22	Clarify the comment about BSD versions in r193465 llvm-svn: 193508	2013-10-27 20:49:19 +00:00
Benjamin Kramer	7ad4100f8b	NVPTX: Remove unused globals. llvm-svn: 193500	2013-10-27 11:31:46 +00:00
Benjamin Kramer	602bb4ad86	Hexagon: Remove global state. llvm-svn: 193499	2013-10-27 11:16:09 +00:00
NAKAMURA Takumi	5bb014371e	MCJIT-remote: __main should be resolved in child context. - Mark tests as XFAIL:cygming in test/ExecutionEngine/MCJIT/remote. Rather to suppress them, I'd like to leave them running as XFAIL. - Revert r193472. RecordMemoryManager no longer resolves __main on cygming. There are a couple of issues. - X86 Codegen emits "call __main" in @main for targeting cygming. It is useless in JIT. FYI, tests are passing when emitting __main is disabled. - Current remote JIT does not resolve any symbols in child context. FIXME: __main should be disabled, or remote JIT should resolve __main. llvm-svn: 193498	2013-10-27 10:22:52 +00:00
Elena Demikhovsky	199c823555	AVX-512: PMIN/PMAX intrinsics and patterns Patch by Cameron McInally <cameron.mcinally@nyu.edu> llvm-svn: 193497	2013-10-27 08:18:37 +00:00
Bill Wendling	6822ecb087	A small grammar-os fixed. llvm-svn: 193496	2013-10-27 05:09:12 +00:00
Bill Wendling	e814a37a72	Update to current output. PR14039 llvm-svn: 193494	2013-10-27 04:50:34 +00:00
Bill Wendling	29c7f168cb	Fix Sphinx warning. llvm-svn: 193493	2013-10-27 04:25:02 +00:00
Bill Wendling	e9d5c4809d	Update to specify that both metadata and label types aren't proper return types. PR15447 llvm-svn: 193492	2013-10-27 04:19:29 +00:00
Bill Wendling	27f96dae10	Update the Python version. And Perl isn't used anymore. PR17608 llvm-svn: 193491	2013-10-27 04:02:21 +00:00
Bill Wendling	7bf172cd45	Update link. PR17608 llvm-svn: 193490	2013-10-27 03:57:10 +00:00
Shuxin Yang	2e1890e18b	Revert r193251 : Use address-taken to disambiguate global variable and indirect memops. llvm-svn: 193489	2013-10-27 03:08:44 +00:00
NAKAMURA Takumi	da469ecbbd	lli/RemoteMemoryManager.cpp: Resurrect __main stuff removed in r192504 to unbreak mingw32. llvm-svn: 193472	2013-10-26 13:52:31 +00:00
Joerg Sonnenberger	853b460e4f	self.path may be empty or otherwise miss the normal system directories, so try PATH next. Assume it is sane enough to cover the usual system bash locations too, but the old list is not good enough for NetBSD. llvm-svn: 193471	2013-10-26 13:25:45 +00:00
Alp Toker	54d210b205	lit: Issue a note when multiprocessing fails to load If multiprocessing was requested, detected as available and subsequently failed to initialize it's worth letting the user know about it before falling back to threads. This condition can arise in certain OpenBSD / FreeBSD Python versions. llvm-svn: 193465	2013-10-26 09:29:58 +00:00
Alp Toker	6c5dbd7a0a	Fix a referenced before assignment in r193463 Some versions of Python on the builders seem strict about this. llvm-svn: 193464	2013-10-26 08:46:05 +00:00
Alp Toker	9ade45482a	lit: handle late multiprocessing errors gracefully This should be a better fix for lit multiprocessing failures, replacing the OpenBSD and FreeBSD workarounds in r193413 and r193457. Reference: http://bugs.python.org/issue3770 llvm-svn: 193463	2013-10-26 08:22:44 +00:00
Wan Xiaofei	be640b28c0	Quick look-up for block in loop. This patch implements quick look-up for block in loop by maintaining a hash set for blocks. It improves the efficiency of loop analysis a lot, the biggest improvement could be 5-6%(458.sjeng). Below are the compilation time for our benchmark in llc before & after the patch. Benchmark llc - trunk llc - patched 401.bzip2 0.339081 100.00% 0.329657 102.86% 403.gcc 19.853966 100.00% 19.605466 101.27% 429.mcf 0.049823 100.00% 0.048451 102.83% 433.milc 0.514898 100.00% 0.510217 100.92% 444.namd 1.109328 100.00% 1.103481 100.53% 445.gobmk 4.988028 100.00% 4.929114 101.20% 456.hmmer 0.843871 100.00% 0.825865 102.18% 458.sjeng 0.754238 100.00% 0.714095 105.62% 464.h264ref 2.9668 100.00% 2.90612 102.09% 471.omnetpp 4.556533 100.00% 4.511886 100.99% bitmnp01 0.038168 100.00% 0.0357 106.91% idctrn01 0.037745 100.00% 0.037332 101.11% libquake2 3.78689 100.00% 3.76209 100.66% libquake_ 2.251525 100.00% 2.234104 100.78% linpack 0.033159 100.00% 0.032788 101.13% matrix01 0.045319 100.00% 0.043497 104.19% nbench 0.333161 100.00% 0.329799 101.02% tblook01 0.017863 100.00% 0.017666 101.12% ttsprk01 0.054337 100.00% 0.053057 102.41% Reviewer : Andrew Trick <atrick@apple.com>, Hal Finkel <hfinkel@anl.gov> Approver : Andrew Trick <atrick@apple.com> Test : Pass make check-all & llvm test-suite llvm-svn: 193460	2013-10-26 03:08:02 +00:00
NAKAMURA Takumi	e00225bf14	llvm/test/lit.cfg: Tighten conditions to enable 'native'. I saw the case that 'native' was mis-enabled when x86_64-pc-win32 on x86_64-linux. FIXME: Consider cases that target can be executed even if host_triple were different from target_triple. llvm-svn: 193459	2013-10-26 02:50:20 +00:00
NAKAMURA Takumi	0328dfa6a4	llvm/test/Other/close-stderr.ll: Remove "XFAIL:win32". It reverts r173509. "REQUIRES: shell" should cover if this failed. llvm-svn: 193458	2013-10-26 02:50:14 +00:00
Alp Toker	5853534b03	Attempt to fix the FreeBSD build, disable multiprocessing Speculative quick fix based on clang-X86_64-freebsd output: File "/usr/local/lib/python2.6/multiprocessing/synchronize.py", line 33, in <module> " function, see issue 3770.") ImportError: This platform lacks a functioning sem_open implementation, therefore, the required synchronization primitives needed will not function, see issue 3770. llvm-svn: 193457	2013-10-26 02:43:08 +00:00
Andrew Trick	57243da70f	Fix SCEVExpander: don't try to expand quadratic recurrences outside a loop. Partial fix for PR17459: wrong code at -O3 on x86_64-linux-gnu (affecting trunk and 3.3) When SCEV expands a recurrence outside of a loop it attempts to scale by the stride of the recurrence. Chained recurrences don't work that way. We could compute binomial coefficients, but would hve to guarantee that the chained AddRec's are in a perfectly reduced form. llvm-svn: 193438	2013-10-25 21:35:56 +00:00
Andrew Trick	29abce3189	Fix LSR: don't normalize quadratic recurrences. Partial fix for PR17459: wrong code at -O3 on x86_64-linux-gnu (affecting trunk and 3.3) ScalarEvolutionNormalization was attempting to normalize by adding and subtracting strides. Chained recurrences don't work that way. llvm-svn: 193437	2013-10-25 21:35:52 +00:00
Rafael Espindola	7749d7ccc7	Handle calls and invokes in GlobalStatus. This patch teaches GlobalStatus to analyze a call that uses the global value as a callee, not as an argument. With this change internalize call handle the common use of linkonce_odr functions. This reduces the number of linkonce_odr functions in a LTO build of clang (checked with the emit-llvm gold plugin option) from 1730 to 60. llvm-svn: 193436	2013-10-25 21:29:52 +00:00
Hal Finkel	02f562df43	LoopVectorizer: Don't attempt to vectorize extractelement instructions The loop vectorizer does not currently understand how to vectorize extractelement instructions. The existing check, which excluded all vector-valued instructions, did not catch extractelement instructions because it checked only the return value. As a result, vectorization would proceed, producing illegal instructions like this: %58 = extractelement <2 x i32> %15, i32 0 %59 = extractelement i32 %58, i32 0 where the second extractelement is illegal because its first operand is not a vector. llvm-svn: 193434	2013-10-25 20:40:15 +00:00
David Blaikie	8bc7db777d	DIEHash: Summary hashing of member functions llvm-svn: 193432	2013-10-25 20:04:25 +00:00
Rafael Espindola	e5bf24684f	Try to fix the build on windows. llvm-svn: 193431	2013-10-25 19:47:55 +00:00
Rafael Espindola	1d19c8f03a	Change MemoryBuffer::getFile to take a Twine. llvm-svn: 193429	2013-10-25 19:06:52 +00:00
David Blaikie	65cc969f50	DIEHash: Summary hashing of nested types llvm-svn: 193427	2013-10-25 18:38:43 +00:00
Quentin Colombet	8761a8f5c0	[X86][AVX512] Add patterns that match the AVX512 floating point register vbroadcast intrinsics. Patch by Cameron McInally <cameron.mcinally@nyu.edu> llvm-svn: 193422	2013-10-25 18:04:12 +00:00
Quentin Colombet	4bf1c282c2	[X86][AVX512] Add patterns that match the AVX512 floating point vbroadcast intrinsics. Patch by Cameron McInally <cameron.mcinally@nyu.edu> llvm-svn: 193421	2013-10-25 17:47:18 +00:00
Daniel Sanders	1b71f42f7d	[bugpoint] Increase the default memory limit for subprocesses to 300MB. Summary: Currently shared library builds (BUILD_SHARED_LIBS=ON in cmake) fail three bugpoint tests (BugPoint/remove_arguments_test.ll, BugPoint/crash-narrowfunctiontest.ll, and BugPoint/metadata.ll). If I run the bugpoint commands that llvm-lit runs with without -silence-passes I see errors such as this: opt: error while loading shared libraries: libLLVMSystemZInfo.so: failed to map segment from shared object: Cannot allocate memory It seems that the increased size of the binaries in a shared library build is causing the subprocess to exceed the 100MB memory limit. This patch therefore increases the default limit to a level at which these tests pass. Reviewers: dsanders Reviewed By: dsanders CC: llvm-commits, rafael Differential Revision: http://llvm-reviews.chandlerc.com/D2013 llvm-svn: 193420	2013-10-25 17:41:41 +00:00
Benjamin Kramer	2daaea5db7	llvm-c-test: Don't leak memory buffers. Detected by valgrind. llvm-svn: 193416	2013-10-25 15:58:58 +00:00
Rafael Espindola	5e82540d11	Try to fix the openbsd bot. llvm-svn: 193413	2013-10-25 15:07:59 +00:00
Rafael Espindola	64cc1b0043	Call destroy from ~BasicCallGraph. This fix a memory leak found by valgrind. Calling it from the base class destructor would not destroy the BasicCallGraph bits. FIXME: BasicCallGraph is the only thing that inherits from CallGraph. Can we merge the two? llvm-svn: 193412	2013-10-25 15:01:34 +00:00
Rafael Espindola	fe3be1153f	Use c comments. llvm-svn: 193404	2013-10-25 12:59:02 +00:00
Tim Northover	1744d0ad83	ARM: allow .thumb_func to be separated from symbol definition When assembling, a .thumb_func directive is supposed to be applicable to the next symbol definition, even if there are intervening directives. We were racing ahead to try and find it, and this commit should fix the issue. Patch by Gabor Ballabas llvm-svn: 193403	2013-10-25 12:49:50 +00:00
Yaron Keren	2eac89868c	The FIXME was indeed fixed in the linker, comment removed. llvm-svn: 193402	2013-10-25 12:01:53 +00:00
Tim Northover	c7ea8048e7	ARM: don't expand atomicrmw inline on Cortex-M0 There's a barrier instruction so that should still be used, but most actual atomic operations are going to need a platform decision on the correct behaviour (either nop if single-threaded or OS-support otherwise). rdar://problem/15287210 llvm-svn: 193399	2013-10-25 09:30:24 +00:00
Tim Northover	a564d329c2	LegalizeDAG: allow libcalls for max/min atomic operations ARM processors without ldrex/strex need to be able to make libcalls for all atomic operations, including the newer min/max versions. The alternative would probably be expanding these operations in terms of cmpxchg (as x86 does always), but in the configurations where this matters code-size tends to be paramount so the libcall is more desirable. llvm-svn: 193398	2013-10-25 09:30:20 +00:00
Tim Northover	41d2049180	ARM: tweak test to pass on all platforms A TableGen indeterminacy means that the reason for the failure can vary, and Windows gets the other option. llvm-svn: 193394	2013-10-25 07:34:56 +00:00
Nadav Rotem	d369d4bdf9	Optimize concat_vectors(X, undef) -> scalar_to_vector(X). This optimization is not SSE specific so I am moving it to DAGco. The new scalar_to_vector dag node exposed a missing pattern in the AArch64 target that I needed to add. llvm-svn: 193393	2013-10-25 06:41:18 +00:00
Richard Smith	a2d566fa98	Fix ODR violation. llvm-svn: 193391	2013-10-25 03:29:42 +00:00
Yuchen Wu	03678157b5	llvm-cov dump to dbgs() instead of outs(). llvm-svn: 193390	2013-10-25 02:22:24 +00:00
Yuchen Wu	14ae8e6195	Support for reading program counts in llvm-cov. llvm-cov will now be able to read program counts from the GCDA file and output it in the same format as gcov. The program summary tag was identified from gcov-io.h as "\0\0\0\a3". There is currently a bug in GCOVProfiling.cpp which does not generate the run- or program-counting IR, so this change was tested manually by modifying the GCDA file and comparing the gcov and llvm-cov outputs. llvm-svn: 193389	2013-10-25 02:22:21 +00:00
Jim Grosbach	c16a657ad0	ARM: Test r193381 a bit more thoroughly. Make sure we're predicating right based on CPU even if the triple is 'wrong'. llvm-svn: 193382	2013-10-24 23:11:05 +00:00
Jim Grosbach	1d1d6d4675	ARM: Tweak usage of '*vfp' compiler_rt functions. Only use them if the subtarget has ARM mode, as these routines are implemented as ARM code. rdar://15302004 llvm-svn: 193381	2013-10-24 23:07:11 +00:00
David Blaikie	d8c5b4e8ef	MCStreamer: Reimplement the virtual EmitRawText as a protected member, EmitRawTextImpl, to avoid string literal ambiguities Also improve the implementation of EmitRawText(Twine) so it doesn't bother using the SmallString buffer if the Twine is a simple StringRef anyway. llvm-svn: 193378	2013-10-24 22:43:10 +00:00
Reid Kleckner	ddac15108a	lto.h: Use lto_bool_t instead of int to restore the ABI This reverts commit r193255 and instead creates an lto_bool_t typedef that points to bool, _Bool, or unsigned char depending on what is available. Only recent versions of MSVC provide a stdbool.h header. Reviewers: rafael.espindola Differential Revision: http://llvm-reviews.chandlerc.com/D2019 llvm-svn: 193377	2013-10-24 22:26:04 +00:00
David Blaikie	68642d3118	DWARF emission: Remove unnecessary/redundant DIE reference code The default case at the end of the switch handles this just fine. llvm-svn: 193374	2013-10-24 22:00:44 +00:00
Eric Christopher	e34116750f	Fix name of variable in comment. llvm-svn: 193373	2013-10-24 21:54:58 +00:00
Eric Christopher	670ee0e941	Grammar. llvm-svn: 193372	2013-10-24 21:20:23 +00:00
Eric Christopher	b088d2d0bc	Update misleading comment. llvm-svn: 193371	2013-10-24 21:05:08 +00:00
Eric Christopher	dd542ef786	Formatting and whitespace. llvm-svn: 193370	2013-10-24 21:04:51 +00:00
David Blaikie	2aee7be871	DIEHash: Const correct and use references where non-null/non-rebound. llvm-svn: 193363	2013-10-24 18:29:03 +00:00
David Blaikie	32744412d2	DIEHash: Do not use shallow type hashing for unnamed types llvm-svn: 193361	2013-10-24 17:53:58 +00:00
David Blaikie	afcb9656c3	DIEHash: Refactor ref attribute hashing into smaller functions llvm-svn: 193360	2013-10-24 17:51:43 +00:00
David Blaikie	e568225fc3	Remove unused debug-only member variable. This may've been used at some point but the 'print' member function grew an Indent parameter that entirely shadows this parameter. llvm-svn: 193358	2013-10-24 17:10:13 +00:00
David Peixotto	b0653e539b	Remove class abstraction from ARM struct byval lowering This commit changes the struct byval lowering for arm to use inline checks for the subtarget instead of a class abstraction to represent the differences. The class abstraction was judged to be too much code for this task. No intended functionality change. llvm-svn: 193357	2013-10-24 16:39:36 +00:00
Tom Stellard	bc7d87f07c	Inliner: Handle readonly attribute per argument when adding memcpy Patch by: Vincent Lejeune llvm-svn: 193356	2013-10-24 16:38:33 +00:00
Renato Golin	9f36932c8d	I had to move and remove llvm-svn: 193355	2013-10-24 16:31:43 +00:00
Tim Northover	5620faf771	ARM: Mark double-precision instructions as such This prevents us from silently accepting invalid instructions on (for example) Cortex-M4 with just single-precision VFP support. No tests for the extra Pat Requires because they're essentially assertions: the affected code should have been lowered to libcalls before ISel. rdar://problem/15302004 llvm-svn: 193354	2013-10-24 15:49:39 +00:00
Renato Golin	e865d70678	Fix broken builds by moving test to x86 dir llvm-svn: 193351	2013-10-24 15:11:03 +00:00
John Thompson	6cd5bd4a3d	Reverting my r193344 checkin due to build breakage. llvm-svn: 193350	2013-10-24 14:52:56 +00:00
Renato Golin	1ba143e140	Mark vector loops as already vectorized Make sure we mark all loops (scalar and vector) when vectorizing, so that we don't try to vectorize them anymore. Also, set unroll to 1, since this is what we check for on early exit. llvm-svn: 193349	2013-10-24 14:50:51 +00:00
John Thompson	e38e57206f	Added std::string as a built-in type for mapping. llvm-svn: 193344	2013-10-24 13:36:58 +00:00
Tim Northover	225bcbbe71	ARM: add a couple more NEON predicates. The fused multiply instructions were added in VFPv4 but are still NEON instructions, in particular they shouldn't be available on a Cortex-M4 not matter how floaty it is. llvm-svn: 193342	2013-10-24 12:48:05 +00:00
Tim Northover	64dacb2b8a	ARM: mark various aliases with their architecture requirements. If an alias inherits directly from InstAlias then it doesn't get any default "Requires" values, so llvm-mc will allow it even on architectures that don't support the underlying instruction. This tidies up the obvious VFP and NEON cases I found. llvm-svn: 193340	2013-10-24 12:22:58 +00:00
Zoran Jovanovic	2f0a712e18	Added tests for microMIPS relocations 1. llvm-svn: 193332	2013-10-24 10:55:00 +00:00
Tim Northover	94ecbd2e6c	ARM: Use non-VFP softcalls on embedded Darwinish targets The compiler-rt functions __adddf3vfp and so on exist purely to allow Thumb1 code to make use of VFP instructions by switching back to ARM mode, they make no sense for M-class processors which don't even have an ARM mode. Given that justification, in practice this is a platform ABI decision so the actual check is based on that rather than CPU features. rdar://problem/15302004 llvm-svn: 193327	2013-10-24 10:37:09 +00:00
Yaron Keren	1ec9df3322	Replaced non-ASCII character. llvm-svn: 193324	2013-10-24 10:04:47 +00:00
Chandler Carruth	d55d159d09	Revert part of r193291, restoring the deletion of loaded objects. Without this, customers of the MCJIT were leaking memory like crazy. It's not really clear what the right memory management is here, so I'm not trying to add lots of tests or other logic, just trying to get us back to a better baseline. I'll follow up on the original commit to figure out the right path forward. llvm-svn: 193323	2013-10-24 09:52:56 +00:00
Tim Northover	741e6ef4d4	ARM: fix assert on unpredictable POP instruction. POP instructions are aliased to the ARM LDM variants but have different syntax. This caused two problems: we tried to access a non-existent operand to annotate the '!', and the error message didn't make much sense. With some vigorous hand-waving in the error message both problems can be fixed. llvm-svn: 193322	2013-10-24 09:37:18 +00:00
Yaron Keren	744fcdf587	Added test for -elf configuration, to see that _alloca call is properly generated. See: http://llvm.org/viewvc/llvm-project?view=revision&revision=193289 llvm-svn: 193321	2013-10-24 09:36:08 +00:00
Job Noorman	a8d35c98fd	Make sure SP is always aligned on a 2 byte boundary llvm-svn: 193320	2013-10-24 09:32:31 +00:00

... 2 3 4 5 6 ...

97219 Commits