llvm-project

Commit Graph

Author	SHA1	Message	Date
Rafael Espindola	aaaf02206a	Also test the created stubs on 32 bits. llvm-svn: 196052	2013-12-01 21:24:30 +00:00
Andrew Trick	ca45c817c3	Add -mcpu to stackmap.ll llvm-svn: 196051	2013-12-01 18:17:05 +00:00
Tim Northover	45479dcf49	ARM: fix bug in -Oz stack adjustment folding Previously, we clobbered callee-saved registers when folding an "add sp, #N" into a "pop {rD, ...}" instruction. This change checks whether a register we're going to add to the "pop" could actually be live outside the function before doing so and should fix the issue. This should fix PR18081. llvm-svn: 196046	2013-12-01 14:16:24 +00:00
Benjamin Kramer	951b15eb09	Revamp error checking in the ms inline asm parser. - Actually abort when an error occurred. - Check that the frontend lookup worked when parsing length/size/type operators. Tested by a clang test. PR18096. llvm-svn: 196044	2013-12-01 11:47:42 +00:00
Michael Kuperstein	1627eeba71	Ensure bitcode encoding of linkage types stays stable. Patch by Boaz Ouriel llvm-svn: 196042	2013-12-01 10:16:35 +00:00
Bill Wendling	cbcb02c35a	Use accessor methods instead. llvm-svn: 196006	2013-12-01 03:40:42 +00:00
Bill Wendling	2798f1ef58	Use 'unsigned char' to get this past gcc error message: error: invalid conversion from 'unsigned char' to '{anonymous}::Sequence' llvm-svn: 196004	2013-12-01 03:36:07 +00:00
Hal Finkel	42daeae9bd	Add a scheduling model (with itinerary) for the PPC POWER7 This adds a scheduling model for the POWER7 (P7) core, and enables the machine-instruction scheduler when targeting the P7. Scheduling for the P7, like earlier ooo PPC cores, requires considering both dispatch group hazards, and functional unit resources and latencies. These are both modeled in a combined itinerary. Dispatch group formation is still handled by the post-RA scheduler (which still needs to be updated for the P7, but nevertheless does a pretty good job). One interesting aspect of this change is that I've also enabled to use of AA duing CodeGen for the P7 (just as it is for the embedded cores). The benchmark results seem to support this decision (see below), and while this is normally useful for in-order cores, and not for ooo cores like the P7, I think that the dispatch slot hazards are enough like in-order resources to make the AA useful. Test suite significant performance differences (where negative is a speedup, and positive is a regression) vs. the current situation: MultiSource/Benchmarks/BitBench/drop3/drop3 with AA: N/A without AA: -28.7614% +/- 19.8356% (significantly against AA) MultiSource/Benchmarks/FreeBench/neural/neural with AA: -17.7406% +/- 11.2712% without AA: N/A (significantly in favor of AA) MultiSource/Benchmarks/SciMark2-C/scimark2 with AA: -11.2079% +/- 1.80543% without AA: -11.3263% +/- 2.79651% MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt with AA: -41.8649% +/- 17.0053% without AA: -34.5256% +/- 23.7072% MultiSource/Benchmarks/mafft/pairlocalalign with AA: 25.3016% +/- 17.8614% without AA: 38.6629% +/- 14.9391% (significantly in favor of AA) MultiSource/Benchmarks/sim/sim with AA: N/A without AA: 13.4844% +/- 7.18195% (significantly in favor of AA) SingleSource/Benchmarks/BenchmarkGame/Large/fasta with AA: 15.0664% +/- 6.70216% without AA: 12.7747% +/- 8.43043% SingleSource/Benchmarks/BenchmarkGame/puzzle with AA: 82.2713% +/- 26.3567% without AA: 75.7525% +/- 41.1842% SingleSource/Benchmarks/Misc/flops-2 with AA: -37.1621% +/- 20.7964% without AA: -35.2342% +/- 20.2999% (significantly in favor of AA) These are 99.5% confidence intervals from 5 runs per configuration. Regarding the choice to turn on AA during CodeGen, of these results, four seem significantly in favor of using AA, and one seems significantly against. I'm not making this decision based on these numbers alone, but these results seem consistent with results I have from other tests, and so I think that, on balance, using AA is a win. llvm-svn: 195981	2013-11-30 20:55:12 +00:00
Hal Finkel	46402a4211	Split some PPC itinerary classes In preparation for adding scheduling definitions for the POWER7, split some PPC itinerary classes so that the P7's latencies and hazards can be better described. For the most part, this means differentiating indexed from non-index pre-increment loads and stores. Also, differentiate single from double-precision sqrt. No functionality change intended (except for a more-specific latency for single-precision sqrt on the A2). llvm-svn: 195980	2013-11-30 20:41:13 +00:00
Hal Finkel	ca93e47258	Convert a PPC test from grep to FileCheck Convert this test to FileCheck, and improve it to check for the instructions it is trying to exclude instead of checking for register use (especially because grepping for r1 can be thrown off, for example, by a use of r12). llvm-svn: 195979	2013-11-30 20:04:33 +00:00
Hal Finkel	2651f97333	Desensitize a couple of PPC regression tests Use CHECK-DAG to make these regression tests more resilient against changes in instruction scheduling. llvm-svn: 195978	2013-11-30 19:52:28 +00:00
Hal Finkel	2b655bb228	Update the cpu specified on some PPC regression tests Some of these tests did not specify a cpu but were also sensitive to instruction scheduling and/or register assignment choices. A few others similarly-sensitive tests specified a cpu (often the POWER7), and while the P7 currently uses the default model for PPC64, this will soon change. For those tests which should not really be cpu-dependent anyway, the cpu is set to the generic 'ppc64'. llvm-svn: 195977	2013-11-30 19:39:27 +00:00
Zoran Jovanovic	472486714e	Test case for issue with microMIPS long branch. llvm-svn: 195976	2013-11-30 19:13:15 +00:00
Zoran Jovanovic	9d86e26e62	Fixed issue with microMIPS long branch. llvm-svn: 195975	2013-11-30 19:12:28 +00:00
Daniel Sanders	7fd68d6018	[mips][msa] MSA loads and stores have a 10-bit offset. Account for this when lowering FrameIndex. This prevents the compiler from emitting invalid ld.[bhwd]'s and st.[bhwd]'s when the stack frame is between 512 and 32,768 bytes in size. llvm-svn: 195973	2013-11-30 13:47:57 +00:00
Daniel Sanders	7153414768	[mips][msa] A small refactor to reduce patch noise in my next commit No functional change. An if-statement has been split into two nested if-statements. llvm-svn: 195972	2013-11-30 13:15:21 +00:00
Juergen Ributzka	5b6234dc4a	Force CPU type to unbreak unit tests on Haswell machines. llvm-svn: 195971	2013-11-30 03:07:16 +00:00
Andrew Trick	c2ab53a318	Reverse the order of eviction checks for possible compile time savings. No functionality. llvm-svn: 195969	2013-11-29 23:49:38 +00:00
Reed Kotler	ad450f239f	Part 1 of 3 patches that completes very long conditional branches in constant islands for Mips16. We introdcuce JalB16 as a synomnym for Jal16. It makes it easier to read and is also necessary because Jal16 is a call instruction but JalB16 is being used as a branch. Various parts of LLVM will not work properly even in this late stage of the backend if we use what was declared as a call instruction to function as a branch. For one, basic block labels may not get emitted in some situations. llvm-svn: 195968	2013-11-29 22:32:56 +00:00
Zoran Jovanovic	1bc3cce040	Revert revision 195965. llvm-svn: 195967	2013-11-29 22:10:02 +00:00
Petar Jovanovic	e3e940d887	mips: XFAIL llvm-cov test XFAIL llvm-cov.test for MIPS until big-endian issues are fixed for llvm-cov. The test does pass on MIPS little-endian. llvm-svn: 195966	2013-11-29 21:59:09 +00:00
Zoran Jovanovic	ff2a40ce4d	Fixed issue with microMIPS long branch. llvm-svn: 195965	2013-11-29 21:41:24 +00:00
Hal Finkel	1df3205e8c	Adjust PPC A2 input operand latencies On the PPC A2, instructions are only issued after their input operands are ready. Model this by specifying that input operands are read at dispatch (0 cycles after issue). This changes all input operand latencies from 1 to 0. Significant test-suite performance changes (these are 99.5% confidence intervals on 6 runs for both before and after): speedups: MultiSource/Benchmarks/sim/sim -1.21915% +/- 0.175063% MultiSource/Benchmarks/TSVC/LinearDependence-flt/LinearDependence-flt -1.23946% +/- 1.05133% SingleSource/Benchmarks/Misc/flops-2 -1.24237% +/- 0.681362% MultiSource/Applications/JM/lencod/lencod -1.33992% +/- 0.757498% MultiSource/Benchmarks/TSVC/InductionVariable-flt/InductionVariable-flt -1.51802% +/- 1.21468% MultiSource/Benchmarks/TSVC/GlobalDataFlow-flt/GlobalDataFlow-flt -2.18818% +/- 1.28605% MultiSource/Benchmarks/TSVC/Packing-flt/Packing-flt -2.21977% +/- 1.19499% SingleSource/Benchmarks/BenchmarkGame/spectral-norm -2.29822% +/- 0.671871% MultiSource/Benchmarks/TSVC/Packing-dbl/Packing-dbl -2.40975% +/- 0.355931% SingleSource/Benchmarks/Misc/fp-convert -2.41899% +/- 1.04751% MultiSource/Benchmarks/TSVC/Searching-dbl/Searching-dbl -2.50349% +/- 0.126765% SingleSource/Benchmarks/Misc/flops-3 -3.00214% +/- 0.700795% MultiSource/Benchmarks/TSVC/LoopRestructuring-flt/LoopRestructuring-flt -3.56995% +/- 3.2929% MultiSource/Applications/sgefa/sgefa -4.24908% +/- 2.00413% MultiSource/Benchmarks/ASC_Sequoia/IRSmk/IRSmk -18.1294% +/- 3.96489% regressions: MultiSource/Benchmarks/TSVC/Reductions-dbl/Reductions-dbl 1.03249% +/- 0.178547% MultiSource/Applications/hexxagon/hexxagon 1.16597% +/- 0.285235% MultiSource/Benchmarks/TSVC/IndirectAddressing-flt/IndirectAddressing-flt 1.39576% +/- 1.07855% SingleSource/Benchmarks/Misc-C++/stepanov_v1p2 1.71539% +/- 0.173182% MultiSource/Benchmarks/Fhourstones-3.1/fhourstones3.1 1.90013% +/- 0.866472% MultiSource/Benchmarks/TSVC/Recurrences-dbl/Recurrences-dbl 2.39854% +/- 1.05914% MultiSource/Benchmarks/TSVC/ControlFlow-dbl/ControlFlow-dbl 2.4402% +/- 0.817904% MultiSource/Benchmarks/TSVC/LoopRestructuring-dbl/LoopRestructuring-dbl 5.87997% +/- 3.3172% MultiSource/Benchmarks/Trimaran/netbench-crc/netbench-crc 9.02643% +/- 5.79591% MultiSource/Benchmarks/VersaBench/bmm/bmm 10.3517% +/- 1.227% Obviously, there are data points on both sides of this; but I think, overall, this supports making the change. llvm-svn: 195951	2013-11-29 07:04:59 +00:00
Lang Hames	7468daadda	Teach LocalStackSlotAllocation that stackmaps/patchpoints don't have range constraints on their frame offsets. llvm-svn: 195950	2013-11-29 06:35:30 +00:00
Hal Finkel	5a7162f36b	Create a PPC440 SchedMachineModel Some of the older PPC processor definitions don't have associated SchedMachineModels; correct this for the PPC440. llvm-svn: 195949	2013-11-29 06:32:17 +00:00
Hal Finkel	4035e8d86a	Fixup PPC440 load/store operand latencies The operand latencies for loads and stores in the PPC440 itinerary were wrong (the store operands are all inputs, and the "with update" (pre-increment) instructions need a latency for the additional output). llvm-svn: 195948	2013-11-29 06:19:43 +00:00
Hal Finkel	a10bd1d23a	Adjust PPC440 operand latencies The operand latencies for the PPC440 should be specified relative to dispatch, not relative to the initial fetch-and-decode stages. Because most instructions (ignoring bypass) wait in dispatch until their operands are ready, this is modeled as reading input operands "at dispatch" (0 cycles after issue), and so every input and output operand has 4 cycles subtracted from it. This could alter scheduling slightly, but I don't expect a large effect. llvm-svn: 195947	2013-11-29 05:59:00 +00:00
Hal Finkel	dd06369913	Don't model the fetch and decode units for the PPC440 Modeling the fetch and decode units in the PPC440 itinerary does not add anything to the hazard detection capability (and so modeling them just wastes compile time). No functionality change intended. llvm-svn: 195946	2013-11-29 05:58:38 +00:00
Lang Hames	c8a73af391	Remove unused variable from r195944. llvm-svn: 195945	2013-11-29 03:36:53 +00:00
Lang Hames	39609996d9	Refactor a lot of patchpoint/stackmap related code to simplify and make it target independent. Most of the x86 specific stackmap/patchpoint handling was necessitated by the use of the native address-mode format for frame index operands. PEI has now been modified to treat stackmap/patchpoint similarly to DEBUG_INFO, allowing us to use a simple, platform independent register/offset pair for frame indexes on stackmap/patchpoints. Notes: - Folding is now platform independent and automatically supported. - Emiting patchpoints with direct memory references now just involves calling the TargetLoweringBase::emitPatchPoint utility method from the target's XXXTargetLowering::EmitInstrWithCustomInserter method. (See X86TargetLowering for an example). - No more ugly platform-specific operand parsers. This patch shouldn't change the generated output for X86. llvm-svn: 195944	2013-11-29 03:07:54 +00:00
Hao Liu	ba38eee8ac	AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. llvm-svn: 195941	2013-11-29 02:11:22 +00:00
Jiangning Liu	f7b4c7c2ce	Add missing test case for bsl_f64 support of AArch64 NEON. llvm-svn: 195939	2013-11-29 01:38:08 +00:00
Jiangning Liu	c429c00f3b	Add missing pattern for supporting intrinsic function vbsl_f64 with argument double floating point. llvm-svn: 195938	2013-11-29 01:37:15 +00:00
Kevin Qin	337cfcc83c	[AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. llvm-svn: 195936	2013-11-29 01:29:16 +00:00
Stephen Canon	c454964c47	Rein in overzealous InstCombine of fptrunc(OP(fpextend, fpextend)). llvm-svn: 195934	2013-11-28 21:38:05 +00:00
Rafael Espindola	d5bd5a4716	Refactor to remove a bit of duplication. No functionality change. llvm-svn: 195933	2013-11-28 20:12:44 +00:00
Benjamin Kramer	ea1982aff9	Silence sign-compare warning and reduce nesting. No functionality change. llvm-svn: 195932	2013-11-28 19:58:56 +00:00
Rafael Espindola	61b3d0c1fb	Remove an always true parameter. llvm-svn: 195931	2013-11-28 19:35:07 +00:00
NAKAMURA Takumi	226e10edff	[CMake] Let add_public_tablegen_target() provide intrinsics_gen, too. I think, in principle, intrinsics_gen may be added explicitly. That said, it can be added incidentally, since each target already has dependencies to llvm-tblgen. Almost all source files depend on both CommonTaleGen and intrinsics_gen. Explicit add_dependencies() have been pruned under lib/Target. llvm-svn: 195929	2013-11-28 17:04:31 +00:00
NAKAMURA Takumi	c08227de0e	[CMake] Also OptionTests can be free from add_dependencies() with add_public_tablegen_target(). llvm-svn: 195928	2013-11-28 17:04:13 +00:00
NAKAMURA Takumi	ce746c6c49	[CMake] Let add_public_tablegen_target responsible to provide dependency to CommonTableGen. add_public_tablegen_target adds *CommonTableGen to LLVM_COMMON_DEPENDS. LLVM_COMMON_DEPENDS affects add_llvm_library (and other add_target stuff) within its scope. llvm-svn: 195927	2013-11-28 17:04:04 +00:00
Rafael Espindola	848493d886	The global prefix is always one char. Don't use a string for it. llvm-svn: 195926	2013-11-28 17:00:49 +00:00
NAKAMURA Takumi	b2abd160b3	[CMake] Prune include_directories() in llvm/lib/Target, take #2 . I forgot to commit them. They were staging in my local repo. llvm-svn: 195924	2013-11-28 15:30:37 +00:00
Daniel Sanders	063b74ad4e	[mips] Revert test commit r195922. llvm-svn: 195923	2013-11-28 15:26:33 +00:00
Daniel Sanders	eb16443fca	[mips] A test commit to test my Herald and Audit workflow Will be reverted in the next commit llvm-svn: 195922	2013-11-28 15:25:43 +00:00
NAKAMURA Takumi	413518f1f8	[CMake] Prune include_directories() in llvm/lib/Target. add_llvm_target() sets them. llvm-svn: 195921	2013-11-28 14:53:30 +00:00
NAKAMURA Takumi	979e604d8c	Add newline at eof. llvm-svn: 195920	2013-11-28 14:52:52 +00:00
Daniel Sanders	a3365ac962	As myself as code-owner of the MIPS backend (lib/Target/Mips/*) llvm-svn: 195915	2013-11-28 09:36:44 +00:00
Peter Zotov	e7255da917	[OCaml] Add a slash accidentally omitted from Makefile llvm-svn: 195912	2013-11-28 09:03:28 +00:00
Rafael Espindola	3e3a3f1f85	Use the mangler consistently instead of using getGlobalPrefix directly. llvm-svn: 195911	2013-11-28 08:59:52 +00:00
Hal Finkel	92720ab1b2	Don't share functional units among the PPC itineraries Instead of sharing functional unit names between the various PPC itineraries, give each core its own unit names prefixed with the core name. This follows the convention used by other backends (such as ARM), and removes a non-obvious ordering dependency between the various PPCSchedule*.td files. No functionality change intended. llvm-svn: 195908	2013-11-28 06:05:59 +00:00
Jiangning Liu	4bc9dbd846	Remove the variable only used by assert to avoid the build failure caused by build options [-Werror,-Wunused-variable]. llvm-svn: 195905	2013-11-28 01:34:55 +00:00
Hao Liu	f9f468abee	AArch64: Fix a bug about disassembling post-index load single element to 4 vectors llvm-svn: 195903	2013-11-28 01:07:45 +00:00
Reed Kotler	0d409e2dfe	Check in conditional branches for constant islands. Still need to finish conditional branches for very large targets. That will be the next small patch. Everything now should in principle work as good (functionality wise) as without constant islands so we decided at Mips/Imagination to make constant islands the default for Mips16 now so that it will get excercised a lot and this port is still experimentatl though hopefully soon we will change the status. Some more cleanup and code review is in order but things are converging fast. llvm-svn: 195902	2013-11-28 00:56:37 +00:00
Akira Hatanaka	f6109e4ad7	[mips] Redefine TAILCALL as a pseudo instruction. No functionality change. llvm-svn: 195896	2013-11-27 23:58:32 +00:00
David Blaikie	bc7e0d43bf	DebugInfo: Do not include variables only referenced by templates in aranges. ARanges included even extern variables referenced by pointer non-type template parameters even though that variable isn't part of this compilation unit. llvm-svn: 195895	2013-11-27 23:53:52 +00:00
Akira Hatanaka	f9a0ec4fc4	Add MipsOptimizePICCall.cpp to CMakeLists.txt. llvm-svn: 195894	2013-11-27 23:47:25 +00:00
Akira Hatanaka	168d4e5b20	[mips] Implement the following optimizations using dominance information to make PIC calls a little more efficient: 1. Remove instructions setting up $gp if it is known that a function has been called at least once. 2. Save the address of a called function in a register instead of loading it from the GOT at every call site. llvm-svn: 195892	2013-11-27 23:38:42 +00:00
Hal Finkel	3e5a360ba3	Add IIC_ prefix to PPC instruction-class names This adds the IIC_ prefix to the instruction itinerary class names, giving the PPC backend a naming convention for itinerary classes that is more consistent with that used by the X86 and ARM backends. Instruction scheduling in the PPC backend needs a bunch of cleanup and improvement (especially for the ooo cores). This is just a preliminary step. No functionality change intended. llvm-svn: 195890	2013-11-27 23:26:09 +00:00
Rafael Espindola	c90584b6f6	Don't set GlobalPrefix to the default value. llvm-svn: 195884	2013-11-27 21:57:54 +00:00
Rafael Espindola	429e3fb068	The R600 has its own asm printer which doesn't use GlobalPrefix. Drop it. llvm-svn: 195883	2013-11-27 21:52:37 +00:00
Tom Stellard	175e7a8c97	R600: Expand vector FABS NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195881	2013-11-27 21:23:39 +00:00
Tom Stellard	c149dc02d3	R600/SI: Implement spilling of SGPRs v5 SGPRs are spilled into VGPRs using the {READ,WRITE}LANE_B32 instructions. v2: - Fix encoding of Lane Mask - Use correct register flags, so we don't overwrite the low dword when restoring multi-dword registers. v3: - Register spilling seems to hang the GPU, so replace all shaders that need spilling with a dummy shader. v4: - Fix *LANE definitions - Change destination reg class for 32-bit SMRD instructions v5: - Remove small optimization that was crashing Serious Sam 3. https://bugs.freedesktop.org/show_bug.cgi?id=68224 https://bugs.freedesktop.org/show_bug.cgi?id=71285 NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195880	2013-11-27 21:23:35 +00:00
Tom Stellard	859199dad8	R600/SI: Use SGPR_32 register class for 32-bit SMRD outputs Writing to the M0 register from an SMRD instruction hangs the GPU, so we need to use the SGPR_32 register class, which does not include M0. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195879	2013-11-27 21:23:29 +00:00
Tom Stellard	4d566b2edf	R600: Add support for ISD::FROUND NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195878	2013-11-27 21:23:20 +00:00
Lang Hames	fde8e4b7c9	Show stackmap entry encodings in stackmap debug logs. This makes it easier to cross-reference debug output with encoded stack-maps, and to create stackmap test-cases. llvm-svn: 195874	2013-11-27 20:10:16 +00:00
Rafael Espindola	745bd85c6a	Use FileCheck and expand the test a bit. In particular, check the name of the symbol we are putting in the constant pool. llvm-svn: 195865	2013-11-27 19:22:14 +00:00
Rafael Espindola	3dc549dbe3	Remove dead code. MO_ExternalSymbol and MO_JumpTableIndex don't show up in inline asm. llvm-svn: 195861	2013-11-27 18:38:14 +00:00
Rafael Espindola	52434f9673	Convert two if sequences to switches. llvm-svn: 195859	2013-11-27 18:26:51 +00:00
Rafael Espindola	ed20f478bc	Use a switch. llvm-svn: 195857	2013-11-27 18:18:24 +00:00
Rafael Espindola	3c8e147a6b	Use the same tls section name as msvc. We currently error in clang with: "error: thread-local storage is unsupported for the current target", but we can start to get the llvm level ready. When compiling template<typename T> struct foo { static __declspec(thread) int bar; }; template<typename T> __declspec(therad) int foo<T>::bar; template struct foo<int>; msvc produces SECTION HEADER #3 .tls$ name 0 physical address 0 virtual address 4 size of raw data 12F file pointer to raw data (0000012F to 00000132) 0 file pointer to relocation table 0 file pointer to line numbers 0 number of relocations 0 number of line numbers C0301040 flags Initialized Data COMDAT; sym= "public: static int foo<int>::bar" (?bar@?$foo@H@@2HA) 4 byte align Read Write gcc produces a ".data$__emutls_v.<symbol>" for the testcase with __declspec(thread) replaced with thread_local. llvm-svn: 195849	2013-11-27 15:52:11 +00:00
Rafael Espindola	c5c7bb6b20	Remove more dead code now that this is only used for inline asm. MO_ConstantPoolIndex is handled in printLeaMemReference. MO_JumpTableIndex and MO_ExternalSymbol don't show up in inline asm. llvm-svn: 195847	2013-11-27 15:13:06 +00:00
Jiangning Liu	97aa8cf8b7	Fix the AArch64 NEON bug exposed by checking constant integer argument range of ACLE intrinsics. llvm-svn: 195843	2013-11-27 14:02:25 +00:00
Peter Zotov	40a5d378f6	[OCaml] Embed rpath into stub libraries and native executables This commit embeds a set of linker flags with hardcoded paths to the LLVM shared library on --enable-shared builds into .cmxa files and stub dynamic libraries. This solution closely follows existing rules for rpath in the LLVM tools, which had to be modified because of differences in toolchain. Without this patch, OCaml tests as well as opam bindings broke, as neither of those updates LD_LIBRARY_PATH to include the $prefix/lib directory. llvm-svn: 195834	2013-11-27 11:03:18 +00:00
Rafael Espindola	e370147b8c	Convert more methods in static helpers. llvm-svn: 195826	2013-11-27 07:34:09 +00:00
Rafael Espindola	7caa135677	Convert these methods into static functions. llvm-svn: 195825	2013-11-27 07:14:26 +00:00
Rafael Espindola	09cf06c75e	Cleanup and test X86AsmPrinter::printPCRelImm. It is only used for asm printing. On X86 we put basic block addresses on register before passing them to inline asm, so the MO_MachineBasicBlock case was dead. MO_ExternalSymbol was dead since any symbol being passed to inline asm is represented as MO_GlobalAddress. The MO_GlobalAddress and MO_Register cases were not tested. llvm-svn: 195824	2013-11-27 06:53:13 +00:00
Sean Silva	6cda6dc6cf	[docs] Mention gotcha regarding implicit BB numbering Impetus for the clarification by Mikael Lyngvig. llvm-svn: 195812	2013-11-27 04:55:23 +00:00
Hal Finkel	8081ae9134	Fix comment in PPCA2Model llvm-svn: 195807	2013-11-27 03:12:56 +00:00
Rafael Espindola	d0ed730f92	Remove dead argument. llvm-svn: 195806	2013-11-27 02:25:20 +00:00
Chad Rosier	75290c6307	[AArch64] Add support for NEON scalar floating-point absolute difference. llvm-svn: 195803	2013-11-27 01:45:58 +00:00
Chandler Carruth	16f56b4c23	[PM] Remove the underspecified 'getRoot' method from CallGraph. It's only user was an ancient SCC printing bit of the opt tool which really should be walking the call graph the same way the CGSCC pass manager does. llvm-svn: 195800	2013-11-27 01:32:17 +00:00
Rafael Espindola	2d30ae2be9	Use simple section names for COMDAT sections on COFF. With this patch we use simple names for COMDAT sections (like .text or .bss). This matches the MSVC behavior. When merging it is the COMDAT symbol that is used to decide if two sections should be merged, so there is no point in building a fancy name. This survived a bootstrap on mingw32. llvm-svn: 195798	2013-11-27 01:18:37 +00:00
Chandler Carruth	104ba2d09f	[PM] [cleanup] Replace a reserved identifier "_Self" with the injected class name. I think we're no longer using any compilers with sufficiently broken ICN for this use case, but I'll watch the bots and introduce a typedef without a reserved name if any yell at me. llvm-svn: 195793	2013-11-26 22:36:41 +00:00
Nadav Rotem	b0082d246a	PR1860 - We can't save a list of ExtractElement instructions to CSE because some of these instructions may be removed and optimized in future iterations. Instead we save a list of basic blocks that we need to CSE. llvm-svn: 195791	2013-11-26 22:24:25 +00:00
Eric Christopher	f52eddf9ca	80-column fixups. llvm-svn: 195790	2013-11-26 22:23:27 +00:00
Chad Rosier	9653d5c989	[AArch64] Add support for NEON scalar floating-point to integer convert instructions. llvm-svn: 195788	2013-11-26 22:17:37 +00:00
Arnold Schwaighofer	a2c8e008d2	LoopVectorizer: Truncate i64 trip counts of i32 phis if necessary In signed arithmetic we could end up with an i64 trip count for an i32 phi. Because it is signed arithmetic we know that this is only defined if the i32 does not wrap. It is therefore safe to truncate the i64 trip count to a i32 value. Fixes PR18049. llvm-svn: 195787	2013-11-26 22:11:23 +00:00
Chandler Carruth	2c9838f622	[PM] [cleanup] Run clang-format over this file. If fixes many inconsistencies that I'll just need to fix myself as I edit things. llvm-svn: 195784	2013-11-26 20:55:11 +00:00
Chandler Carruth	1b2b73a7a1	[PM] [cleanup] Update doxygen comments to use the new style, add some doxygen comments, make existing comments doxygen comments etc. Also, switch commented-out debug helpers to #if-0-ed out debug helpers. No functionality changed. llvm-svn: 195783	2013-11-26 20:51:48 +00:00
Peter Zotov	5d35f2ce03	[OCaml] Embed the flags necessary for linking with libLLVM.so into .cmxa files llvm-svn: 195782	2013-11-26 20:40:34 +00:00
Reed Kotler	3aeb1d0857	Fix a bug related to constant islands for Mips16 and mips16/32 dual mode. The determination of when we are doing constant pools was being made too early in the asm printer. llvm-svn: 195781	2013-11-26 20:38:40 +00:00
Diego Novillo	c0dd1037c8	Refactor some code in SampleProfile.cpp I'm adding new functionality in the sample profiler. This will require more data to be kept around for each function, so I moved the structure SampleProfile that we keep for each function into a separate class. There are no functional changes in this patch. It simply provides a new home where to place all the new data that I need to propagate weights through edges. There are some other name and minor edits throughout. llvm-svn: 195780	2013-11-26 20:37:33 +00:00
Michael Liao	d617a3015d	Fix PR18054 - Fix bug in (vsext (vzext x)) -> (vsext x) in SIGN_EXTEND_IN_REG lowering where we need to check whether x is a vector type (in-reg type) of i8, i16 or i32; otherwise, that optimization is not valid. llvm-svn: 195779	2013-11-26 20:31:31 +00:00
Diego Novillo	e43611fc45	Add PostDominatorTree::getDescendants. This patch adds the counter-part to DominatorTree::getDescendants. It also fixes a couple of comments I noticed out of date in the DominatorTree class. llvm-svn: 195778	2013-11-26 20:11:12 +00:00
David Blaikie	fd1eff5a0a	DwarfDebug: Include type units in accelerator tables. Since type units aren't in the CUMap, use the DwarfUnits list to iterate over units for tasks such as accelerator table building. llvm-svn: 195776	2013-11-26 19:14:34 +00:00
Renato Golin	1388f07053	Fix spurious return introduced by my earlier patch to DebugInfo llvm-svn: 195775	2013-11-26 18:54:37 +00:00
Nadav Rotem	f9f8482e3a	PR18060 - When we RAUW values with ExtractElement instructions in some cases we generate PHI nodes with multiple entries from the same basic block but with different values. Enabling CSE on ExtractElement instructions make sure that all of the RAUWed instructions are the same. llvm-svn: 195773	2013-11-26 17:29:19 +00:00
Renato Golin	47f46fd42c	Add return to DIType::Verify Code scanner ran by Sylvestre Ledru got a no_return bug in DebugInfo.cpp. Adding the return statements that should be there. llvm-svn: 195772	2013-11-26 16:47:00 +00:00
Stepan Dyatkovskiy	abb8505dc5	PR17925 bugfix. Short description. This issue is about case of treating pointers as integers. We treat pointers as different if they references different address space. At the same time, we treat pointers equal to integers (with machine address width). It was a point of false-positive. Consider next case on 32bit machine: void foo0(i32 addrespace(1)* %p) void foo1(i32 addrespace(2)* %p) void foo2(i32 %p) foo0 != foo1, while foo1 == foo2 and foo0 == foo2. As you can see it breaks transitivity. That means that result depends on order of how functions are presented in module. Next order causes merging of foo0 and foo1: foo2, foo0, foo1 First foo0 will be merged with foo2, foo0 will be erased. Second foo1 will be merged with foo2. Depending on order, things could be merged we don't expect to. The fix: Forbid to treat any pointer as integer, except for those, who belong to address space 0. llvm-svn: 195769	2013-11-26 16:11:03 +00:00
Timur Iskhodzhanov	119f307317	Rename DwarfException methods so the new names are consistent with DwarfDebug and the style guide llvm-svn: 195763	2013-11-26 13:34:55 +00:00
Tim Northover	fa36dfeeca	Darwin-ARM: use movw/movt for static relocations llvm-svn: 195759	2013-11-26 12:45:05 +00:00
Chandler Carruth	954ee10528	[PM] Fix a stale comment after my last refactoring spoted by Joey in review! llvm-svn: 195757	2013-11-26 12:00:58 +00:00
Chandler Carruth	cffb33c53f	[PM] Remove four extraneous 'typename's that Clang (in C++11 mode) is happy with but GCC complains about. I'm assuming both compilers are correct and these are optional in C++11 because I'm too tired to read the standard. ;] llvm-svn: 195748	2013-11-26 11:31:06 +00:00
Chandler Carruth	16ea68e806	[PM] Factor the overwhelming majority of the interface boiler plate out of the two analysis managers into a CRTP base class that can be shared and re-used in building any analysis manager. This will in turn simplify adding yet another analysis manager to the system. The base class provides all of the interface sugar for the analysis manager delegating the functionality back through DerivedT methods which operate on simple pass IDs. It also provides the pass registration, storage, and lookup system which is common across the various formulations of analysis managers. llvm-svn: 195747	2013-11-26 11:24:37 +00:00
Richard Sandiford	dd7dd930d1	[SystemZ] Fix incorrect use of RISBG for a zero-extended right shift We would wrongly transform the testcase into the equivalent of an AND with 1. The problem was that, when testing whether the shifted-in bits of the right shift were significant, we used the width of the final zero-extended result rather than the width of the shifted value. llvm-svn: 195731	2013-11-26 10:53:16 +00:00
Arnaud A. de Grandmaison	b697b538dc	CMake : optionaly enable LLVM to be compiled with -std=c++11 (default: off) In some case, it may be required to build LLVM in C++11 mode, as some the subprojects (like lldb) requires it. This mimics the autoconf behaviour. However, given the discussions on the switch to C++11 of the codebase, this behaviour should evolve to default to C++11 with some checks of the compiler capabilities. llvm-svn: 195727	2013-11-26 10:33:53 +00:00
Chandler Carruth	6378cf539f	[PM] Split the CallGraph out from the ModulePass which creates the CallGraph. This makes the CallGraph a totally generic analysis object that is the container for the graph data structure and the primary interface for querying and manipulating it. The pass logic is separated into its own class. For compatibility reasons, the pass provides wrapper methods for most of the methods on CallGraph -- they all just forward. This will allow the new pass manager infrastructure to provide its own analysis pass that constructs the same CallGraph object and makes it available. The idea is that in the new pass manager, the analysis pass's 'run' method returns a concrete analysis 'result'. Here, that result is a 'CallGraph'. The 'run' method will typically do only minimal work, deferring much of the work into the implementation of the result object in order to be lazy about computing things, but when (like DomTree) there is some up-front computation, the analysis does it prior to handing the result back to the querying pass. I know some of this is fairly ugly. I'm happy to change it around if folks can suggest a cleaner interim state, but there is going to be some amount of unavoidable ugliness during the transition period. The good thing is that this is very limited and will naturally go away when the old pass infrastructure goes away. It won't hang around to bother us later. Next up is the initial new-PM-style call graph analysis. =] llvm-svn: 195722	2013-11-26 04:19:30 +00:00
Chandler Carruth	878b55372a	[PM] Reformat some code with clang-format as I'm going to be editting as part of generalizing the call graph infrastructure for the new pass manager. llvm-svn: 195718	2013-11-26 03:45:26 +00:00
Chandler Carruth	1a60023b94	[PM] Add a really simple trait to the DOTGraphTraitsPass class templates that lets the analysis and graph types be separate and the graph computed from the analysis through some arbitrary user-supplied code. This will allow a call graph to an independent entity from the pass which creates it which is necessary for the new pass manager. llvm-svn: 195717	2013-11-26 03:43:52 +00:00
Kevin Qin	599c47d0de	Refactored the implementation of AArch64 NEON instruction ZIP, UZP and TRN. Fix a bug when mixed use of vget_high_u8() and vuzp_u8(). llvm-svn: 195716	2013-11-26 03:26:47 +00:00
Chandler Carruth	5477a592f9	[PM] Re-format this code with clang-format before making substantial changes to it. No functionality changed. You may wonder why on earth touching this code is involved in the pass manager work as indicated by my lovely '[PM]' tag? Let me tell you a story. <redacted> Yea, it's too long of a story. Let us say that there are yaks, many of them. I am busy shaving them as fast as I can. llvm-svn: 195715	2013-11-26 03:22:09 +00:00
Kevin Qin	33ca18fdcf	[AArch64]Implement 128 bit register copy with NEON. llvm-svn: 195713	2013-11-26 02:33:42 +00:00
Andrew Trick	391dbadb51	StackMap: Implement support for DirectMemRefOp. A Direct stack map location records the address of frame index. This address is itself the value that the runtime requested. This differs from IndirectMemRefOp locations, which refer to a stack locations from which the requested values must be loaded. Direct locations can directly communicate the address if an alloca, while IndirectMemRefOp handle register spills. For example: entry: %a = alloca i64... llvm.experimental.stackmap(i32 <ID>, i32 <shadowBytes>, i64* %a) Since both the alloca and stackmap intrinsic are in the entry block, and the intrinsic takes the address of the alloca, the runtime can assume that LLVM will not substitute alloca with any intervening value. This must be verified by the runtime by checking that the stack map's location is a Direct location type. The runtime can then determine the alloca's relative location on the stack immediately after compilation, or at any time thereafter. This differs from Register and Indirect locations, because the runtime can only read the values in those locations when execution reaches the instruction address of the stack map. llvm-svn: 195712	2013-11-26 02:03:25 +00:00
Andrew Trick	d3ab37cfeb	whitespace llvm-svn: 195711	2013-11-26 02:03:20 +00:00
Chandler Carruth	8a7bdd9194	[PM] Make the (really awesome) file comment here available as part of the Doxygen. llvm-svn: 195709	2013-11-26 01:27:20 +00:00
Chandler Carruth	e1901fafbd	[PM] Reformat this file with clang-format. Mostly fixes inconsistent spacing around the '*' in pointer types. Will let me use clang-format on subsequent changes without introducing any noise. No functionality changed. llvm-svn: 195708	2013-11-26 01:25:07 +00:00
David Blaikie	a357aa03af	DebugInfo: Update test case due to dumper improvements in r195698 The dumper was only dumping one pubtypes set and it was /always/ dumping one pubtypes set even when there were zero sets. Now that the dumper correctly dumps zero, one, or many sets, we can update this test case to test for the absolute absence of a set rather than a bogus/accidental zero-valued set. llvm-svn: 195706	2013-11-26 01:11:02 +00:00
Chandler Carruth	480f5d265a	Lift self-copy protection up to the header file and add self-move protection to the same layer. This is in line with Howard's advice on how best to handle self-move assignment as he explained on SO[1]. It also ensures that implementing swap with move assignment continues to work in the case of self-swap. [1]: http://stackoverflow.com/questions/9322174/move-assignment-operator-and-if-this-rhs llvm-svn: 195705	2013-11-26 00:54:44 +00:00
Chandler Carruth	2664317b66	Fix a self-memcpy which only breaks under Valgrind's memcpy implementation. Silliness, but it'll be a trivial performance optimization. This should clear up a failure on the vg_leak bot. llvm-svn: 195704	2013-11-26 00:44:36 +00:00
Chandler Carruth	14c87f48ec	[PM] Sink a trailing comment to be a doxygen comment. llvm-svn: 195702	2013-11-26 00:37:27 +00:00
Chandler Carruth	9a398f453d	[PM] Rename the 'Mod' member to the more idiomatic 'M'. No functionality changed. llvm-svn: 195701	2013-11-26 00:37:23 +00:00
David Blaikie	fbd29eb3b6	DebugInfo: Remove CompileUnit::constructTypeDIEImpl now that it's just a simple wrapper again. r195698 moved the type unit checking up into getOrCreateTypeDIE so remove the redundant check and fold the functions back together again. llvm-svn: 195700	2013-11-26 00:35:04 +00:00
Chandler Carruth	0decd7da5a	[PM] Clean up a bunch of comments, modernize the doxygen, nuke some whitespace, and a couple of argument name fixes before I start hacking on this code. No functionality changed here. llvm-svn: 195699	2013-11-26 00:29:36 +00:00
David Blaikie	8a263cbc99	DebugInfo: Avoid emitting pubtype entries for type DIEs that just indirect to a type unit. llvm-svn: 195698	2013-11-26 00:22:37 +00:00
Cameron McInally	c592e5251c	Add an intrinsic for the SSE2 PAUSE instruction. llvm-svn: 195697	2013-11-26 00:20:43 +00:00
David Blaikie	9d861bed9b	DebugInfo: Pubtypes: Coelesce pubtype registration with accelerator type registration. It might be possible to eventually use one data structure, but I haven't looked at the exact criteria used for accelerator tables and pubtypes to see if there's good reason for the differences between the two or not. llvm-svn: 195696	2013-11-26 00:15:27 +00:00
Chandler Carruth	ec1fb5c705	Add the test case that I missed when committing r195528. Doh! llvm-svn: 195691	2013-11-25 22:24:27 +00:00
Rafael Espindola	a834e30130	Do the string comparison in the constructor instead of once per nop. Thanks to Roman Divacky for the suggestion. llvm-svn: 195684	2013-11-25 20:50:03 +00:00
Rafael Espindola	009a390c8c	Use -triple to fix the test on non-ELF hosts. llvm-svn: 195682	2013-11-25 20:46:18 +00:00
Rafael Espindola	1b8bfdaae3	Don't use nopl in cpus that don't support it. Patch by Mikulas Patocka. I added the test. I checked that for cpu names that gas knows about, it also doesn't generate nopl. The modified cpus: i686 - there are i686-class CPUs that don't have nopl: Via c3, Transmeta Crusoe, Microsoft VirtualBox - see https://bbs.archlinux.org/viewtopic.php?pid=775414 k6, k6-2, k6-3, winchip-c6, winchip2 - these are 586-class CPUs via c3 c3-2 - see https://bugs.archlinux.org/task/19733 as a proof that Via c3 and c3-Nehemiah don't have nopl llvm-svn: 195679	2013-11-25 20:15:14 +00:00
David Peixotto	7266731f9e	ARM integrated assembler generates incorrect nop opcode This patch fixes a bug in the assembler that was causing bad code to be emitted. When switching modes in an assembly file (e.g. arm to thumb mode) we would always emit the opcode from the original mode. Consider this small example: $ cat align.s .code 16 foo: add r0, r0 .align 3 add r0, r0 $ llvm-mc -triple armv7-none-linux align.s -filetype=obj -o t.o $ llvm-objdump -triple thumbv7 -d t.o Disassembly of section .text: foo: 0: 00 44 add r0, r0 2: 00 f0 20 e3 blx #4195904 6: 00 00 movs r0, r0 8: 00 44 add r0, r0 This shows that we have actually emitted an arm nop (e320f000) instead of a thumb nop. Unfortunately, this encodes to a thumb branch which causes bad things to happen when compiling assembly code with align directives. The fix is to notify the ARMAsmBackend when we switch mode. The MCMachOStreamer was already doing this correctly. This patch makes the same change for the MCElfStreamer. There is still a bug in the way nops are emitted for alignment because the MCAlignment fragment does not store the correct mode. The ARMAsmBackend will emit nops for the last mode it knew about. In the example above, we still generate an arm nop if we add a `.code 32` to the end of the file. PR18019 llvm-svn: 195677	2013-11-25 19:11:13 +00:00
Bill Wendling	9200bb08f9	Unrevert r195599 with testcase fix. I'm not sure how it was checking for the wrong values... PR18023. llvm-svn: 195670	2013-11-25 18:05:22 +00:00
Tim Northover	d34094e525	Fix indentation typo llvm-svn: 195660	2013-11-25 17:04:35 +00:00
Tim Northover	db962e2c45	ARM: remove special cases for Darwin dynamic-no-pic mode. These are handled almost identically to static mode (and ELF's global address materialisation), except that a symbol may have "$non_lazy_ptr" appended. This can be handled by passing appropriate flags along with the instruction instead of using entirely separate pseudo-instructions. llvm-svn: 195655	2013-11-25 16:24:52 +00:00
Rafael Espindola	edcf1ff7d1	Fix .comm and .lcomm on COFF. These should not use COMDATs. GNU as uses .bss for .lcomm and section 0 for .comm. Given static int a; int b; MSVC puts both in .bss. This patch then puts both .comm and .lcomm on .bss. With this change we agree with gas on .lcomm, are much closer on .comm and clang-cl matches msvc on the above example. llvm-svn: 195654	2013-11-25 16:06:04 +00:00
Rafael Espindola	3294e05762	Refactor to make the .bss, .data and .text sections available for other uses. No functionality change. llvm-svn: 195653	2013-11-25 16:00:32 +00:00
Benjamin Kramer	583b00e60a	Make helper function static. llvm-svn: 195650	2013-11-25 15:40:24 +00:00
Tim Northover	dfe2156c91	ARM: remove unused patterns. There is no sane way for an LEApcrel (= single ADR) instruction to generate a global address on any ARM target I know of. Fortunately, no-one was trying to any more, but there were vestigial patterns. llvm-svn: 195644	2013-11-25 14:40:57 +00:00
Amara Emerson	34df448f7c	[ARM] Enable FeatureMP for Cortex-A5 by default. Patch by Oliver Stannard. llvm-svn: 195640	2013-11-25 13:17:15 +00:00
Amara Emerson	f59125f5bb	Revert r195599 as it broke the builds. llvm-svn: 195636	2013-11-25 11:24:18 +00:00
Daniel Sanders	b021c6fdbd	Fixed tryFoldToZero() for vector types that need expansion. Summary: Moved the requirement for SelectionDAG::getConstant() to return legally typed nodes slightly earlier. There were two optional DAGCombine passes that were missed out and were required to produce type-legal DAGs. Simplified a code-path in tryFoldToZero() to use SelectionDAG::getConstant(). This provides support for both promoted and expanded vector types whereas the previous code only supported promoted vector types. Fixes a "Type for zero vector elements is not legal" assertion detected by an llvm-stress generated test. Reviewers: resistor CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2251 llvm-svn: 195635	2013-11-25 11:14:43 +00:00
Tim Northover	89ccb616bd	X86: enable AVX2 under Haswell native compilation Patch by Adam Strzelecki llvm-svn: 195632	2013-11-25 09:52:59 +00:00
Bill Wendling	e3c48709ed	Don't look past volatile loads. A volatile load should block us from trying to coalesce stores. PR18023 llvm-svn: 195599	2013-11-25 05:01:21 +00:00
Hao Liu	fbd2b4484c	Fixed a bug about disassembling AArch64 post-index load/store single element instructions. ie. echo "0x00 0x04 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble echo "0x00 0x00 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble will be disassembled into the same instruction st1 {v0b}[0], [x0], x0. llvm-svn: 195591	2013-11-25 01:53:26 +00:00
NAKAMURA Takumi	edbeaee857	SparcFrameLowering.cpp: Prune 'DL' [-Wunused-variable] llvm-svn: 195590	2013-11-25 00:52:46 +00:00
Chandler Carruth	260258b9c0	Output a bit more information in the debug printing for MBP. This was useful when analyzing parts of zlib's behavior here. llvm-svn: 195588	2013-11-25 00:43:41 +00:00
Venkatraman Govindaraju	1116868a0d	[Sparc] Emit large negative adjustments to SP/FP with sethi+xor instead of sethi+or. This generates correct code for both sparc32 and sparc64. llvm-svn: 195576	2013-11-24 20:23:25 +00:00
Venkatraman Govindaraju	9c338504e5	[Sparc]: Implement LEA pattern for sparcv9. llvm-svn: 195575	2013-11-24 20:07:35 +00:00
Venkatraman Govindaraju	f79528c132	[SparcV9]: Do not emit .register directives for global registers that are clobbered by calls but not used in the function itself. llvm-svn: 195574	2013-11-24 18:41:49 +00:00
Venkatraman Govindaraju	0510db0597	[SparcV9] Enable custom lowering of DYNAMIC_STACKALLOC in sparc64. llvm-svn: 195573	2013-11-24 17:41:41 +00:00
Reed Kotler	a787aa2b1e	Make sure that for C++ emitting LwConstant32 pseudos, that it corresponds to what is needed for constant islands. The prescan method for Mips16 constant islands will eventually go away. It is only temporary and should be done earlier when the instructions are first created or from the DAG. If we keep it here we need to handle better the situation where constant islands is called multiple times since don't want to prescan more than once. llvm-svn: 195569	2013-11-24 06:18:50 +00:00
Bill Wendling	1585fea1a2	Default to a better compression algorithm. llvm-svn: 195567	2013-11-24 05:29:35 +00:00
Reed Kotler	ed00c59cdb	Update older test cases for latest patch. llvm-svn: 195566	2013-11-24 03:37:56 +00:00
Reed Kotler	d3b28ebe03	Fix a funny bug I introduced during conversion of ARM constant islands to Mips. I had to move some code and I moved a declaration forward past it's first use in the function but by nutty coincidence there was another variable of the same name and type and with completely unrelated function that was declared globally in the class so no compilation error ensued. It required some unusual conditions for it to even matter. Caused test case casts.c in test-suite to fail during compilation with a duplicate symbol error. I would have noticed it during final code review for this port. llvm-svn: 195565	2013-11-24 02:53:09 +00:00
Alp Toker	20be263c37	Put an unused result attribute on SmallSet::empty() This matches other empty() container functions in LLVM. No actual usage problems discovered in this instance. llvm-svn: 195562	2013-11-23 23:06:20 +00:00
Chandler Carruth	c1ff9ed6e0	[PM] Complete the cross-layer interfaces with a Module-to-Function proxy. This lets a function pass query a module analysis manager. However, the interface is const to indicate that only cached results can be safely queried. With this, I think the new pass manager is largely functionally complete for modules and analyses. Still lots to test, and need to generalize to SCCs and Loops, and need to build an adaptor layer to support the use of existing Pass objects in the new managers. llvm-svn: 195538	2013-11-23 01:25:07 +00:00
Chandler Carruth	2ad185836f	[PM] Rename TestAnalysisPass to TestFunctionAnalysis to clear the way for a TestModuleAnalysis. llvm-svn: 195537	2013-11-23 01:25:02 +00:00
David Blaikie	72f1a3ec76	DwarfDebug: Move ownership of CompileUnits into DwarfUnits This avoids the need for an extra list of SkeletonCUs and associated cleanup while staging things to be cleaner for further type unit improvements. Also hopefully fixes a memory leak introduced in r195166. llvm-svn: 195536	2013-11-23 01:17:34 +00:00
Manman Ren	d664bd7725	Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. Make tests more robust by removing hard-coded metadata numbers in CHECK lines. llvm-svn: 195535	2013-11-23 01:16:29 +00:00
Chandler Carruth	57458517ef	Migrate metadata information from scalar to vector instructions during SLP vectorization. Based on the code in BBVectorizer. Fixes PR17741. Patch by Raul Silvera, reviewed by Hal and Nadav. Reformatted by my driving of clang-format. =] llvm-svn: 195528	2013-11-23 00:48:34 +00:00
Chandler Carruth	de9afd845b	[PM] Add support to the analysis managers to query explicitly for cached results. This is the last piece of infrastructure needed to effectively support querying up the analysis layers. The next step will be to introduce a proxy which provides access to those layers with appropriate use of const to direct queries to the safe interface. llvm-svn: 195525	2013-11-23 00:38:42 +00:00
Eric Christopher	4751d701b7	Refactor DW_AT_ranges handling to use labels for ranges rather than a non-relocatable number offset. One fixme to make the ranges as discrete data structures and have range lists explicitly represented rather than as a list of symbols. llvm-svn: 195523	2013-11-23 00:05:29 +00:00
Eric Christopher	f8da6aa7c7	Reformat const for readability. llvm-svn: 195522	2013-11-23 00:05:06 +00:00
Chandler Carruth	bceeb22905	[PM] Switch the downward invalidation to be incremental where only the one function's analyses are invalidated at a time. Also switch the preservation of the proxy to fully preserve the lower (function) analyses. Combined, this gets both upward and downward analysis invalidation to a point I'm happy with: - A function pass invalidates its function analyses, and its parent's module analyses. - A module pass invalidates all of its functions' analyses including the set of which functions are in the module. - A function pass can preserve a module analysis pass. - If all function passes preserve a module analysis pass, that preservation persists. If any doesn't the module analysis is invalidated. - A module pass can opt into managing all function analysis invalidation itself or none. - The conservative default is none, and the proxy takes the maximally conservative approach that works even if the set of functions has changed. - If a module pass opts into managing function analysis invalidation it has to propagate the invalidation itself, the proxy just does nothing. The only thing really missing is a way to query for a cached analysis or nothing at all. With this, function passes can more safely request a cached module analysis pass without fear of it accidentally running part way through. llvm-svn: 195519	2013-11-22 23:38:07 +00:00
Chandler Carruth	bfb9bb2437	[PM] Remove a FIXME comment that was fixed by my recent refactorings: now the access to the manager is via the proxy that ensures it behaves correctly. llvm-svn: 195518	2013-11-22 23:37:54 +00:00
Tom Stellard	c0845334da	R600/SI: Fixing handling of condition codes We were ignoring the ordered/onordered bits and also the signed/unsigned bits of condition codes when lowering the DAG to MachineInstrs. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195514	2013-11-22 23:07:58 +00:00
Yuchen Wu	c87ca32163	llvm-cov: Split entry blocks in GCNOProfiling.cpp. gcov expects every function to contain an entry block that unconditionally branches into the next block. clang does not implement basic blocks in this manner, so gcov did not output correct branch info if the entry block branched to multiple blocks. This change splits every function's entry block into an empty block and a block with the rest of the instructions. The instrumentation code will take care of the rest. llvm-svn: 195513	2013-11-22 23:07:45 +00:00
Manman Ren	cb14bbcc48	Debug Info: move StripDebugInfo from StripSymbols.cpp to DebugInfo.cpp. We can share the implementation between StripSymbols and dropping debug info for metadata versions that do not match. Also update the comments to match the implementation. A follow-on patch will drop the "Debug Info Version" module flag in StripDebugInfo. llvm-svn: 195505	2013-11-22 22:06:31 +00:00
Manman Ren	409558f81e	Debug Info: update testing cases to specify the debug info version number. We are going to drop debug info without a version number or with a different version number, to make sure we don't crash when we see bitcode files with different debug info metadata format. llvm-svn: 195504	2013-11-22 21:49:45 +00:00
Jim Grosbach	860934a924	X86: Perform integer comparisons at i32 or larger. Utilizing the 8 and 16 bit comparison instructions, even when an input can be folded into the comparison instruction itself, is typically not worth it. There are too many partial register stalls as a result, leading to significant slowdowns. By always performing comparisons on at least 32-bit registers, performance of the calculation chain leading to the comparison improves. Continue to use the smaller comparisons when minimizing size, as that allows better folding of loads into the comparison instructions. rdar://15386341 llvm-svn: 195496	2013-11-22 19:57:47 +00:00
Manman Ren	fb6439654d	Debug Info: add a constant for debug info version number. This will be used to output the debug info version number as a module flag. llvm-svn: 195494	2013-11-22 19:41:59 +00:00
Matt Arsenault	6ea0aade26	StructurizeCFG: Fix verification failure with some loops. If the beginning of the loop was also the entry block of the function, branches were inserted to the entry block which isn't allowed. If this occurs, create a new dummy function entry block that branches to the start of the loop. llvm-svn: 195493	2013-11-22 19:24:39 +00:00
Matt Arsenault	9fb6e0ba58	StructurizeCFG: Fix inverting a branch on an argument llvm-svn: 195492	2013-11-22 19:24:37 +00:00
Paul Robinson	d89125a5d8	Teach ISel not to optimize 'optnone' functions (revised). Improvements over r195317: - Set/restore EnableFastISel flag instead of just running FastISel within SelectAllBasicBlocks; the flag is checked in various places, and FastISel won't run properly if those places don't do the right thing. - Test looks for normal ISel versus FastISel behavior, and not something more subtle that doesn't work everywhere. Based on work by Andrea Di Biagio. llvm-svn: 195491	2013-11-22 19:11:24 +00:00
Andrew Trick	059e800fda	DEBUG shouldEvict decisions llvm-svn: 195490	2013-11-22 19:07:42 +00:00
Andrew Trick	3621b8a217	Minor cleanup. EvictionCost ctor was confusing relative to the other costs floating around in the code. llvm-svn: 195489	2013-11-22 19:07:38 +00:00
Andrew Trick	4a1abb7ab5	patchpoint: factor SD builder code for live vars. Plain stackmap also optimizes Constant values now. llvm-svn: 195488	2013-11-22 19:07:36 +00:00
Andrew Trick	a2428e0f40	patchpoint: eliminate hard coded operand indices. llvm-svn: 195487	2013-11-22 19:07:33 +00:00
Hans Wennborg	a74768a12a	VS integration: use the correct registry key after r195379 I changed the registry key in that commit, but forgot to update the integration files. This change makes them use the same variable. llvm-svn: 195479	2013-11-22 18:25:43 +00:00
Rafael Espindola	6597992c69	Add a fixed version of r195470 back. The fix is simply to use CurI instead of I when handling aliases to avoid accessing a invalid iterator. original message: Convert linkonce* to weak* instead of strong. Also refactor the logic into a helper function. This is an important improve on mingw where the linker complains about mixed weak and strong symbols. Converting to weak ensures that the symbol is not dropped, but keeps in a comdat, making the linker happy. llvm-svn: 195477	2013-11-22 17:58:12 +00:00
Michael Liao	02160d580b	Fix PR18014 - When simplifying the mask generation for BLEND, check whether that mask is also consumed by other non-BLEND insns. If true, skip that simplification. llvm-svn: 195476	2013-11-22 17:56:57 +00:00
Richard Sandiford	f03789ca3f	[SystemZ] Fix TMHH and TMHL usage for z10 with -O0 I've no idea why I decided to handle TMxx differently from all the other high/low logic operations, but it was a stupid thing to do. The high registers aren't available as separate 32-bit registers on z10, so subreg_h32 can't be used on a GR64 there. I've normally been testing with z196 and with -O3 and so hadn't noticed this until now. llvm-svn: 195473	2013-11-22 17:28:28 +00:00
Rafael Espindola	77aa674cc4	Revert "Convert linkonce* to weak* instead of strong." This reverts commit r195470. Debugging failure in some bots. llvm-svn: 195472	2013-11-22 17:09:34 +00:00
Richard Sandiford	8ee1b77de3	Add a Scalarizer pass. llvm-svn: 195471	2013-11-22 16:58:05 +00:00
Rafael Espindola	5574032575	Convert linkonce* to weak* instead of strong. Also refactor the logic into a helper function. This is an important improvement on mingw where the linker complains about mixed weak and strong symbols. Converting to weak ensures that the symbol is not dropped, but keeps in a comdat, making the linker happy. llvm-svn: 195470	2013-11-22 16:14:30 +00:00
Daniel Sanders	b516aae48e	[mips][msa] Add test case that should have been added in r195456. llvm-svn: 195469	2013-11-22 15:47:18 +00:00
Arnold Schwaighofer	1756e1ea92	SLPVectorizer: Fix whitespace errors. llvm-svn: 195468	2013-11-22 15:47:17 +00:00
Rafael Espindola	5a8e985ad3	Don't produce tail calls when the caller is x86_thiscallcc. The callee will not pop the stack for us. llvm-svn: 195467	2013-11-22 15:18:28 +00:00
Tim Northover	74e3637a0c	ARM: use CHECK-LABEL on a test. llvm-svn: 195457	2013-11-22 13:25:07 +00:00
Daniel Sanders	d40aea8768	Fix typo in a comment added in r195455. Credit to Matheus Almeida for spotting it. llvm-svn: 195456	2013-11-22 13:22:52 +00:00
Daniel Sanders	630dbe0a14	[mips][msa] Fix corner case for integer constant splats with undef values. lowerBUILD_VECTOR() was treating integer constant splats as being legal regardless of whether they had undef values. This caused instruction selection failures when the undefs were legalized to zero, making the constant non-splat. Fixed this by requiring HasAnyUndef to be false for a integer constant splat to be legal. If it is true, a new node is generated with the undefs replaced with the necessary values to remain a splat. llvm-svn: 195455	2013-11-22 13:14:06 +00:00
Chandler Carruth	831bfabad9	[PM] Remove extraneous space that I left in there. llvm-svn: 195453	2013-11-22 12:26:40 +00:00
Chandler Carruth	f2edc07571	[PM] Teach the analysis managers to pass themselves as arguments to the run methods of the analysis passes. Also generalizes and re-uses the SFINAE for transformation passes so that users can write an analysis pass and only accept an analysis manager if that is useful to their pass. This completes the plumbing to make an analysis manager available through every pass's run method if desired so that passes no longer need to be constructed around them. llvm-svn: 195451	2013-11-22 12:11:02 +00:00
Chandler Carruth	71ec5a64dd	[PM] Reverse the template arguments 'PassT' and 'AnalysisManagerT' in several templates. The previous order didn't make any sense as it separated 'IRUnitT' and 'AnalysisManagerT', the types which are essentially paired and passed along together throughout the layers. llvm-svn: 195450	2013-11-22 11:55:38 +00:00
Richard Barton	c31078cded	Add support for Cortex-A12. Patch by Oliver Stannard! llvm-svn: 195448	2013-11-22 11:53:16 +00:00
Chandler Carruth	bf950c0f6f	[PM] Remove the IRUnitT typedef requirement for analysis passes. Since the analysis managers were split into explicit function and module analysis managers, it is now completely trivial to specify this when building up the concept and model types explicitly, and it is impossible to end up with a type error at run time. We instantiate a template when registering a pass that will enforce the requirement at a type-system level, and we produce a dynamic error on all the other query paths to the analysis manager if the pass in question isn't registered. llvm-svn: 195447	2013-11-22 11:46:33 +00:00
Chandler Carruth	5bf5e31c5a	[PM] Fix the analysis templates' usage of IRUnitT. This is supposed to be the whole type of the IR unit, and so we shouldn't pass a pointer to it but rather the value itself. In turn, we need to provide a 'Module *' as that type argument (for example). This will become more relevant with SCCs or other units which may not be passed as a pointer type, but also brings consistency with the transformation pass templates. llvm-svn: 195445	2013-11-22 11:34:43 +00:00
Daniel Sanders	fd8e416879	[mips][msa] Float vector constants cannot use ldi.[wd] directly. Bitcast from the appropriate integer vector type. Fixes an instruction selection failure detected by llvm-stress. llvm-svn: 195444	2013-11-22 11:24:50 +00:00
Kostya Serebryany	4007009815	Revert r195318 as it causes miscompilation (PR18029) llvm-svn: 195439	2013-11-22 10:30:39 +00:00

... 2 3 4 5 6 ...

98132 Commits