llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	3ef5e46b6d	MemCpyOpt: When merging memsets also merge the trivial case of two memsets with the same destination. The testcase is from PR19092, but I think the bug described there is actually a clang issue. llvm-svn: 203489	2014-03-10 21:05:13 +00:00
Evan Cheng	0e8f4612a9	For functions with ARM target specific calling convention, when simplify-libcall optimize a call to a llvm intrinsic to something that invovles a call to a C library call, make sure it sets the right calling convention on the call. e.g. extern double pow(double, double); double t(double x) { return pow(10, x); } Compiles to something like this for AAPCS-VFP: define arm_aapcs_vfpcc double @t(double %x) #0 { entry: %0 = call double @llvm.pow.f64(double 1.000000e+01, double %x) ret double %0 } declare double @llvm.pow.f64(double, double) #1 Simplify libcall (part of instcombine) will turn the above into: define arm_aapcs_vfpcc double @t(double %x) #0 { entry: %__exp10 = call double @__exp10(double %x) #1 ret double %__exp10 } declare double @__exp10(double) The pre-instcombine code works because calls to LLVM builtins are special. Instruction selection will chose the right calling convention for the call. However, the code after instcombine is wrong. The call to __exp10 will use the C calling convention. I can think of 3 options to fix this. 1. Make "C" calling convention just work since the target should know what CC is being used. This doesn't work because each function can use different CC with the "pcs" attribute. 2. Have Clang add the right CC keyword on the calls to LLVM builtin. This will work but it doesn't match the LLVM IR specification which states these are "Standard C Library Intrinsics". 3. Fix simplify libcall so the resulting calls to the C routines will have the proper CC keyword. e.g. %__exp10 = call arm_aapcs_vfpcc double @__exp10(double %x) #1 This works and is the solution I implemented here. Both solutions #2 and #3 would work. After carefully considering the pros and cons, I decided to implement #3 for the following reasons. 1. It doesn't change the "spec" of the intrinsics. 2. It's a self-contained fix. There are a couple of potential downsides. 1. There could be other places in the optimizer that is broken in the same way that's not addressed by this. 2. There could be other calling conventions that need to be propagated by simplify-libcall that's not handled. But for now, this is the fix that I'm most comfortable with. llvm-svn: 203488	2014-03-10 20:49:45 +00:00
Eli Bendersky	d47a5c2d3f	Followup to r203483 - add test. [forgot to 'svn add' before committing r203483] llvm-svn: 203485	2014-03-10 20:36:04 +00:00
Sasa Stankovic	5fddf61089	[mips] Implement NaCl sandboxing of loads, stores and SP changes: * Add masking instructions before loads and stores (in MC layer). * Add masking instructions after SP changes (in MC layer). * Forbid loads, stores and SP changes in delay slots (in MI layer). Differential Revision: http://llvm-reviews.chandlerc.com/D2904 llvm-svn: 203484	2014-03-10 20:34:23 +00:00
Adam Nemet	47492919c6	[bugpoint] Add testcase for r203343. llvm-svn: 203472	2014-03-10 16:58:54 +00:00
Reed Kotler	96b7402bac	Fix regression with -O0 for mips . llvm-svn: 203469	2014-03-10 16:31:25 +00:00
JF Bastien	76086c667d	Add test for LinkModules warning on triple, modified by r203009. Datalayout is already tested. llvm-svn: 203468	2014-03-10 15:54:49 +00:00
Matheus Almeida	64459d296b	[mips] Assembly parser must invoke the target streamer to handle .set reorder macro. llvm-svn: 203459	2014-03-10 13:21:10 +00:00
Tim Northover	2a661f3f73	AArch64: fix LowerCONCAT_VECTORS for new CodeGen. The function was making too many assumptions about its input: 1. The NEON_VDUP optimisation was far too aggressive, assuming (I think) that the input would always be BUILD_VECTOR. 2. We were treating most unknown concats as legal (by returning Op rather than SDValue()). I think only concats of pairs of vectors are actually legal. http://llvm.org/PR19094 llvm-svn: 203450	2014-03-10 09:34:07 +00:00
Venkatraman Govindaraju	f703132b09	[Sparc] Add support for decoding 'swap' instruction. llvm-svn: 203424	2014-03-09 23:32:07 +00:00
NAKAMURA Takumi	1783e1e984	Revert r203230, "CodeGenPrep: sink extends of illegal types into use block." It choked i686 stage2. llvm-svn: 203386	2014-03-09 11:01:07 +00:00
David Majnemer	c4ab61cb2f	IR: Change inalloca's grammar a bit The grammar for LLVM IR is not well specified in any document but seems to obey the following rules: - Attributes which have parenthesized arguments are never preceded by commas. This form of attribute is the only one which ever has optional arguments. However, not all of these attributes support optional arguments: 'thread_local' supports an optional argument but 'addrspace' does not. Interestingly, 'addrspace' is documented as being a "qualifier". What constitutes a qualifier? I cannot find a definition. - Some attributes use a space between the keyword and the value. Examples of this form are 'align' and 'section'. These are always preceded by a comma. - Otherwise, the attribute has no argument. These attributes do not have a preceding comma. Sometimes an attribute goes before the instruction, between the instruction and it's type, or after it's type. 'atomicrmw' has 'volatile' between the instruction and the type while 'call' has 'tail' preceding the instruction. With all this in mind, it seems most consistent for 'inalloca' on an 'inalloca' instruction to occur before between the instruction and the type. Unlike the current formulation, there would be no preceding comma. The combination 'alloca inalloca' doesn't look particularly appetizing, perhaps a better spelling of 'inalloca' is down the road. llvm-svn: 203376	2014-03-09 06:41:58 +00:00
Adam Nemet	4203039760	Update comment from r203315 based on review llvm-svn: 203361	2014-03-08 21:51:55 +00:00
David Blaikie	078278fe3a	DebugInfo: further improvements to test following up on r203329 llvm-svn: 203337	2014-03-08 02:45:53 +00:00
David Blaikie	f528f054d0	DebugInfo: Fix test fallout from r203323 Will fix this harder in a moment. llvm-svn: 203329	2014-03-08 01:32:51 +00:00
David Blaikie	26ab6c6dd5	DebugInfo: Use DW_FORM_data4 for DW_AT_high_pc in DW_TAG_lexical_blocks Suggested by Adrian Prantl in code review for r203187 llvm-svn: 203323	2014-03-08 00:58:20 +00:00
Eric Christopher	4f17ee09f9	Add support for hashing location information for CU level hashes. Add a testcase based on sret.cpp where we can now hash the entire compile unit. llvm-svn: 203319	2014-03-08 00:29:41 +00:00
Adam Nemet	5117f5dffc	[DAGCombiner] Recognize another rotation idiom This is the new idiom: x<<(y&31) \| x>>((0-y)&31) which is recognized as: x ROTL (y&31) The change refines matchRotateSub. In Neg & (OpSize - 1) == (OpSize - Pos) & (OpSize - 1), if Pos is Pos' & (OpSize - 1) we can just use Pos' instead of Pos. llvm-svn: 203315	2014-03-07 23:56:28 +00:00
Arnold Schwaighofer	d33e942958	ISel: Make VSELECT selection terminate in cases where the condition type has to be split and the result type widened. When the condition of a vselect has to be split it makes no sense widening the vselect and thereby widening the condition. We end up in an endless loop of widening (vselect result type) and splitting (condition mask type) doing this. Instead, split both the condition and the vselect and widen the result. I ran this over the test suite with i686 and mattr=+sse and saw no regressions. Fixes PR18036. llvm-svn: 203311	2014-03-07 23:25:55 +00:00
Adrian Prantl	887e70786a	Remove unnecessary test for Darwin and update testcase to be a little less horrible/fragile. rdar://problem/16264854 llvm-svn: 203309	2014-03-07 23:07:21 +00:00
Sasa Stankovic	1e50b46bf9	Moved test file from test/MC/Mips to test/CodeGen/Mips. llvm-svn: 203298	2014-03-07 22:08:46 +00:00
David Blaikie	555e79a304	DebugInfo: Use DW_FORM_data4 for DW_AT_high_pc in inlined functions Suggested by Adrian Prantl in code review for r203187. llvm-svn: 203296	2014-03-07 22:00:56 +00:00
David Blaikie	3e4ff7a92a	DebugInfo: Update test to cover linux (with a FIXME...) too llvm-svn: 203295	2014-03-07 22:00:49 +00:00
Tom Stellard	e28859f8fa	R600/SI: Using SGPRs is illegal for instructions that read carry-out from VCC Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 203281	2014-03-07 20:12:39 +00:00
Tom Stellard	1c8788ef5a	R600/SI: Custom lower i1 stores These are sometimes created by the shrink to boolean optimization in the globalopt pass. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 203280	2014-03-07 20:12:33 +00:00
David Blaikie	d723f5186e	DebugInfo: Restrict DW_AT_high_pc encoding as data4 offset to DWARF 4 as per spec Code review feedback to r203187 from Oliver Stannard. Thanks! llvm-svn: 203256	2014-03-07 18:04:24 +00:00
Duncan P. N. Exon Smith	29db0eb855	ARM: Make .unreq directives case-insensitive Be case-insensitive when processing .unreq directives. Patch by Lin Zuojian! llvm-svn: 203251	2014-03-07 16:16:52 +00:00
Tim Northover	ad3d81d320	CodeGenPrep: sink extends of illegal types into use block. This helps the instruction selector to lower an i64 * i64 -> i128 multiplication into a single instruction on targets which support it. Patch by Manuel Jacob. llvm-svn: 203230	2014-03-07 11:04:30 +00:00
Tim Northover	fad2761ca0	InstCombine: form shuffles from wider range of insert/extractelements Sequences of insertelement/extractelements are sometimes used to build vectorsr; this code tries to put them back together into shuffles, but could only produce a completely uniform shuffle types (<N x T> from two <N x T> sources). This should allow shuffles with different numbers of elements on the input and output sides as well. llvm-svn: 203229	2014-03-07 10:24:44 +00:00
Rafael Espindola	b1f25f1b93	Replace PROLOG_LABEL with a new CFI_INSTRUCTION. The old system was fairly convoluted: * A temporary label was created. * A single PROLOG_LABEL was created with it. * A few MCCFIInstructions were created with the same label. The semantics were that the cfi instructions were mapped to the PROLOG_LABEL via the temporary label. The output position was that of the PROLOG_LABEL. The temporary label itself was used only for doing the mapping. The new CFI_INSTRUCTION has a 1:1 mapping to MCCFIInstructions and points to one by holding an index into the CFI instructions of this function. I did consider removing MMI.getFrameInstructions completelly and having CFI_INSTRUCTION own a MCCFIInstruction, but MCCFIInstructions have non trivial constructors and destructors and are somewhat big, so the this setup is probably better. The net result is that we don't create temporary labels that are never used. llvm-svn: 203204	2014-03-07 06:08:31 +00:00
Karthik Bhat	b67688a87c	Allow constant folding of round function whenever feasible llvm-svn: 203198	2014-03-07 04:36:21 +00:00
David Blaikie	479323a62b	DebugInfo: Limit r203187 to non-darwin as lldb can't handle this yet llvm-svn: 203192	2014-03-07 02:19:41 +00:00
David Blaikie	48b1bdcf28	DebugInfo: Emit DW_TAG_subprogram's DW_AT_high_pc as an offset from the low_pc This removes a relocation from each subprogram, reducing link times, etc. llvm-svn: 203187	2014-03-07 01:30:55 +00:00
David Blaikie	f5040a64bb	DebugInfo: Refactor test to not rely on fixed DIE offsets llvm-svn: 203186	2014-03-07 01:19:31 +00:00
David Blaikie	b9a0265cc1	DebugInfo: Improve test to not depend on the specific naming of temporary symbols llvm-svn: 203184	2014-03-07 00:23:38 +00:00
Rafael Espindola	3b30cb41a9	Remove shouldEmitUsedDirectiveFor. Clang now uses llvm.compiler.used for these cases. llvm-svn: 203174	2014-03-06 22:47:08 +00:00
Rafael Espindola	123256a4aa	Convert test to FileCheck. llvm-svn: 203173	2014-03-06 22:21:43 +00:00
Andrea Di Biagio	6292a140ee	[X86] Teach the DAGCombiner how to fold a OR of two shufflevector nodes. This patch teaches the DAGCombiner how to fold a binary OR between two shufflevector into a single shuffle vector when possible. The rules are: 1. fold (or (shuf A, V_0, MA), (shuf B, V_0, MB)) -> (shuf A, B, Mask1) 2. fold (or (shuf A, V_0, MA), (shuf B, V_0, MB)) -> (shuf B, A, Mask2) The DAGCombiner can take advantage of the fact that OR is commutative and compute two possible shuffle masks (Mask1 and Mask2) for the resulting shuffle node. Before folding a dag according to either rule 1 or 2, DAGCombiner verifies that the resulting shuffle mask is legal for the target. DAGCombiner would firstly try to fold according to 1.; If not possible then it will try to fold according to 2. If both Mask1 and Mask2 are illegal then we conservatively don't fold the OR instruction. llvm-svn: 203156	2014-03-06 20:19:52 +00:00
Rafael Espindola	1194e69fe6	Fix the printing of n_type. Despite the name, n_type contains the type of the symbol, but also if it is extern or private extern. llvm-svn: 203154	2014-03-06 20:13:41 +00:00
Matt Arsenault	f9a995d68c	R600: Fix extloads from i8 / i16 to i64. This appears to only be working for global loads. Private and local break for other reasons. llvm-svn: 203135	2014-03-06 17:34:12 +00:00
Matt Arsenault	9fe669c522	R600/SI: Expand selects on vectors. llvm-svn: 203134	2014-03-06 17:34:03 +00:00
Matt Arsenault	a236ea551c	Teach lint about address spaces llvm-svn: 203132	2014-03-06 17:33:55 +00:00
Richard Osborne	47155af5eb	[XCore] Add support for the "m" inline asm constraint. Summary: This provides support for CP and DP relative global accesses in inline asm. Reviewers: robertlytton Reviewed By: robertlytton Differential Revision: http://llvm-reviews.chandlerc.com/D2943 llvm-svn: 203129	2014-03-06 16:37:48 +00:00
Chad Rosier	86a8f72041	[AArch64] This is a work in progress to provide a machine description for the Cortex-A53 subtarget in the AArch64 backend. This patch lays the ground work to annotate each AArch64 instruction (no NEON yet) with a list of SchedReadWrite types. The patch also provides the Cortex-A53 processor resources, maps those the the default SchedReadWrites, and provides basic latency. NEON support will be added in a subsequent patch with proper forwarding logic. Verification was done by setting the pre-RA scheduler to linearize to better gauge the effect of the MIScheduler. Even without modeling the forward logic, the results show a modest improvement for Cortex-A53. Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 203125	2014-03-06 16:04:00 +00:00
Elena Demikhovsky	f7c1b16591	AVX-512: Added rrk, rrkz, rmk, rmkz, rmbk, rmbkz versions of AVX512 FP packed instructions, added encoding tests for them. By Robert Khazanov. llvm-svn: 203098	2014-03-06 08:45:30 +00:00
Elena Demikhovsky	8fae565f08	AVX-512: fixed comressed displacement - by Robert Khazanov llvm-svn: 203096	2014-03-06 08:15:35 +00:00
David Blaikie	47c254beb7	DebugInfo: Tag units as having been indexed in GNU pubnames by using a DW_AT_GNU_pubnames of DW_FORM_flag(_present) rather than sec_offsets to the pubnames/types sections This is consistent with GDB ToT and reduces the number of relocations in (type and compile) units, substantially reducing relocations and debug size in fission + type units builds. llvm-svn: 203082	2014-03-06 05:47:39 +00:00
Karthik Bhat	daa8cd10d9	Allow constant folding of copysign llvm-svn: 203076	2014-03-06 05:32:52 +00:00
David Blaikie	c3d9e9e55f	DebugInfo: Shrink pubnames/pubtypes in the presence of type units by only emitting pub sections for compile units llvm-svn: 203057	2014-03-06 01:42:00 +00:00
Hal Finkel	7f908e8ef4	Fixup PPC Darwin i1 argument handling Like on other targets, we need to zero_extend/truncate i1 args before copying them to GPRs. llvm-svn: 203045	2014-03-06 00:45:19 +00:00
Hal Finkel	2a9d318e4a	When using CR bit registers on PPC32, handle the i1 vaarg case When copying an i1 value into a GPR for a vaarg call, we need to explicitly zero-extend the i1 value (otherwise an invalid CRBIT -> GPR copy will be generated). llvm-svn: 203041	2014-03-06 00:23:33 +00:00
Raul E. Silvera	b741b945c5	Change math intrinsic attributes from readonly to readnone. These are operations that do not access memory but may be sensitive to floating-point environment changes. LLVM does not attempt to model FP environment changes, so this was unnecessarily conservative and was getting on the way of some optimizations, in particular SLP vectorization. llvm-svn: 203037	2014-03-06 00:18:15 +00:00
Jack Carter	6b9cf961bd	[Mips] Testcase typo fix. No functionality change. llvm-svn: 203020	2014-03-05 22:54:56 +00:00
Hal Finkel	6a56b21729	With PPC CR bit registers, handle int_to_fp on older cores On cores without fpcvt support, we cannot promote int_to_fp i1 operations, because there is nothing to promote them to. The most straightforward implementation of this uses a select to choose between the two possible resulting floating-point values (and that's what is done here). llvm-svn: 203015	2014-03-05 22:14:00 +00:00
JF Bastien	d44807ca67	Fix datalayout test that I broke with my previous LinkModules warning improvement. llvm-svn: 203011	2014-03-05 21:37:08 +00:00
Arnold Schwaighofer	ab12363c02	LoopVectorizer: Preserve fast-math flags Fixes PR19045. llvm-svn: 203008	2014-03-05 21:10:47 +00:00
Rafael Espindola	8377085657	Always print the implicit .text at the start of an asm file. Before llvm-mc would print it, but llc was assuming that it would produce another section changing directive before one was needed. That assumption is false with inline asm. Fixes PR19049. Another option would be to always create the section, but in the asm printer avoid printing sections changes during initialization. That would work, but * We do use the fact that llvm-mc prints it in testing. The tests can be changed if needed. * A quick poll on IRC suggest that most developers prefer the implicit .text to be printed. llvm-svn: 203001	2014-03-05 20:09:15 +00:00
Benjamin Kramer	061d147f74	ConstantFolding: Also fold the vector overloads of our math intrinsics. llvm-svn: 202997	2014-03-05 19:41:48 +00:00
Cameron McInally	791ae9927c	Lower AVX v4i64->v4i32 truncate to one shuffle. llvm-svn: 202996	2014-03-05 19:41:16 +00:00
Oliver Stannard	d55e115b58	ARM: Correctly align arguments after a byval struct is passed on the stack llvm-svn: 202985	2014-03-05 15:25:27 +00:00
Vladimir Medic	27c398e38c	This patch implements .set dsp directive and sets appropriate feature bits.This directive is a counterpart of -mattr=dsp command line option with the exception that it does not influence elf header flags. The usage example is gives in test file. llvm-svn: 202966	2014-03-05 11:05:09 +00:00
Andrew Trick	fbb278c541	Make stackmap machineinstrs clobber the scratch regs too. Patchpoints already did this. Doing it for stackmaps is a convenience for the runtime in the event that it needs to scratch register to patch or perform a runtime call thunk. Unlike patchpoints, we just assume the AnyRegCC calling convention. This is the only language and target independent calling convention specific to stackmaps so makes sense. Although the calling convention is not currently used to select the scratch registers. llvm-svn: 202943	2014-03-05 07:08:16 +00:00
Hans Wennborg	acb842d523	Check for dynamic allocas and inline asm that clobbers sp before building selection dag (PR19012) In X86SelectionDagInfo::EmitTargetCodeForMemcpy we check with MachineFrameInfo to make sure that ESI isn't used as a base pointer register before we choose to emit rep movs (which clobbers esi). The problem is that MachineFrameInfo wouldn't know about dynamic allocas or inline asm that clobbers the stack pointer until SelectionDAGBuilder has encountered them. This patch fixes the problem by checking for such things when building the FunctionLoweringInfo. Differential Revision: http://llvm-reviews.chandlerc.com/D2954 llvm-svn: 202930	2014-03-05 02:43:26 +00:00
Raul E. Silvera	18ebc7cd0a	Trivial test commit. llvm-svn: 202924	2014-03-05 02:09:51 +00:00
Matt Arsenault	8377858c55	Allow constant folding of fma and fmuladd llvm-svn: 202914	2014-03-05 00:02:00 +00:00
Rui Ueyama	595932f1b0	llvm-objdump: Indent unwind info contents. Unwind info contents were indented at the same level as function table contents. That's a bit confusing because the unwind info is pointed by function table. In other places we usually increment indentation depth by one when dereferncing a pointer. This patch also removes extraneous newlines between function tables. llvm-svn: 202879	2014-03-04 19:23:56 +00:00
Rui Ueyama	5aa88fe1e7	llvm-objdump: Fix typo in output. llvm-svn: 202875	2014-03-04 19:03:42 +00:00
Richard Osborne	1b5fc39710	[XCore] Fix call of absolute address. Previously for: tail call void inttoptr (i64 65536 to void ()*)() nounwind We would emit: bl 65536 The immediate operand of the bl instruction is a relative offset so it is wrong to use the absolute address here. llvm-svn: 202860	2014-03-04 16:50:30 +00:00
NAKAMURA Takumi	afd8d16bce	[CMake] check-llvm: Include "bugpoint" in dependent list. llvm-svn: 202858	2014-03-04 16:13:30 +00:00
Daniel Sanders	d920770add	[mips][msa] Correct the behaviour of the COPY_FW pseudo on lanes 2 and 3. Summary: Previously, attempting to extract lanes 2 and 3 would actually extract lane 1. The MSA CodeGen tests only covered lanes 0 and 1. Differential Revision: http://llvm-reviews.chandlerc.com/D2935 llvm-svn: 202848	2014-03-04 13:54:30 +00:00
Vladimir Medic	615b26e1cd	This patch implements .set mips32r2 directive and sets appropriate feature bits. It also introduces helper functions that are used to set and clear feature bits as necessary. This directive is a counterpart of -mips32r2 command line options with the exception that it does not influence elf header flags. The usage example is gives in test file. llvm-svn: 202807	2014-03-04 09:54:09 +00:00
Rui Ueyama	9c674e6851	llvm-objdump: Print x64 unwind info in executable. The original code does not work correctly on executable files because the code is written in such a way that only object files are assumed to be given to llvm-objdump. Contents of RuntimeFunction are different between executables and objects. In executables, fields in RuntimeFunction have actual addresses to unwind info structures. On the other hand, in object files, the fields have zero value, but instead there are relocations pointing to the fields, so that Linker will fill them at link-time. So, when we are reading an object file, we need to use relocation info to find the location of unwind info. When executable, we should just look at the values in RuntimeFunction. llvm-svn: 202785	2014-03-04 04:00:55 +00:00
Rui Ueyama	432bc1048f	Make a test for llvm-objdump a little bit more readable. llvm-svn: 202783	2014-03-04 03:23:19 +00:00
Kevin Qin	b08c6746c4	[AArch64]Fix improper diagnostics about offset range of load/store instructions. llvm-svn: 202775	2014-03-04 02:05:13 +00:00
Reid Kleckner	d84e70ea1b	MC: Fix Intel assembly parser for [global + offset] We were dropping the displacement on the floor if we also had some immediate offset. Should fix PR19033. llvm-svn: 202774	2014-03-04 00:33:17 +00:00
Chad Rosier	70cb2311ab	Revert "[AArch64] This is a work in progress to provide a machine description" This reverts commit ff717c8fc786a0cfa1602982b91895fa09e514fc. llvm-svn: 202773	2014-03-04 00:32:07 +00:00
Chad Rosier	fe45290566	[AArch64] This is a work in progress to provide a machine description for the Cortex-A53 subtarget in the AArch64 backend. This patch lays the ground work to annotate each AArch64 instruction (no NEON yet) with a list of SchedReadWrite types. The patch also provides the Cortex-A53 processor resources, maps those the the default SchedReadWrites, and provides basic latency. NEON support will be added in a subsequent patch with proper forwarding logic. Verification was done by setting the pre-RA scheduler to linearize to better gauge the effect of the MIScheduler. Even without modeling the forward logic, the results show a modest improvement for Cortex-A53. Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 202767	2014-03-03 23:32:47 +00:00
Diego Novillo	f5041ce558	Pass to emit DWARF path discriminators. DWARF discriminators are used to distinguish multiple control flow paths on the same source location. When this happens, instructions across basic block boundaries will share the same debug location. This pass detects this situation and creates a new lexical scope to one of the two instructions. This lexical scope is a child scope of the original and contains a new discriminator value. This discriminator is then picked up from MCObjectStreamer::EmitDwarfLocDirective to be written on the object file. This fixes http://llvm.org/bugs/show_bug.cgi?id=18270. llvm-svn: 202752	2014-03-03 20:06:11 +00:00
Diego Novillo	282450d94c	Add DWARF discriminator support to DILexicalBlocks. This adds support for emitting discriminators from DILexicalBlocks. llvm-svn: 202736	2014-03-03 18:53:17 +00:00
Daniel Sanders	fa961d76f0	[mips] Prevent %lo relocation being used on MSA loads and stores. Summary: Parts of the compiler still believed MSA load/stores have a 16-bit offset when it is actually 10-bit. Corrected this, and fixed a closely related issue this uncovered where load/stores with 10-bit and 12-bit offsets (MSA and microMIPS respectively) could not load/store using offsets from the stack/frame pointer. They accepted frameindex+offset, but not frameindex by itself. Reviewers: jacksprat, matheusalmeida Reviewed By: jacksprat Differential Revision: http://llvm-reviews.chandlerc.com/D2888 llvm-svn: 202717	2014-03-03 14:31:21 +00:00
Ed Maste	2a710d0a5b	[mips] support FK_Data_2 and FK_Data_8 to fix big-endian debug data This fixes invalid lengths in .debug_aranges on big-endian mips64 (lengths appear to be left-shifted by 32 bits) and in .debug_loc. Differential Revision: http://llvm-reviews.chandlerc.com/D2517 llvm-svn: 202716	2014-03-03 14:27:49 +00:00
Evgeniy Stepanov	77be532f71	[msan] Handle X86 SIMD bitshift intrinsics. llvm-svn: 202712	2014-03-03 13:47:42 +00:00
Vladimir Medic	43e978234a	This patch implements jalx instruction for Mips architecture.This instruction executes a procedure call within the current 256 MB-aligned region and change the ISA Mode from MIPS32 to microMIPS32 or MIPS16e. Usage samples for assembler and dissasembler are provided as well. llvm-svn: 202706	2014-03-03 13:12:59 +00:00
Saleem Abdulrasool	19dcc312ee	AsmParser: add missed tests The diagnostics tests were missing from the previous introduction of ifeqs. llvm-svn: 202674	2014-03-03 06:35:00 +00:00
Venkatraman Govindaraju	925ec9b11e	[Sparc] Add trap on integer condition codes (Ticc) instructions to Sparc backend. llvm-svn: 202670	2014-03-02 23:39:07 +00:00
Venkatraman Govindaraju	07d3af2821	[Sparc] Add return/rett instruction to Sparc backend. llvm-svn: 202666	2014-03-02 22:55:53 +00:00
Venkatraman Govindaraju	4fa2ab26f5	[Sparc] Add support for decoding jmpl/retl/ret instruction. llvm-svn: 202663	2014-03-02 21:17:44 +00:00
Venkatraman Govindaraju	c3084ad294	[Sparc] Add fcmpe* instructions to Sparc backend. llvm-svn: 202661	2014-03-02 19:56:19 +00:00
Venkatraman Govindaraju	f9a202a9ac	[Sparc] Add VIS instructions to sparc backend. llvm-svn: 202660	2014-03-02 19:31:21 +00:00
Hal Finkel	6aca2373f2	Add a PPC inline asm constraint type for single CR bits Now that the PowerPC backend can track individual CR bits as first-class registers, we should also have a way of allocating them for inline asm statements. Because these registers are only one bit, if an output variable is implicitly cast to a larger integer size, we'll get an any_extend to that larger type (this is part of the existing target-independent logic). As a result, regardless of the size of the output type, only the first bit is meaningful. The constraint identifier "wc" has been chosen for this purpose. Although gcc does not currently support allocating individual CR bits, this identifier choice has been coordinated with the gcc PowerPC team, and will be marked as reserved for this purpose in the gcc constraints.md file. llvm-svn: 202657	2014-03-02 18:23:39 +00:00
Michael Kuperstein	661e288a70	Ensure bitcode encoding of instructions and their operands stays stable. This includes instructions that relate to memory access (load/store/GEP), comparison instructions and calls. Work was done by lama.saba@intel.com. llvm-svn: 202647	2014-03-02 15:26:36 +00:00
Venkatraman Govindaraju	b745e67a64	[SparcV9] Adds support for branch on integer register instructions (BPr) and conditional moves on integer register (MOVr/FMOVr). llvm-svn: 202628	2014-03-02 09:46:56 +00:00
Elena Demikhovsky	9737e3886b	AVX-512: Fixed extract_vector_elt for v8i1 vector llvm-svn: 202624	2014-03-02 09:19:44 +00:00
Venkatraman Govindaraju	600f390bb9	[Sparc] Add support for parsing branches and conditional move instructions with %fcc1-%fcc3 conditional registers. llvm-svn: 202616	2014-03-02 06:28:15 +00:00
Venkatraman Govindaraju	81aae57282	[Sparc] Add support for parsing fcmp with %fcc registers. llvm-svn: 202610	2014-03-02 03:39:39 +00:00
Venkatraman Govindaraju	c86e0f3873	[SparcV9] Add support for parsing branch instructions with prediction. llvm-svn: 202602	2014-03-01 22:03:07 +00:00
Matt Arsenault	2430958182	R600: Add failing control flow tests. Simple cases hit a variety of problems at -O0. llvm-svn: 202601	2014-03-01 21:45:41 +00:00
Hal Finkel	46043edc56	Remove extra truncs/exts around i32 bit operations on PPC64 This generalizes the code to eliminate extra truncs/exts around i1 bit operations to also do the same on PPC64 for i32 bit operations. This eliminates a fairly prevalent code wart: int foo(int a) { return a == 5 ? 7 : 8; } On PPC64, because of the extension implied by the ABI, this would generate: cmplwi 0, 3, 5 li 12, 8 li 4, 7 isel 3, 4, 12, 2 rldicl 3, 3, 0, 32 blr where the 'rldicl 3, 3, 0, 32', the extension, is completely unnecessary. At least for the single-BB case (which is all that the DAG combine mechanism can handle), this unnecessary extension is no longer generated. llvm-svn: 202600	2014-03-01 21:36:57 +00:00
Venkatraman Govindaraju	2286874119	[Sparc] Add support for parsing annulled branch instructions. llvm-svn: 202599	2014-03-01 20:08:48 +00:00
Venkatraman Govindaraju	e0c5bff720	[Sparc] Add support for parsing sparcv9 instructions addc/subc/addccc/subccc. llvm-svn: 202598	2014-03-01 18:54:52 +00:00
Venkatraman Govindaraju	2a9c430677	[Sparc] Add missing ALU instruction patterns. llvm-svn: 202597	2014-03-01 17:51:00 +00:00
Sasa Stankovic	075e339373	Add missing FileCheck in test command line. llvm-svn: 202594	2014-03-01 16:14:29 +00:00
Venkatraman Govindaraju	256735d485	[Sparc] Add support to decode unimp instruction. llvm-svn: 202581	2014-03-01 09:28:18 +00:00
Venkatraman Govindaraju	484ca1a030	[Sparc] Add support to decode negative simm13 operands in the sparc disassembler. llvm-svn: 202578	2014-03-01 09:11:57 +00:00
Venkatraman Govindaraju	78df2dec0c	[Sparc] Add support for decoding call instructions in the sparc disassembler. llvm-svn: 202577	2014-03-01 08:30:58 +00:00
Venkatraman Govindaraju	fb54821398	[Sparc] Add support to disassemble sparc memory instructions. llvm-svn: 202575	2014-03-01 07:46:33 +00:00
Venkatraman Govindaraju	bf70566a45	Add support for parsing sun-style section flags in ELFAsmParser. llvm-svn: 202573	2014-03-01 06:21:00 +00:00
Venkatraman Govindaraju	2b1682bcd4	[Sparc] Implement writeNopData. Emit actual NOP instruction instead of just filling with zeroes. llvm-svn: 202572	2014-03-01 05:45:09 +00:00
Venkatraman Govindaraju	9fc29098df	[Sparc] Teach SparcAsmParser to emit correct relocations for PIC code. llvm-svn: 202571	2014-03-01 05:07:21 +00:00
Venkatraman Govindaraju	6f2e08c8e1	[Sparc] Add support for parsing directives in SparcAsmParser. llvm-svn: 202564	2014-03-01 02:18:04 +00:00
Venkatraman Govindaraju	f7eecf80c4	[Sparc] Emit 'restore' instead of 'restore %g0, %g0, %g0'. This improves the readability of the generated code. llvm-svn: 202563	2014-03-01 01:04:26 +00:00
Manman Ren	709c951b42	SpillPlacement: fix a bug in iterate. Inside iterate, we scan backwards then scan forwards in a loop. When iteration is not zero, the last node was just updated so we can skip it. But when iteration is zero, we can't skip the last node. For the testing case, fixing this will save a spill and move register copies from hot path to cold path. llvm-svn: 202557	2014-02-28 23:05:31 +00:00
Tom Stellard	d61a1c3360	R600/SI: Expand all v16[if]32 operations llvm-svn: 202543	2014-02-28 21:36:37 +00:00
Justin Bogner	02b958422c	CommandLine: Exit successfully for -version and -help Tools that use the CommandLine library currently exit with an error when invoked with -version or -help. This is unusual and non-standard, so we'll fix them to exit successfully instead. I don't expect that anyone relies on the current behaviour, so this should be a fairly safe change. llvm-svn: 202530	2014-02-28 19:08:01 +00:00
Adam Nemet	6586e5d6ac	Test commit llvm-svn: 202528	2014-02-28 18:44:39 +00:00
Zoran Jovanovic	285cc289e8	Fixed operand of SC microMIPS instruction. llvm-svn: 202526	2014-02-28 18:22:56 +00:00
Zoran Jovanovic	7c6c36d92d	Fixed encoding of SYSCALL microMIPS instruction. llvm-svn: 202523	2014-02-28 18:17:08 +00:00
Zoran Jovanovic	d0a289003d	Revert revision 202518 because of wrong commit message. llvm-svn: 202521	2014-02-28 18:14:16 +00:00
Zoran Jovanovic	9874a2b1ef	Fix operand of SC instruction. llvm-svn: 202518	2014-02-28 18:02:17 +00:00
Rafael Espindola	11ac853774	With rpaths being set correctly, SHLIBPATH_VAR is not needed anymore. llvm-svn: 202510	2014-02-28 16:16:51 +00:00
Sasa Stankovic	8c5736b921	[mips] Implement NaCl sandboxing of indirect jumps: * Align targets of indirect jumps to instruction bundle boundaries (in MI layer). * Add masking instructions before indirect jumps (in MC layer). Differential Revision: http://llvm-reviews.chandlerc.com/D2847 llvm-svn: 202479	2014-02-28 10:00:38 +00:00
Hal Finkel	b998915ee1	Swap PPC isel operands to allow for 0-folding The PPC isel instruction can fold 0 into the first operand (thus eliminating the need to materialize a zero-containing register when the 'true' result of the isel is 0). When the isel is fed by a bit register operation that we can invert, do so as part of the bit-register-operation peephole routine. llvm-svn: 202469	2014-02-28 06:11:16 +00:00
Rafael Espindola	a51f0f8367	Now that it is possible, use the mangler in IRObjectFile. A really simple patch marks the end of a lot of yak shaving :-) llvm-svn: 202463	2014-02-28 02:17:23 +00:00
Hal Finkel	940ab934d4	Add CR-bit tracking to the PowerPC backend for i1 values This change enables tracking i1 values in the PowerPC backend using the condition register bits. These bits can be treated on PowerPC as separate registers; individual bit operations (and, or, xor, etc.) are supported. Tracking booleans in CR bits has several advantages: - Reduction in register pressure (because we no longer need GPRs to store boolean values). - Logical operations on booleans can be handled more efficiently; we used to have to move all results from comparisons into GPRs, perform promoted logical operations in GPRs, and then move the result back into condition register bits to be used by conditional branches. This can be very inefficient, because the throughput of these CR <-> GPR moves have high latency and low throughput (especially when other associated instructions are accounted for). - On the POWER7 and similar cores, we can increase total throughput by using the CR bits. CR bit operations have a dedicated functional unit. Most of this is more-or-less mechanical: Adjustments were needed in the calling-convention code, support was added for spilling/restoring individual condition-register bits, and conditional branch instruction definitions taking specific CR bits were added (plus patterns and code for generating bit-level operations). This is enabled by default when running at -O2 and higher. For -O0 and -O1, where the ability to debug is more important, this feature is disabled by default. Individual CR bits do not have assigned DWARF register numbers, and storing values in CR bits makes them invisible to the debugger. It is critical, however, that we don't move i1 values that have been promoted to larger values (such as those passed as function arguments) into bit registers only to quickly turn around and move the values back into GPRs (such as happens when values are returned by functions). A pair of target-specific DAG combines are added to remove the trunc/extends in: trunc(binary-ops(binary-ops(zext(x), zext(y)), ...) and: zext(binary-ops(binary-ops(trunc(x), trunc(y)), ...) In short, we only want to use CR bits where some of the i1 values come from comparisons or are used by conditional branches or selects. To put it another way, if we can do the entire i1 computation in GPRs, then we probably should (on the POWER7, the GPR-operation throughput is higher, and for all cores, the CR <-> GPR moves are expensive). POWER7 test-suite performance results (from 10 runs in each configuration): SingleSource/Benchmarks/Misc/mandel-2: 35% speedup MultiSource/Benchmarks/Prolangs-C++/city/city: 21% speedup MultiSource/Benchmarks/MiBench/automotive-susan: 23% speedup SingleSource/Benchmarks/CoyoteBench/huffbench: 13% speedup SingleSource/Benchmarks/Misc-C++/Large/sphereflake: 13% speedup SingleSource/Benchmarks/Misc-C++/mandel-text: 10% speedup SingleSource/Benchmarks/Misc-C++-EH/spirit: 10% slowdown MultiSource/Applications/lemon/lemon: 8% slowdown llvm-svn: 202451	2014-02-28 00:27:01 +00:00
Roman Divacky	7a9c6549ba	Lower FNEG just like FABS to fneg[ds] and fmov[ds], thus avoiding expensive libcall. Also, Qp_neg is not implemented on at least FreeBSD. This is also what gcc is doing. llvm-svn: 202422	2014-02-27 19:26:29 +00:00
Adrian Prantl	7072073cc9	Debug info: Remove ARMAsmPrinter::EmitDwarfRegOp(). AsmPrinter can now scan the register file for sub- and super-registers. No functionality change intended. (Tests are updated because the comments in the assembler output are different.) llvm-svn: 202416	2014-02-27 17:56:08 +00:00
Richard Osborne	521bdf211d	[XCore] Support functions returning more than 4 words. If a function returns a large struct by value return the first 4 words in registers and the rest on the stack in a location reserved by the caller. This is needed to support the xC language which supports functions returning an arbitrary number of return values. This is r202397 reapplied with a fix to avoid an uninitialized read of a member. llvm-svn: 202414	2014-02-27 17:47:54 +00:00
Richard Osborne	527aa5052d	Revert r202396, r202397. These are causing test failures, revert for now. llvm-svn: 202398	2014-02-27 14:24:13 +00:00
Richard Osborne	e82bf0988e	[XCore] Support functions returning more than 4 words. Summary: If a function returns a large struct by value return the first 4 words in registers and the rest on the stack in a location reserved by the caller. This is needed to support the xC language which supports functions returning an arbitrary number of return values. Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2889 llvm-svn: 202397	2014-02-27 14:00:40 +00:00
Richard Osborne	a283d24ad9	[XCore] Target optimized library function __memcpy_4() Summary: If the src, dst and size of a memcpy are known to be 4 byte aligned we can call __memcpy_4() instead of memcpy(). Reviewers: robertlytton Reviewed By: robertlytton CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2871 llvm-svn: 202395	2014-02-27 13:39:07 +00:00
Richard Osborne	d6e85018c5	[XCore] Add dag combines for instructions that ignore some input bits. These instructions ignore the high bits of one of their input operands - try and use this to simplify the code. llvm-svn: 202394	2014-02-27 13:20:11 +00:00
Richard Osborne	2d3a2bee41	[XCore] Provide information about known zero bits of resource instructions. llvm-svn: 202393	2014-02-27 13:20:06 +00:00
Daniel Sanders	9f088ba322	Stop test/CodeGen/X86/v4i32load-crash.ll targeting non-X86-64 targets. Summary: Fixes an issue where a test attempts to use -mcpu=x86-64 on non-X86-64 targets. This triggers an assertion in the MIPS backend since it doesn't know what ABI to use by default for unrecognized processors. CC: llvm-commits, rafael Differential Revision: http://llvm-reviews.chandlerc.com/D2877 llvm-svn: 202369	2014-02-27 09:24:31 +00:00
Eric Christopher	a9a1d27677	Don't emit anything into the debug_ranges section if we aren't emitting any ranges - this includes CU ranges where we were previously emitting an end list marker even if we didn't have a list. Testcase includes a test for line table only code emission as the problem was noticed while writing this test. llvm-svn: 202357	2014-02-27 07:44:45 +00:00
Juergen Ributzka	95d11dee8b	Revert "Use count 0." This reverts commit r202283, because when we use GuardMalloc the test will fail due to additional output to std err. llvm-svn: 202341	2014-02-27 03:10:10 +00:00
Michel Danzer	9e61c4b6cd	R600/SI: Optimize SI_KILL for constant operands If the SI_KILL operand is constant, we can either clear the exec mask if the operand is negative, or do nothing otherwise. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202337	2014-02-27 01:47:09 +00:00
Michel Danzer	6f273c57db	R600/SI: Allow SI_KILL for geometry shaders Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202336	2014-02-27 01:47:02 +00:00
Eric Christopher	740a833a3b	If we're only emitting line tables for a particular CU then don't add any ranges to the list of ranges for the CU as we don't want to emit them anyway. This ensures that we will still emit ranges if we have a compile unit compiled with only line tables and one compiled with full debug info requested (we'll emit for the one with full debug info). Update testcase metadata accordingly to continue emitting ranges. llvm-svn: 202333	2014-02-27 01:25:00 +00:00
Eric Christopher	75d49db19b	Add a debug info code generation level to the compile unit metadata and update everything accordingly. This can be used to conditionalize the amount of output in the backend based on the amount of debug requested/metadata emission scheme by a front end (e.g. clang). Paired with a commit to clang. llvm-svn: 202332	2014-02-27 01:24:56 +00:00
Andrew Trick	9f240f742b	Use regnum regex in an XCore test case. llvm-svn: 202315	2014-02-26 23:22:49 +00:00
Andrew Trick	2560d11e72	Very temporarily XFAILing a test. Will be fixed shortly. llvm-svn: 202310	2014-02-26 22:39:59 +00:00
Nico Rieck	0a0c674b7a	Fix broken FileCheck prefixes llvm-svn: 202308	2014-02-26 22:29:11 +00:00
Andrew Trick	52a00936b4	Add a limit to the heuristic that register allocates instructions in local order. This handles pathological cases in which we see 2x increase in spill code for large blocks (~50k instructions). I don't have a unit test for this behavior. Fixes rdar://16072279. llvm-svn: 202304	2014-02-26 22:07:26 +00:00
Quentin Colombet	85c9e16291	Lower unsigned vsetcc to psubus in certain cases The current approach to lower a vsetult is to flip the sign bit of the operands, swap the operands and then use a (signed) pcmpgt. psubus (unsigned saturating subtract) can be used to emulate a vsetult more efficiently: + case ISD::SETULT: { + // If the comparison is against a constant we can turn this into a + // setule. With psubus, setule does not require a swap. This is + // beneficial because the constant in the register is no longer + // destructed as the destination so it can be hoisted out of a loop. I also enable lowering via psubus in a few other cases where it's clearly beneficial: setule and setuge if minu/maxu cannot be used. rdar://problem/14338765 Patch by Adam Nemet <anemet@apple.com>. llvm-svn: 202301	2014-02-26 21:39:12 +00:00
Reid Kleckner	22869378d9	GlobalOpt: Apply fastcc to internal x86_thiscallcc functions We should apply fastcc whenever profitable. We can expand this list, but there are lots of conventions with performance implications that we don't want to change. Differential Revision: http://llvm-reviews.chandlerc.com/D2705 llvm-svn: 202293	2014-02-26 19:57:30 +00:00
Nico Rieck	773a57958c	Relax COFF string table check COFF object files with 0 as string table size are currently rejected. This prevents us from reading object files written by tools like cvtres that violate the PECOFF spec and write 0 instead of 4 for the size of an empty string table. llvm-svn: 202292	2014-02-26 19:51:44 +00:00
Nico Rieck	5645b36306	Fix broken FileCheck prefix llvm-svn: 202291	2014-02-26 19:51:08 +00:00
Rafael Espindola	b556fcbdb5	Use count 0. Thanks to Roman Divacky for the suggestion. llvm-svn: 202283	2014-02-26 17:57:35 +00:00
Rafael Espindola	ae593f1563	Compare DataLayout by Value, not by pointer. This fixes spurious warnings in llvm-link about the datalayout not matching. Thanks to Zalman Stern for reporting the bug! llvm-svn: 202276	2014-02-26 17:02:08 +00:00
Andrew Trick	429e9edd08	Fix PR18165: LSR must avoid scaling factors that exceed the limit on truncated use. Patch by Michael Zolotukhin! llvm-svn: 202273	2014-02-26 16:31:56 +00:00
Alexey Samsonov	a5f0768f5e	llvm-symbolizer: use dynamic symbol table if the regular one is stripped. llvm-svn: 202265	2014-02-26 13:10:01 +00:00
Michael Kuperstein	9201fb9ce7	Ensure bitcode encoding of instructions and their operands stays stable. This includes instructions with aggregate operands (insert/extract), instructions with vector operands (insert/extract/shuffle), binary arithmetic and bitwise instructions, conversion instructions and terminators. Work was done by lama.saba@intel.com. llvm-svn: 202262	2014-02-26 12:06:36 +00:00
Tim Northover	ed9c20681d	AArch64: simplify tbl/tbx polymorphism The table argument is always 128-bit (and interpreted as <16 x i8>) so the extra specifier for it is just clutter. No user-visible behaviour change, so no tests. llvm-svn: 202258	2014-02-26 11:55:09 +00:00
Artyom Skrobov	1a6cd1d912	ARMv8 IfConversion must skip narrow instructions that a) define CPSR and b) wouldn't affect CPSR in an IT block llvm-svn: 202257	2014-02-26 11:27:28 +00:00
Daniel Sanders	91efd1c464	Stop test/CodeGen/ARM/a15.ll targetting non-ARM targets. Summary: Fixes an issue where a test attempts to use -mcpu=cortex-a15 on non-ARM targets. This triggers an assertion on MIPS since it doesn't know what ABI to use by default for unrecognized processors. Reviewers: rengolin Reviewed By: rengolin CC: llvm-commits, aemerson, rengolin Differential Revision: http://llvm-reviews.chandlerc.com/D2876 llvm-svn: 202256	2014-02-26 11:26:18 +00:00
Chandler Carruth	dfb2efd0da	[SROA] Use the correct index integer size in GEPs through non-default address spaces. This isn't really a correctness issue (the values are truncated) but its much cleaner. Patch by Matt Arsenault! llvm-svn: 202252	2014-02-26 10:08:16 +00:00
Chandler Carruth	286d87ed38	[SROA] Teach SROA how to handle pointers from address spaces other than the default. Based on the patch by Matt Arsenault, D1764! I switched one place to use the more direct pointer type to compute the desired address space, and I reworked the memcpy rewriting section to reflect significant refactorings that this patch helped inspire. Thanks to several of the folks who helped review and improve the patch as well. llvm-svn: 202247	2014-02-26 08:25:02 +00:00
Chandler Carruth	aa72b93ae7	[SROA] Split the alignment computation complete for the memcpy rewriting to work independently for the slice side and the other side. This allows us to only compute the minimum of the two when we actually rewrite to a memcpy that needs to take the minimum, and preserve higher alignment for one side or the other when rewriting to loads and stores. This fix was inspired by seeing the result of some refactoring that makes addrspace handling better. llvm-svn: 202242	2014-02-26 07:29:54 +00:00
Chandler Carruth	6aedc106ba	[SROA] Fix PR18615 with some long overdue simplifications to the bounds checking in SROA. The primary change is to just rely on uge for checking that the offset is within the allocation size. This removes the explicit checks against isNegative which were terribly error prone (including the reversed logic that led to PR18615) and prevented us from supporting stack allocations larger than half the address space.... Ok, so maybe the latter isn't common but it's a silly restriction to have. Also, we used to try to support a PHI node which loaded from before the start of the allocation if any of the loaded bytes were within the allocation. This doesn't make any sense, we have never really supported loading or storing before the allocation starts. The simplified logic just doesn't care. We continue to allow loading past the end of the allocation in part to support cases where there is a PHI and some loads are larger than others and the larger ones reach past the end of the allocation. We could solve this a different and more conservative way, but I'm still somewhat paranoid about this. llvm-svn: 202224	2014-02-26 03:14:14 +00:00
Adrian Prantl	6aa7616355	Attempt to unbreak an MSVC buildbot by switching to %llc_dwarf. llvm-svn: 202202	2014-02-25 23:03:00 +00:00
David Blaikie	20474106a1	DwarfDebug: Avoid emitting an empty debug_aranges section when aranges are disabled llvm-svn: 202201	2014-02-25 22:46:44 +00:00
Adrian Prantl	69140d2c0f	Address review comments for r202188. This is refactoring / simplifying code, updating comments and enabling the testcase on non-x86 platforms. No functionality change. llvm-svn: 202199	2014-02-25 22:27:14 +00:00
Tom Stellard	1f15bff0df	R600/SI: Custom select 64-bit ADD llvm-svn: 202194	2014-02-25 21:36:18 +00:00
Hal Finkel	22304046c7	Account for 128-bit integer operations in PPCCTRLoops We need to abort the formation of counter-register-based loops where there are 128-bit integer operations that might become function calls. llvm-svn: 202192	2014-02-25 20:51:50 +00:00
Rafael Espindola	449d3b7493	Don't try to set a dummy DataLayout. It is parsed now. llvm-svn: 202191	2014-02-25 20:41:28 +00:00
Rafael Espindola	f863ee2949	Store a DataLayout in Module. Now that DataLayout is not a pass, store one in Module. Since the C API expects to be able to get a char* to the datalayout description, we have to keep a std::string somewhere. This patch keeps it in Module and also uses it to represent modules without a DataLayout. Once DataLayout is mandatory, we should probably move the string to DataLayout itself since it won't be necessary anymore to represent the special case of a module without a DataLayout. llvm-svn: 202190	2014-02-25 20:01:08 +00:00
Adrian Prantl	3f49c890bf	Debug info: Support variadic functions. Variadic functions have an unspecified parameter tag after the last argument. In IR this is represented as an unspecified parameter in the subroutine type. Paired commit with CFE r202185. rdar://problem/13690847 This re-applies r202184 + a bugfix in DwarfDebug's argument handling. llvm-svn: 202188	2014-02-25 19:57:42 +00:00
Adrian Prantl	fd1f82a711	Revert "Debug info: Support variadic functions." This reverts commit r202184 because of buildbot breakage. llvm-svn: 202187	2014-02-25 19:48:36 +00:00
Adrian Prantl	70ff4f7003	Debug info: Support variadic functions. Variadic functions have an unspecified parameter tag after the last argument. In IR this is represented as an unspecified parameter in the subroutine type. Paired commit with CFE. rdar://problem/13690847 llvm-svn: 202184	2014-02-25 19:38:07 +00:00
Richard Osborne	50e3b7f759	[XCore] Add intrinsic for CLRPT (clear port time) instruction. llvm-svn: 202172	2014-02-25 17:31:15 +00:00
Richard Osborne	92fdd3491a	[XCore] Add intrinsic for EDU (event disable unconditional) instruction. llvm-svn: 202171	2014-02-25 17:31:06 +00:00
Logan Chien	18583d71e8	Keep the link register for uwtable. The function with uwtable attribute might be visited by the stack unwinder, thus the link register should be considered as clobbered after the execution of the branch and link instruction (i.e. the definition of the machine instruction can't be ignored) even when the callee function are marked with noreturn. llvm-svn: 202165	2014-02-25 16:57:28 +00:00
Richard Osborne	8b7466e886	[XCore] Prefer to word align functions. The behaviour of the XCore's instruction buffer means that the performance of the same code sequence can differ depending on whether it starts at a 4 byte aligned address or not. Since we don't model the instruction buffer in the backend we have no way of knowing for sure if it is beneficial to word align a specific function. However, in the absence of precise modelling, it is better on balance to word align functions because: * It makes a fetch-nop while executing the prologue slightly less likely. * If we don't word align functions then a small perturbation in one function can have a dramatic knock on effect. If the size of the function changes it might change the alignment and therefore the performance of all the functions that happen to follow it in the binary. This butterfly effect makes it harder to reason about and measure the performance of code. llvm-svn: 202163	2014-02-25 16:37:15 +00:00
Renato Golin	69736692d8	Ignore old JIT tests in AARch64 - CMake style llvm-svn: 202126	2014-02-25 09:31:00 +00:00
Alp Toker	70b36995e4	Fix typos llvm-svn: 202107	2014-02-25 04:21:15 +00:00
Chandler Carruth	3bf18ed5e3	[SROA] Fix another instability in SROA with respect to the slice ordering. The fundamental problem that we're hitting here is that the use-def chain ordering is itself not a stable thing to be relying on in the rewriting for SROA. Further, we use a non-stable sort over the slices to arrange them based on the section of the alloca they're operating on. With a debugging STL implementation (or different implementations in stage2 and stage3) this can cause stage2 != stage3. The specific aspect of this problem fixed in this commit deals with the rewriting and load-speculation around PHIs and Selects. This, like many other aspects of the use-rewriting in SROA, is really part of the "strong SSA-formation" that is doen by SROA where it works very hard to canonicalize loads and stores in just the right way to satisfy the needs of mem2reg[1]. When we have a select (or a PHI) with 2 uses of the same alloca, we test that loads downstream of the select are speculatable around it twice. If only one of the operands to the select needs to be rewritten, then if we get lucky we rewrite that one first and the select is immediately speculatable. This can cause the order of operand visitation, and thus the order of slices to be rewritten, to change an alloca from promotable to non-promotable and vice versa. The fix is to defer all of the speculation until after the rewrite phase is done. Once we've rewritten everything, we can accurately test for whether speculation will work (once, instead of twice!) and the order ceases to matter. This also happens to simplify the other subtlety of speculation -- we need to not speculate anything unless the result of speculating will make the alloca fully promotable by mem2reg. I had a previous attempt at simplifying this, but it was still pretty horrible. There is actually already a really nice test case for this in basictest.ll, but on multiple STL implementations and inputs, we just got "lucky". Fortunately, the test case is very small and we can essentially build it in exactly the opposite way to get reasonable coverage in both directions even from normal STL implementations. llvm-svn: 202092	2014-02-25 00:07:09 +00:00
David Blaikie	1d4736e0b1	llvm-dwarfdump: Support for debug_line.dwo section for file names for type units under fission. llvm-svn: 202091	2014-02-24 23:58:54 +00:00
Simon Atanasyan	2b614e1163	llvm-objdump: Do not attempt to disassemble symbols outside of section boundaries. It is possible to create an ELF executable where symbol from say .text section 'points' to the address outside the section boundaries. It does not have a sense to disassemble something outside the section. Without this fix llvm-objdump prints finite or infinite (depends on the executable file architecture) number of 'invalid instruction encoding' warnings. llvm-svn: 202083	2014-02-24 22:12:11 +00:00
Matt Arsenault	41e2f2bacd	R600/SI - Add new CI arithmetic instructions. Does not yet include larger part required to match v_mad_i64_i32 / v_mad_u64_u32. llvm-svn: 202077	2014-02-24 21:01:28 +00:00
Arnold Schwaighofer	9611d23d63	SLPVectorizer: Try vectorizing 'splat' stores Vectorize sequential stores of a broadcasted value. 5% on eon. radar://16124699 llvm-svn: 202067	2014-02-24 19:52:29 +00:00
Reed Kotler	59ebf32d16	For lcov tests, don't Xfail mips littl endian (mipsel-... and mip64el-...) targets. Just big endian (mips-... and mips64-...) llvm-svn: 202049	2014-02-24 16:33:56 +00:00
Alexey Samsonov	7860107c7d	[CMake] Remove dependency on non-existing profile_rt-shared. Patch by Brad King. llvm-svn: 202041	2014-02-24 15:07:06 +00:00
Kostya Serebryany	f72bdb47bc	[asan] remove test that should have been removed in r202033 llvm-svn: 202034	2014-02-24 13:44:24 +00:00
Saleem Abdulrasool	7ecc549724	Asm Parser: support .error directive The .error directive is similar to .err in that it will halt assembly if it is evaluated for assembly. However, it permits a user supplied message to be rendered. llvm-svn: 201999	2014-02-23 23:02:23 +00:00
Saleem Abdulrasool	00f53c103c	AsmParser: support .ifeqs directive The .ifeqs directive assembles the following code if the quoted string parameters are equal. The strings must be quoted using double quotes. llvm-svn: 201998	2014-02-23 23:02:18 +00:00
Benjamin Kramer	facca1f049	SPARC: Implement TRAP lowering. Matches what GCC emits. llvm-svn: 201994	2014-02-23 21:43:52 +00:00
Saleem Abdulrasool	fd6ed1ea6b	ARM IAS: support .align without parameters .align is handled specially on certain targets. .align without any parameters on ARM indicates a default alignment (4). Handle the special case in the target parser, but fall back to the generic parser for the normal version. llvm-svn: 201988	2014-02-23 17:45:32 +00:00
Saleem Abdulrasool	5852d6bc57	MCAsmParser: support .ifne The .ifne directive assembles the following section of code if the argument expression is non-zero. Effectively, it is equivalent to if. llvm-svn: 201986	2014-02-23 15:53:41 +00:00
Saleem Abdulrasool	5db529852e	MCAsmParser: handle space properly for .ifc/.ifnc If the strings are not quoted, the first string stops at the first comma, and the second string stops at the end of the line. Strings which contain whitespace should be quoted. Unquoted space is to be discarded. llvm-svn: 201985	2014-02-23 15:53:36 +00:00
Saleem Abdulrasool	b2ae2c0fd5	MCAsmParser: add support for .err directive The .err directive produces an error whenever it is assembled. This can be useful for preventing assembly when an unexpected condition occurs. llvm-svn: 201984	2014-02-23 15:53:30 +00:00
Elena Demikhovsky	3ebfe11532	AVX-512: Fixed encoding of VPTESTMQ llvm-svn: 201980	2014-02-23 14:28:35 +00:00
Saleem Abdulrasool	3897651250	ARM IAS: support .short and .hword This adds support for the .short and its alias .hword for adding literal values into the object file. This is similar to the .word directive, however, rather than inserting a value of 4 bytes, adds a 2-byte value. llvm-svn: 201968	2014-02-23 06:22:09 +00:00
Benjamin Kramer	d20d1adfb8	Make test more resilient against scheduling decisions. Should bring the atom buildbots back to life. llvm-svn: 201951	2014-02-22 20:14:02 +00:00
Nico Rieck	9d2c15eff7	MC: Support COFF string tables larger than 10MB Offsets past the range of single-slash encoding are encoded as base64, padded to 6 characters, and prefixed with two slashes. This encoding is undocumented but used by MSVC. llvm-svn: 201940	2014-02-22 16:12:20 +00:00
NAKAMURA Takumi	0607c15435	llvm/test/CodeGen/X86/shift-pcmp.ll: Tweak to appease FileCheck. "CHECK-LABEL" doesn't identify labels magically and CHECK-LABEL behaves free from other contexts. For targeting pecoff, ".def foo" appears before ".short 32". .def foo; ... .LCPI0_0: .short 32 foo: CHECK-LABEL seeks not from ".short 32" but from the top of the input. llvm-svn: 201931	2014-02-22 07:27:04 +00:00
Quentin Colombet	1627a4159e	[CodeGenPrepare] Fix the check of the legality of an instruction. The API expects an ISD opcode, not an IR opcode. Fixes a regression for R600. Related to <rdar://problem/15519855>. llvm-svn: 201923	2014-02-22 01:06:41 +00:00
Quentin Colombet	4db08df18e	[DAGCombiner] PCMP* sets its result to all ones or zeros so we can AND with the shifted mask rather than masking and shifting separately. The patch adds this transformation to the DAGCombiner: (shl (and (setcc:i8v16 ...) N01C) N1C) -> (and (setcc:i8v16 ...) N01C<<N1C) <rdar://problem/16054492> Patch by Adam Nemet <anemet@apple.com> llvm-svn: 201906	2014-02-21 23:42:41 +00:00
Rafael Espindola	f12b82824a	Add a SymbolicFile interface between Binary and ObjectFile. This interface allows IRObjectFile to be implemented without having dummy methods for all section and segment related methods. Both llvm-ar and llvm-nm are changed to use it. Unfortunately the mangler is still not plugged in since it requires some refactoring to make a Module hold a DataLayout. llvm-svn: 201881	2014-02-21 20:10:59 +00:00
Sebastian Pop	f05ba89bd3	add -da-delinearize runs and checks to MIV testcases llvm-svn: 201869	2014-02-21 18:15:18 +00:00
Kevin Qin	07334d37de	[AArch64] Add register constraints to avoid generating STLXR and STXR with unpredictable behavior. llvm-svn: 201841	2014-02-21 07:45:48 +00:00

... 2 3 4 5 6 ...

23195 Commits