llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	57a3d084cd	Make static variables const if possible. Makes them go into a read-only section. Or fold them into a initializer list which has the same effect. NFC. llvm-svn: 231598	2015-03-08 16:07:39 +00:00
Simon Pilgrim	8c58c066b7	[DAGCombiner] Add a shuffle mask commutation helper function. NFCI. We have an increasing number of cases where we are creating commuted shuffle masks - all implementing nearly the same code. This patch adds a static helper function - ShuffleVectorSDNode::commuteMask() and replaces a number of cases to use it. Differential Revision: http://reviews.llvm.org/D8139 llvm-svn: 231581	2015-03-07 22:33:11 +00:00
David Majnemer	73460f94a2	Fix the autoconf build lib/ExecutionEngine/Targets has no Makefile, causing the autoconf build to fail. Solve this by bringing the COFF implementation of RuntimeDyld in line like the Mach-O and ELF implementations. llvm-svn: 231579	2015-03-07 21:47:46 +00:00
Benjamin Kramer	f027ad7883	Make the assertion macros in Verifier and Linter truly variadic. NFC. llvm-svn: 231577	2015-03-07 21:15:40 +00:00
David Majnemer	b654b55619	Fix unused variable/function warnings llvm-svn: 231576	2015-03-07 20:56:50 +00:00
David Majnemer	1a666e0f69	ExecutionEngine: Preliminary support for dynamically loadable coff objects Provide basic support for dynamically loadable coff objects. Only handles a subset of x64 currently. Patch by Andy Ayers! Differential Revision: http://reviews.llvm.org/D7793 llvm-svn: 231574	2015-03-07 20:21:27 +00:00
Benjamin Kramer	867bfc53ee	Make constant arrays that are passed to functions as const. In theory this allows the compiler to skip materializing the array on the stack. In practice clang often fails to do that, but that's a different story. NFC. llvm-svn: 231571	2015-03-07 17:41:00 +00:00
Simon Pilgrim	2dcbe74dfd	Use SDValue bool check to tidyup some possible combines. NFC. llvm-svn: 231569	2015-03-07 16:34:55 +00:00
Aaron Ballman	6b329f5cd2	Adding parenthesis around logical expressions to silence a -Wparentheses warning; NFC. llvm-svn: 231567	2015-03-07 15:16:27 +00:00
Aaron Ballman	5287f0c3c7	Removing spurious semi-colons; NFC llvm-svn: 231566	2015-03-07 15:10:32 +00:00
Benjamin Kramer	4e0c7928e2	X86: Roll repetitive code into a loop. NFC. llvm-svn: 231565	2015-03-07 15:06:16 +00:00
Andrea Di Biagio	c9d79e8103	[DAGCombiner] Fix wrong folding of AND dag nodes. This patch fixes the logic in the DAGCombiner that folds an AND node according to rule: (and (X (load V)), C) -> (X (load V)) An AND between a vector load 'X' and a constant build_vector 'C' can be folded into the load itself only if we can prove that the AND operation is redundant. The algorithm implemented by 'visitAND' firstly computes the splat value 'S' from C, and then checks if S has the lower 'B' bits set (where B is the size in bits of the vector element type). The algorithm takes into account also the 'undef' bits in the splat mask. Unfortunately, the algorithm only worked under the assumption that the size of S is a multiple of the vector element type. With this patch, we conservatively avoid folding the AND if the splat bits are not compatible with the vector element type. Added X86 test and-load-fold.ll Differential Revision: http://reviews.llvm.org/D8085 llvm-svn: 231563	2015-03-07 12:24:55 +00:00
Chandler Carruth	324f75aa47	[Modules] Include the header needed for make_unique, otherwise we can't build this header in a module. llvm-svn: 231561	2015-03-07 10:55:47 +00:00
Chandler Carruth	dc57c481af	Teach the LLVM CMake build how to explicitly use libc++abi when using libc++. This lets me almost self-host on Linux with libc++ and libc++abi very simply. Currently, MCJIT and OrcJIT are failing due to uncaught exceptions, and the Go binding tests are failing to build due to not linking in the correct C++ standard library. llvm-svn: 231560	2015-03-07 10:30:34 +00:00
Chandler Carruth	df397c520d	[PM] Fixup for r231556 where I missed a dependency on intrinsics generation. llvm-svn: 231558	2015-03-07 09:08:20 +00:00
Chandler Carruth	1ff7724da5	[PM] Create a separate library for high-level pass management code. This will provide the analogous replacements for the PassManagerBuilder and other code long term. This code is extracted from the opt tool currently, and I plan to extend it as I build up support for using the new pass manager in Clang and other places. Mailing this out for review in part to let folks comment on the terrible names here. A brief word about why I chose the names I did. The library is called "Passes" to try and make it clear that it is a high-level utility and where all of the passes come together and are registered in a common library. I didn't want it to be limited to a registry though, the registry is just one component. The class is a "PassBuilder" but this name I'm less happy with. It doesn't build passes in any traditional sense and isn't a Builder-style API at all. The class is a PassRegisterer or PassAdder, but neither of those really make a lot of sense. This class is responsible for constructing passes for registry in an analysis manager or for population of a pass pipeline. If anyone has a better name, I would love to hear it. The other candidate I looked at was PassRegistrar, but that doesn't really fit either. There is no register of all the passes in use, and so I think continuing the "registry" analog outside of the registry of pass names and types is a mistake. The objects themselves are just objects with the new pass manager. Differential Revision: http://reviews.llvm.org/D8054 llvm-svn: 231556	2015-03-07 09:02:36 +00:00
Simon Pilgrim	bede80a440	[DAGCombiner] SCALAR_TO_VECTOR(EXTRACT_VECTOR_ELT(V,C)) -> VECTOR_SHUFFLE This patch attempts to convert a SCALAR_TO_VECTOR using an operand from an EXTRACT_VECTOR_ELT into a VECTOR_SHUFFLE. This prevents many cases of spilling scalar data between the gpr + simd registers. At present the optimization only accepts cases where there is no TRUNC of the scalar type (i.e. all types must match). Differential Revision: http://reviews.llvm.org/D8132 llvm-svn: 231554	2015-03-07 05:52:42 +00:00
Eric Christopher	25dbdeb4d1	Typo. llvm-svn: 231547	2015-03-07 01:39:09 +00:00
Eric Christopher	e035e26655	Remove use of misched-bench from this test and replace it with non-temporary enabling options. This is part of removing misched-bench as an option. llvm-svn: 231546	2015-03-07 01:39:06 +00:00
Frederic Riss	23e20e95e9	[dsymutil] Apply relocations to DIE data before cloning. Doing this gets function's low_pc and global variable's locations right in the output debug info. It also could get right other attributes that need to be relocated (in linker terms), but I don't know of any other than the address attributes. This doesn't fixup low_pc attributes in compile_unit, lexical_block or inlined subroutine, nor does it get right high_pc attributes for function. This will come in a subsequent commit. llvm-svn: 231544	2015-03-07 01:25:09 +00:00
Eric Christopher	7e70aba1a8	Recommit r231324 with a fix to the ARM execution domain code to disable lane switching if we don't actually have the instruction set we want to switch to. Models the earlier check above the conditional for the pass. The testcase is one that triggered with the assert that's added as part of the fix, use it to avoid adding a new testcase as it highlights the same problem. llvm-svn: 231539	2015-03-07 00:12:22 +00:00
Richard Smith	6661a67d50	[modules] Mark Analysis/TargetLibraryInfo.def as a textual header. llvm-svn: 231532	2015-03-06 23:39:54 +00:00
Frederic Riss	9833de65a7	[dsymutil] Support cloning DIE reference attributes. Reference attributes are mainly handled by just creating DIEEntry attributes for them. There is a special case for DW_FORM_ref_addr attributes though, because the DIEEntry code needs a DwarfDebug code to emit them (and we don't have one as we do no CodeGen). In that case, just use DIEInteger attributes with the right form. llvm-svn: 231531	2015-03-06 23:22:53 +00:00
Frederic Riss	9d441b68a3	[dsymutil] Set linked unit start offset early. NFC. The start offset of a linked unit is known before starting to clone its DIEs. Handling DW_FORM_ref_addr attributes requires that this offset is set while cloning the unit. Split CompileUnit::computeOffsets() into setStartOffset() and computeNextUnitOffset() and call them repsectively before cloning the DIEs and right after. llvm-svn: 231530	2015-03-06 23:22:50 +00:00
Frederic Riss	718c60e203	Add DIEInteger::setValue() method. dsymutil needs to 'patch' attribute values after creating them. Just add this trivial capability. llvm-svn: 231529	2015-03-06 23:22:46 +00:00
Olivier Sallenave	049d803ce0	Do not restrict interleaved unrolling to small loops, depending on the target. llvm-svn: 231528	2015-03-06 23:12:04 +00:00
Quentin Colombet	66b616351c	[AArch64][LoadStoreOptimizer] Generate LDP + SXTW instead of LD[U]R + LD[U]RSW. Teach the load store optimizer how to sign extend a result of a load pair when it helps creating more pairs. The rational is that loads are more expensive than sign extensions, so if we gather some in one instruction this is better! <rdar://problem/20072968> llvm-svn: 231527	2015-03-06 22:42:10 +00:00
Sanjay Patel	3fee49b236	fixed to test features, not CPUs llvm-svn: 231524	2015-03-06 21:50:42 +00:00
Sanjay Patel	a800b6c04b	fixed to test features, not CPUs llvm-svn: 231523	2015-03-06 21:50:27 +00:00
Sanjay Patel	4593045f01	loosen checking for buildbots llvm-svn: 231522	2015-03-06 21:30:18 +00:00
Sanjay Patel	3fd51f3c4d	fixed to test only the feature, not the feature and a CPU llvm-svn: 231521	2015-03-06 21:24:56 +00:00
Sanjay Patel	eb60f0728d	fixed to test only the feature, not the feature and a CPU llvm-svn: 231520	2015-03-06 21:19:32 +00:00
Sanjay Patel	9c04ad5ed7	fixed test to use FileCheck llvm-svn: 231519	2015-03-06 21:16:15 +00:00
Sanjay Patel	9881f9531c	fixed to use CHECK-LABELs llvm-svn: 231517	2015-03-06 21:05:02 +00:00
Sanjay Patel	6a53998a48	fixed to test only the feature, not the feature and a CPU llvm-svn: 231516	2015-03-06 20:58:15 +00:00
Sanjay Patel	869cea48cc	fixed to test only the feature, not the feature and a CPU llvm-svn: 231515	2015-03-06 20:57:40 +00:00
Sanjay Patel	dba8012f69	fixed to test feature, not CPU llvm-svn: 231513	2015-03-06 20:51:25 +00:00
Sanjay Patel	7c6eaf03d7	fixed to test features, not CPUs llvm-svn: 231512	2015-03-06 20:46:16 +00:00
Sanjay Patel	829c7347d1	fixed test to use SSE2 attribute llvm-svn: 231510	2015-03-06 20:38:55 +00:00
Sanjay Patel	2b7229c34d	fixed to test only the feature, not the feature and a CPU llvm-svn: 231509	2015-03-06 20:34:20 +00:00
Matthias Braun	898d11e864	DAGCombiner: Canonicalize select(and/or,x,y) depending on target. This is based on the following equivalences: select(C0 & C1, X, Y) <=> select(C0, select(C1, X, Y), Y) select(C0 \| C1, X, Y) <=> select(C0, X, select(C1, X, Y)) Many target cannot perform and/or on the CPU flags and therefore the right side should be choosen to avoid materializign the i1 flags in an integer register. If the target can perform this operation efficiently we normalize to the left form. Differential Revision: http://reviews.llvm.org/D7622 llvm-svn: 231507	2015-03-06 19:49:10 +00:00
Matthias Braun	3ecb557739	DAGCombiner: Factor out some and/or combines. This is in preparation for changing visitSELECT to normalize towards select(Cond0, select(Cond1, X, Y), Y); select(Cond0, X, select(Cond1, X, Y)) which perfom an implicit and/or of the conditions. The factored function contains all DAGCombine rules which reduce two values combined by an And/Or operation to a single value. This does not include rules involving constants as visitSELECT already handles that case. Differential Revision: http://reviews.llvm.org/D8026 llvm-svn: 231506	2015-03-06 19:49:06 +00:00
Bruno Cardoso Lopes	61b9fd4686	[AsmPrinter][TLOF] Remove AArch64 test to appease buildbots Follow up from r231497. Using XFAIL would still trigger fail on some buildbots. Will re-introduce it as soon as I have a fix. llvm-svn: 231505	2015-03-06 19:42:18 +00:00
Benjamin Kramer	e8a64a20f2	LoopInterchange: Remove empty method. llvm-svn: 231503	2015-03-06 19:37:26 +00:00
Benjamin Kramer	79442920bf	LoopInterchange: Rephrase instruction moving using ilist's splice and factor it into a function + Random cleanups. No functional change. llvm-svn: 231501	2015-03-06 18:59:14 +00:00
Matthias Braun	046318b87e	ExecutionDepsFix: Indizes -> Indices. Translate german to english. llvm-svn: 231500	2015-03-06 18:56:20 +00:00
Bruno Cardoso Lopes	6e38693507	[AsmPrinter][TLOF] XFAIL AArch64 test to appease buildbots The checking for extgotequiv and localgotequiv rely on the emission order, which is not guaranteed because we use DenseMap to hold the GOT equivalents. XFAIL this now until I get time to use MapVector and test out the solution. In the meantime, appease buildbots. llvm-svn: 231497	2015-03-06 18:38:42 +00:00
Eric Christopher	6a8bfe7198	Fix typo. llvm-svn: 231495	2015-03-06 18:20:23 +00:00
Frederic Riss	ef648462d2	[dsymutil] Add debug_str construction support. With this comes the ability to correctly clone string attributes in DIEs. llvm-svn: 231493	2015-03-06 17:56:30 +00:00
Tom Stellard	6b42f2d8aa	R600/SI: Remove unused register class llvm-svn: 231491	2015-03-06 17:00:16 +00:00
Benjamin Kramer	298a3a0567	Fold init() helpers into constructors. NFC. llvm-svn: 231486	2015-03-06 16:21:15 +00:00
Chad Rosier	99b3e022c4	Avoid calls to dumpPassInfo and RegionBase<Tr>::getNameStr() in RGPassManager if -debug-pass is not specified, as the string is only used when dumping pass information. There is a big cost of determining the name in ReginBase<Tr>:getNameStr() if the region's entry or exit block doesn't have a name. This is the case for the Release build, as names are not preserved by the front-end. RegionPass is mainly used by Polly, resulting in long compile time for one file of a customer application with the Release build (1m24s) vs Release+Asserts build (10s) when Polly is used. With this change, the compile time with the Release build went down to 8s. Patch by Sanjin Sijaric <ssijaric@codeaurora.org>! Phabricator: http://reviews.llvm.org/D8076 llvm-svn: 231485	2015-03-06 16:15:04 +00:00
James Molloy	dcc78ec386	[ConstantRange] Teach multiply to be cleverer about signed ranges. Multiplication is not dependent on signedness, so just treating all input ranges as unsigned is not incorrect. However it will cause overly pessimistic ranges (such as full-set) when used with signed negative values. Teach multiply to try to interpret its inputs as both signed and unsigned, and then to take the most specific (smallest population) as its result. llvm-svn: 231483	2015-03-06 15:50:47 +00:00
Bruno Cardoso Lopes	5b75f4a356	[AsmPrinter][TLOF] Make AArch64 test a bit more flexible llvm-svn: 231481	2015-03-06 15:11:41 +00:00
Bruno Cardoso Lopes	2d54aa496e	[AsmPrinter][TLOF] Split tests and move to appropriate directories Follow up from r231474 and 231475 to appease buildbots llvm-svn: 231480	2015-03-06 14:41:56 +00:00
Bruno Cardoso Lopes	618c67a018	[AsmPrinter][TLOF] 32-bit MachO support for replacing GOT equivalents Add MachO 32-bit (i.e. arm and x86) support for replacing global GOT equivalent symbol accesses. Unlike 64-bit targets, there's no GOTPCREL relocation, and access through a non_lazy_symbol_pointers section is used instead. -- before _extgotequiv: .long _extfoo _delta: .long _extgotequiv-_delta -- after _delta: .long L_extfoo$non_lazy_ptr-_delta .section __IMPORT,__pointers,non_lazy_symbol_pointers L_extfoo$non_lazy_ptr: .indirect_symbol _extfoo .long 0 llvm-svn: 231475	2015-03-06 13:49:05 +00:00
Bruno Cardoso Lopes	52b1391df6	[AsmPrinter][TLOF] ARM64 MachO support for replacing GOT equivalents Follow up r230264 and add ARM64 support for replacing global GOT equivalent symbol accesses by references to the GOT entry for the final symbol instead, example: -- before .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta -- after .globl _foo _foo: .long 42 .globl _delta Ltmp3: .long _foo@GOT-Ltmp3 llvm-svn: 231474	2015-03-06 13:48:45 +00:00
Benjamin Kramer	6409a3c5d8	CodingStyle: Allow delegating ctors Delegating constructors seem to work fine with all supported compilers. llvm-svn: 231473	2015-03-06 13:46:50 +00:00
Toma Tabacu	4e0cf8e211	[mips] [IAS] Add missing constraints and improve testing for the .module directive. Summary: None of the .set directives can be used before the .module directives. The .set mips0/pop/push were not triggering this constraint. Also added testing for all the other implemented directives which are supposed to trigger this constraint. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7140 llvm-svn: 231465	2015-03-06 12:15:12 +00:00
Daniel Jasper	6adbd7aecf	Change the way in which error case is being handled. Specifically this: * Prevents an "unused" warning in non-assert builds. * In that error case return with out removing a child loop instead of looping forever. llvm-svn: 231459	2015-03-06 10:39:14 +00:00
Karthik Bhat	88db86dd29	Add a new pass "Loop Interchange" This pass interchanges loops to provide a more cache-friendly memory access. For e.g. given a loop like - for(int i=0;i<N;i++) for(int j=0;j<N;j++) A[j][i] = A[j][i]+B[j][i]; is interchanged to - for(int j=0;j<N;j++) for(int i=0;i<N;i++) A[j][i] = A[j][i]+B[j][i]; This pass is currently disabled by default. To give a brief introduction it consists of 3 stages- LoopInterchangeLegality : Checks the legality of loop interchange based on Dependency matrix. LoopInterchangeProfitability: A very basic heuristic has been added to check for profitibility. This will evolve over time. LoopInterchangeTransform : Which does the actual transform. LNT Performance tests shows improvement in Polybench/linear-algebra/kernels/mvt and Polybench/linear-algebra/kernels/gemver becnmarks. TODO: 1) Add support for reductions and lcssa phi. 2) Improve profitability model. 3) Improve loop selection algorithm to select best loop for interchange. Currently the innermost loop is selected for interchange. 4) Improve compile time regression found in llvm lnt due to this pass. 5) Fix issues in Dependency Analysis module. A special thanks to Hal for reviewing this code. Review: http://reviews.llvm.org/D7499 llvm-svn: 231458	2015-03-06 10:11:25 +00:00
David Majnemer	b61f4e403d	X86: Form IMGREL relocations for LLVM Functions We supported forming IMGREL relocations from ConstantExprs involving __ImageBase if the minuend was a GlobalVariable. Extend this functionality to all GlobalObjects. llvm-svn: 231456	2015-03-06 08:11:32 +00:00
Yaron Keren	322bdad085	Silence C4715 'not all control paths return a value' warnings. llvm-svn: 231455	2015-03-06 07:49:14 +00:00
Rui Ueyama	da9bc2e56d	Support: Improve performance of FileOutputBuffer on Windows We extend an underlying file before mmap'ing it, but it's not needed on Windows. Extending file is slow on Windows, so we should avoid doing that. The difference gets larger as the size of an output file gets larger. It shove off 2 seconds out of 25 seconds when linking chrome.dll with LLD, for example. llvm-svn: 231452	2015-03-06 06:07:32 +00:00
Michael Gottesman	6ff10c959a	[objc-arc] Sprinkle some more auto on some iterators. llvm-svn: 231447	2015-03-06 02:10:03 +00:00
Michael Gottesman	16e6a2057f	[objc-arc] Move the detection of potential uses or altering of a ref count onto PtrState. llvm-svn: 231446	2015-03-06 02:07:12 +00:00
Michael Zolotukhin	03dd1082ad	LegalizeTypes: Handle shift by 0 in ExpandShiftByConstant. Though such shifts are usually optimized away by combiner, we still can encounter them after a vector shift is legalized. llvm-svn: 231443	2015-03-06 01:13:01 +00:00
Rafael Espindola	a5b9e1cf39	Remember to move a type to the correct set when setting the body. We would set the body of a struct type (therefore making it non-opaque) but were forgetting to move it to the non-opaque set. Fixes pr22807. llvm-svn: 231442	2015-03-06 00:50:21 +00:00
Michael Gottesman	6080596328	[objc-arc] Move the checking of whether or not we can match onto PtrStates and out of the main dataflow. These refactored computations check whether or not we are at a stage of the sequence where we can perform a match. This patch moves the computation out of the main dataflow and into {BottomUp,TopDown}PtrState. llvm-svn: 231439	2015-03-06 00:34:42 +00:00
Michael Gottesman	4eae396ae9	[objc-arc] Refactor (Re-)initialization of PtrState from dataflow -> {TopDown,BottomUp}PtrState Class. This initialization occurs when we see a new retain or release. Before we performed the actual initialization inline in the dataflow. That is just messy. llvm-svn: 231438	2015-03-06 00:34:39 +00:00
Michael Gottesman	feb138e211	[objc-arc] Create two subclasses of PtrState in preparation for moving per ptr state change behavior onto a PtrState class. This will enable the main ObjCARCOpts dataflow to work with higher level concepts such as "can this ptr state be modified by this ref count" and not need to understand the nitty gritty details of how that is determined. This makes the dataflow cleaner. llvm-svn: 231437	2015-03-06 00:34:36 +00:00
Michael Gottesman	41c01005ed	[objc-arc] Extract out MDNodes into a cache structure so the information can be passed around. llvm-svn: 231436	2015-03-06 00:34:33 +00:00
Michael Gottesman	f6bcb81000	[objc-arc] Remove annotations code. It will always be in the history if it is needed again. Now it is just dead code. llvm-svn: 231435	2015-03-06 00:34:29 +00:00
Nadav Rotem	c99a38796c	Teach ComputeNumSignBits about signed reminder. This optimization a continuation of r231140 that reasoned about signed div. llvm-svn: 231433	2015-03-06 00:23:58 +00:00
Michael Gottesman	d45907bd38	Fix build error. llvm-svn: 231430	2015-03-05 23:57:07 +00:00
Michael Gottesman	a9fc016281	[objc-arc] Change some casts and loop iterators to use auto. llvm-svn: 231427	2015-03-05 23:29:06 +00:00
Michael Gottesman	68b91dbf84	[objc-arc] Extract out state specific to a ref count from the main objc arc sequence dataflow. This will allow me to separate the actual ARC queries from the meat of the dataflow algorithm. llvm-svn: 231426	2015-03-05 23:29:03 +00:00
Michael Gottesman	0be6920e23	[objc-arc] Extract blot map vector into its own file. NFC. llvm-svn: 231425	2015-03-05 23:28:58 +00:00
Ahmed Bougacha	c6dcf7a7cc	[X86] Remove stale comment. NFC. It turns out 256bit V[SZ]EXT nodes are still generated by the new shuffle lowering, so this is here to stay! llvm-svn: 231422	2015-03-05 23:18:41 +00:00
Paul Robinson	282b3d3ff5	All FileCheck directives allow patterns. llvm-svn: 231418	2015-03-05 23:04:26 +00:00
Peter Collingbourne	febd93c7c9	Go bindings: use MDNode::replaceAllUsesWith instead of MDTuple::replaceAllUsesWith. Fixes llgo following Duncan's changes to debug info in r231082. llgo needs to replace composite types, which are no longer represented using MDTuple. llvm-svn: 231416	2015-03-05 22:55:38 +00:00
Philip Reames	e21ce4540c	[RewriteStatepointsForGC] Yet more test cases for relocation At this point, we should have decent coverage of the involved code. I've got a few more test cases to cleanup and submit, but what's here is already reasonable. I've got a collection of liveness tests which will be posted for review along with a decent liveness algorithm in the next few days. Once those are in, the code in this file should be well tested and I can start renaming things without risk of serious breakage. llvm-svn: 231414	2015-03-05 22:28:06 +00:00
Quentin Colombet	2cd9d0b783	[CODE_OWNERS] Change the ownership of register allocators. llvm-svn: 231412	2015-03-05 22:15:17 +00:00
Benjamin Kramer	fc165f1434	Instructions: Use delegated constructors to reduce duplication NFC. llvm-svn: 231411	2015-03-05 22:05:26 +00:00
Sanjay Patel	302404b277	[AVX] Lower / fast-isel scalar FP selects into VBLENDV instructions (PR22483) This patch reduces code size for all AVX targets and increases speed for some chips. SSE 4.1 introduced the useless (see code comments) 2-register form of BLENDV and only in the packed float/double flavors. AVX subsequently made the instruction useful by adding a 4-register operand form. So we just need to paper over the lack of scalar forms of this instruction, complicate the code to choose float or double forms, and use blendv on scalars since all FP is in xmm registers anyway. This gives us an approximately 50% speed up for a blendv microbenchmark sequence on SandyBridge and Haswell: blendv : 29.73 cycles/iter logic : 43.15 cycles/iter No new test cases with this patch because: 1. fast-isel-select-sse.ll tests the positive side for regular X86 lowering and fast-isel 2. sse-minmax.ll and fp-select-cmp-and.ll confirm that we're not firing for scalar selects without AVX 3. fp-select-cmp-and.ll and logical-load-fold.ll confirm that we're not firing for scalar selects with constants. http://llvm.org/bugs/show_bug.cgi?id=22483 Differential Revision: http://reviews.llvm.org/D8063 llvm-svn: 231408	2015-03-05 21:46:54 +00:00
Benjamin Kramer	fb0abceb5c	SelectionDAGBuilder: Merge 3 copies of the limited precision exp2 emission code. NFC intended. llvm-svn: 231406	2015-03-05 21:13:08 +00:00
Andrew Kaylor	05ee8bd4e3	Fix uninitialized memory references in WinEHPrepare llvm-svn: 231405	2015-03-05 21:06:42 +00:00
Benjamin Kramer	c54c38e090	SDAG: Merge the meat of two ExpandAtomic implementations. The copies already diverged, don't let them become any worse. Reduce redundancy in code with a little macro metaprogramming. llvm-svn: 231401	2015-03-05 20:04:29 +00:00
Ahmed Bougacha	1b67630cb3	[AArch64] Teach AsmPrinter about GlobalAddress operands. Fixes PR22761, rdar://20024866. Differential Revision: http://reviews.llvm.org/D8042 llvm-svn: 231400	2015-03-05 20:04:21 +00:00
Philip Reames	03ea8642b1	[RewriteStatepointsForGC] Add additional tests around relocation These are focused around the actual relocation rewriting itself, not the rest of the infrastructure. llvm-svn: 231399	2015-03-05 19:52:13 +00:00
Rafael Espindola	092b619e55	Use the correct func begin symbol in all places in ppc. I missed an occurrence of the old symbol in my previous patch. llvm-svn: 231398	2015-03-05 19:47:50 +00:00
Tom Stellard	5698d63348	TableGen: Initialize ErrorInfo to ~0ULL in the MatchInstructionImpl This is what all the targets check for and is consistent with the initialized value of MissingFeatures, which is sometimes assinged to ErrorInfo. llvm-svn: 231397	2015-03-05 19:46:55 +00:00
Ahmed Bougacha	4200cc95b4	[ARM] Enable vector extload combine for legal types. This commit enables forming vector extloads for ARM. It only does so for legal types, and when we can't fold the extension in a wide/long form of the user instruction. Enabling it for larger types isn't as good an idea on ARM as it is on X86, because: - we pretend that extloads are legal, but end up generating vld+vmov - we have instructions like vld {dN, dM}, which can't be generated when we "manually expand" extloads to vld+vmov. For legal types, the combine doesn't fire that often: in the integration tests only in a big endian testcase, where it removes a pointless AND. Related to rdar://19723053 Differential Revision: http://reviews.llvm.org/D7423 llvm-svn: 231396	2015-03-05 19:37:53 +00:00
Zachary Turner	cd132c9b0d	Replace PrintStackTrace(FILE*) with PrintStackTrace(raw_ostream&) This will be followed by a change on the clang side to update the only user of this function with the new version. Differential Revision: http://reviews.llvm.org/D8074 Reviewed By: Reid Kleckner llvm-svn: 231392	2015-03-05 19:10:52 +00:00
Reid Kleckner	286b100750	Remove accidental errs() call in Verifier llvm-svn: 231391	2015-03-05 19:05:25 +00:00
Rafael Espindola	86bd6a1202	Use the generic Lfunc_begin label on ppc. This removes yet another custom label to mark the start of a function. llvm-svn: 231390	2015-03-05 18:55:50 +00:00
David Majnemer	71b9b6be1b	X86: Optimize address mode matching for FRAME_ALLOC_RECOVER nodes We know that the absolute symbol will be less than 2GB and thus will always fit. llvm-svn: 231389	2015-03-05 18:50:12 +00:00
Reid Kleckner	caf7444b80	Revert busted CallSite change from r231386 llvm-svn: 231388	2015-03-05 18:32:14 +00:00
Reid Kleckner	e658058cc0	Silence -Wmissing-braces warning from clang-cl The first element of STACKFRAME64 is a struct and Clang wants us to put braces around it's initialization. Instead, drop the zero. The result should be the same. llvm-svn: 231387	2015-03-05 18:26:58 +00:00
Reid Kleckner	cfb9ce53c1	Replace llvm.frameallocate with llvm.frameescape Turns out it's pretty straightforward and simplifies the implementation. Reviewers: andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D8051 llvm-svn: 231386	2015-03-05 18:26:34 +00:00
Erik Eckstein	8c76e669c5	Revert r231276 (including r231277): Add a lock() function in PassRegistry to speed up multi-thread synchronization. llvm-svn: 231385	2015-03-05 17:53:00 +00:00
Zachary Turner	62b7b617a8	[Windows] Implement PrintStackTrace(FILE) llvm::sys::PrintBacktrace(FILE) is supposed to print a backtrace of the current thread given the current PC. This function was unimplemented on Windows, and instead the only time we could print a backtrace was as the result of an exception through LLVMUnhandledExceptionFilter. This patch implements backtracing of self by using RtlCaptureContext to get a CONTEXT for the current thread, and moving the printing and StackWalk64 code to a common method that printing own stack trace and printing stack trace of an exception can use. Differential Revision: http://reviews.llvm.org/D8068 Reviewed by: Reid Kleckner llvm-svn: 231382	2015-03-05 17:47:52 +00:00
Simon Pilgrim	7189084bef	[DagCombiner] Allow shuffles to merge through bitcasts Currently shuffles may only be combined if they are of the same type, despite the fact that bitcasts are often introduced in between shuffle nodes (e.g. x86 shuffle type widening). This patch allows a single input shuffle to peek through bitcasts and if the input is another shuffle will merge them, shuffling using the smallest sized type, and re-applying the bitcasts at the inputs and output instead. Dropped old ShuffleToZext test - this patch removes the use of the zext and vector-zext.ll covers these anyhow. Differential Revision: http://reviews.llvm.org/D7939 llvm-svn: 231380	2015-03-05 17:14:04 +00:00
Duncan P. N. Exon Smith	cffbbe92f1	FileCheck: Document CHECK-SAME, follow-up to r230612 llvm-svn: 231379	2015-03-05 17:00:05 +00:00
Kit Barton	e48b1e1c4f	While reviewing the changes to Clang to add builtin support for the vsld, vsrd, and vsrad instructions, it was pointed out that the builtins are generating the LLVM opcodes (shl, lshr, and ashr) not calls to the intrinsics. This patch changes the implementation of the vsld, vsrd, and vsrad instructions from from intrinsics to VXForm_1 instructions and makes them legal with P8 Altivec. It also removes the definition of the int_ppc_altivec_vsld, int_ppc_altivec_vsrd, and int_ppc_altivec_vsrad intrinsics. llvm-svn: 231378	2015-03-05 16:24:38 +00:00
Igor Laevsky	8d0851f509	Revert change r231366 as it broke clang-native-arm-cortex-a9 Analysis/properties.m test. llvm-svn: 231374	2015-03-05 15:41:14 +00:00
Elena Demikhovsky	de05f10de2	AVX-512, SKX: Enabled masked_load/store operations for this target. Added lowering for ISD::CONCAT_VECTORS and ISD::INSERT_SUBVECTOR for i1 vectors, it is needed to pass all masked_memop.ll tests for SKX. llvm-svn: 231371	2015-03-05 15:11:35 +00:00
Frederic Riss	0d94ef9b2c	Fix -Woverflow warning in unittest. llvm-svn: 231368	2015-03-05 14:43:15 +00:00
Igor Laevsky	1725997f14	Teach lowering to correctly handle invoke statepoint and gc results tied to them. Note that we still can not lower gc.relocates for invoke statepoints. Also it extracts getCopyFromRegs helper function in SelectionDAGBuilder as we need to be able to customize type of the register exported from basic block during lowering of the gc.result. llvm-svn: 231366	2015-03-05 14:11:21 +00:00
Arnaud A. de Grandmaison	d8ed0d372c	[PBQP] Use a local bit-matrix to speedup searching an edge in the graph. Build time (user time) for building llvm+clang+lldb in release mode: - default allocator: 9086 seconds - with PBQP: 9126 seconds - with PBQP + local bit matrix cache: 9097 seconds llvm-svn: 231360	2015-03-05 09:12:59 +00:00
Michael Kuperstein	bcb26d6880	[InstCombine] Fix an assertion when fmul has a ConstantExpr operand isNormalFp and isFiniteNonZeroFp should not assume vector operands can not be constant expressions. Patch by Pawel Jurek <pawel.jurek@intel.com> Differential Revision: http://reviews.llvm.org/D8053 llvm-svn: 231359	2015-03-05 08:38:57 +00:00
Craig Topper	35b3dbc4a3	Revert "[TableGen] Implement at least some support for multiple explicit results in an instruction pattern. No functional change to existing patterns." This is failing on several build bots. llvm-svn: 231358	2015-03-05 07:17:52 +00:00
Craig Topper	5edbf1ccaf	[TableGen] Implement at least some support for multiple explicit results in an instruction pattern. No functional change to existing patterns. This should help with the AVX512 masked gather changes Elena is working on. This patch is derived from some of the changes Elena made to tablegen, but modified by me to support arbitrary number of results. llvm-svn: 231357	2015-03-05 07:11:36 +00:00
Craig Topper	0be3458006	[TableGen] Add support constraining a vector type in a pattern to have a specific element type and for constraining a vector type to have the same number of elements as another vector type. This is useful for AVX512 mask operations so we relate the mask type to the type of the other arguments. llvm-svn: 231356	2015-03-05 07:11:34 +00:00
Craig Topper	0ee8470a43	[X86] Use vmovss to handle inserting an element into index 0 of a v8f32 vector of zeros. llvm-svn: 231354	2015-03-05 06:38:42 +00:00
Frederic Riss	6e56345dbc	Remove useless break after return. Pointed out by Paul Robinson. llvm-svn: 231353	2015-03-05 06:13:39 +00:00
Philip Reames	34843ae51e	Add a few more performance tips These came from my own experience and may not apply equally to all use cases. Any alternate perspective anyone has should be used to refine these. As always, grammar and spelling adjustments are more than welcome. Please just directly commit a fix if you see something problematic. llvm-svn: 231352	2015-03-05 05:55:55 +00:00
Frederic Riss	2838f9ed61	Revert "[dsymutil] MSVC does generate move constructors, but it should accept to default them" This reverts commit r231350. It turns out MSVC doesn't generate implicit move constructors and also doesn't accept to default them... See for example http://lab.llvm.org:8011/builders/lldb-x86-windows-msvc/builds/2786 llvm-svn: 231351	2015-03-05 05:29:05 +00:00
Frederic Riss	1e9cd2910a	[dsymutil] MSVC does generate move constructors, but it should accept to default them llvm-svn: 231350	2015-03-05 05:17:06 +00:00
Philip Reames	aedd404a3d	Add a link to the new PerformanceTips docs from the 3.7 release notes llvm-svn: 231349	2015-03-05 05:11:05 +00:00
Hans Wennborg	6d8e6d5ee4	Revert r231324 "Remove the conditional addition of the execution dependency fixing" See PR22799. llvm-svn: 231348	2015-03-05 03:24:49 +00:00
Chandler Carruth	7a715dae05	[MBP] Use range based for-loops throughout this code. Several had already been added and the inconsistency made choosing names and changing code more annoying. Plus, wow are they better for this code! llvm-svn: 231347	2015-03-05 03:19:05 +00:00
Chandler Carruth	2fc3fe1282	[MBP] NFC, run clang-format over this code and tweak things to make the result reasonable. This code predated clang-format and so there was a reasonable amount of crufty formatting that had accumulated. This should ensure that neither myself nor others end up with formatting-only changes sneaking into other fixes. llvm-svn: 231341	2015-03-05 02:35:31 +00:00
Chandler Carruth	d0dced58ab	[MBP] This is no longer 'block-placement2'. ;] The old variants are long gone, update this code to reflect that. llvm-svn: 231340	2015-03-05 02:28:25 +00:00
Rafael Espindola	07c03d316d	Use the existing begin and end symbol for debug info. llvm-svn: 231338	2015-03-05 02:05:42 +00:00
NAKAMURA Takumi	478559a532	Reformat. llvm-svn: 231336	2015-03-05 01:25:19 +00:00
NAKAMURA Takumi	d8422ce0ec	Revert r231103, "FullDependenceAnalysis: Avoid using the (deprecated in C++11) copy ctor" It is miscompiled on msc18. llvm-svn: 231335	2015-03-05 01:25:12 +00:00
NAKAMURA Takumi	e110d641a0	Revert r231104, "unique_ptrify FullDependenceAnalysis::DV", to appease msc18 C2280. llvm-svn: 231334	2015-03-05 01:25:06 +00:00
Kostya Serebryany	83ce8779d5	[sanitizer] add nosanitize metadata to more coverage instrumentation instructions llvm-svn: 231333	2015-03-05 01:20:05 +00:00
Chandler Carruth	af7e99f2f4	[MBP] Revert r231238 which attempted to fix a nasty bug where MBP is just arbitrarily interleaving unrelated control flows once they get moved "out-of-line" (both outside of natural CFG ordering and with diamonds that cannot be fully laid out by chaining fallthrough edges). This easy solution doesn't work in practice, and it isn't just a small bug. It looks like a very different strategy will be required. I'm working on that now, and it'll again go behind some flag so that everyone can experiment and make sure it is working well for them. llvm-svn: 231332	2015-03-05 01:07:03 +00:00
NAKAMURA Takumi	8f49dd3687	ScalarEvolution.cpp: Appease g++-4.7. He missed implicit "this" in lambda. llvm-svn: 231331	2015-03-05 01:02:45 +00:00
Eric Christopher	385f4b36d8	Remove the conditional addition of the execution dependency fixing pass from the ARM backend as the pass itself will detect any use of the appropriate register class. llvm-svn: 231324	2015-03-05 00:28:55 +00:00
Eric Christopher	63b44882ef	Cleanup and remove a chunk of getARMSubtarget calls in the ARM TargetMachine pass pipeline construction by pushing them down into the appropriate pass. llvm-svn: 231323	2015-03-05 00:23:40 +00:00
Paul Robinson	49e38965dc	Turn off .debug_pubnames/pubtypes for PS4. Differential Revision: http://reviews.llvm.org/D8067 llvm-svn: 231322	2015-03-05 00:08:27 +00:00
Aaron Ballman	6ab161497a	Initializer lists are supported in MSVC 2013. Since that's our minimum required version, we can move that to the list of acceptable C++11 features. llvm-svn: 231313	2015-03-04 23:17:31 +00:00
Argyrios Kyrtzidis	dc8f979b41	[Support] Increase timeout for the LockFileManager back to 5 mins. Waiting for just 1 min may not be enough for some contexts. llvm-svn: 231309	2015-03-04 22:54:38 +00:00
Matthias Braun	eca5151780	Improve test robustness Improve test robustness in preparation of coming commits: - Avoid undefs which may get propagated too much. - Remove several pointless add 0, instructions llvm-svn: 231307	2015-03-04 22:31:18 +00:00
Sanjoy Das	a5397c0198	[IndVarSimplify] use the "canonical" way to infer no-wrap. Summary: rL225282 introduced an ad-hoc way to promote some additions to nuw or nsw. Since then SCEV has become smarter in directly proving no-wrap; and using the canonical "ext(A op B) == ext(A) op ext(B)" method of proving no-wrap is just as powerful now. Rip out the existing complexity in favor of getting SCEV to do all the heaving lifting internally. This change does not add any unit tests because it is supposed to be a non-functional change. Tests added in rL225282 and rL226075 are valid tests for this change. Reviewers: atrick, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7981 llvm-svn: 231306	2015-03-04 22:24:23 +00:00
Sanjoy Das	9e2c5010f6	[SCEV] make SCEV smarter about proving no-wrap. Summary: Teach SCEV to prove no overflow for an add recurrence by proving something about the range of another add recurrence a loop-invariant distance away from it. Reviewers: atrick, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7980 llvm-svn: 231305	2015-03-04 22:24:17 +00:00
David Blaikie	a8adc1349b	Provide an explicit move ctor because MSVC can't synthesize one llvm-svn: 231303	2015-03-04 22:20:52 +00:00
Frederic Riss	b8b43d5494	[dsymutil] Add minimal code to emit DIE trees. This commit adds code to emit DIE trees that have been pruned from the parts that haven't been marked as kept in the previous pass. It works by 'cloning' the input DIE tree (as read by libDebugInfoDwarf) into a tree of DIE objects. Cloning the DIEs means essentially cloning their attributes. The code in this commit does only handle scalar and block attributes (scalar because they are trivial, blocks because they can't be easily replaced by a scalr placeholder), all the other ones are replaced by placeholder zero values and will be handled in further commits. The added tests mostly check that the DIE tree has the correct layout and also verify that a few chosen scalar and block attributes correctly make their way into the output. llvm-svn: 231300	2015-03-04 22:07:44 +00:00
Frederic Riss	77f850e336	DWARFFormValue: Add getAsSignedConstant method. The implementation accepts explicitely signed forms (DW_FORM_sdata), but also unsigned forms as long as they fit in an int64_t. llvm-svn: 231299	2015-03-04 22:07:41 +00:00
Frederic Riss	ee17fb9b0e	Teach DIEInteger to emit FORM_strp and FORM_ref_addr attributes. To be used/tested by llvm-dsymutil. (llvm-dsymutil does a 'static' link, no need for relocations for most things, so it'll just emit raw integers for most attributes) llvm-svn: 231298	2015-03-04 22:07:36 +00:00
Frederic Riss	8874ce8d4b	Make the DWARFAbbreviationDeclaration::AttributeSpec type public. It was already exposed through the iterators anyway. llvm-svn: 231297	2015-03-04 22:07:30 +00:00
David Blaikie	c7aabbb78e	Update LangRef for explicit type changes to 'load' instruction llvm-svn: 231296	2015-03-04 22:06:14 +00:00
Rafael Espindola	266b8c8043	Expand variables when evaluating absolute expressions. This allows for variables to be used in .size. This matches gnu AS functionality. llvm-svn: 231295	2015-03-04 22:03:21 +00:00
David Blaikie	16a97ebf7f	Update LangRef for getelementptr explicit type changes Here's a rough/first draft - it at least hits the actual textual IR examples and some of the phrasing. It's probably worth a full pass over, but I'm not sure how much these docs should reflect the strange intermediate state we're in anyway. Totally open to lots of review/feedback/suggestions. llvm-svn: 231294	2015-03-04 22:02:58 +00:00
Sanjay Patel	edd04aeb3c	don't repeat class / function / variable names in comments; NFC llvm-svn: 231292	2015-03-04 21:49:03 +00:00
Paul Robinson	78cc0821f0	Support standard DWARF TLS opcode; Darwin and PS4 use it. Differential Revision: http://reviews.llvm.org/D8018 llvm-svn: 231286	2015-03-04 20:55:11 +00:00
Nemanja Ivanovic	e8effe1edb	Add LLVM support for PPC cryptography builtins Review: http://reviews.llvm.org/D7955 llvm-svn: 231285	2015-03-04 20:44:33 +00:00
Reid Kleckner	4276945161	Try to satisfy sanitizer lint check llvm-svn: 231284	2015-03-04 20:38:59 +00:00
David Blaikie	eb6c31f900	Add a FIXME for PR22796, broken ordering of ClassInfo in TableGen As discussed (at length) in code review of r222935, with Duncan. llvm-svn: 231282	2015-03-04 19:56:44 +00:00
Rafael Espindola	265ffbeb0c	Fix the build of the gold-plugin and examples. llvm-svn: 231279	2015-03-04 19:15:29 +00:00
Reid Kleckner	107f9e6fcd	Add missing <atomic> include to PassRegistry.h llvm-svn: 231277	2015-03-04 19:06:17 +00:00
Erik Eckstein	8c38b8b873	Add a lock() function in PassRegistry to speed up multi-thread synchronization. When calling lock() after all passes are registered, the PassRegistry doesn't need a mutex anymore to look up passes. This speeds up multithreaded llvm execution by ~5% (tested with 4 threads). In an asserts build of llvm this has an even bigger impact. Note that it's not required to use the lock function. llvm-svn: 231276	2015-03-04 18:57:11 +00:00
David Blaikie	1c47111a85	Recommit r231221: "Devirtualize ~parser<T> by making it protected in base classes and making derived classes final" Reverted in r231254 due to a self-hosting crash of Clang (see Clang PR22793). Workaround the crash by using {} instead of = default to define a dtor. llvm-svn: 231274	2015-03-04 18:52:32 +00:00
Rafael Espindola	f3f185486c	Bring r231132 back with a fix. The issue was that we were always printing the remarks. Fix that and add a test showing that it prints nothing if -pass-remarks is not given. Original message: Correctly handle -pass-remarks in the gold plugin. llvm-svn: 231273	2015-03-04 18:51:45 +00:00
Mehdi Amini	46a43556db	Make DataLayout Non-Optional in the Module Summary: DataLayout keeps the string used for its creation. As a side effect it is no longer needed in the Module. This is "almost" NFC, the string is no longer canonicalized, you can't rely on two "equals" DataLayout having the same string returned by getStringRepresentation(). Get rid of DataLayoutPass: the DataLayout is in the Module The DataLayout is "per-module", let's enforce this by not duplicating it more than necessary. One more step toward non-optionality of the DataLayout in the module. Make DataLayout Non-Optional in the Module Module->getDataLayout() will never returns nullptr anymore. Reviewers: echristo Subscribers: resistor, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D7992 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231270	2015-03-04 18:43:29 +00:00
Reid Kleckner	2ae03e1783	Revert "unique_ptrify ValID::ConstantStructElts" This reverts r231200 and r231204. The second one added an explicit move ctor for MSVC. This change broke the clang-cl self-host due to weirdness in MSVC's implementation of std::map::insert. Somehow we lost our rvalue ref-ness when going through variadic placement new: template <class _Objty, class... _Types> void construct(_Objty _Ptr, _Types &&... _Args) { // construct _Objty(_Types...) at _Ptr ::new ((void )_Ptr) _Objty(_STD forward<_Types>(_Args)...); } For some reason, Clang decided to call the deleted std::pair copy constructor at this point. Needs further investigation, once I can build. llvm-svn: 231269	2015-03-04 18:31:10 +00:00
Wei Mi	4d9347993b	Revert the test commit. llvm-svn: 231264	2015-03-04 17:44:22 +00:00
Wei Mi	20401eecd6	Test commit. It will be reverted in the next commit. llvm-svn: 231262	2015-03-04 17:41:17 +00:00
Adrian Prantl	afdac4b7f0	Update the out-of-date dwarf expressions in these testcases. llvm-svn: 231261	2015-03-04 17:39:59 +00:00
Adrian Prantl	0f61579602	Fix DwarfExpression::AddMachineRegExpression so it doesn't read past the end of an expression that ends with DW_OP_plus. Caught by the ASAN build bots. llvm-svn: 231260	2015-03-04 17:39:33 +00:00
Marek Olsak	d2af89df10	R600/SI: Add an intrinsic for S_FLBIT_I32 / V_FFBH_I32 Required by OpenGL (ARB_gpu_shader5). llvm-svn: 231259	2015-03-04 17:33:45 +00:00
Nemanja Ivanovic	d384cd9907	Test commit. Removed an unnecessary space llvm-svn: 231257	2015-03-04 17:09:12 +00:00
David Blaikie	0bc56dacba	Explicitly default ilistTest::Node's copy constructor In the presence of a user-declared dtor, calling an implicit copy ctor is deprecated in C++11. llvm-svn: 231256	2015-03-04 17:01:18 +00:00
NAKAMURA Takumi	2f99a77487	Revert r231221, "Devirtualize ~parser<T> by making it protected in base classes and making derived classes final" It broke seflhosting. llvm-svn: 231254	2015-03-04 16:24:40 +00:00
NAKAMURA Takumi	84a9697c17	Revert r231132, "Correctly handle -pass-remarks in the gold plugin.", for now, to suppress log floodng in LTO. llvm-svn: 231253	2015-03-04 16:24:28 +00:00
JF Bastien	f14889ee34	Mutate TargetLowering::shouldExpandAtomicRMWInIR to specifically dictate how AtomicRMWInsts are expanded. Summary: In PNaCl, most atomic instructions have their own @llvm.nacl.atomic.* function, each one, with a few exceptions, represents a consistent behaviour across all NaCl-supported targets. Unfortunately, the atomic RMW operations nand, [u]min, and [u]max aren't directly represented by any such @llvm.nacl.atomic.* function. This patch refines shouldExpandAtomicRMWInIR in TargetLowering so that a future `Le32TargetLowering` class can selectively inform the caller how the target desires the atomic RMW instruction to be expanded (ie via load-linked/store-conditional for ARM/AArch64, via cmpxchg for X86/others?, or not at all for Mips) if at all. This does not represent a behavioural change and as such no tests were added. Patch by: Richard Diamond. Reviewers: jfb Reviewed By: jfb Subscribers: jfb, aemerson, t.p.northover, llvm-commits Differential Revision: http://reviews.llvm.org/D7713 llvm-svn: 231250	2015-03-04 15:47:57 +00:00
Jozef Kolek	c925808ee5	[mips][microMIPS] Make usage of ADDU16 and SUBU16 by code generator Differential Revision: http://reviews.llvm.org/D7609 llvm-svn: 231249	2015-03-04 15:47:42 +00:00
Bill Schmidt	d90aff2c4f	[PowerPC] Remove unnecessary and incomplete commentary This "itinerary class map" in PPCSchedule.td is incomplete and redundant with the actual code. As it provides no value, we've decided to remove it. No functional change. llvm-svn: 231246	2015-03-04 14:56:05 +00:00
Andrea Di Biagio	df93ccf49a	[X86][FastISel] Simplify the logic in method X86SelectSIToFP. The target-independent selection algorithm in FastISel already knows how to select a SINT_TO_FP if the target is SSE but not AVX. On targets that have SSE but not AVX, the tablegen'd 'fastEmit' functions for ISD::SINT_TO_FP know how to select instruction X86::CVTSI2SSrr (for an i32 to f32 conversion) and X86::CVTSI2SDrr (for an i32 to f64 conversion). This patch simplifies the logic in method X86SelectSIToFP knowing that the code would not be reachable if the subtarget doesn't have AVX. No functional change intended. llvm-svn: 231243	2015-03-04 14:23:25 +00:00
Dmitry Vyukov	b37b95ed3e	asan: do not instrument direct inbounds accesses to stack variables Do not instrument direct accesses to stack variables that can be proven to be inbounds, e.g. accesses to fields of structs on stack. But it eliminates 33% of instrumentation on webrtc/modules_unittests (number of memory accesses goes down from 290152 to 193998) and reduces binary size by 15% (from 74M to 64M) and improved compilation time by 6-12%. The optimization is guarded by asan-opt-stack flag that is off by default. http://reviews.llvm.org/D7583 llvm-svn: 231241	2015-03-04 13:27:53 +00:00
Toma Tabacu	e1e3ffe71d	[mips] Rename the LA/LI/DLI TableGen definitions and classes. NFC. Summary: Use more reasonable names for these pseudo-instructions. As there's only one definition tied to any one of these classes, I named them with abbreviated versions of their respective class' name. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7831 llvm-svn: 231240	2015-03-04 13:01:14 +00:00
Vasileios Kalintiris	8761490d2e	[mips] Keep the parameter list of Filler::searchRange() consistent. NFC. Summary: Move the "Filler" parameter to the end of the parameter list as it is, conceptually, the only output parameter of that function. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7726 llvm-svn: 231239	2015-03-04 12:37:58 +00:00
Chandler Carruth	9a53fbe243	[MBP] Fix a really horrible bug in MachineBlockPlacement, but behind a flag for now. First off, thanks to Daniel Jasper for really pointing out the issue here. It's been here forever (at least, I think it was there when I first wrote this code) without getting really noticed or fixed. The key problem is what happens when two reasonably common patterns happen at the same time: we outline multiple cold regions of code, and those regions in turn have diamonds or other CFGs for which we can't just topologically lay them out. Consider some C code that looks like: if (a1()) { if (b1()) c1(); else d1(); f1(); } if (a2()) { if (b2()) c2(); else d2(); f2(); } done(); Now consider the case where a1() and a2() are unlikely to be true. In that case, we might lay out the first part of the function like: a1, a2, done; And then we will be out of successors in which to build the chain. We go to find the best block to continue the chain with, which is perfectly reasonable here, and find "b1" let's say. Laying out successors gets us to: a1, a2, done; b1, c1; At this point, we will refuse to lay out the successor to c1 (f1) because there are still un-placed predecessors of f1 and we want to try to preserve the CFG structure. So we go get the next best block, d1. ... wait for it ... Except that the next best block isn't d1. It is b2! d1 is waaay down inside these conditionals. It is much less important than b2. Except that this is exactly what we didn't want. If we keep going we get the entire set of the rest of the CFG interleaved!!! a1, a2, done; b1, c1; b2, c2; d1, f1; d2, f2; So we clearly need a better strategy here. =] My current favorite strategy is to actually try to place the block whose predecessor is closest. This very simply ensures that we unwind these kinds of CFGs the way that is natural and fitting, and should minimize the number of cache lines instructions are spread across. It also happens to be dead simple. It's like the datastructure was specifically set up for this use case or something. We only push blocks onto the work list when the last predecessor for them is placed into the chain. So the back of the worklist is the nearest next block. Unfortunately, a change like this is going to cause soooo many benchmarks to swing wildly. So for now I'm adding this under a flag so that we and others can validate that this is fixing the problems described, that it seems possible to enable, and hopefully that it fixes more of our problems long term. llvm-svn: 231238	2015-03-04 12:18:08 +00:00
Vasileios Kalintiris	2ef2888273	[mips] Specify the correct value type when combining a CMovFP node. This commit fixes a bug introduced in r230956 where we were creating CMovFP_{T,F} nodes with multiple return value types (one for each operand). With this change the return value type of the new node is the same as the value type of the True/False operands of the original node. llvm-svn: 231237	2015-03-04 12:10:18 +00:00
Daniel Jasper	471e856f49	Add a flag to experiment with outlining optional branches. In a CFG with the edges A->B->C and A->C, B is an optional branch. LLVM's default behavior is to lay the blocks out naturally, i.e. A, B, C, in order to improve code locality and fallthroughs. However, if a function contains many of those optional branches only a few of which are taken, this leads to a lot of unnecessary icache misses. Moving B out of line can work around this. Review: http://reviews.llvm.org/D7719 llvm-svn: 231230	2015-03-04 11:05:34 +00:00
Kristof Beyls	aea8461820	Fix PR22408 - LLVM producing AArch64 TLS relocations that GNU linkers cannot handle yet. As is described at http://llvm.org/bugs/show_bug.cgi?id=22408, the GNU linkers ld.bfd and ld.gold currently only support a subset of the whole range of AArch64 ELF TLS relocations. Furthermore, they assume that some of the code sequences to access thread-local variables are produced in a very specific sequence. When the sequence is not as the linker expects, it can silently mis-relaxe/mis-optimize the instructions. Even if that wouldn't be the case, it's good to produce the exact sequence, as that ensures that linkers can perform optimizing relaxations. This patch: * implements support for 16MiB TLS area size instead of 4GiB TLS area size. Ideally clang would grow an -mtls-size option to allow support for both, but that's not part of this patch. * by default doesn't produce local dynamic access patterns, as even modern ld.bfd and ld.gold linkers do not support the associated relocations. An option (-aarch64-elf-ldtls-generation) is added to enable generation of local dynamic code sequence, but is off by default. * makes sure that the exact expected code sequence for local dynamic and general dynamic accesses is produced, by making use of a new pseudo instruction. The patch also removes two (AArch64ISD::TLSDESC_BLR, AArch64ISD::TLSDESC_CALL) pre-existing AArch64-specific pseudo SDNode instructions that are superseded by the new one (TLSDESC_CALLSEQ). llvm-svn: 231227	2015-03-04 09:12:08 +00:00
Craig Topper	483a3000f5	[Tablegen] Use correct result number variables with the pattern nodes they go with when handling SDTCisSameAs. No functional change as they are always both 0 unless you try to define a multi result type profile that uses SDTCisSame on one of the other results. llvm-svn: 231226	2015-03-04 09:04:54 +00:00
David Blaikie	54a7ca3fcb	Explicitly default DenseMapTest::CtorTest::operator= Using the implicit default copy assignment operator in the presence of a user-declared copy ctor is deprecated in C++11. llvm-svn: 231225	2015-03-04 07:57:45 +00:00
David Blaikie	a985e1ebda	Remove explicit RNSuccIterator copy assignment in favor of implicit default Asserting that the source and destination iterators are from the same region is unnecessary - there's no reason to disallow reassignment from any regions, so long as they aren't compared. llvm-svn: 231224	2015-03-04 07:51:50 +00:00
David Blaikie	2d5f42928c	use = default instead of {} llvm-svn: 231223	2015-03-04 07:35:04 +00:00
David Blaikie	4e15257656	Make format_object_base explicitly copyable, so format_objects can be copied without relying on the implicit copy ctor Use of the implicit copy ctor is deprecated in C++11 in the presence of a user declared dtor. llvm-svn: 231222	2015-03-04 07:35:02 +00:00
David Blaikie	4d7eb730e4	Devirtualize ~parser<T> by making it protected in base classes and making derived classes final These objects are never owned/destroyed polymorphically, so there's no need for a virtual dtor. llvm-svn: 231221	2015-03-04 07:29:01 +00:00
David Blaikie	c08b2669a4	Avoid copying parser objects Use of their copy members is deprecated since they have a user-declared dtor. llvm-svn: 231220	2015-03-04 07:28:59 +00:00
Michael Kuperstein	fb95697c88	[DAGCombine] Fix a bug in a BUILD_VECTOR combine When trying to convert a BUILD_VECTOR into a shuffle, we try to split a single source vector that is twice as wide as the destination vector. We can not do this when we also need the zero vector to create a blend. This fixes PR22774. Differential Revision: http://reviews.llvm.org/D8040 llvm-svn: 231219	2015-03-04 07:27:39 +00:00
David Blaikie	43fbf68c0e	Make OptionValue explicitly copyable Since OptionValue (& its base classes) have user-declared dtors, use of the implicit copy ctor/assignment operator is deprecated in C++11. Provide them explicitly (defaulted) to avoid depending on this deprecated feature. llvm-svn: 231218	2015-03-04 07:09:53 +00:00
David Blaikie	4c154c6b91	Devirtualize OptionValue::~OptionValue in favor of protected in the base, with final derived classes These objects are never polymorphically owned, so there's no need for virtual dtors - just make the dtor protected in the base classes, and make the derived classes final. llvm-svn: 231217	2015-03-04 06:57:14 +00:00
Davide Italiano	fcae934c03	[MC][Target] Implement support for R_X86_64_SIZE{32,64}. Differential Revision: D7990 Reviewed by: rafael, majnemer llvm-svn: 231216	2015-03-04 06:49:39 +00:00
Zachary Turner	653236596a	[llvm-pdbdump] Display full enum definitions. This will now display enum definitions both at the global scope as well as nested inside of classes. Additionally, it will no longer display enums at the global scope if the enum is nested. Instead, it will omit the definition of the enum globally and instead emit it in the corresponding class definition. llvm-svn: 231215	2015-03-04 06:09:53 +00:00
Chaoren Lin	9074634002	Revert "[ADT] fail-fast iterators for DenseMap" This reverts commit 4b7263d855006988854036b4a4891fcf19aebe65. r231125 http://reviews.llvm.org/D7931 This was causing many LLDB tests to fail on OS X, Linux, and FreeBSD. https://bpaste.net/show/6a23e1f53623 llvm-svn: 231214	2015-03-04 06:05:37 +00:00
Frederic Riss	9412d63f68	Move emitDIE and emitAbbrevs to AsmPrinter. NFC. (They are called emitDwarfDIE and emitDwarfAbbrevs in their new home) llvm-dsymutil wants to reuse that code, but it doesn't have a DwarfUnit or a DwarfDebug object to call those. It has access to an AsmPrinter though. Having emitDIE in the AsmPrinter also removes the DwarfFile dependency on DwarfDebug, and thus the patch drops that field. Differential Revision: http://reviews.llvm.org/D8024 llvm-svn: 231210	2015-03-04 02:30:17 +00:00
Frederic Riss	cd04434cd5	Constify AsmPrinter passed to DIE methods. llvm-svn: 231209	2015-03-04 02:30:08 +00:00
Filipe Cabecinhas	0524acc727	Fix the test for r231201. We don't crash anymore. llvm-svn: 231207	2015-03-04 02:09:40 +00:00
David Blaikie	f22e370733	Workaround MSVC not providing implicit move members llvm-svn: 231204	2015-03-04 02:07:51 +00:00
Rui Ueyama	bd68504bf6	Object: Add range iterators to Archive symbols Also define operator* for symbol iterator just like Archive children iterator. llvm-svn: 231203	2015-03-04 02:05:06 +00:00
Mehdi Amini	367bfa42d8	Use report_fatal_error instead of unreachable for -fast-isel-abort Suggestion by Andrea Di Biagio From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 231201	2015-03-04 01:48:39 +00:00
David Blaikie	0afee85176	unique_ptrify ValID::ConstantStructElts llvm-svn: 231200	2015-03-04 01:41:01 +00:00
David Blaikie	b9cc659efe	LLParser: Avoid copying ValIDs, the copy ctor is deprecated in C++11 due to the presence of a user-declared dtor llvm-svn: 231199	2015-03-04 01:40:07 +00:00

... 2 3 4 5 6 ...

114639 Commits