llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Bieneman	1c89280916	[CMake] LLVM_PROFDATA_FILE only works if you're using clang, so we should error out if it is specified when not using clang. Also updated the CMake docs. Based on post-commit review of r250108 from Sean Silvas. llvm-svn: 250150	2015-10-13 05:35:12 +00:00
Craig Topper	24b56a62bb	[X86] Mark the AAD and AAM aliases as not valid in 64-bit mode. llvm-svn: 250148	2015-10-13 05:12:07 +00:00
Craig Topper	4f76372afc	[X86] Change all the i8imm operands in XOP instructions to u8imm so the parser will check the size. llvm-svn: 250147	2015-10-13 05:06:25 +00:00
Manman Ren	9f824dab1d	Revert 250089 due to bot failure. It failed when building clang itself with PGO. llvm-svn: 250145	2015-10-13 03:38:02 +00:00
Duncan P. N. Exon Smith	584af871cc	BitcodeWriter: Stop using implicit ilist iterator conversion, NFC Now LLVMBitWriter compiles without implicit ilist iterator conversions. In these cases, the cleanest thing was to switch to range-based for loops. Since there wasn't much noise I converted sub-loops and parent loops as a drive-by. llvm-svn: 250144	2015-10-13 03:26:19 +00:00
Sanjoy Das	1ed6910338	[SCEV] Put some utilites in the ScalarEvolution class In a later commit, `SplitBinaryAdd` will be used outside `IsConstDiff`, so lift that out. And lift out `IsConstDiff` as `computeConstantDifference` to keep things clean and to avoid playing C++ access specifier games. NFC. llvm-svn: 250143	2015-10-13 02:53:27 +00:00
Duncan P. N. Exon Smith	5b4c837c58	TransformUtils: Remove implicit ilist iterator conversions, NFC Continuing the work from last week to remove implicit ilist iterator conversions. First related commit was probably r249767, with some more motivation in r249925. This edition gets LLVMTransformUtils compiling without the implicit conversions. No functional change intended. llvm-svn: 250142	2015-10-13 02:39:05 +00:00
Kevin Enderby	19e291aac0	Looks like malformed-machos 00000031.a test is just getting a different error on some of the bots. I’ll remove this test for now. llvm-svn: 250141	2015-10-13 01:27:28 +00:00
Matt Arsenault	e5d9515fb7	DAGCombiner: Don't stop finding better chain on 2 aliases The comment says this was stopped because it was unlikely to be profitable. This is not true if you want to combine vector loads with multiple components. For a simple case that looks like t0 = load t0 ... t1 = load t0 ... t2 = load t0 ... t3 = load t0 ... t4 = store t0:1, t0:1 t5 = store t4, t1:0 t6 = store t5, t2:0 t7 = store t6, t3:0 We want to get all of these stores onto a chain that is a TokenFactor of these N loads. This mostly solves the AMDGPU merge-stores.ll regressions with -combiner-alias-analysis for merging vector stores of vector loads. llvm-svn: 250138	2015-10-13 00:49:00 +00:00
JF Bastien	986ed68eed	x86: preserve flags when folding atomic operations Summary: D4796 taught LLVM to fold some atomic integer operations into a single instruction. The pattern was unaware that the instructions clobbered flags. This patch adds the missing EFLAGS definition. Floating point operations don't set flags, the subsequent fadd optimization is therefore correct. The same applies for surrounding load/store optimizations. Reviewers: rsmith, rtrieu Subscribers: llvm-commits, reames, morisset Differential Revision: http://reviews.llvm.org/D13680 llvm-svn: 250135	2015-10-13 00:28:47 +00:00
Matt Arsenault	f0d9e47da2	AMDGPU: Refactor isVGPRToSGPRCopy It should now correctly handle physical registers and make it easier to identify the other direction. llvm-svn: 250132	2015-10-13 00:07:54 +00:00
Kevin Enderby	3c4927b723	Remove the correct unstable malformed-machos test mem-crup-0261.macho and restore the malformed-machos 00000031.a test. Hopefully this will get all the build bots happy again. I’ll again keep an eye on them. llvm-svn: 250130	2015-10-13 00:05:17 +00:00
Matt Arsenault	61dc235f20	DAGCombiner: Combine extract_vector_elt from build_vector This basic combine was surprisingly missing. AMDGPU legalizes many operations in terms of 32-bit vector components, so not doing this results in many extra copies and subregister extracts that need to be cleaned up later. InstCombine already does this for the hasOneUse case. The target hook is to fix a handful of tests which break (e.g. ARM/vmov.ll) which turn from a vector materialize repeated immediate instruction to a constant vector load with more scalar copies from it. llvm-svn: 250129	2015-10-12 23:59:50 +00:00
Simon Pilgrim	aa0ec7f45c	[InstCombine] Tidied up SSE4A tests. First stage of bugfix discussed in D13348 llvm-svn: 250121	2015-10-12 23:07:06 +00:00
Kevin Enderby	0b3bfd15fe	Temporarily remove the test added in r250117 while I investigate why two of the build bots get a different error on that malformed file. llvm-svn: 250120	2015-10-12 23:03:43 +00:00
Cong Hou	bf22f5063a	Assign correct edge weights to unwind destinations when lowering invoke statement. When lowering invoke statement, all unwind destinations are directly added as successors of call site block, and the weight of those new edges are not assigned properly. Actually, default weight 16 are used for those edges. This patch calculates the proper edge weights for those edges when collecting all unwind destinations. Differential revision: http://reviews.llvm.org/D13354 llvm-svn: 250119	2015-10-12 23:02:58 +00:00
Simon Pilgrim	c8832fc233	[SelectionDAG] Add common vector constant folding helper function We have a number of functions that implement constant folding of vectors (unary and binary ops) in near identical manners (and the differences don't appear to be critical). This patch introduces a common implementation (SelectionDAG::FoldConstantVectorArithmetic) and calls this in both the unary and binary op cases. After this initial patch I intend to begin enabling vector constant folding for a wider number of opcodes in SelectionDAG::getNode(). Differential Revision: http://reviews.llvm.org/D13665 llvm-svn: 250118	2015-10-12 23:00:11 +00:00
Kevin Enderby	903955451e	Fixed bugs in llvm-obdump while parsing Mach-O files from malformed archives that caused aborts. This was because of the characters of the ‘Size’ field in the archive header did not contain decimal characters. rdar://22983603 llvm-svn: 250117	2015-10-12 22:04:54 +00:00
Chris Bieneman	9ad0380b85	[CMake] Adding support for passing in profiling data. Adds LLVM_PROFDATA_FILE option to allow specifying a profile data file to be used during compilation of LLVM and subprojects. llvm-svn: 250108	2015-10-12 21:13:20 +00:00
Cong Hou	3320bcd815	Update the branch weight metadata in JumpThreading pass. In JumpThreading pass, the branch weight metadata is not updated after CFG modification. Consider the jump threading on PredBB, BB, and SuccBB. After jump threading, the weight on BB->SuccBB should be adjusted as some of it is contributed by the edge PredBB->BB, which doesn't exist anymore. This patch tries to update the edge weight in metadata on BB->SuccBB by scaling it by 1 - Freq(PredBB->BB) / Freq(BB->SuccBB). Differential revision: http://reviews.llvm.org/D10979 llvm-svn: 250089	2015-10-12 19:44:08 +00:00
Reid Kleckner	4a5f35c0ae	Make Win64 localescape offsets FP relative instead of SP relative We made them SP relative back in March (r233137) because that's the value the runtime passes to EH functions. With the new cleanuppad IR, funclets adjust their frame argument from SP to FP, so our offsets should now be FP-relative. llvm-svn: 250088	2015-10-12 19:43:34 +00:00
Hemant Kulkarni	80f82fb2d4	[llvm-symbolizer] Add -print-address option Differential Revision: http://reviews.llvm.org/D13518 llvm-svn: 250086	2015-10-12 19:26:44 +00:00
Andrea Di Biagio	b0fe4eb199	[x86] Fix wrong lowering of vsetcc nodes (PR25080). Function LowerVSETCC (in X86ISelLowering.cpp) worked under the wrong assumption that for non-AVX512 targets, the source type and destination type of a type-legalized setcc node were always the same type. This assumption was unfortunately incorrect; the type legalizer is not always able to promote the return type of a setcc to the same type as the first operand of a setcc. In the case of a vsetcc node, the legalizer firstly checks if the first input operand has a legal type. If so, then it promotes the return type of the vsetcc to that same type. Otherwise, the return type is promoted to the 'next legal type', which, for vectors of MVT::i1 is always a 128-bit integer vector type. Example (-mattr=+avx): %0 = trunc <8 x i32> %a to <8 x i23> %1 = icmp eq <8 x i23> %0, zeroinitializer The initial selection dag for the code above is: v8i1 = setcc t5, t7, seteq:ch t5: v8i23 = truncate t2 t2: v8i32,ch = CopyFromReg t0, Register:v8i32 %vreg1 t7: v8i32 = build_vector of all zeroes. The type legalizer would firstly check if 't5' has a legal type. If so, then it would reuse that same type to promote the return type of the setcc node. Unfortunately 't5' is of illegal type v8i23, and therefore it cannot be used to promote the return type of the setcc node. Consequently, the setcc return type is promoted to v8i16. Later on, 't5' is promoted to v8i32 thus leading to the following dag node: v8i16 = setcc t32, t25, seteq:ch where t32 and t25 are now values of type v8i32. Before this patch, function LowerVSETCC would have wrongly expanded the setcc to a single X86ISD::PCMPEQ. Surprisingly, ISel was still able to match an instruction. In our case, ISel would have matched a VPCMPEQWrr: t37: v8i16 = X86ISD::VPCMPEQWrr t36, t25 However, t36 and t25 are both VR256, while the result type is instead of class VR128. This inconsistency ended up causing the insertion of COPY instructions like this: %vreg7<def> = COPY %vreg3; VR128:%vreg7 VR256:%vreg3 Which is an invalid full copy (not a sub register copy). Eventually, the backend would have hit an UNREACHABLE "Cannot emit physreg copy instruction" in the attempt to expand the malformed pseudo COPY instructions. This patch fixes the problem adding the missing logic in LowerVSETCC to handle the corner case of a setcc with 128-bit return type and 256-bit operand type. This problem was originally reported by Dimitry as PR25080. It has been latent for a very long time. I have added the minimal reproducible from that bugzilla as test setcc-lowering.ll. Differential Revision: http://reviews.llvm.org/D13660 llvm-svn: 250085	2015-10-12 19:22:30 +00:00
Cong Hou	61e13de408	Add - and -= operators to BlockFrequency using saturating arithmetic. llvm-svn: 250077	2015-10-12 18:34:00 +00:00
Kostya Serebryany	928eb33a9e	[libFuzzer] mention more trophies and improve the link formatting llvm-svn: 250076	2015-10-12 18:15:42 +00:00
Sanjay Patel	0dc91b3143	combine predicates; NFCI llvm-svn: 250075	2015-10-12 18:15:08 +00:00
Cong Hou	90c6cf8e7d	Turn const/const& into value type for BlockFrequency in functions of this class. Also fix a naming issue. NFC. llvm-svn: 250074	2015-10-12 18:14:15 +00:00
Colin LeMahieu	e901616bf6	[llvm-symbolizer] Reverting r250067 llvm-svn: 250072	2015-10-12 17:57:02 +00:00
Matt Arsenault	8c0ef8b36d	AMDGPU: Register some more passes so -print-before works llvm-svn: 250071	2015-10-12 17:43:59 +00:00
Matt Arsenault	07a72bad0b	Enable verifier after PeepholeOptimizer No tests fail with this enabled so I assume it was an accident that it isn't enabled now. llvm-svn: 250070	2015-10-12 17:43:56 +00:00
Reid Kleckner	9abb3c06a6	Don't call PrepareEHLandingPad on non EH pads This was a minor bug in r249492. Calling PrepareEHLandingPad on a non-landingpad was a no-op, but it attempted to get the generic pointer register class, which apparently doesn't exist for some targets. llvm-svn: 250068	2015-10-12 17:42:32 +00:00
Hemant Kulkarni	c07c7eddad	[llvm-symbolizer] Add -print-address option Differential Revision http://reviews.llvm.org/D13518 llvm-svn: 250067	2015-10-12 17:31:22 +00:00
David Majnemer	99c1d13e52	[WinEH] Remove CatchObjRecoverIdx CatchObjRecoverIdx was used for the old scheme, it is no longer relevant. llvm-svn: 250065	2015-10-12 16:44:22 +00:00
Sanjay Patel	b814ef1ad6	fix typos; NFC llvm-svn: 250059	2015-10-12 16:09:59 +00:00
Zoran Jovanovic	2e386d3d07	[mips][micromips] Initial support for micrmomips DSP instructions and addu.qb implementation Differential Revision: http://reviews.llvm.org/D12798 llvm-svn: 250058	2015-10-12 16:07:25 +00:00
Oliver Stannard	cca893ffac	[Debug] Look through bitcasts to find argument registers On targets where f32 is not legal, we have to look through a BITCAST SDNode to find the register that an argument is stored in when emitting debug info, or we will not be able to emit a DW_AT_location for it. Differential Revision: http://reviews.llvm.org/D13005 llvm-svn: 250056	2015-10-12 15:52:36 +00:00
Vasileios Kalintiris	2a95f82859	[mips][FastISel] Clang-format switch statement. NFC. llvm-svn: 250053	2015-10-12 15:39:41 +00:00
Jun Bum Lim	54f3ddfbe2	[AArch64]Fix bug in function names in test case Functions in this test case need to be renamed as its names are the same as the instructions we are comparing with. llvm-svn: 250052	2015-10-12 15:34:52 +00:00
Sanjay Patel	53d1d8b731	fix capitalization; NFC llvm-svn: 250049	2015-10-12 15:24:01 +00:00
Greg Bedwell	7f68a71669	Fix rename() sometimes failing if another process uses openFileForRead() On Windows, fs::rename() could fail is another process was reading the file at the same time using fs::openFileForRead(). In most cases the user wouldn't notice as fs::rename() will continue to retry for 2000ms. Typically this is enough for the read to complete and a retry to succeed, but if the disk is being it too hard then the response time might be longer than the retry time and the rename would fail with a permission error. Add FILE_SHARE_DELETE to the sharing flags for CreateFileW() in fs::openFileForRead() and try ReplaceFileW() prior to MoveFileExW() in fs::rename(). Based on an initial patch by Edd Dawson! Differential Revision: http://reviews.llvm.org/D13647 llvm-svn: 250046	2015-10-12 15:11:47 +00:00
Daniel Sanders	b1ef88c172	[mips][ias] Implement macro expansion when bcc has an immediate where a register belongs. Summary: Fixes PR24915. Reviewers: vkalintiris Subscribers: emaste, seanbruno, llvm-commits Differential Revision: http://reviews.llvm.org/D13533 llvm-svn: 250042	2015-10-12 14:24:05 +00:00
Daniel Sanders	332cef6c5f	[mips] Whitespace cleanup in MIPS16 tests to reduce noise in following changes. NFC. Mostly tabs -> spaces and double spacing. llvm-svn: 250041	2015-10-12 14:16:52 +00:00
Daniel Sanders	2a5ce1ace0	[mips] Clean up most macro expansions to use the emit*() functions. Reviewers: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13591 llvm-svn: 250040	2015-10-12 14:09:12 +00:00
Daniel Sanders	2fb8564d99	[mips] Handle undef when extracting subregs from FP64 registers. Summary: This removes unnecessary instructions when extracting from an undefined register and also fixes a crash for O32 when passing undef to a double argument in held in integer registers. Reviewers: vkalintiris Subscribers: llvm-commits, zoran.jovanovic, petarj Differential Revision: http://reviews.llvm.org/D13467 llvm-svn: 250039	2015-10-12 13:55:44 +00:00
Oliver Stannard	939724cd02	GlobalOpt does not treat externally_initialized globals correctly GlobalOpt currently merges stores into the initialisers of internal, externally_initialized globals, but should not do so as the value of the global may change between the initialiser and any code in the module being run. llvm-svn: 250035	2015-10-12 13:20:52 +00:00
James Molloy	fa4e994a7a	[ARM] Mark Swift MISched model as incomplete The Swift Machine Scheduler Model is incomplete. There are instructions missing which can trigger the "incomplete machine model" abort. This was observed when a downstream SchedMachineModel was added to the ARM target. Patch by Christof Douma! llvm-svn: 250033	2015-10-12 12:49:59 +00:00
James Molloy	55d633bd60	[LoopVectorize] Shrink integer operations into the smallest type possible C semantics force sub-int-sized values (e.g. i8, i16) to be promoted to int type (e.g. i32) whenever arithmetic is performed on them. For targets with native i8 or i16 operations, usually InstCombine can shrink the arithmetic type down again. However InstCombine refuses to create illegal types, so for targets without i8 or i16 registers, the lengthening and shrinking remains. Most SIMD ISAs (e.g. NEON) however support vectors of i8 or i16 even when their scalar equivalents do not, so during vectorization it is important to remove these lengthens and truncates when deciding the profitability of vectorization. The algorithm this uses starts at truncs and icmps, trawling their use-def chains until they terminate or instructions outside the loop are found (or unsafe instructions like inttoptr casts are found). If the use-def chains starting from different root instructions (truncs/icmps) meet, they are unioned. The demanded bits of each node in the graph are ORed together to form an overall mask of the demanded bits in the entire graph. The minimum bitwidth that graph can be truncated to is the bitwidth minus the number of leading zeroes in the overall mask. The intention is that this algorithm should "first do no harm", so it will never insert extra cast instructions. This is why the use-def graphs are unioned, so that subgraphs with different minimum bitwidths do not need casts inserted between them. This algorithm works hard to reduce compile time impact. DemandedBits are only queried if there are extends of illegal types and if a truncate to an illegal type is seen. In the general case, this results in a simple linear scan of the instructions in the loop. No non-noise compile time impact was seen on a clang bootstrap build. llvm-svn: 250032	2015-10-12 12:34:45 +00:00
Amjad Aboud	1db6d7af46	[X86] Add XSAVE intrinsic family Add intrinsics for the XSAVE instructions (XSAVE/XSAVE64/XRSTOR/XRSTOR64) XSAVEOPT instructions (XSAVEOPT/XSAVEOPT64) XSAVEC instructions (XSAVEC/XSAVEC64) XSAVES instructions (XSAVES/XSAVES64/XRSTORS/XRSTORS64) Differential Revision: http://reviews.llvm.org/D13012 llvm-svn: 250029	2015-10-12 11:47:46 +00:00
Andrea Di Biagio	a0922ed8fe	[x86] PR24562: fix incorrect folding of PSHUFB nodes with a mask where all indices have the most significant bit set. This patch fixes a problem in function 'combineX86ShuffleChain' that causes a chain of shuffles to be wrongly folded away when the combined shuffle mask has only one element. We may end up with a combined shuffle mask of one element as a result of multiple calls to function 'canWidenShuffleElements()'. Function canWidenShuffleElements attempts to simplify a shuffle mask by widening the size of the elements being shuffled. For every pair of shuffle indices, function canWidenShuffleElements checks if indices refer to adjacent elements. If all pairs refer to "adjacent" elements then the shuffle mask is safely widened. As a consequence of widening, we end up with a new shuffle mask which is half the size of the original shuffle mask. The byte shuffle (pshufb) from test pr24562.ll has a mask of all SM_SentinelZero indices. Function canWidenShuffleElements would combine each pair of SM_SentinelZero indices into a single SM_SentinelZero index. So, in a logarithmic number of steps (4 in this case), the pshufb mask is simplified to a mask with only one index which is equal to SM_SentinelZero. Before this patch, function combineX86ShuffleChain wrongly assumed that a mask of size one is always equivalent to an identity mask. So, the entire shuffle chain was just folded away as the combined shuffle mask was treated as a no-op mask. With this patch we know check if the only element of a combined shuffle mask is SM_SentinelZero. In case, we propagate a zero vector. Differential Revision: http://reviews.llvm.org/D13364 llvm-svn: 250027	2015-10-12 11:25:41 +00:00
Zlatko Buljan	d76b666a06	Test commit llvm-svn: 250026	2015-10-12 11:19:40 +00:00
Pawel Bylica	b29e96d29f	cmake: Avoid leading space in LLVM_DEFINITIONS. Summary: Unnecessary space at the beginning of LLVM_DEFINITIONS in cmake shared files can break projects that use the variable. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13432 llvm-svn: 250025	2015-10-12 10:42:03 +00:00
Jonas Paulsson	233b9ce8bf	[SystemZ] testcase MC/SystemZ/insn-good-z13.s extended. New instructions using floating point registers have been added, to check that AsmParser can deal with fp regs in vector instructions. This tests r249810. llvm-svn: 250023	2015-10-12 10:13:57 +00:00
James Molloy	6558edcaf5	[MISched] Python script to check coverage of misched info This script prints a CSV of all misched models of a target when given the output of the debug output of subtarget using: llvm-tblgen --gen-subtarget --debug-only=subtarget-emitter ... With thanks to Dave Estes for mentioning the idea at the 2014 LLVM Developers' Meeting. Patch by Christof Douma! llvm-svn: 250020	2015-10-12 08:50:47 +00:00
Tobias Grosser	374bce0c22	SCEV: Allow simple AddRec * Parameter products in delinearization This patch also allows the -delinearize pass to delinearize expressions that do not have an outermost SCEVAddRec expression. The SCEV::delinearize infrastructure allowed this since r240952, but the -delinearize pass was not updated yet. llvm-svn: 250018	2015-10-12 08:02:00 +00:00
Craig Topper	8d2e6bc25b	[X86] Use u8imm for the immediate type for all shift and rotate instructions. This way the assembler will perform range checking. Believe this matches gas behavior. llvm-svn: 250016	2015-10-12 06:23:10 +00:00
Craig Topper	d6b661dbf0	[X86] Add support to assembler and MCInst lowering to use the other vmovq %xmmX, %xmmX encoding if it would be a shorter VEX encoding. llvm-svn: 250014	2015-10-12 04:57:59 +00:00
Craig Topper	635e05df0a	[X86] Cleanup formatting a bit. NFC llvm-svn: 250013	2015-10-12 04:27:17 +00:00
Craig Topper	5be914eda1	[X86] Change the immediate for IN/OUT instructions to u8imm so the assembly parser will check the size. llvm-svn: 250012	2015-10-12 04:17:55 +00:00
Craig Topper	95fffba227	[X86] Add some instruction aliases to get the assembly parser table to favor arithmetic instructions with 8-bit immediates over the forms that implicitly use the ax/eax/rax. This allows us to remove the explicit code for working around the existing priority llvm-svn: 250011	2015-10-12 03:39:57 +00:00
Davide Italiano	5d7e8fdaeb	[llvm-rtdyld] General modernization/cleanup in preparation for (bigger) changes. llvm-svn: 250004	2015-10-12 00:57:29 +00:00
Davide Italiano	1e91a278b6	[Bugpoint] Get rid of dead code. No functional change. llvm-svn: 249999	2015-10-11 21:36:11 +00:00
Craig Topper	fcc34bdee0	[X86] Fix CMP and TEST with al/ax/eax/rax to not mark EFLAGS as a use or al/ax/eax/rax as a def. Probably doesn't have a functional affect since these aren't used in isel. llvm-svn: 249994	2015-10-11 19:54:02 +00:00
Simon Pilgrim	d45c88bbb5	[DAGCombiner] Improved FMA combine support for vectors Enabled constant canonicalization for all constants. Improved combining of constant vectors. llvm-svn: 249993	2015-10-11 19:48:12 +00:00
Simon Pilgrim	18a048e1cd	[X86] Completed SHL cost model tests As discussed in D8690. llvm-svn: 249990	2015-10-11 18:33:48 +00:00
Craig Topper	87990ee4ec	[X86] Remove special validation for INT immediate operand from AsmParser. Instead mark its operand type as u8imm which will cause it to fail to match. This is more consistent with other instruction behavior. This also fixes a bug where negative immediates below -128 were not being reported as errors. llvm-svn: 249989	2015-10-11 18:27:24 +00:00
Simon Pilgrim	3bcf5bb79e	[X86] Renamed SHL cost model tests Matches naming conventions for ASHR/LSHR cost tests As discussed in D8690. llvm-svn: 249984	2015-10-11 17:34:32 +00:00
Simon Pilgrim	acbf51ab60	[X86] Added LSHR cost model tests There are several dodgy costings due to AVX1 legalizing 256-bit integer vectors that need fixing. As discussed in D8690. llvm-svn: 249983	2015-10-11 17:29:26 +00:00
Simon Pilgrim	602b0e1f0b	[X86] Added ASHR cost model tests There are several dodgy costings due to AVX1 legalizing 256-bit integer vectors that need fixing. As discussed in D8690. llvm-svn: 249981	2015-10-11 17:08:05 +00:00
Craig Topper	5b0f57df1c	[TableGen] Add a space between type and '*' in front of a variable name in output file. While there replace type with 'auto' since there's a cast on the right side of the assignment. NFC llvm-svn: 249980	2015-10-11 16:59:29 +00:00
Craig Topper	a71630729d	[X86] Simplify immediate range checking code. llvm-svn: 249979	2015-10-11 16:38:14 +00:00
Simon Pilgrim	5eac2607b9	[DAGCombiner] Tidyup FMINNUM/FMAXNUM constant folding Enable constant folding for vector splats as well as scalars. Enable constant canonicalization for all scalar and vector constants. llvm-svn: 249978	2015-10-11 16:02:28 +00:00
Simon Pilgrim	1d1c56e2df	[InstCombine][X86][XOP] Combine XOP integer vector comparisons to native IR We now have lowering support for XOP PCOM/PCOMU instructions. llvm-svn: 249977	2015-10-11 14:38:34 +00:00
Simon Pilgrim	52d47e5704	[X86][XOP] Added support for the lowering of 128-bit vector integer comparisons to XOP PCOM/PCOMU instructions. The XOP vector integer comparisons can deal with all signed/unsigned comparison cases directly and can be easily commuted as well (D7646). llvm-svn: 249976	2015-10-11 14:15:17 +00:00
Nathan Slingerland	5e896ce2d1	[ProfileData] Test commit for slingn This is a test of the LLVM commit system. In the event of a real commit there would be some useful code changes. llvm-svn: 249972	2015-10-11 13:30:56 +00:00
Simon Pilgrim	bdbf839a3b	[X86][SSE] Vector signed/unsigned integer compare tests. llvm-svn: 249954	2015-10-10 22:21:05 +00:00
Craig Topper	55b1f29203	Change isUIntN/isIntN calls with constant N to use the template version. NFC llvm-svn: 249952	2015-10-10 20:17:07 +00:00
Craig Topper	798cc60ad9	In isUIntN, make sure N is less than 64 before using in a shift to avoid undefined behavior. Also change it to use the same formula as the template version which I think results in less math in compiled code. llvm-svn: 249951	2015-10-10 18:54:26 +00:00
Teresa Johnson	1493ad9c24	Fix PR25101 - Handle anonymous functions without VST entries Summary: The change to use the VST function entries for lazy deserialization did not handle the case of anonymous functions without aliases. In that case we must fall back to scanning the function blocks as there is no VST entry. Reviewers: dexonsmith, joker.eph, davidxl Subscribers: tstellarAMD, llvm-commits Differential Revision: http://reviews.llvm.org/D13596 llvm-svn: 249947	2015-10-10 14:18:36 +00:00
Jonas Paulsson	28fa48de32	[SystemZ] CodeGen/SystemZ/asm-18.ll run with -verify-machineinstrs Relates to the fixes of r249811. llvm-svn: 249946	2015-10-10 07:20:23 +00:00
Jonas Paulsson	63a2b6862e	[SystemZ] Fixes in the backend I/R. expandPostRAPseudo(): STX -> 2 * STD: The first STD should not have the kill flag set for the address. SystemZElimCompare: BRC -> BRCT conversion: Don't forget to remove the CC<use,kill> operand. Needed to make SystemZ/asm-17.ll pass with -verify-machineinstrs, which now runs with this flag. Reviewed by Ulrich Weigand. llvm-svn: 249945	2015-10-10 07:14:24 +00:00
Sanjoy Das	cc16ccc1ab	[IndVars] Use `auto`; NFC llvm-svn: 249944	2015-10-10 06:33:33 +00:00
Craig Topper	84008481e4	Use range-based for loops. NFC llvm-svn: 249943	2015-10-10 05:38:14 +00:00
Keno Fischer	2cd66e9270	[RuntimeDyld] Fix performance problem in resolveRelocations with many sections Summary: Rather than just iterating over all sections and checking whether we have relocations for them, iterate over the relocation map instead. This showed up heavily in an artificial julia benchmark that does lots of compilation. On that particular benchmark, this patch gives ~15% performance improvements. As far as I can tell the primary reason why the original loop was so expensive is that Relocations[i] actually constructs a relocationList (allocating memory & doing lots of other unnecessary computing) if none is found. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13545 llvm-svn: 249942	2015-10-10 05:37:02 +00:00
Craig Topper	7143d8001a	Use range-based for loops. NFC. llvm-svn: 249941	2015-10-10 05:25:06 +00:00
Craig Topper	7d5b23101c	Use emplace_back instead of a constructor call and push_back. NFC llvm-svn: 249940	2015-10-10 05:25:02 +00:00
NAKAMURA Takumi	2b0e1730a0	Suppress LLVM::tools/llvm-symbolizer/coff-dwarf.test for mingw, for now. FIXME: Improve llvm-symbolizer, or rename the feature "system-windows". llvm-svn: 249937	2015-10-10 02:57:02 +00:00
Kostya Serebryany	45dac2a3ac	[libFuzzer] document more trophies llvm-svn: 249933	2015-10-10 02:14:18 +00:00
Kevin Enderby	78ab58077f	Move llvm-objdump malformed Mach-O tests to X86 test directory. rdar://22983603 llvm-svn: 249927	2015-10-10 01:06:20 +00:00
Duncan P. N. Exon Smith	5a82c916b0	Analysis: Remove implicit ilist iterator conversions Remove implicit ilist iterator conversions from LLVMAnalysis. I came across something really scary in `llvm::isKnownNotFullPoison()` which relied on `Instruction::getNextNode()` being completely broken (not surprising, but scary nevertheless). This function is documented (and coded to) return `nullptr` when it gets to the sentinel, but with an `ilist_half_node` as a sentinel, the sentinel check looks into some other memory and we don't recognize we've hit the end. Rooting out these scary cases is the reason I'm removing the implicit conversions before doing anything else with `ilist`; I'm not at all surprised that clients rely on badness. I found another scary case -- this time, not relying on badness, just bad (but I guess getting lucky so far) -- in `ObjectSizeOffsetEvaluator::compute_()`. Here, we save out the insertion point, do some things, and then restore it. Previously, we let the iterator auto-convert to `Instruction`, and then set it back using the `Instruction` version: Instruction PrevInsertPoint = Builder.GetInsertPoint(); / Logic that may change insert point */ if (PrevInsertPoint) Builder.SetInsertPoint(PrevInsertPoint); The check for `PrevInsertPoint` doesn't protect correctly against bad accesses. If the insertion point has been set to the end of a basic block (i.e., `SetInsertPoint(SomeBB)`), then `GetInsertPoint()` returns an iterator pointing at the list sentinel. The version of `SetInsertPoint()` that's getting called will then call `PrevInsertPoint->getParent()`, which explodes horribly. The only reason this hasn't blown up is that it's fairly unlikely the builder is adding to the end of the block; usually, we're adding instructions somewhere before the terminator. llvm-svn: 249925	2015-10-10 00:53:03 +00:00
Davide Italiano	d91bf8ec59	[llvm-rtdyld] Use range-based loop. NFC. llvm-svn: 249923	2015-10-10 00:45:24 +00:00
Duncan P. N. Exon Smith	a5f45da27e	MC: Remove implicit ilist iterator conversions, NFC llvm-svn: 249922	2015-10-10 00:13:11 +00:00
Kevin Enderby	d90a4176ff	Fix a bugs in the Mach-O disassembler when disassembling from a malformed Mach-O file that caused a crash. This was because of an assert where the code was incorrectly attempting to parse relocation entries off of the sections and the filetype was not an MH_OBJECT. rdar://22983603 llvm-svn: 249921	2015-10-10 00:05:01 +00:00
David Majnemer	bfa5b98201	[WinEH] Remove more dead code wineh-parent is dead, so is ValueOrMBB. llvm-svn: 249920	2015-10-10 00:04:29 +00:00
Reid Kleckner	14e773500e	[WinEH] Delete the old landingpad implementation of Windows EH The new implementation works at least as well as the old implementation did. Also delete the associated preparation tests. They don't exercise interesting corner cases of the new implementation. All the codegen tests of the EH tables have already been ported. llvm-svn: 249918	2015-10-09 23:34:53 +00:00
Reid Kleckner	eb7cd6c889	[SEH] Update SEH codegen tests to use the new IR Also Fix a buglet where SEH tables had ranges that spanned funclets. The remaining tests using the old landingpad IR are preparation tests, and will be deleted along with the old preparation. llvm-svn: 249917	2015-10-09 23:05:54 +00:00
Duncan P. N. Exon Smith	f1ff53ecc2	CodeGen: Remove implicit ilist iterator conversions, NFC Finish removing implicit ilist iterator conversions from LLVMCodeGen. I'm sure there are lots more of these in lib/CodeGen/*/. llvm-svn: 249915	2015-10-09 22:56:24 +00:00
Chris Bieneman	8a3e6e19af	[CMake] Parallel make breaks on native tablegen Patch by Alex Wang This patch resolves a parallelization issue that occurs when native tablegen targets are built at the same time. They both try to build libSupport and clobber each other causing the builds to fail. llvm-svn: 249911	2015-10-09 22:26:04 +00:00
David Majnemer	35d27b21a1	[WinEH] Insert the catchpad return before CSR restoration x64 catchpads use rax to inform the unwinder where control should go next. However, we must initialize rax before the epilogue sequence so as to not perturb the unwinder. llvm-svn: 249910	2015-10-09 22:18:45 +00:00
Richard Smith	81ff44d89d	Fix use of uninitialized bool, found by ubsan in portion of test/tools/llvm-objdump/malformed-machos.test added in r249845. llvm-svn: 249909	2015-10-09 22:09:56 +00:00
James Y Knight	692e037499	Fix assert when emitting llvm.pow.f86. This occurred due to introducing the invalid i64 type after type legalization had already finished, in an attempt to workaround bitcast f64 -> v2i32 not doing constant folding. The right thing is to actually fix bitcast, but that has other complications. So, for now, just get rid of the broken workaround, and check in a test-case showing that it doesn't crash, with TODOs for emitting proper code. llvm-svn: 249908	2015-10-09 21:36:19 +00:00
Diego Novillo	935cc537a6	Remove unused function in sample profile writer API - NFC. These functions are not needed and are getting in the way of changes for implementing a table of contents for the binary format. llvm-svn: 249907	2015-10-09 21:33:13 +00:00
Reid Kleckner	e1c8a7f9c7	[SEH] Fix _except_handler4 table base states We got them right for the old IR, but not with funclets. Port the old test to the new IR and fix the code. llvm-svn: 249906	2015-10-09 21:27:28 +00:00
Duncan P. N. Exon Smith	6e98cd32dc	CodeGen: Avoid more ilist iterator implicit conversions, NFC llvm-svn: 249903	2015-10-09 21:08:19 +00:00
Duncan P. N. Exon Smith	1ff409802d	CodeGen: Use range-based for in PostRAScheduler, NFC llvm-svn: 249901	2015-10-09 21:05:00 +00:00
Reid Kleckner	d880dc7509	[SEH] Remember to emit the last invoke range for SEH This wasn't very observable in execution tests, because usually there is an invoke in the catchpad that unwinds the the catchendpad but never actually throws. llvm-svn: 249898	2015-10-09 20:39:39 +00:00
Owen Anderson	97ca0f3f2c	Generalize convergent check to handle invokes as well as calls. llvm-svn: 249892	2015-10-09 20:17:46 +00:00
James Y Knight	5b8217bc05	Fix assert in X86 backend. When running combine on an extract_vector_elt, it wants to look through a bitcast to check if the argument to the bitcast was itself an extract_vector_elt with particular operands. However, it called getOperand() on the argument to the bitcast before checking that the opcode was EXTRACT_VECTOR_ELT, assert-failing if there were zero operands for the actual opcode. Fix, and add trivial test. llvm-svn: 249891	2015-10-09 20:10:14 +00:00
Chad Rosier	47eba05b47	Revert "Simplify code. NFC." This reverts commit r248610. llvm-svn: 249887	2015-10-09 19:48:48 +00:00
Duncan P. N. Exon Smith	5ec1568c9c	CodeGen: Continue removing ilist iterator implicit conversions llvm-svn: 249884	2015-10-09 19:40:45 +00:00
Duncan P. N. Exon Smith	6ac07fd228	CodeGen: Remove implicit iterator conversions from MBB.cpp Remove implicit ilist iterator conversions from MachineBasicBlock.cpp. I've also added an overload of `splice()` that takes a pointer, since it's a natural API. This is similar to the overloads I added for `remove()` and `erase()` in r249867. llvm-svn: 249883	2015-10-09 19:36:12 +00:00
Duncan P. N. Exon Smith	0ac8eb9171	CodeGen: Avoid ilist iterator implicit conversions in a few more places, NFC llvm-svn: 249880	2015-10-09 19:23:20 +00:00
Duncan P. N. Exon Smith	5ae5939fa1	CodeGen: Remove more ilist iterator implicit conversions, NFC llvm-svn: 249879	2015-10-09 19:13:58 +00:00
Duncan P. N. Exon Smith	6c64aeb065	CodeGen: Use range-based for in IntrinsicLowering::AddPrototypes, NFC This happens to avoid a host of implicit ilist iterator conversions. llvm-svn: 249877	2015-10-09 19:07:41 +00:00
Duncan P. N. Exon Smith	530d040bd9	CodeGen: Use range-based for in GlobalMerge, NFC llvm-svn: 249876	2015-10-09 18:57:47 +00:00
Duncan P. N. Exon Smith	d83547a16e	CodeGen: Remove a few more ilist iterator implicit conversions, NFC llvm-svn: 249875	2015-10-09 18:44:40 +00:00
Owen Anderson	2c9978b12b	Teach LoopUnswitch not to perform non-trivial unswitching on loops containing convergent operations. Doing so could cause the post-unswitching convergent ops to be control-dependent on the unswitch condition where they were not before. This check could be refined to allow unswitching where the convergent operation was already control-dependent on the unswitch condition. llvm-svn: 249874	2015-10-09 18:40:20 +00:00
Owen Anderson	dbd02a40d2	Add iterator ranges for blocks in a Loop. llvm-svn: 249873	2015-10-09 18:40:15 +00:00
Duncan P. N. Exon Smith	6b40936b81	CodeGen: Remove implicit iterator conversions from SlotIndexes.h, NFC Be explicit about changes between pointers and iterators, as with other recent commits. This transitively removes implicit ilist iterator conversions from about 20 source files in CodeGen. llvm-svn: 249869	2015-10-09 18:35:09 +00:00
Duncan P. N. Exon Smith	980f8f2639	CodeGen: Remove implicit conversions from Analysis and BranchFolding Remove a few more implicit ilist iterator conversions, this time from Analysis.cpp and BranchFolding.cpp. I added a few overloads for `remove()` and `erase()`, which quite naturally take pointers as well as iterators as parameters. This will reduce the churn at least in the short term, but I don't really have a problem with these existing for longer. llvm-svn: 249867	2015-10-09 18:23:49 +00:00
Owen Anderson	d95b08a0a7	Refine the definition of convergent to only disallow the addition of new control dependencies. This covers the common case of operations that cannot be sunk. Operations that cannot be hoisted should already be handled properly via the safe-to-speculate rules and mechanisms. llvm-svn: 249865	2015-10-09 18:06:13 +00:00
Sanjay Patel	9fbe22bac6	fix typos; NFC llvm-svn: 249863	2015-10-09 18:01:03 +00:00
Chris Bieneman	0ac9109d22	[CMake] If LLVM_DYLIB_EXPORT_ALL is On don't generate an export list at all, just export the world. This should resolve Bug 24157 - CMake built shared library does not export all public symbols llvm-svn: 249862	2015-10-09 17:55:21 +00:00
Diego Novillo	a7f1e8ef83	Add inline stack streaming to binary sample profiles. With this patch we can now read and write inline stacks in sample profiles to the binary encoded profiles. In a subsequent patch, I will add a string table to the binary encoding. Right now function names are emitted as strings every time we find them. This is too bloated and will produce large files in applications with lots of inlining. llvm-svn: 249861	2015-10-09 17:54:24 +00:00
Dan Gohman	ee1588ce96	[WebAssembly] Rename floating-point operators to match their spec names. llvm-svn: 249859	2015-10-09 17:50:00 +00:00
Artur Pilipenko	cca800207a	Add verification for align, dereferenceable, dereferenceable_or_null load metadata Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13428 llvm-svn: 249856	2015-10-09 17:41:29 +00:00
Keno Fischer	21a7f23666	Clear SectionSymbols in MCContext::Reset This was just forgotten when SectionSymbols was introduced and could cause corruption if the MCContext was reused after Reset. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13547 llvm-svn: 249854	2015-10-09 17:24:54 +00:00
Duncan P. N. Exon Smith	769e1a972d	AArch64: Make getNextNode() cleanup in r249764 more clear After r249764, if you didn't see the full context, it looked like `std::next(I)` would get the same result as `++MachineBasicBlock::iterator(I)`. However, `I` is a `MachineInstr*` (not a `MachineBasicBlock::iterator`). Use the `getIterator()` helper I added later (r249782) to make this code more clear. llvm-svn: 249852	2015-10-09 16:54:54 +00:00
Duncan P. N. Exon Smith	8f11e1a713	CodeGen: Start removing implicit conversions to/from list iterators, NFC Start removing implicit conversions to/from list iterators in CodeGen, ala r249782 for IR. A lot more to go after this. llvm-svn: 249851	2015-10-09 16:54:49 +00:00
Duncan P. N. Exon Smith	8f39941669	Revert "Support: Partially revert r249782 to unbreak clang build" This reverts commit r249783, fully reinstating r249782. I've fixed the bug in clang: it was a non-const iterator that dereferenced to const (but had an implicit conversion to non-const). llvm-svn: 249850	2015-10-09 16:51:23 +00:00
Dehao Chen	41dc5a6e86	Make HeaderLineno a local variable. http://reviews.llvm.org/D13576 As we are using hierarchical profile, there is no need to keep HeaderLineno a member variable. This is because each level of the inline stack will have its own header lineno. One should use the head lineno of its own inline stack level instead of the actual symbol. llvm-svn: 249848	2015-10-09 16:50:16 +00:00
Reid Kleckner	848055ad16	Fix pdb.test when python is not on PATH llvm-svn: 249847	2015-10-09 16:49:56 +00:00
Kevin Enderby	af7c9d0123	Fixed two bugs in llvm-objdump’s printing of Objective-C meta data from malformed Mach-O files that caused crashes. The first because the offset in a dyld bind table entry was out of range. The second because their was no image info section and the routine printing it did not have the need check to see the section did not exist. rdar://22983603 llvm-svn: 249845	2015-10-09 16:48:44 +00:00
Artur Pilipenko	ffd132878a	ValueTracking: use getAlignment in isAligned Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13517 llvm-svn: 249841	2015-10-09 15:58:26 +00:00
Frederic Riss	9f5013a138	[dsymutil] Prevent warning llvm-svn: 249836	2015-10-09 15:04:05 +00:00
Jun Bum Lim	0aace13d18	Improve ISel across lane float min/max reduction In vectorized float min/max reduction code, the final "reduce" step is sub-optimal. In AArch64, this change wll combine : svn0 = vector_shuffle t0, undef<2,3,u,u> fmin = fminnum t0,svn0 svn1 = vector_shuffle fmin, undef<1,u,u,u> cc = setcc fmin, svn1, ole n0 = extract_vector_elt cc, #0 n1 = extract_vector_elt fmin, #0 n2 = extract_vector_elt fmin, #1 result = select n0, n1,n2 into : result = llvm.aarch64.neon.fminnmv t0 This change extends r247575. llvm-svn: 249834	2015-10-09 14:11:25 +00:00
Jonas Paulsson	ee3685fd45	[SystemZ] Remove unused code in SystemZElimCompare.cpp The Reference IndirectDef and IndirectUse members were unused and therefore removed. llvm-svn: 249824	2015-10-09 11:27:44 +00:00
Nemanja Ivanovic	d389657399	Vector element extraction without stack operations on Power 8 This patch corresponds to review: http://reviews.llvm.org/D12032 This patch builds onto the patch that provided scalar to vector conversions without stack operations (D11471). Included in this patch: - Vector element extraction for all vector types with constant element number - Vector element extraction for v16i8 and v8i16 with variable element number - Removal of some unnecessary COPY_TO_REGCLASS operations that ended up unnecessarily moving things around between registers Not included in this patch (will be in upcoming patch): - Vector element extraction for v4i32, v4f32, v2i64 and v2f64 with variable element number - Vector element insertion for variable/constant element number Testing is provided for all extractions. The extractions that are not implemented yet are just placeholders. llvm-svn: 249822	2015-10-09 11:12:18 +00:00
Andrea Di Biagio	99493df257	[MemCpyOpt] Fix wrong merging adjacent nontemporal stores into memset calls. Pass MemCpyOpt doesn't check if a store instruction is nontemporal. As a consequence, adjacent nontemporal stores are always merged into a memset call. Example: ;;; define void @foo(<4 x float>* nocapture %p) { entry: store <4 x float> zeroinitializer, <4 x float>* %p, align 16, !nontemporal !0 %p1 = getelementptr inbounds <4 x float>, <4 x float>* %dst, i64 1 store <4 x float> zeroinitializer, <4 x float>* %p1, align 16, !nontemporal !0 ret void } !0 = !{i32 1} ;;; In this example, the two nontemporal stores are combined to a memset of zero which does not preserve the nontemporal hint. Later on the backend (tested on a x86-64 corei7) expands that memset call into a sequence of two normal 16-byte aligned vector stores. opt -memcpyopt example.ll -S -o - \| llc -mcpu=corei7 -o - Before: xorps %xmm0, %xmm0 movaps %xmm0, 16(%rdi) movaps %xmm0, (%rdi) With this patch, we no longer merge nontemporal stores into calls to memset. In this example, llc correctly expands the two stores into two movntps: xorps %xmm0, %xmm0 movntps %xmm0, 16(%rdi) movntps %xmm0, (%rdi) In theory, we could extend the usage of !nontemporal metadata to memcpy/memset calls. However a change like that would only have the effect of forcing the backend to expand !nontemporal memsets back to sequences of store instructions. A memset library call would not have exactly the same semantic of a builtin !nontemporal memset call. So, SelectionDAG will have to conservatively expand it back to a sequence of !nontemporal stores (effectively undoing the merging). Differential Revision: http://reviews.llvm.org/D13519 llvm-svn: 249820	2015-10-09 10:53:41 +00:00
Arnaud A. de Grandmaison	859b2ac07d	[EarlyCSE] Address post commit review for r249523. llvm-svn: 249814	2015-10-09 09:23:01 +00:00
Jonas Paulsson	5b3bab40b2	[SystemZ] Remove superfluous braces in SystemZShortenInst.cpp llvm-svn: 249812	2015-10-09 07:19:20 +00:00
Jonas Paulsson	18d877f79b	[SystemZ] Minor bugfixes. LLCH, LLHH and CLIH had the wrong register classes for the def-operand. Tie operands if changing opcode to an instruction with tied ops. Comment typo fix. These fixes were needed in order to make regression test case SystemZ/asm-18.ll pass with -verify-machineinstrs (not used by default). Reviewed by Ulrich Weigand. llvm-svn: 249811	2015-10-09 07:19:16 +00:00
Jonas Paulsson	0a9049ba82	[SystemZ] Bugfix in SystemZAsmParser.cpp. Let parseRegister() allow RegFP Group if expecting RegV Group, since the %f register prefix yields the FP group even while used with vector instructions. Reviewed by Ulrich Weigand. llvm-svn: 249810	2015-10-09 07:19:12 +00:00
Kostya Serebryany	e95022ac14	[libFuzzer] don't print large artifacts to stderr llvm-svn: 249808	2015-10-09 04:03:14 +00:00
Kostya Serebryany	bd5d1cdbb9	[libFuzzer] add -artifact_prefix flag llvm-svn: 249807	2015-10-09 03:57:59 +00:00
Saleem Abdulrasool	1825fac3c9	ARM: tweak WoA frame lowering Accept r11 when targeting Windows on ARM rather than just low registers. Because we are in a thumb-2 only mode, this may be slightly more expensive in code size, but results in better code for the environment since it spills the frame register, which is generally desired for fast stack walking as per the ABI. llvm-svn: 249804	2015-10-09 03:19:03 +00:00
Sanjoy Das	648956118b	[SCEV] Call `StrengthenNoWrapFlags` after `GroupByComplexity`; NFCI The current implementation of `StrengthenNoWrapFlags` is agnostic to the order of `Ops`, so this commit should not change anything semantic. An upcoming change will make `StrengthenNoWrapFlags` sensitive to the order of `Ops`. llvm-svn: 249802	2015-10-09 02:44:45 +00:00
Reid Kleckner	ba77cd2737	Re-enable the coff-dwarf test on Windows Apparently system-windows was only a clang lit suite feature. llvm-svn: 249797	2015-10-09 01:18:27 +00:00
Reid Kleckner	ae44e871cd	Revert "Revert "Revert r248959, "[WinEH] Emit int3 after noreturn calls on Win64""" This reverts commit r249794. Apparently my checkouts are full of unexpected surprises today. llvm-svn: 249796	2015-10-09 01:13:17 +00:00
Reid Kleckner	37bb6810f2	Fix coff-dwarf test for non-Windows platforms that cannot demangle MS C++ names llvm-svn: 249795	2015-10-09 01:11:40 +00:00
Reid Kleckner	b510401785	Revert "Revert r248959, "[WinEH] Emit int3 after noreturn calls on Win64"" This reverts commit r249032. TODO write commit msg llvm-svn: 249794	2015-10-09 01:11:37 +00:00
Joseph Tremoulet	676e5cf07f	[WinEH] Fix cleanup state numbering Summary: - Recurse from cleanupendpads to their cleanuppads, to make sure the cleanuppad is visited if it has a cleanupendpad but no cleanupret. - Check for and avoid double-processing cleanuppads, to allow for them to have multiple cleanuprets (plus cleanupendpads). - Update Cxx state numbering to visit toplevel cleanupendpads and to recurse from cleanupendpads to their preds, to ensure we number any funclets in inlined cleanups. SEH state numbering already did this. Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13374 llvm-svn: 249792	2015-10-09 00:46:08 +00:00
Reid Kleckner	ebef256269	[SEH] Fix llvm.eh.exceptioncode fast register allocation assertion I called the wrong MachineBasicBlock::addLiveIn() overload. llvm-svn: 249786	2015-10-09 00:15:13 +00:00
Reid Kleckner	21427ada3e	Address review comments, remove error case and return 0 instead as required by tests llvm-svn: 249785	2015-10-09 00:15:08 +00:00
Reid Kleckner	e94fef7b3d	[llvm-symbolizer] Make --relative-address work with DWARF contexts Summary: Previously the relative address flag only affected PDB debug info. Now both DIContext implementations always expect to be passed virtual addresses. llvm-symbolizer is now responsible for adding ImageBase to module offsets when --relative-offset is passed. Reviewers: zturner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D12883 llvm-svn: 249784	2015-10-09 00:15:01 +00:00
Duncan P. N. Exon Smith	d837cd044f	Support: Partially revert r249782 to unbreak clang build Apparently the iterators in `clang::CFGBlock` have an auto-conversion to `CFGBlock *`, but the dereference operator gives `const CFGBlock &`. Until I have a moment to fix that, revert the GenericDomTree chagnes from r249782. llvm-svn: 249783	2015-10-09 00:03:57 +00:00
Duncan P. N. Exon Smith	52888a6738	IR: Remove implicit iterator conversions from lib/IR, NFC Stop converting implicitly between iterators and pointers/references in lib/IR. For convenience, I've added a `getIterator()` accessor to `ilist_node` so that callers don't need to know how to spell the iterator class (i.e., they can use `X.getIterator()` instead of `Function::iterator(X)`). I'll eventually disallow these implicit conversions entirely, but there's a lot of code, so it doesn't make sense to do it all in one patch. One library or so at a time. Why? To root out cases of `getNextNode()` and `getPrevNode()` being used in iterator logic. The design of `ilist` makes that invalid when the current node could be at the back of the list, but it happens to "work" right now because of a bug where those functions never return `nullptr` if you're using a half-node sentinel. Before I can fix the function, I have to remove uses of it that rely on it misbehaving. (Maybe the function should just be deleted anyway? But I don't want deleting it -- potentially a huge project -- to block fixing ilist/iplist.) llvm-svn: 249782	2015-10-08 23:49:46 +00:00
Sanjoy Das	3c520a1272	[RS4GC] Refactoring to make a later change easier, NFCI Summary: These non-semantic changes will help make a later change adding support for deopt operand bundles more streamlined. Reviewers: reames, swaroop.sridhar Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13491 llvm-svn: 249779	2015-10-08 23:18:38 +00:00
Sanjoy Das	4fd3d400fa	[IRBuilder] Change the `gc.statepoint` creation interface This is to enable me to address review for D13491 -- `Flags` is a bitfield of `StatepointFlags`, not an individual item out of the enum, so it should be represented as an `uint32_t`. llvm-svn: 249778	2015-10-08 23:18:33 +00:00
Sanjoy Das	c21a05a3a4	[PlaceSafeopints] Extract out `callsGCLeafFunction`, NFC Summary: This will be used in a later change to RewriteStatepointsForGC. Reviewers: reames, swaroop.sridhar Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13490 llvm-svn: 249777	2015-10-08 23:18:30 +00:00
Sanjoy Das	1ede5367ba	[RS4GC] Don't copy ADT's unneccessarily, NFCI Summary: Use `const auto &` instead of `auto` in `makeStatepointExplicit`. Reviewers: reames, swaroop.sridhar Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13454 llvm-svn: 249776	2015-10-08 23:18:22 +00:00
Kevin Enderby	46e642f8c5	Fix a bug in llvm-objdump’s printing of Objective-C meta data from malformed Mach-O files that caused a crash because of a section header had a size that extended past the end of the file. rdar://22983603 llvm-svn: 249768	2015-10-08 22:50:55 +00:00
Duncan P. N. Exon Smith	6eeaff169d	Support: Stop relying on iterator auto-conversion, NFC Stop relying on ilist implicit conversions from `value_type&` to `iterator` in YAMLParser.cpp. I eventually want to outlaw this entirely. It encourages `getNextNode()` and `getPrevNode()` in iterator logic, which is extremely fragile (and relies on them never returning `nullptr`). FTR, there's nothing nefarious going on in this case, it was just easy to clean up since the callers really wanted iterators to begin with. llvm-svn: 249767	2015-10-08 22:47:55 +00:00
Reid Kleckner	066c8db347	Enable gtest SEH when building with clang-cl Clang supports SEH well enough that this should work out of the box. If it doesn't, we'll hear about it. llvm-svn: 249766	2015-10-08 22:44:39 +00:00
Duncan P. N. Exon Smith	d389165c14	AArch64: Stop using MachineInstr::getNextNode() Stop using `getNextNode()` to get an insertion point (at least, in this one place). Instead, use iterator logic directly. The `getNextNode()` interface isn't actually supposed to work for creating iterators; it's supposed to return `nullptr` (not a real iterator) if this is the last node. It's currently broken and will "happen" to work, but if we ever fix the function, we'll get some strange failures in places like this. llvm-svn: 249764	2015-10-08 22:43:26 +00:00
Duncan P. N. Exon Smith	ece61624b1	MC: Stop using Fragment::getNextNode() Stop using `getNextNode()` to get an iterator to a fragment (at least, in this one place). Instead, use iterator logic directly. The `getNextNode()` interface isn't actually supposed to work for creating iterators; it's supposed to return `nullptr` (not a real iterator) if this is the last node. It's currently broken and will "happen" to work, but if we ever fix the function, we'll get some strange failures in places like this. llvm-svn: 249763	2015-10-08 22:36:08 +00:00
Frederic Riss	02cccde95b	[dsymutil] Try to find lipo first besides dsymutil before looking up the PATH. Even if we don't have it in PATH, lipo should usually exist in the same directory as dsymutil. Keep the fallback looking up the PATH, it's very useful when testing a non-installed executable. llvm-svn: 249762	2015-10-08 22:35:53 +00:00
Duncan P. N. Exon Smith	a3da44882f	PowerPC: Don't use getNextNode() for insertion point Stop using `getNextNode()` to create an insertion point for machine instructions (at least, in this one place). Instead, use an iterator. As a drive-by, clean up dump statements to use iterator logic. The `getNextNode()` interface isn't actually supposed to work for insertion points; it's supposed to return `nullptr` if this is the last node. It's currently broken and will "happen" to work, but if we ever fix the function, we'll get some strange failures. llvm-svn: 249758	2015-10-08 22:20:37 +00:00
Evgeniy Stepanov	d12212bc8c	New MSan mapping layout (llvm part). This is an implementation of https://github.com/google/sanitizers/issues/579 It has a number of advantages over the current mapping: * Works for non-PIE executables. * Does not require ASLR; as a consequence, debugging MSan programs in gdb no longer requires "set disable-randomization off". * Supports linux kernels >=4.1.2. * The code is marginally faster and smaller. This is an ABI break. We never really promised ABI stability, but this patch includes a courtesy escape hatch: a compile-time macro that reverts back to the old mapping layout. llvm-svn: 249753	2015-10-08 21:35:26 +00:00
Evgeniy Stepanov	5fe279e727	Add Triple::isAndroid(). This is a simple refactoring that replaces Triple.getEnvironment() checks for Android with Triple.isAndroid(). llvm-svn: 249750	2015-10-08 21:21:24 +00:00
Teresa Johnson	881e8860ec	Fix another UBSan test error from r248897 and follow on fix r249689 While here fix a few more issues with potential overflow and add new tests for these cases. Ensured that test now passes with UBSan. llvm-svn: 249745	2015-10-08 20:52:23 +00:00
Eric Christopher	ab2241f1b8	Remove a '#' so that we can check either form for the various targets. llvm-svn: 249734	2015-10-08 20:18:15 +00:00
Eric Christopher	11e5983658	Move the MMX subtarget feature out of the SSE set of features and into its own variable. This is needed so that we can explicitly turn off MMX without turning off SSE and also so that we can diagnose feature set incompatibilities that involve MMX without SSE. Rationale: // sse3 __m128d test_mm_addsub_pd(__m128d A, __m128d B) { return _mm_addsub_pd(A, B); } // mmx void shift(__m64 a, __m64 b, int c) { _mm_slli_pi16(a, c); _mm_slli_pi32(a, c); _mm_slli_si64(a, c); _mm_srli_pi16(a, c); _mm_srli_pi32(a, c); _mm_srli_si64(a, c); _mm_srai_pi16(a, c); _mm_srai_pi32(a, c); } clang -msse3 -mno-mmx file.c -c For this code we should be able to explicitly turn off MMX without affecting the compilation of the SSE3 function and then diagnose and error on compiling the MMX function. This matches the existing gcc behavior and follows the spirit of the SSE/MMX separation in llvm where we can (and do) turn off MMX code generation except in the presence of intrinsics. Updated a couple of tests, but primarily tested with a couple of tests for turning on only mmx and only sse. This is paired with a patch to clang to take advantage of this behavior. llvm-svn: 249731	2015-10-08 20:10:06 +00:00
Diego Novillo	aae1ed8e08	Re-apply r249644: Handle inline stacks in gcov-encoded sample profiles. This fixes memory allocation problems by making the merge operation keep the profile readers around until the merged profile has been emitted. This is needed to prevent the inlined function names to disappear from the function profiles. Since all the names are kept as references, once the reader disappears, the names are also deallocated. Additionally, XFAIL on big-endian architectures. The test case uses a gcov file generated on a little-endian system. llvm-svn: 249724	2015-10-08 19:40:37 +00:00
Alexei Starovoitov	87f83e6926	[bpf] Do not expand UNDEF SDNode during insn selection lowering o Before this patch, BPF backend will expand UNDEF node to i64 constant 0. o For second pass of dag combiner, legalizer will run through each to-be-processed dag node. o If any new SDNode is generated and has an undef operand, dag combiner will put undef node, newly-generated constant-0 node, and any node which uses these nodes in the working list. o During this process, it is possible undef operand is generated again, and this will form an infinite loop for dag combiner pass2. o This patch allows UNDEF to be a legal type. Signed-off-by: Yonghong Song <yhs@plumgrid.com> Signed-off-by: Alexei Starovoitov <ast@plumgrid.com> llvm-svn: 249718	2015-10-08 18:52:40 +00:00
Sanjoy Das	413dbbb1c2	[SCEV] Bring some methods up to coding style; NFC - Start methods with lower case - Reflow a comment - Delete header comment repeated in .cpp file llvm-svn: 249716	2015-10-08 18:46:59 +00:00
Reid Kleckner	b2244cb8f0	[WinEH] Relax assertion in the presence of stack realignment The code is correct as is, but we should test it. llvm-svn: 249715	2015-10-08 18:41:52 +00:00
Hal Finkel	e5d3ac8240	[PowerPC] Add R_PPC64_GLOB_DAT and R_PPC64_RELATIVE to PowerPC64.def These are not used by LLVM proper, but will be used by upcoming commits to lld (and will receive test coverage there). llvm-svn: 249714	2015-10-08 18:30:27 +00:00
Sanjoy Das	3bf22b1883	[SCEV] Remove comment repeated in cpp file; NFC llvm-svn: 249713	2015-10-08 18:28:42 +00:00
Sanjoy Das	dd70996a5c	[SCEV] Pick backedge values for phi nodes correctly Summary: `getConstantEvolutionLoopExitValue` and `ComputeExitCountExhaustively` assumed all phi nodes in the loop header have the same order of incoming values. This is not correct, and this commit changes `getConstantEvolutionLoopExitValue` and `ComputeExitCountExhaustively` to lookup the backedge value of a phi node using the loop's latch block. Unfortunately, there is still some code duplication `getConstantEvolutionLoopExitValue` and `ComputeExitCountExhaustively`. At some point in the future we should extract out a helper class / method that can evolve constant evolution phi nodes across iterations. Fixes 25060. Thanks to Mattias Eriksson for the spot-on analysis! Depends on D13457. Reviewers: atrick, hfinkel Subscribers: materi, llvm-commits Differential Revision: http://reviews.llvm.org/D13458 llvm-svn: 249712	2015-10-08 18:28:36 +00:00
Rafael Espindola	483ad20009	Handle Archive::getNumberOfSymbols being called in an archive with no symbols. No change in llvm, but will be tested from lld. llvm-svn: 249709	2015-10-08 18:06:20 +00:00
Ulrich Weigand	f4d14f781f	[SystemZ] Fix another assertion failure in tryBuildVectorShuffle This fixes yet another scenario where tryBuildVectorShuffle would attempt to create a BUILD_VECTOR node with an invalid combination of types. This can happen if the incoming BUILD_VECTOR has elements of a type different from the vector element type, which is allowed in certain cases as long as they are all the same type. When one of these elements is used in the residual vector, and UNDEF elements are added to fill up the residual vector, those UNDEFs then have to use the type of the original element, not the vector element type, or else the resulting BUILD_VECTOR will have an invalid type combination. llvm-svn: 249706	2015-10-08 17:46:59 +00:00
David Blaikie	ad60be9bdc	Make the Kaleidoscope Orc examples -Wdeprecated clean by avoiding copying some AST nodes llvm-svn: 249703	2015-10-08 17:22:12 +00:00
Sanjay Patel	f61a08fbf1	[InstCombine] transform masking off of an FP sign bit into a fabs() intrinsic call (PR24886) This is a partial fix for PR24886: https://llvm.org/bugs/show_bug.cgi?id=24886 Without this IR transform, the backend (x86 at least) was producing inefficient code. This patch is making 2 assumptions: 1. The canonical form of a fabs() operation is, in fact, the LLVM fabs() intrinsic. 2. The high bit of an FP value is always the sign bit; as noted in the bug report, this isn't specified by the LangRef. Differential Revision: http://reviews.llvm.org/D13076 llvm-svn: 249702	2015-10-08 17:09:31 +00:00
Sanjay Patel	9115cf8c9d	[ValueTracking] teach computeKnownBits that a fabs() clears sign bits This was requested in D13076: if we're going to canonicalize to fabs(), ValueTracking should know that fabs() clears sign bits. In this patch (as in D13076), we're not handling vectors yet even though computeKnownBits' fabs() case itself should be vector-ready via the splat in this patch. Fixing this will require follow-on patches to correct other logic that uses 'getScalarType'. Differential Revision: http://reviews.llvm.org/D13222 llvm-svn: 249701	2015-10-08 16:56:55 +00:00
Kevin Enderby	aac7538216	Fix a bug in llvm-objdump’s printing of Objective-C meta data from malformed Mach-O files that caused a crash because of loops in the class meta data. llvm-svn: 249700	2015-10-08 16:56:35 +00:00
George Rimar	87780300f6	Windows: Fixed sys::findProgramByName to work with files containing dot in their name. Problem was in SearchPathW function that does not attach an extension if file already has one. That does not work for executables like ld.lld2 for example which require to have .exe extension but SearchPath thinks that its "lld2". Solution was to add the extension manually. Differential Revision: http://reviews.llvm.org/D13536 llvm-svn: 249696	2015-10-08 16:03:19 +00:00
Teresa Johnson	b1cfcd4a53	Support for llvm-bcanalyzer dumping of record array strings. Summary: Adds support for automatically detecting and printing strings represented by Array abbrev operands, analogous to the string dumping performed for Blob abbrev operands. Enhanced the ThinLTO combined index test to check for the appropriate module and function strings. Reviewers: dexonsmith, joker.eph, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13553 llvm-svn: 249695	2015-10-08 15:56:24 +00:00
Frederic Riss	263b772bda	[X86] Disable X86CallFrameOptimization on Darwin in presence of EH We emit 1 compact unwind encoding per function, and this can’t represent the varying stack pointer that will be generated by X86CallFrameOptimization. Disable the optimization on Darwin. (It might be possible to split the function into multiple ranges and emit 1 compact unwind info per range. The compact unwind emission code isn’t ready for that and this kind of info certainly isn’t tested/used anywhere. It might be worth exploring this path if we want to get the space savings at some point though) llvm-svn: 249694	2015-10-08 15:45:08 +00:00
Teresa Johnson	ca6b64ff04	Fix combined function index abbrev (NFC) Removed an unused abbrev op in the VST_CODE_COMBINED_FNENTRY abbrev. I noticed while writing/testing an array string dumper for llvm-bcanalyze that the combined function's VST entry abbrevs contained an old field that I am not using. Everything was working fine since the bitcode writer and reader were in sync on how the record fields were actually being set up and interpreted. llvm-svn: 249691	2015-10-08 13:52:56 +00:00
Rafael Espindola	b02895cab6	Add a helper for getting a section's content as an array. It will be used in lld shortly. llvm-svn: 249690	2015-10-08 13:38:42 +00:00
Teresa Johnson	01a02d36b5	Fix UBSan test error from r248897 about left shift of unsigned value. Fixed by masking off the upper bits that we are shifting off before doing the left shift. llvm-svn: 249689	2015-10-08 13:14:59 +00:00
Igor Breger	defab3c1ef	AVX512: vpextrb/w/d/q and vpinsrb/w/d/q implementation. This instructions doesn't have intrincis. Added tests for lowering and encoding. Differential Revision: http://reviews.llvm.org/D12317 llvm-svn: 249688	2015-10-08 12:55:01 +00:00
James Molloy	e9d50dc9f7	Compute demanded bits for icmp instructions Instead of bailing out when we see an icmp, we can instead at least say that if the upper bits of both operands are known zero, they are not demanded. This doesn't help with signed comparisons, but it's at least better than bailing out. llvm-svn: 249687	2015-10-08 12:40:06 +00:00
James Molloy	bcd7f0ac98	Treat Mul just like Add and Subtract Like adds and subtracts, muls ripple only to the left so we can use the same logic. While we're here, add a print method to DemandedBits so it can be used with -analyze, which we'll use in the testcase. llvm-svn: 249686	2015-10-08 12:39:59 +00:00
James Molloy	ab9fdb9226	Make demanded bits lazy The algorithm itself is still eager, but it doesn't get run until a query function is called. This greatly reduces the compile-time impact of requiring DemandedBits when at runtime it is not often used. NFCI. llvm-svn: 249685	2015-10-08 12:39:50 +00:00
Michael Kuperstein	04e79329d0	[X86] Fix wrong treatment of multi-lane blends in BUILD_VECTORtoBlendMask() This fixes two separate bugs: 1) The mask for the high lane was not set correctly. That fixes PR24532. 2) The transformation should bail out if it believes it involves more than 2 lanes, as it does not currently do anything sensible in this case. Differential Revision: http://reviews.llvm.org/D13505 llvm-svn: 249669	2015-10-08 08:13:02 +00:00
Michael Kuperstein	2b3c16ca17	Do not assert on first non-prologue instruction being a CFI directive. llvm-svn: 249668	2015-10-08 07:48:49 +00:00
Jonas Paulsson	5d3fbd3733	[SystemZ] SystemZElimCompare pass improved. Compare elimination extended to recognize load-and-test instructions used for comparison and eliminate them the same way as with compare instructions. Test case fp-cmp-05.ll updated to expect optimized results now also for z13. The order of instruction shortening and compare elimination passes have been changed so that opcodes do not have to be handled in both passes. Reviewed by Ulrich Weigand. llvm-svn: 249666	2015-10-08 07:40:23 +00:00
Jonas Paulsson	29d9d8d955	[SystemZ] Bugfix: check CC reg liveness in SystemZShortenInst. The following instruction shortening transformations would introduce a definition of the CC reg, so therefore liveness of CC reg must be checked: WFADB -> ADBR WFSDB -> SDBR Also add the CC reg implicit def operand to the MI in case of change of opcode. Reviewed by Ulrich Weigand. llvm-svn: 249665	2015-10-08 07:40:19 +00:00
Jonas Paulsson	7c5ce10a07	[SystemZ] Use load-and-test for fp compare with 0 if vector support is present. Since the LTxBRCompare instructions can't be used with vector registers, a normal load-and-test instruction (with a modelled def operand) is used instead. Reviewed by Ulrich Weigand. llvm-svn: 249664	2015-10-08 07:40:16 +00:00
Jonas Paulsson	2c96dd64fc	[SystemZ] More minor fixing in SystemZElimCompare.cpp Don't use subreg indices since they are not used after regalloc. Reviewed by Ulrich Weigand. llvm-svn: 249663	2015-10-08 07:40:11 +00:00
Jonas Paulsson	9e1f3bd1bd	[SystemZ] Minor fixes in SystemZElimCompare.cpp Reviewed by Ulrich Weigand. llvm-svn: 249662	2015-10-08 07:39:55 +00:00
Craig Topper	da5168b7ce	Use range-based for loops. NFC. llvm-svn: 249659	2015-10-08 06:06:42 +00:00
Sanjoy Das	10dffcb36b	[SCEV] Check `Pred` first in isKnownPredicateViaSplitting Comparing `Pred` with `ICmpInst::ICMP_ULT` is cheaper that memory access -- do that check before loading / storing `ProvingSplitPredicate`. llvm-svn: 249654	2015-10-08 03:46:00 +00:00
Sanjoy Das	1195dbee66	[SCEV] Use `auto *` instead of `auto`; NFCI (As prescribed by the coding style document) llvm-svn: 249653	2015-10-08 03:45:58 +00:00
Diego Novillo	a082040ded	Revert "Handle inline stacks in gcov-encoded sample profiles." This reverts commit r249644. The buildbots are failing the new test I added. Investigating. llvm-svn: 249648	2015-10-08 01:17:26 +00:00
Kostya Serebryany	3b804877fd	[libFuzzer] fix 32-bit build llvm-svn: 249646	2015-10-08 00:59:25 +00:00
Diego Novillo	b7fca57493	Handle inline stacks in gcov-encoded sample profiles. This patch adds support for reading sample profiles with inline stacks. Inline stacks in a profile are generated when the sampled binary has samples in inlined functions. For instance, if main() calls foo() and foo() calls bar(), and bar() is inlined into foo() and foo() inlined into main(), the profile may look something like: main total:364084 head:0 [ ... ] 2.3: _Z3fool total:243786 1: 60149 1.2: 38568 1.4: 46511 1.7: _Z3bari total:98558 1.1: 52672 1.2: 45886 At line 2, discriminator 3, main() calls foo(). In turn, foo() calls bar() at line 1, discriminator 7. In the textual format, this stacking of inline calls is represented with indentation. With this change, LLVM can now read sample profile files generated by the create_gcov tool from https://github.com/google/autofdo. llvm-svn: 249644	2015-10-08 00:39:11 +00:00
Justin Bogner	468c998031	CodeGen: print and verify after TargetPassConfig::insertPass by default In r224059, we started verifying after addPass, but missed doing so on insertPass. There isn't a good reason for the discrepancy, and skipping the verifier in these cases causes bugs. This also exposes a verifier error that was introduced in r249087, but the verifier doesn't run until after the register coalescer, when the issue happens to have been resolved. I've skipped the verifier after SIFixSGPRLiveRangesID to avoid the failures for now and will follow up with Matt for a proper fix. llvm-svn: 249643	2015-10-08 00:36:22 +00:00
Reid Kleckner	94fe836afa	[WinEH] Add missing test case for llvm.eh.exceptioncode llvm-svn: 249638	2015-10-07 23:55:06 +00:00
Reid Kleckner	97797419e6	[WinEH] Fix 32-bit funclet epilogues in the presence of dynamic allocas In particular, passing non-trivially copyable objects by value on win32 uses a dynamic alloca (inalloca). We would clobber ESP in the epilogue and end up returning to outer space. llvm-svn: 249637	2015-10-07 23:55:01 +00:00
Pete Cooper	e11c9de83d	Stop linking all target libraries in llvm-nm and llvm-objdump. llvm-nm only needs the target to parse module level assembly in bitcode. It doesn't need a disassembler or codegen. llvm-objdump needs to be able to disassemble a file, but doesn't need asm parsers or codegen. This reduces the sizes of these tools by a few MB each, depending on how many backends are linked in. llvm-svn: 249632	2015-10-07 22:39:17 +00:00
Lang Hames	6df48a97d2	[Orc] Enable user supplied partitioning functors in the CompileOnDemand layer. Previously the CompileOnDemand layer always created single-function partitions. In theory this new API allows for more interesting partitions, though this has not been well tested yet. llvm-svn: 249623	2015-10-07 21:53:41 +00:00
David Majnemer	6af5f82c20	[WinEH] Refer to filter funclets using their symbol-table symbol The relocation for the filter funclet will be against a symbol table entry for a function instead of the section, making it easier to understand what is going on. llvm-svn: 249621	2015-10-07 21:34:00 +00:00
Sanjoy Das	40bdd041db	[RS4GC] Use AssertingVH for RematerializedValueMapTy, NFCI Reviewers: reames, swaroop.sridhar Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13489 llvm-svn: 249620	2015-10-07 21:32:35 +00:00
Reid Kleckner	70bf6bb5e6	[WinEH] Undo the effect of r249578 for 32-bit The __CxxFrameHandler3 tables for 32-bit are supposed to hold stack offsets relative to EBP, not ESP. I blindly updated the win-catchpad.ll test case, and immediately noticed that 32-bit catching stopped working. While I'm at it, move the frame index to frame offset WinEH table logic out of PEI. PEI shouldn't have to know about WinEHFuncInfo. I realized we can calculate frame index offsets just fine from the table printer. llvm-svn: 249618	2015-10-07 21:13:15 +00:00
David Majnemer	c289c9ff55	[WinEH] Remove unreachable blocks before preparation We remove unreachable blocks because it is pointless to consider them for coloring. However, we still had stale pointers to these blocks in some data structures after we removed them from the function. Instead, remove the unreachable blocks before attempting to do anything with the function. This fixes PR25099. llvm-svn: 249617	2015-10-07 21:08:25 +00:00
Duncan P. N. Exon Smith	4462c6190e	Support: Stop using iplist in Recycler Recycler just needs a singly-linked list, and it takes less (and simpler) code to hand-roll one of those than to build up the equivalent `iplist_traits`. In theory, this should speed things up a bit too, but this is really just a drive-by cleanup so I haven't measured. llvm-svn: 249615	2015-10-07 20:49:09 +00:00
Rafael Espindola	284093033f	git-clang-format r249548. Sorry for missing this the first time. llvm-svn: 249610	2015-10-07 20:32:24 +00:00
Vasileios Kalintiris	b876b58d38	[mips][FastISel] Factor out common code from switch statement. NFC llvm-svn: 249603	2015-10-07 20:06:30 +00:00
Duncan P. N. Exon Smith	37bf678a0d	IR: Create SymbolTableList wrapper around iplist, NFC Create `SymbolTableList`, a wrapper around `iplist` for lists that automatically manage a symbol table. This commit reduces a ton of code duplication between the six traits classes that were used previously. As a drive by, reduce the number of template parameters from 2 to 1 by using a SymbolTableListParentType metafunction (I originally had this as a separate commit, but it touched most of the same lines so I squashed them). I'm in the process of trying to remove the UB in `createSentinel()` (see the FIXMEs I added for `ilist_embedded_sentinel_traits` and `ilist_half_embedded_sentinel_traits`). My eventual goal is to separate the list logic into a base class layer that knows nothing about (and isn't templated on) the downcasted nodes -- removing the need to invoke UB -- but for now I'm just trying to get a handle on all the current use cases (and cleaning things up as I see them). Besides these six SymbolTable lists, there are two others that use the addNode/removeNode/transferNodes() hooks: the `MachineInstruction` and `MachineBasicBlock` lists. Ideally there'll be a way to factor these hooks out of the low-level API entirely, but I'm not quite there yet. llvm-svn: 249602	2015-10-07 20:05:10 +00:00
Sanjoy Das	af6980c70a	[IRBuilder] Add gc.statepoint related methods to IRBuilder Summary: This adds some more routines to `IRBuilder` around creating calls and invokes to `gc.statepoint`. These will be used later. Reviewers: reames, swaroop.sridhar Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13371 llvm-svn: 249596	2015-10-07 19:52:12 +00:00
Vasileios Kalintiris	6ae1b35cda	[mips][FastISel] Use ternary operator to select opcode. NFC llvm-svn: 249594	2015-10-07 19:43:31 +00:00
Joseph Tremoulet	39234fc67e	[WinEH] Set NoModuleLevelChanges in clone flags Summary: This is necessary to keep the cloner from making bogus copies of debug metadata attached to the IR it is cloning. Also, avoid running RemapInstruction over all instructions in the common case that no cloning was performed. Reviewers: rnk, andrew.w.kaylor, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13514 llvm-svn: 249591	2015-10-07 19:29:56 +00:00
Rafael Espindola	4264e2d531	Use SpecificBumpPtrAllocator to simplify the MCSeciton destruction. llvm-svn: 249589	2015-10-07 19:08:19 +00:00
Kevin B. Smith	99e8c0fffb	[X86]Update test to use FileCheck. Updates this test to use FileCheck and a single llc invocation rather than 3 llc invocations and grep. llvm-svn: 249583	2015-10-07 18:21:41 +00:00
Mehdi Amini	044cb34bdc	Revert "Revert "This patch builds on top of D13378 to handle constant condition."" This reverts commit r249528 and reapply r249431. The fix for the fallout has been commited in r249575. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 249581	2015-10-07 18:14:25 +00:00
Vasileios Kalintiris	daad571ba4	[mips][FastISel] Simple refactoring of MipsFastISel::emitLogicalOP(). NFC. llvm-svn: 249580	2015-10-07 18:14:24 +00:00
Chad Rosier	7c6ac2b8f9	[AArch64] Fold a floating-point divide by power of two into fp conversion. Part of http://reviews.llvm.org/D13442 llvm-svn: 249579	2015-10-07 17:51:37 +00:00
Reid Kleckner	33bd2d99d8	[WinEH] Fix two minor issues in __CxxFrameHandler3 tables There was an off-by-one bug in ip2state tables which manifested when one call immediately preceded the try-range of the next. The return address of the previous call would appear to be within the try range of the next scope, resulting in extra destructors or catches running. We also computed the wrong offset for catch parameter stack objects. The offset should be from RSP, not from RBP. llvm-svn: 249578	2015-10-07 17:49:32 +00:00
Matt Arsenault	fc0ad42516	AMDGPU: Fix missing implicit m0 uses on movrel instructions llvm-svn: 249577	2015-10-07 17:46:32 +00:00
Chad Rosier	fa30c9b436	[AArch64] Fold a floating-point multiply by power of two into fp conversion. Part of http://reviews.llvm.org/D13442 llvm-svn: 249576	2015-10-07 17:39:18 +00:00
Sanjoy Das	0015e5a088	[IndVars] Preserve LCSSA in `eliminateIdentitySCEV` Summary: After r249211, SCEV can see through some LCSSA phis. Add a `replacementPreservesLCSSAForm` check before replacing uses of these phi nodes with a simplified use of the induction variable to avoid breaking LCSSA. Fixes 25047. Depends on D13460. Reviewers: atrick, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13461 llvm-svn: 249575	2015-10-07 17:38:31 +00:00
Sanjoy Das	4493b40002	[SCEV] Use some C++11'ism, NFC Summary: Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13457 llvm-svn: 249574	2015-10-07 17:38:25 +00:00
Chad Rosier	169865ffda	[ARM] Promote helper function to SelectionDAG. I'll be using the function in a similar combine for AArch64. The helper was also improved to handle undef values. Part of http://reviews.llvm.org/D13442 llvm-svn: 249572	2015-10-07 17:28:58 +00:00
Kevin B. Smith	9c7408807f	Test commit access. Fixed comment to have correct input parameter name and period termination. llvm-svn: 249571	2015-10-07 17:24:25 +00:00
Joseph Tremoulet	bde46c5642	[WinEH] Update CoreCLR EH for catchpad MBBs Summary: Set the pad MBB as a funclet entry for CoreCLR as well as MSVCCXX, and update state numbering to put the catchpad block rather than its normal successor into the unwind map. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13492 llvm-svn: 249569	2015-10-07 17:16:25 +00:00
Oliver Stannard	d3d114ba54	[ARM] Use correct half-precision functions in EABI mode The ARM RTABI defines the half- to single-precision float conversion functions with an __aeabi prefix, but libgcc only has them with a __gnu prefix. Therefore we need to emit the __aeabi version when compiling with an eabi or eabihf triple, and the __gnu version with a gnueabi or gnueabihf triple. llvm-svn: 249565	2015-10-07 16:58:49 +00:00
David Blaikie	30f07f9326	Move test back to Generic now it's fixed the right way (thanks Eric!) I knee-jerk tried to fix this in completely the wrong way - it's not an CPU limitation, but an OS/object file type one, so moving it into a CPU-specific classification didn't help at all. llvm-svn: 249562	2015-10-07 16:26:28 +00:00
Chad Rosier	17436bf64e	[ARM] Prevent PerformVDIVCombine from combining a vcvt/vdiv with 8 lanes. This would result in a crash since the vcvt used does not support v8i32 types. llvm-svn: 249560	2015-10-07 16:15:40 +00:00
Artur Pilipenko	d94903c9f8	Teach computeKnownBits to use new align attribute/metadata Reviewed By: reames Differential Revision: http://reviews.llvm.org/D13470 llvm-svn: 249557	2015-10-07 16:01:18 +00:00
Jeroen Ketema	aebca09543	[ARM][AArch64] Only lower to interleaved load/store if the target has NEON Without an additional check for NEON, the compiler crashes during legalization of NEON ldN/stN. Differential Revision: http://reviews.llvm.org/D13508 llvm-svn: 249550	2015-10-07 14:53:29 +00:00
Rafael Espindola	30d77777e7	Use non virtual destructors for sections. llvm-svn: 249548	2015-10-07 13:46:06 +00:00
Chad Rosier	db71abf2d4	[ARM] Push more complex check down to reduce compile time. NFC. llvm-svn: 249547	2015-10-07 13:40:44 +00:00
Rafael Espindola	665b0d3a4e	Don't repeat names in comments and don't indent in namespaces. NFC. llvm-svn: 249546	2015-10-07 13:38:49 +00:00
Scott Egerton	9004cc7942	Revert: r249536 - Testing commit access with a trival whitespace change. llvm-svn: 249537	2015-10-07 10:57:06 +00:00
Scott Egerton	be6b54b691	Testing commit access with a trival whitespace change. llvm-svn: 249536	2015-10-07 10:49:49 +00:00
James Molloy	47efaeb36e	Revert "This patch builds on top of D13378 to handle constant condition." This reverts commit r249431. This caused failures in sqlite3: http://lab.llvm.org:8011/builders/clang-native-arm-lnt/builds/14453 llvm-svn: 249528	2015-10-07 09:03:34 +00:00
Arnaud A. de Grandmaison	a6178a179d	[EarlyCSE] Fix handling of target memory intrinsics for CSE'ing loads. Summary: Some target intrinsics can access multiple elements, using the pointer as a base address (e.g. AArch64 ld4). When trying to CSE such instructions, it must be checked the available value comes from a compatible instruction because the pointer is not enough to discriminate whether the value is correct. Reviewers: ssijaric Subscribers: mcrosier, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D13475 llvm-svn: 249523	2015-10-07 07:41:29 +00:00
Michael Kuperstein	259f1508f0	[X86] Emit .cfi_escape GNU_ARGS_SIZE when adjusting the stack before calls When outgoing function arguments are passed using push instructions, and EH is enabled, we may need to indicate to the stack unwinder that the stack pointer was adjusted before the call. This should fix the exception handling issues in PR24792. Differential Revision: http://reviews.llvm.org/D13132 llvm-svn: 249522	2015-10-07 07:01:31 +00:00

... 3 4 5 6 7 ...

122683 Commits