llvm-project

Commit Graph

Author	SHA1	Message	Date
Diego Novillo	c04270d2e4	Fix SamplePGO segfault when debug info is missing. When emitting a remark for a conditional branch annotation, the remark uses the line location information of the conditional branch in the message. In some cases, that information is unavailable and the optimization would segfaul. I'm still not sure whether this is a bug or WAI, but the optimizer should not die because of this. llvm-svn: 251420	2015-10-27 17:37:00 +00:00
Reid Kleckner	fb1c1c7e4d	[ms-inline-asm] Leave alignment in bytes if the native assembler uses bytes The existing behavior was correct on Darwin, which is probably the platform it was written for. Before this change, we would rewrite "align 8" to ".align 3" and then fail to make it through the integrated assembler because 3 is not a power of 2. Differential Revision: http://reviews.llvm.org/D14120 llvm-svn: 251418	2015-10-27 17:32:48 +00:00
Rui Ueyama	5579e0b88a	Rename qsort -> multikey_qsort. NFC. `qsort` as a file-scope local function name was confusing. llvm-svn: 251414	2015-10-27 16:57:50 +00:00
Asaf Badouh	c7cb880669	[X86][AVX512] [X86][AVX512] add convert float to half convert float to half with mask/maskz for the reg to reg version and mask for the reg to mem version (there is no maskz version for reg to mem). Differential Revision: http://reviews.llvm.org/D14113 llvm-svn: 251409	2015-10-27 15:37:17 +00:00
Charlie Turner	458e79b814	[ARM] Expand ROTL and ROTR of vector value types Summary: After D13851 landed, we saw backend crashes when compiling the reduced test case included in this patch. The right fix seems to be to allow these vector types for expansion in instruction selection. Reviewers: rengolin, t.p.northover Subscribers: RKSimon, t.p.northover, aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14082 llvm-svn: 251401	2015-10-27 10:25:20 +00:00
Mehdi Amini	891c0973df	Do not use "else" when both branches return (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 251398	2015-10-27 08:12:08 +00:00
David Majnemer	dd9a815746	[ScalarEvolutionExpander] Properly insert no-op casts + EH Pads We want to insert no-op casts as close as possible to the def. This is tricky when the cast is of a PHI node and the BasicBlocks between the def and the use cannot hold any instructions. Iteratively walk EH pads until we hit a non-EH pad. This fixes PR25326. llvm-svn: 251393	2015-10-27 07:36:42 +00:00
Michael Kuperstein	e1194bdb4f	[X86] Make elfiamcu an OS, not an environment. GNU tools require elfiamcu to take up the entire OS field, so, e.g. i?86-*-linux-elfiamcu is not considered a legal triple. Make us compatible. Differential Revision: http://reviews.llvm.org/D14081 llvm-svn: 251390	2015-10-27 07:23:59 +00:00
Davide Italiano	c692688cbd	[SimplifyLibCalls] Use range-based loop. No functional change. llvm-svn: 251383	2015-10-27 04:17:51 +00:00
Craig Topper	ee0c859788	Convert cost table lookup functions to return a pointer to the entry or nullptr instead of the index. This avoid mentioning the table name an extra time and allows the lookup to be done directly in the ifs by relying on the bool conversion of the pointer. While there make use of ArrayRef and std::find_if. llvm-svn: 251382	2015-10-27 04:14:24 +00:00
Chandler Carruth	69798fb5ec	[function-attrs] Refactor code to handle shorter code with early exits. No functionality changed here, but the indentation is substantially reduced and IMO the code is much easier to read. I've also added some helpful comments. This is just a clean-up I wrote while studying the code, and that has been in my backlog for a while. llvm-svn: 251381	2015-10-27 01:41:43 +00:00
Sanjoy Das	63d2b77961	[ValueTracking] Don't special case wrapped ConstantRanges; NFCI Use `getUnsignedMax` directly instead of special casing a wrapped ConstantRange. The previous code would have been "buggy" (and this would have been a semantic change) if LLVM allowed !range metadata to denote full ranges. E.g. in %val = load i1, i1* %ptr, !range !{i1 1, i1 1} ;; == full set ValueTracking would conclude that the high bit (IOW the only bit) in %val was zero. Since !range metadata does not allow empty or full ranges, this change is just a minor stylistic improvement. llvm-svn: 251380	2015-10-27 01:36:06 +00:00
Sanjay Patel	309c4f93e5	[x86] replace integer logic ops with packed SSE FP logic ops If we have an operand to a bitwise logic op that's already in an XMM register and the result is going to be sent to an XMM register, then use an SSE logic op to avoid moves between the integer and vector register files. Related commits: http://reviews.llvm.org/rL248395 http://reviews.llvm.org/rL248399 http://reviews.llvm.org/rL248404 http://reviews.llvm.org/rL248409 http://reviews.llvm.org/rL248415 This should solve PR22428: https://llvm.org/bugs/show_bug.cgi?id=22428 llvm-svn: 251378	2015-10-27 01:28:07 +00:00
Sanjoy Das	49edd3b3a8	[SCEV] Refactor out ScalarEvolution::getDataLayout; NFC llvm-svn: 251375	2015-10-27 00:52:09 +00:00
Steve King	fee370be72	Fix llc crash processing S/UREM for -Oz builds caused by rL250825. When taking the remainder of a value divided by a constant, visitREM() attempts to convert the REM to a longer but faster sequence of instructions. This conversion calls combine() on a speculative DIV instruction. Commit rL250825 may cause this combine() to return a DIVREM, corrupting nearby nodes. Flow eventually hits unreachable(). This patch adds a test case and a check to prevent visitREM() from trying to convert the REM instruction in cases where a DIVREM is possible. See http://reviews.llvm.org/D14035 llvm-svn: 251373	2015-10-27 00:14:06 +00:00
Daniel Sanders	5bf6eab6b8	[mips][ias] Fold needsExpansion() and expandInstruction() together. NFC. Summary: Previously we maintained two separate switch statements that had to be kept in sync. This patch merges them into a single switch. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders Differential Revision: http://reviews.llvm.org/D14012 llvm-svn: 251369	2015-10-26 23:50:00 +00:00
Oleksiy Vyalov	6c2403f3fa	Use Twin instead of std::to_string. http://reviews.llvm.org/D14095 llvm-svn: 251365	2015-10-26 22:37:36 +00:00
Ivan Krasin	465fbe25c4	Fix indents. It's a follow up to r251353. llvm-svn: 251364	2015-10-26 22:35:40 +00:00
Alexey Samsonov	0fb6451ade	[LLVMSymbolize] Don't use LLVMSymbolizer::Options in ModuleInfo. NFC. LLVMSymbolizer::Options is mostly used in LLVMSymbolizer class anyway. Let's keep their usage restricted to that class, especially given that it's worth to move ModuleInfo to a different header, independent from the symbolizer class. llvm-svn: 251363	2015-10-26 22:34:56 +00:00
Sanjay Patel	e9b500f722	reorganize logic; NFCI (retry r251349) This is a preliminary step before adding another optimization to PerformBITCASTCombine(). ..and I really hope it's NFC this time! llvm-svn: 251357	2015-10-26 21:54:14 +00:00
Ivan Krasin	298639a5fd	Move imported entities into DwarfCompilationUnit to speed up LTO linking. Summary: In particular, this CL speeds up the official Chrome linking with LTO by 1.8x. See more details in https://crbug.com/542426 Reviewers: dblaikie Subscribers: jevinskie Differential Revision: http://reviews.llvm.org/D13918 llvm-svn: 251353	2015-10-26 21:36:35 +00:00
Tim Northover	939f089242	ARM: make sure VFP loads and stores are properly aligned. Both VLDRS and VLDRD fault if the memory is not 4 byte aligned, which wasn't really being checked before, leading to faults at runtime. llvm-svn: 251352	2015-10-26 21:32:53 +00:00
Sanjay Patel	f29fed423a	revert r251349; it included code for a functional change llvm-svn: 251350	2015-10-26 21:28:02 +00:00
Sanjay Patel	fdf75452e4	reorganize logic; NFCI This is a preliminary step before adding another optimization to PerformBITCASTCombine(). llvm-svn: 251349	2015-10-26 21:24:09 +00:00
Keno Fischer	277bfaefaf	Initialize BasicAAWrapperPass in it's constructor Summary: This idiom is used elsewhere in LLVM, but was overlooked here. Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13628 llvm-svn: 251348	2015-10-26 21:22:58 +00:00
Alexey Samsonov	ff8a80b477	Fix build failure on GCC 4.7 (old libstdc++ doesn't have std::map::emplace). llvm-svn: 251347	2015-10-26 21:20:37 +00:00
David Blaikie	efbb29153e	Remove use of std::map<>::emplace which is not supported on some older versions of libstdc++ llvm-svn: 251346	2015-10-26 21:10:36 +00:00
Diego Novillo	e822b63681	Remove unused local variable. NFC. llvm-svn: 251344	2015-10-26 20:50:26 +00:00
Peter Collingbourne	99fac80db2	ARM/ELF: Restore original (pre-r251322) logic for deciding whether to use GOT. Unbreaks linking with gold, which cannot resolve direct relocations referring to global symbols. llvm-svn: 251342	2015-10-26 20:46:44 +00:00
Alexey Samsonov	f3ecfd3af4	[LLVMSymbolize] Use symbol table only if function linkage name was requested. Now it's enough to just specify -functions=short without additionally providing -use-symbol-table=false. llvm-svn: 251339	2015-10-26 20:12:29 +00:00
Alexey Samsonov	1d3f3271ac	Fix build error by fully qualifying llvm::make_unique. llvm-svn: 251338	2015-10-26 20:12:27 +00:00
Rui Ueyama	df94852a60	Optimize StringTableBuilder. This is a patch to improve StringTableBuilder's performance. That class' finalize function is very hot particularly in LLD because the function does tail-merge strings in string tables or SHF_MERGE sections. Generic std::sort-style sorter is not efficient for sorting strings. The function implemented in this patch seems to be more efficient. Here's a benchmark of LLD to link Clang with or without this patch. The numbers are medians of 50 runs. -O0 real 0m0.455s real 0m0.430s (5.5% faster) -O3 real 0m0.487s real 0m0.452s (7.2% faster) Since that is a benchmark of the whole linker, the speedup of StringTableBuilder itself is much more than that. http://reviews.llvm.org/D14053 llvm-svn: 251337	2015-10-26 19:58:29 +00:00
Alexey Samsonov	7a952e53f9	[LLVMSymbolize] Use std::unique_ptr more extensively to clarify ownership. llvm-svn: 251336	2015-10-26 19:41:23 +00:00
Igor Laevsky	1ef06559f4	[RS4GC] Strip noalias attribute after statepoint rewrite We should remove noalias along with dereference and dereference_or_null attributes because statepoint could potentially touch the entire heap including noalias objects. Differential Revision: http://reviews.llvm.org/D14032 llvm-svn: 251333	2015-10-26 19:06:01 +00:00
Diego Novillo	7963ea1996	SamplePGO - Add optimization reports. This adds a couple of optimization remarks to the SamplePGO transformation. When it decides to inline a hot function (to mimic the inline stack and repeat useful inline decisions in the original build). It will also report branch destinations. For instance, given the code fragment: 6 if (i < 1000) 7 sum -= i; 8 else 9 sum += -i * rand(); If the 'else' branch is taken most of the time, building this code with -Rpass=sample-profile will produce: a.cc:9:14: remark: most popular destination for conditional branches at small.cc:6:9 [-Rpass=sample-profile] sum += -i * rand(); ^ llvm-svn: 251330	2015-10-26 18:52:53 +00:00
David Blaikie	7b54b525cd	Remove assert(false) in favor of asserting the if conditional it is contained within. Also adjust the code to avoid 3 redundant map lookups. llvm-svn: 251327	2015-10-26 18:41:13 +00:00
David Blaikie	94c83370b5	Move the canonical header to the top of its matching cpp file as per coding convention This ensures that the header will be verified to be standalone (and avoid mistakes like the one fixed in r251178) llvm-svn: 251326	2015-10-26 18:40:56 +00:00
Mehdi Amini	5d303285b9	Add an (optional) identification block in the bitcode Processing bitcode from a different LLVM version can lead to unexpected behavior. The LLVM project guarantees autoupdating bitcode from a previous minor revision for the same major, but can't make any promise when reading bitcode generated from a either a non-released LLVM, a vendor toolchain, or a "future" LLVM release. This patch aims at being more user-friendly and allows a bitcode produce to emit an optional block at the beginning of the bitcode that will contains an opaque string intended to describe the bitcode producer information. The bitcode reader will dump this information alongside any error it reports. The optional block also includes an "epoch" number, monotonically increasing when incompatible changes are made to the bitcode. The reader will reject bitcode whose epoch is different from the one expected. Differential Revision: http://reviews.llvm.org/D13666 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 251325	2015-10-26 18:37:00 +00:00
Evgeniy Stepanov	d1aad26589	[safestack] Fast access to the unsafe stack pointer on AArch64/Android. Android libc provides a fixed TLS slot for the unsafe stack pointer, and this change implements direct access to that slot on AArch64 via __builtin_thread_pointer() + offset. This change also moves more code into TargetLowering and its target-specific subclasses to get rid of target-specific codegen in SafeStackPass. This change does not touch the ARM backend because ARM lowers builting_thread_pointer as aeabi_read_tp, which is not available on Android. The previous iteration of this change was reverted in r250461. This version leaves the generic, compiler-rt based implementation in SafeStack.cpp instead of moving it to TargetLoweringBase in order to allow testing without a TargetMachine. llvm-svn: 251324	2015-10-26 18:28:25 +00:00
Peter Collingbourne	97aae40880	ARM/ELF: Better codegen for global variable addresses. In PIC mode we were previously computing global variable addresses (or GOT entry addresses) by adding the PC, the PC-relative GOT displacement and the GOT-relative symbol/GOT entry displacement. Because the latter two displacements are fixed, we ended up performing one more addition than necessary. This change causes us to compute addresses using a single PC-relative displacement, resulting in a shorter code sequence. This reduces code size by about 4% in a recent build of Chromium for Android. As a result of this change we no longer need to compute the GOT base address in the ARM backend, which allows us to remove the Global Base Reg pass and SDAG lowering for the GOT. We also now no longer use the GOT when addressing a symbol which is known to be defined in the same linkage unit. Specifically, the symbol must have either hidden visibility or a strong definition in the current module in order to not use the the GOT. This is a change from the previous behaviour where we would use the GOT to address externally visible symbols defined in the same module. I think the only cases where this could matter are cases involving symbol interposition, but we don't really support that well anyway. Differential Revision: http://reviews.llvm.org/D13650 llvm-svn: 251322	2015-10-26 18:23:16 +00:00
Alexey Samsonov	145b0fd2a0	Refactor: Simplify boolean conditional return statements in lib/Transforms/Instrumentation Summary: Use clang-tidy to simplify boolean conditional return statements. Differential Revision: http://reviews.llvm.org/D9996 Patch by Richard (legalize@xmission.com)! llvm-svn: 251318	2015-10-26 18:06:40 +00:00
Cong Hou	fff8ccf579	Check the case that the numerator and denominator are both zeros when getting edge probabilities in BPI and return 100% in this case. This issue is triggered in PGO mode when bootstrapping LLVM. It seems that it is not guaranteed that edge weights are always greater than zero which are read from profile data. llvm-svn: 251317	2015-10-26 18:00:17 +00:00
Alexey Samsonov	57f8837ada	Move parts of llvm-symbolizer tool into LLVMSymbolize library. Summary: See http://lists.llvm.org/pipermail/llvm-dev/2015-October/091624.html Reviewers: echristo Subscribers: llvm-commits, aizatsky Differential Revision: http://reviews.llvm.org/D13998 llvm-svn: 251316	2015-10-26 17:56:12 +00:00
Jonas Paulsson	83553d0cac	[SystemZ] LTGFR use regclass should be GR32, not GR64. Discovered by testing int-cmp-44.ll with -verify-machineinstrs (added to test run). llvm-svn: 251299	2015-10-26 15:03:49 +00:00
Jonas Paulsson	7da3820882	[SystemZ] Also clear kill flag for index reg in splitMove(). Discovered by running fp-move-05.ll with -verify-machineinstrs (added to test case run). llvm-svn: 251298	2015-10-26 15:03:41 +00:00
Jonas Paulsson	9525b2c0c8	[SystemZ] Don't forget the CC def op on LTEBRCompare pseudos Discovered by running fp-cmp-02.ll with -verify-machineinstrs (now added to test run). llvm-svn: 251297	2015-10-26 15:03:32 +00:00
Jonas Paulsson	dab7407258	[SystemZ] Tie operands in SystemZShorteInst if MI becomes 2-address. Discovered by testing fp-add-02.ll with -verify-machineinstrs. Test case updated to always run with -verify-machineinstrs. llvm-svn: 251296	2015-10-26 15:03:07 +00:00
Vasileios Kalintiris	165121f326	[mips] Check for the correct error message in tests for interrupt attributes. Instead of XFAIL-ing the tests with the wrong usage of the "interrupt" attribute, we should check that we emit the correct error messages to the user. llvm-svn: 251295	2015-10-26 14:24:30 +00:00
James Molloy	493e57de01	[ValueTracking] Extend r251146 to catch a fairly common case Even though we may not know the value of the shifter operand, it's possible we know the shifter operand is non-zero. This can allow us to infer more known bits - for example: %1 = load %p !range {1, 5} %2 = shl %q, %1 We don't know %1, but we do know that it is nonzero so %2[0] is known zero, and importantly %2 is known non-zero. Calling isKnownNonZero is nontrivially expensive so use an Optional to run it lazily and cache its result. llvm-svn: 251294	2015-10-26 14:10:46 +00:00
Elena Demikhovsky	7a77149391	Loop Vectorizer - skipping "bitcast" before GEP Vectorization of memory instruction (Load/Store) is possible when the pointer is coming from GEP. The GEP analysis allows to estimate the profit. In some cases we have a "bitcast" between GEP and memory instruction. I added code that skips the "bitcast". http://reviews.llvm.org/D13886 llvm-svn: 251291	2015-10-26 13:42:41 +00:00
Igor Breger	e4ddc3f4cd	AVX512: Enabled VPBROADCASTB lowering for v64i8 vectors. Differential Revision: http://reviews.llvm.org/D13896 llvm-svn: 251287	2015-10-26 13:01:02 +00:00
Vasileios Kalintiris	43dff0c033	[mips] Interrupt attribute support for mips32r2+. Summary: This patch adds support for using the "interrupt" attribute on Mips for interrupt handling functions. At this time only mips32r2+ with the o32 ABI with the static relocation model is supported. Unsupported configurations will be rejected Patch by Simon Dardis (+ clang-format & some trivial changes to follow the LLVM coding standards by me). Reviewers: mpf, dsanders Subscribers: dsanders, vkalintiris, llvm-commits Differential Revision: http://reviews.llvm.org/D10768 llvm-svn: 251286	2015-10-26 12:38:43 +00:00
Igor Breger	684af8156c	AVX-512: Use correct extract vector length. Bug https://llvm.org/bugs/show_bug.cgi?id=25318 Differential Revision: http://reviews.llvm.org/D14062 llvm-svn: 251285	2015-10-26 12:26:34 +00:00
Silviu Baranga	b892e35520	[InstCombine] Teach instcombine not to create extra PHI nodes when folding GEPs Summary: InstCombine tries to transform GEP(PHI(GEP1, GEP2, ..)) into GEP(GEP(PHI(...)) when possible. However, this may leave the old PHI node around. Even if we do end up folding the GEPs, having an extra PHI node might not be beneficial. This change makes the transformation more conservative. We now only do this if the PHI has only one use, and can therefore be removed after the transformation. Reviewers: jmolloy, majnemer Subscribers: mcrosier, mssimpso, llvm-commits Differential Revision: http://reviews.llvm.org/D13887 llvm-svn: 251281	2015-10-26 10:25:05 +00:00
James Molloy	72222f5dca	[ARM] Handle the inline asm constraint type 'o' This means "memory with offset" and requires very little plumbing to get working. This fixes PR25317. llvm-svn: 251280	2015-10-26 10:04:52 +00:00
Benjamin Kramer	8604457f2e	Drop code after unreachable. No functionality change. llvm-svn: 251278	2015-10-26 09:55:45 +00:00
Igor Breger	f8e461f920	AVX512: Add AVX-512 not materializable instructions. Otherwise value can be reused , despite its value could be changed - produces incorrect assembler. https://llvm.org/bugs/show_bug.cgi?id=25270 Differential Revision: http://reviews.llvm.org/D14057 llvm-svn: 251275	2015-10-26 08:37:12 +00:00
Lang Hames	4df7ba7a16	[Orc] Add license header to OrcTargetSupport. llvm-svn: 251274	2015-10-26 06:40:28 +00:00
David Majnemer	0993e0b8a1	[MC] Add support for GNU as-compatible binary operator precedence GNU as and Darwin give the various binary operators different precedence. LLVM's MC supported the Darwin semantics but not the GNU semantics. This fixes PR25311. llvm-svn: 251271	2015-10-26 03:15:34 +00:00
David Majnemer	a375b26144	[MC] Don't crash when .word is given bogus values We didn't validate that the .word directive was given a sane value, leading to crashes when we attempt to write out the object file. Instead, perform some validation and issue a diagnostic pointing at the start of the diagnostic. llvm-svn: 251270	2015-10-26 02:45:50 +00:00
Benjamin Kramer	8ceb323bb4	Convert assert(false) into llvm_unreachable where it makes sense. llvm-svn: 251266	2015-10-25 22:28:27 +00:00
Davide Italiano	f04d89bdb4	[ScalarEvolution] Throw away dead code. llvm-svn: 251256	2015-10-25 20:00:49 +00:00
Davide Italiano	2071f4cc5a	[ScalarEvolution] Get rid of NDEBUG in header (correctly this time). llvm-svn: 251255	2015-10-25 19:55:24 +00:00
Sanjoy Das	15c4c4604f	[LCSSA] Unbreak build, don't reuse L; NFC The build broke in r251248. llvm-svn: 251251	2015-10-25 19:27:17 +00:00
Davide Italiano	0c34243ac1	[ScalarEvolution] Get rid of NDEBUG in header. llvm-svn: 251249	2015-10-25 19:13:36 +00:00
Sanjoy Das	331521c688	[LCSSA] Use range for loops; NFC llvm-svn: 251248	2015-10-25 19:08:32 +00:00
Simon Pilgrim	ec6db262e0	[X86][SSE4A] Fix for EXTRQI shuffle lowering. Incorrect range test - found during fuzz testing. llvm-svn: 251245	2015-10-25 17:40:54 +00:00
Elena Demikhovsky	092858588a	Scalarizer for masked.gather and masked.scatter intrinsics. When the target does not support these intrinsics they should be converted to a chain of scalar load or store operations. If the mask is not constant, the scalarizer will build a chain of conditional basic blocks. I added isLegalMaskedGather() isLegalMaskedScatter() APIs. Differential Revision: http://reviews.llvm.org/D13722 llvm-svn: 251237	2015-10-25 15:37:55 +00:00
Michael Kuperstein	eaa16005af	[X86] Use correct calling convention for MCU psABI libcalls When using the MCU psABI, compiler-generated library calls should pass some parameters in-register. However, since inreg marking for x86 is currently done by the front end, it will not be applied to backend-generated calls. This is a workaround for PR3997, which describes a similar issue for -mregparm. Differential Revision: http://reviews.llvm.org/D13977 llvm-svn: 251223	2015-10-25 08:14:05 +00:00
Michael Kuperstein	fe897623f3	[X86] Add support for elfiamcu triple This adds support for the i?86-*-elfiamcu triple, which indicates the IAMCU psABI is used. Differential Revision: http://reviews.llvm.org/D13977 llvm-svn: 251222	2015-10-25 08:07:37 +00:00
Craig Topper	eda02a905e	Remove two unnecessary conversions from MVT to EVT. NFC llvm-svn: 251219	2015-10-25 03:15:29 +00:00
Craig Topper	7bf52c9d26	Use MVT::SimpleValueType instead of MVT in template parameter. NFC llvm-svn: 251217	2015-10-25 00:27:14 +00:00
Rafael Espindola	84921b9860	Refactor: Simplify boolean conditional return statements in lib/CodeGen. Patch by Richard. llvm-svn: 251213	2015-10-24 23:11:13 +00:00
Simon Pilgrim	53c2bff5fe	[X86][SSE] Use lowerVectorShuffleWithUNPCK instead of custom matches. Most 128-bit and 256-bit shuffles were manually matching UNPCK patterns - use lowerVectorShuffleWithUNPCK to be more thorough. llvm-svn: 251211	2015-10-24 22:45:04 +00:00
Simon Pilgrim	fdfed5143c	[X86][SSE] lowerVectorShuffleWithUNPCK - use equivalent shuffle mask test. Use isShuffleEquivalent to match UNPCK shuffles - better support for build vector inputs. llvm-svn: 251207	2015-10-24 20:48:08 +00:00
Michael Zolotukhin	1eeb2da7d4	Refactor: Simplify boolean conditional return statements in lib/Transforms/Vectorize (NFC). Summary: Use clang-tidy to simplify boolean conditional return statements Differential Revision: http://reviews.llvm.org/D10003 Patch by Richard<legalize@xmission.com> llvm-svn: 251206	2015-10-24 20:16:42 +00:00
Simon Pilgrim	3448cbcc51	[DAGCombiner] Tidy up ConstantFP commutation. NFCI Move ConstantFP canonicalization of commutative instructions to start of 2-op node creation (matches integer) - simplifies constant folding code. llvm-svn: 251203	2015-10-24 20:06:18 +00:00
Benjamin Kramer	5611561e99	Use all_of to simplify control flow. NFC. llvm-svn: 251202	2015-10-24 19:30:37 +00:00
Yaron Keren	57fa135b40	Add libuuid to required system libraries list for mingw. This list is produced by llvm-config --system-libs to be used by external programs using the llvm libraries, such as creduce. In r250501 llvm/Support/Windows/Path.inc started to use the constant FOLDERID_Profile from libuuid. llvm-svn: 251201	2015-10-24 19:27:28 +00:00
Benjamin Kramer	74b6d3b967	Use find_if to simplify control flow. NFC. llvm-svn: 251200	2015-10-24 19:03:15 +00:00
Simon Pilgrim	7430804fe1	[DAGCombiner] Generalize masking of constant rotates. We don't need a mask of a rotation result to be a constant splat - any constant scalar/vector can be usefully folded. Followup to D13851. llvm-svn: 251197	2015-10-24 18:44:52 +00:00
Craig Topper	272d6a57bb	Call the version of ConvertCostTableLookup that takes a statically sized array rather than pointer and size. NFC llvm-svn: 251196	2015-10-24 18:40:22 +00:00
Hans Wennborg	34d40434a7	X86ISelLowering: Support tail calls to/from callee pop functions This enables tail calls with thiscall, stdcall, vectorcall and fastcall functions. Differential Revision: http://reviews.llvm.org/D13999 llvm-svn: 251190	2015-10-24 16:47:10 +00:00
Simon Pilgrim	e379fe0ddb	Fix unused variable warning. NFC. llvm-svn: 251189	2015-10-24 13:41:45 +00:00
Simon Pilgrim	d5ef318b5b	[X86][XOP] Add support for lowering vector rotations This patch adds support for lowering to the XOP VPROT / VPROTI vector bit rotation instructions. This has required changes to the DAGCombiner rotation pattern matching to support vector types - so far I've only changed it to support splat vectors, but generalising this further is feasible in the future. Differential Revision: http://reviews.llvm.org/D13851 llvm-svn: 251188	2015-10-24 13:17:26 +00:00
Benjamin Kramer	7ecf8c22cf	[TblGen] ArrayRefize TGParser. No functional change intended. llvm-svn: 251186	2015-10-24 12:46:45 +00:00
Benjamin Kramer	557b601b08	[BasicAliasAnalysis] Simplify expression, no functional change. (-1) - x + 1 is the same as -x. llvm-svn: 251185	2015-10-24 11:38:01 +00:00
NAKAMURA Takumi	26c3872666	ScalarReplAggregates.cpp: Try to appease clash of anonymous::SROA in modules build. llvm-svn: 251181	2015-10-24 06:42:42 +00:00
Sanjoy Das	a7e13782f1	Extract out getConstantRangeFromMetadata; NFC The loop idiom creating a ConstantRange is repeated twice in the codebase, time to give it a name and a home. The loop is also repeated in `rangeMetadataExcludesValue`, but using `getConstantRangeFromMetadata` there would not be an NFC -- the range returned by `getConstantRangeFromMetadata` may contain a value that none of the subranges did. llvm-svn: 251180	2015-10-24 05:37:35 +00:00
Sanjoy Das	bb5ffc50b7	Fix whitespace issues in two places; NFC llvm-svn: 251179	2015-10-24 05:37:28 +00:00
Kostya Serebryany	9cc3b0ddb6	[libFuzzer] add -merge flag to merge corpora llvm-svn: 251168	2015-10-24 01:16:40 +00:00
Matt Arsenault	2ea0a23f18	AMDGPU: Print modifiers when dumping AMDGPUOperand llvm-svn: 251160	2015-10-24 00:12:56 +00:00
Igor Laevsky	dde0029a25	[RS4GC] Rename stripDereferenceabilityInfo into stripNonValidAttributes. llvm-svn: 251157	2015-10-23 22:42:44 +00:00
Rafael Espindola	21956e4007	Add a RAW mode to StringTableBuilder. In this mode it just tries to tail merge the strings without imposing any other format constrains. It will not, for example, add a null byte between them. Also add support for keeping a tentative size and offset if we decide to not optimize after all. This will be used shortly in lld for merging SHF_STRINGS sections. llvm-svn: 251153	2015-10-23 21:48:05 +00:00
Chen Li	7009cd3554	Revert rL251061 [SimplifyCFG] Extend SimplifyResume to handle phi of trivial landing pad. llvm-svn: 251149	2015-10-23 21:13:01 +00:00
Hal Finkel	f2199b2178	Handle non-constant shifts in computeKnownBits, and use computeKnownBits for constant folding in InstCombine/Simplify First, the motivation: LLVM currently does not realize that: ((2072 >> (L == 0)) >> 7) & 1 == 0 where L is some arbitrary value. Whether you right-shift 2072 by 7 or by 8, the lowest-order bit is always zero. There are obviously several ways to go about fixing this, but the generic solution pursued in this patch is to teach computeKnownBits something about shifts by a non-constant amount. Previously, we would give up completely on these. Instead, in cases where we know something about the low-order bits of the shift-amount operand, we can combine (and together) the associated restrictions for all shift amounts consistent with that knowledge. As a further generalization, I refactored all of the logic for all three kinds of shifts to have this capability. This works well in the above case, for example, because the dynamic shift amount can only be 0 or 1, and thus we can say a lot about the known bits of the result. This brings us to the second part of this change: Even when we know all of the bits of a value via computeKnownBits, nothing used to constant-fold the result. This introduces the necessary code into InstCombine and InstSimplify. I've added it into both because: 1. InstCombine won't automatically pick up the associated logic in InstSimplify (InstCombine uses InstSimplify, but not via the API that passes in the original instruction). 2. Putting the logic in InstCombine allows the resulting simplifications to become part of the iterative worklist 3. Putting the logic in InstSimplify allows the resulting simplifications to be used by everywhere else that calls SimplifyInstruction (inlining, unrolling, and many others). And this requires a small change to our definition of an ephemeral value so that we don't break the rest case from r246696 (where the icmp feeding the @llvm.assume, is also feeding a br). Under the old definition, the icmp would not be considered ephemeral (because it is used by the br), but this causes the assume to remove itself (in addition to simplifying the branch structure), and it seems more-useful to prevent that from happening. llvm-svn: 251146	2015-10-23 20:37:08 +00:00
Tim Northover	d4f55c0b1b	GVN: don't try to replace instruction with itself. After some look-ahead PRE was added for GEPs, an instruction could end up in the table of candidates before it was actually inspected. When this happened the pass might decide it was the best candidate to replace itself. This didn't go well. Should fix PR25291 llvm-svn: 251145	2015-10-23 20:30:02 +00:00
Rafael Espindola	a9b3944c0e	Fix the variable names to match the LLVM style. llvm-svn: 251143	2015-10-23 20:15:35 +00:00
Sanjoy Das	52f7b08b4a	[SCEV] Fix stylistic issue in MatchBinaryAddToConst; NFCI Instead of checking `(FlagsPresent & ExpectedFlags) != 0`, check `(FlagsPresent & ExpectedFlags) == ExpectedFlags`. Right now they're equivalent since `ExpectedFlags` can only be either `FlagNUW` or `FlagNSW`, but if we ever pass in `ExpectedFlags` as `FlagNUW \| FlagNSW` then checking `(FlagsPresent & ExpectedFlags) != 0` would be wrong. llvm-svn: 251142	2015-10-23 20:09:57 +00:00
Sanjoy Das	0a1bee8a80	[Inliner] Don't inline through callsites with operand bundles Summary: This change teaches the LLVM inliner to not inline through callsites with unknown operand bundles. Currently all operand bundles are "unknown" operand bundles but in the near future we will add support for inlining through some select kinds of operand bundles. Reviewers: reames, chandlerc, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14001 llvm-svn: 251141	2015-10-23 20:09:55 +00:00

1 2 3 4 5 ...

83997 Commits