llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	85a1f5c20c	[AVX-512] Add tests for masked palignr/valignd/valignq shuffles, many of which show failures to fold the masking into the operation. Many of these problems are because shuffle lowering widens element size and reduces element count when possible. This causes the shuffle to become separated from the select by a bitcast. Future patches will work to improve these cases by rewriting the shuffle back to a narrow element type if we think it can result in folding the mask. llvm-svn: 287503	2016-11-20 19:50:32 +00:00
Coby Tayree	99a6639047	The 'vpmultishiftqb' instruction was implemented falsely, this patch amend it. More specifically - (MS dialect) broadcasting variants were implemented falsely. Differential Revision: https://reviews.llvm.org/D26257 llvm-svn: 287501	2016-11-20 17:19:55 +00:00
Coby Tayree	97e9cf62f4	Some instructions were missing, other implemented falsely. this patch aims at amending those issues. full list: vcvtps2pd vcvtudq2pd vcvtps2qq vcvttps2qq vcvtps2uqq vcvttps2uqq variants are: [Dst]XMM(zero-masked/merge-masked/unmasked) [Src]Mem64 Differential Revision: https://reviews.llvm.org/D26799 llvm-svn: 287500	2016-11-20 17:09:56 +00:00
Simon Pilgrim	5fadce4a3f	[X86][AVX512] Combine unary + zero target shuffles to VPERMV3 with a zero vector where possible llvm-svn: 287497	2016-11-20 16:11:36 +00:00
Simon Pilgrim	5401bae523	[X86][AVX512] Add support for VBMI VPERMV3 target shuffle combines llvm-svn: 287496	2016-11-20 15:24:38 +00:00
Simon Pilgrim	3f40412e0f	[X86][AVX512] Add support for VBMI VPERMV target shuffle combines llvm-svn: 287495	2016-11-20 15:05:45 +00:00
Simon Pilgrim	9e3f5cc015	[X86][AVX512] Add some initial VBMI target shuffle combine tests llvm-svn: 287494	2016-11-20 14:45:46 +00:00
Simon Pilgrim	c17e1b74b8	[X86][AVX512VL] Removed duplicate operation action Basic AVX512F already declared uint_to_fp v4i32 as legal llvm-svn: 287493	2016-11-20 14:19:29 +00:00
Simon Pilgrim	3f10e9953d	Strip trailing whitespace llvm-svn: 287492	2016-11-20 14:05:23 +00:00
Simon Pilgrim	096b6d4f81	[X86][AVX512F] Add support for uint_to_fp v2i32 to v2f64 on AVX512F-only targets Use 512-bit instructions (we already do something similar for uint_to_fp v4i32 to v4f64) llvm-svn: 287491	2016-11-20 14:03:23 +00:00
Simon Pilgrim	f2fbf43704	Fix comment typos. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287490	2016-11-20 13:47:59 +00:00
Simon Pilgrim	dae11f7aab	Fix spelling mistakes in Tools/Tests comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287489	2016-11-20 13:31:13 +00:00
Simon Pilgrim	7d18a70dac	Fix spelling mistakes in Transforms comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287488	2016-11-20 13:19:49 +00:00
Simon Pilgrim	7a6b6d5656	Fix spelling mistakes in SelectionDAG comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287487	2016-11-20 13:14:57 +00:00
Simon Pilgrim	fbd2221de5	Fix comment typos. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287486	2016-11-20 13:10:51 +00:00
Oren Ben Simhon	c0f073b67f	[X86] RegCall - Handling long double arguments The change is part of RegCall calling convention support for LLVM. Long double (f80) requires special treatment as the first f80 parameter is saved in FP0 (floating point stack). This review present the change and the corresponding tests. Differential Revision: https://reviews.llvm.org/D26151 llvm-svn: 287485	2016-11-20 11:06:07 +00:00
Coby Tayree	179ff0e541	[X86][InlineAsm]Test commit. Fixing a wrong comment on X86AsmParser.cpp::ParseZ: "true" --> "false" Differential Revision: https://reviews.llvm.org/D26797 llvm-svn: 287484	2016-11-20 09:31:11 +00:00
Serge Pavlov	f258ff1fa9	Fix file name resolution in nested response files If a response file in construct `@file` was specified by relative name, constructs `@file` nested within it were resolved incorrectly if the flag RelativeNames in call to ExpandResponseFile was set to true. This feature is used in configuration files, tests for it are in respective change (see D24933). llvm-svn: 287482	2016-11-20 06:25:07 +00:00
Saleem Abdulrasool	b14fc390dc	ExceptionDemo: remove some undefined behaviour The casting based reading of the LSDA could attempt to read unsuitably aligned data. Avoid that case by explicitly using a memcpy. A similar approach is used in libc++abi to address the same UB. llvm-svn: 287479	2016-11-20 02:36:38 +00:00
Saleem Abdulrasool	c0e4e7d990	ExceptionDemo: prefer headers over redeclarations Rather than redeclaring the interfaces for exceptions, prefer using the `unwind.h` header. This is vended by at least gcc and clang, and can also be found by an external unwinding library (e.g. libunwind). Doing this simplifies the example to the exception handling itself. Minor tweaks are the result of _Unwind_Context_t not being defined, which is just a typedef for struct _Unwind_Context *. NFC. llvm-svn: 287478	2016-11-20 02:36:36 +00:00
Alexei Starovoitov	e6ddac0def	[bpf] add BPF disassembler add BPF disassembler, so tools like llvm-objdump can be used: $ llvm-objdump -d -no-show-raw-insn ./sockex1_kern.o ./sockex1_kern.o: file format ELF64-BPF Disassembly of section socket1: bpf_prog1: 0: r6 = r1 8: r0 = (u8 )skb[23] 10: (u32 )(r10 - 4) = r0 18: r1 = (u32 )(r6 + 4) 20: if r1 != 4 goto 8 28: r2 = r10 30: r2 += -4 ld_imm64 (the only 16-byte insn) and special ld_abs/ld_ind instructions had to be treated in a special way. The decoders for the rest of the insns are automatically generated. Add tests to cover new functionality. Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 287477	2016-11-20 02:25:00 +00:00
Rui Ueyama	e5669cecde	Attempt to fix big-endian buildbots. llvm-svn: 287476	2016-11-20 01:41:28 +00:00
Rui Ueyama	567d9c4b8f	Style fix. NFC. llvm-svn: 287475	2016-11-20 01:15:56 +00:00
Rui Ueyama	218072a989	Fix buildbot. llvm-svn: 287474	2016-11-20 01:13:22 +00:00
Rui Ueyama	fe33661ab0	SHA1: unroll loop in hashBlock. This code is taken from public domain. https://github.com/jsonn/src/blob/trunk/common/lib/libc/hash/sha1/sha1.c I wrote a sha1 command and ran it on my Xeon E5-2680 v2 2.80GHz machine. Here is a result. The new hash function is 37% faster than before. Performance counter stats for './llvm-sha1-old /ssd/build/bin/lld' (10 runs): 6640.503687 task-clock (msec) # 1.001 CPUs utilized ( +- 0.03% ) 54 context-switches # 0.008 K/sec ( +- 5.03% ) 5 cpu-migrations # 0.001 K/sec ( +- 31.73% ) 183,803 page-faults # 0.028 M/sec ( +- 0.00% ) 18,527,954,113 cycles # 2.790 GHz ( +- 0.03% ) 4,993,237,485 stalled-cycles-frontend # 26.95% frontend cycles idle ( +- 0.11% ) <not supported> stalled-cycles-backend 50,217,149,423 instructions # 2.71 insns per cycle # 0.10 stalled cycles per insn ( +- 0.00% ) 6,094,322,337 branches # 917.750 M/sec ( +- 0.00% ) 11,778,239 branch-misses # 0.19% of all branches ( +- 0.01% ) 6.634017401 seconds time elapsed ( +- 0.03% ) Performance counter stats for './llvm-sha1-new /ssd/build/bin/lld' (10 runs): 4167.062720 task-clock (msec) # 1.001 CPUs utilized ( +- 0.02% ) 52 context-switches # 0.012 K/sec ( +- 16.45% ) 7 cpu-migrations # 0.002 K/sec ( +- 32.20% ) 183,804 page-faults # 0.044 M/sec ( +- 0.00% ) 11,626,611,958 cycles # 2.790 GHz ( +- 0.02% ) 4,491,897,976 stalled-cycles-frontend # 38.63% frontend cycles idle ( +- 0.05% ) <not supported> stalled-cycles-backend 24,320,180,617 instructions # 2.09 insns per cycle # 0.18 stalled cycles per insn ( +- 0.00% ) 1,574,674,576 branches # 377.886 M/sec ( +- 0.00% ) 11,769,693 branch-misses # 0.75% of all branches ( +- 0.00% ) 4.163251552 seconds time elapsed ( +- 0.02% ) Differential Revision: https://reviews.llvm.org/D26890 llvm-svn: 287473	2016-11-20 01:03:22 +00:00
Saleem Abdulrasool	a577509f0a	Demangle: remove references to allocator for default allocator The demangler had stopped using a custom allocator but had not been updated to remove the use of the explicit allocator passing. This removes that as we do not need to do anything special here anymore. This just makes the code more compact. NFC. llvm-svn: 287472	2016-11-20 00:20:27 +00:00
Saleem Abdulrasool	54ec3f9cf8	Demangle: remove unnecessary typedef for std::vector We could create a local typedef for std::vector called Vector. Inline the use of std::vector rather than use the typedef. NFC. llvm-svn: 287471	2016-11-20 00:20:25 +00:00
Saleem Abdulrasool	be1fd54f85	Demangle: replace custom typedef for std::string with std::string We created a local typedef for `std::basic_string<char, std::char_traits<char>>` which is just `std::string`. Remove the local typedef and propagate the type information through the rest of the demangler. NFC. llvm-svn: 287470	2016-11-20 00:20:23 +00:00
Saleem Abdulrasool	0da9050976	Demangle: use direct member initialization (NFC) Prefer direct member initialization over the explicit out-of-line initialization for the construction of the local type. NFC. llvm-svn: 287469	2016-11-20 00:20:20 +00:00
Benjamin Kramer	ffd3715d16	Give some helper classes/functions internal linkage. NFC. llvm-svn: 287462	2016-11-19 20:44:26 +00:00
Simon Pilgrim	a14e0cb852	[X86][SSE] Improve PSHUFB lowering from either input Canonicalization may leave the zeroable vector in the first input. llvm-svn: 287461	2016-11-19 20:41:48 +00:00
Simon Pilgrim	623a7c57b5	[X86][AVX512] Add VPERMV/VPERMV3 v64i8 byte shuffles on avx512vbmi targets llvm-svn: 287459	2016-11-19 20:12:34 +00:00
Mehdi Amini	fec2158292	[ThinLTO] Fix crash when importing an opaque type It seems that because ThinLTO does not import the full module, some invariant of the type mapper are broken. In Monolithic LTO, we import every globals: when calling IRLinker::copyFunctionProto() on @foo(), we end-up calling TypeMapTy::get(FTy) on the type of @foo(), which will map %0 and record the destination as opaque. ThinLTO skips this because @foo is not imported and goes directly to the next stage. Next we call computeTypeMapping() that map the types for each globals, and ends up checking for type isomorphism, and may add type mapping. However it doesn't record if there was an opaque destination type that was resolved. Instead of lazily "discovering" opaque type in the destination module on the go, we change the TypeFinder to eagerly record all types and not only the named ones. Differential Revision: https://reviews.llvm.org/D26840 llvm-svn: 287453	2016-11-19 18:44:16 +00:00
Mehdi Amini	19f176b982	[ThinLTO] Implement -pass-remarks-output in ThinLTOCodeGenerator Summary: This will also be added to the LTO API, right now this will bring ThinLTO on par with Monolithic LTO on Darwin. Reviewers: anemet Subscribers: tejohnson, llvm-commits Differential Revision: https://reviews.llvm.org/D26886 llvm-svn: 287450	2016-11-19 18:20:05 +00:00
Mehdi Amini	6f40836823	Change setDiagnosticsOutputFile to take a unique_ptr from a raw pointer (NFC) Summary: This makes it explicit that ownership is taken. Also replace all `new` with make_unique<> at call sites. Reviewers: anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26884 llvm-svn: 287449	2016-11-19 18:19:41 +00:00
Simon Pilgrim	f011f7e160	[X86][AVX512] Add avx512vbmi tests llvm-svn: 287447	2016-11-19 18:12:48 +00:00
Simon Pilgrim	28f1e0dab9	[X86][AVX512] Added some more complex v64i8 shuffles llvm-svn: 287444	2016-11-19 17:50:14 +00:00
Craig Topper	893ea9fb2c	[X86] Simplify some code a little by removing a dulicate variable and combinining two if statements. NFCI llvm-svn: 287443	2016-11-19 17:33:17 +00:00
Daniel Sanders	c95590bc45	Try again to fix unused variable warning on lld-x86_64-darwin13 after r287439. The previous attempt didn't work. I assume LLVM_ATTRIBUTE_UNUSED isn't available on that machine. llvm-svn: 287442	2016-11-19 14:47:41 +00:00
Daniel Sanders	c6d1986a84	Try to fix unused variable warning on lld-x86_64-darwin13 after r287439. Whether the variable is used or not depends on NDEBUG. llvm-svn: 287440	2016-11-19 13:50:32 +00:00
Daniel Sanders	72db2a390a	Check that emitted instructions meet their predicates on all targets except ARM, Mips, and X86. Summary: * ARM is omitted from this patch because this check appears to expose bugs in this target. * Mips is omitted from this patch because this check either detects bugs or deliberate emission of instructions that don't satisfy their predicates. One deliberate use is the SYNC instruction where the version with an operand is correctly defined as requiring MIPS32 while the version without an operand is defined as an alias of 'SYNC 0' and requires MIPS2. * X86 is omitted from this patch because it doesn't use the tablegen-erated MCCodeEmitter infrastructure. Patches for ARM and Mips will follow. Depends on D25617 Reviewers: tstellarAMD, jmolloy Subscribers: wdng, jmolloy, aemerson, rengolin, arsenm, jyknight, nemanjai, nhaehnle, tstellarAMD, llvm-commits Differential Revision: https://reviews.llvm.org/D25618 llvm-svn: 287439	2016-11-19 13:05:44 +00:00
Daniel Sanders	ca89f3a19b	[tablegen] Merge duplicate definitions of getMinimalTypeForRange. NFC. Summary: Depends on D25614 Reviewers: qcolombet Subscribers: qcolombet, beanz, mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D25617 llvm-svn: 287438	2016-11-19 12:21:34 +00:00
Chris Bieneman	671a1279b6	[CMake] llvm-lto2 depends on intrinsics_gen llvm-lto2.cpp has the following include chain: llvm/LTO/Caching.h llvm/LTO/LTO.h llvm/CodeGen/Analysis.h llvm/IR/CallSite.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-lto2 needs to depend on intrinsics_gen. llvm-svn: 287434	2016-11-19 03:19:58 +00:00
Chris Bieneman	367cf3c22c	[CMake] opt depends on intrinsics_gen AnalysisWrappers.cpp has the following include chain: llvm/Analysis/CallGraph.h llvm/IR/CallSite.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means opt needs to depend on intrinsics_gen. llvm-svn: 287433	2016-11-19 03:18:50 +00:00
Chris Bieneman	458796ddf8	[CMake] llvm-nm depends on intrinsics_gen llvm-nm.cpp has the following include chain: llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-nm needs to depend on intrinsics_gen. llvm-svn: 287432	2016-11-19 03:16:33 +00:00
Chris Bieneman	b2b18d2ada	[CMake] llvm-link depends on intrinsics_gen llvm-link.cpp has the following include chain: llvm/Bitcode/BitcodeWriter.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-link needs to depend on intrinsics_gen. llvm-svn: 287431	2016-11-19 02:36:28 +00:00
Chris Bieneman	1bc4fab8cd	[CMake] llvm-extract depends on intrinsics_gen llvm-extract.cpp has the following include chain: llvm/Bitcode/BitcodeWriterPass.h llvm/IR/PassManager.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-extract needs to depend on intrinsics_gen. llvm-svn: 287430	2016-11-19 02:33:57 +00:00
Chris Bieneman	4e826e0986	[CMake] llvm-dwp depends on intrinsics_gen llvm-dwp.cpp has the following include chain: llvm/CodeGen/AsmPrinter.h llvm/CodeGen/MachineFunctionPass.h llvm/CodeGen/MachineFunction.h llvm/CodeGen/MachineBasicBlock.h llvm/CodeGen/MachineInstr.h llvm/Analysis/AliasAnalysis.h llvm/IR/CallSite.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-dwp needs to depend on intrinsics_gen. llvm-svn: 287429	2016-11-19 02:33:42 +00:00
Chris Bieneman	e525ef9cd1	[CMake] llvm-dis depends on intrinsics_gen llvm-dis.cpp has the following include chain: llvm/Bitcode/BitcodeReader.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-dis needs to depend on intrinsics_gen. llvm-svn: 287428	2016-11-19 02:31:14 +00:00
Chris Bieneman	041c1102eb	[CMake] llvm-diff depends on intrinsics_gen llvm-diff.cpp has the following include chain: llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-diff needs to depend on intrinsics_gen. llvm-svn: 287427	2016-11-19 02:28:18 +00:00
Chris Bieneman	70390f5d22	[CMake] llvm-stress depends on intrinsics_gen llvm-stress.cpp has the following include chain: llvm/Analysis/CallGraphSCCPass.h llvm/Analysis/CallGraph.h llvm/IR/CallSite.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-stress needs to depend on intrinsics_gen. llvm-svn: 287426	2016-11-19 02:25:54 +00:00
Chris Bieneman	d9d28a74b5	[CMake] bugpoint-passes depends on intrinsics_gen TestPasses.cpp has the following include chain: llvm/IR/InstVisitor.h llvm/IR/CallSite.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means bugpoint-passes needs to depend on intrinsics_gen. llvm-svn: 287425	2016-11-19 02:20:59 +00:00
Chris Bieneman	a3acfaa5cd	[CMake] llvm-bcanalyzer depends on intrinsics_gen llvm-bcanalyzer.cpp has the following include chain: llvm/Bitcode/BitcodeReader.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-bcanalyzer needs to depend on intrinsics_gen. llvm-svn: 287424	2016-11-19 02:17:12 +00:00
Chris Bieneman	ac6ab6fdb6	[CMake] llvm-as depends on intrinsics_gen llvm-as.cpp has the following include chain: llvm/Bitcode/BitcodeWriter.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-as needs to depend on intrinsics_gen. llvm-svn: 287423	2016-11-19 02:15:04 +00:00
Chris Bieneman	e5cb14cf27	[CMake] llc depends on intrinsics_gen llc.cpp has the following include chain: llvm/Analysis/TargetLibraryInfo.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llc needs to depend on intrinsics_gen. llvm-svn: 287422	2016-11-19 02:12:03 +00:00
Chris Bieneman	d7f71b5187	[CMake] lli-child-target depends on intrinsics gen Messed up in r287420, it isn't just lli, but also but lli-child-target that need to depend on intrinsics_gen. llvm-svn: 287421	2016-11-19 02:09:51 +00:00
Chris Bieneman	3bd0191c5b	[CMake] lli depends on intrinsics_gen ChildTarget.cpp has the following include chain: llvm/ExecutionEngine/Orc/OrcABISupport.h llvm/ExecutionEngine/Orc/IndirectionUtils.h llvm/IR/IRBuilder.h llvm/IR/ConstantFolder.h llvm/IR/InstrTypes.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means lli needs to depend on intrinsics_gen. llvm-svn: 287420	2016-11-19 02:05:19 +00:00
Chris Bieneman	d22fa5091c	[CMake] llvm-dsymutil depends on intrinsics_gen DwarfLinker.cpp has the following include chain: llvm/CodeGen/AsmPrinter.h llvm/CodeGen/MachineFunctionPass.h llvm/CodeGen/MachineFunction.h llvm/CodeGen/MachineBasicBlock.h llvm/CodeGen/MachineInstr.h llvm/Analysis/AliasAnalysis.h llvm/IR/CallSite.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-dsymutil needs to depend on intrinsics_gen. llvm-svn: 287419	2016-11-19 02:02:46 +00:00
Dylan McKay	1a55f201ef	[AVR] Remove a bunch of unused variables llvm-svn: 287416	2016-11-19 01:33:42 +00:00
Chris Bieneman	958edcbdd6	[CMake] Apply sandbox profile to target not directory When LLVM_DEPENDENCY_DEBUGGING=On we should apply the sandbox only on the target, not the directory. This is important for directories that create more than one target, or for nested directories. llvm-svn: 287415	2016-11-19 01:32:09 +00:00
Dylan McKay	19270f3438	[AVR] Remove a variable which was unused in release mode In release mode where assertions are not enabled, this caused an 'unused variable' warning. llvm-svn: 287414	2016-11-19 01:14:44 +00:00
Chris Bieneman	9c520d75b9	[CMake] verify-uselistorder depends on intrinsics_gen verify-uselistorder.cpp has the following include chain: llvm/Bitcode/BitcodeReader.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means verify-uselistorder needs to depend on intrinsics_gen. llvm-svn: 287405	2016-11-18 23:30:58 +00:00
Chris Bieneman	585b4a3e39	[CMake] sanstats depends on intrinsics_gen sanstats.cpp has the following include chain: llvm/Transforms/Utils/SanitizerStats.h llvm/IR/IRBuilder.h llvm/IR/ConstantFolder.h llvm/IR/InstrTypes.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means sanstats needs to depend on intrinsics_gen. llvm-svn: 287404	2016-11-18 23:30:39 +00:00
Kuba Mracek	fe16c1ff14	[lit] When setting SDKROOT on Darwin, use '--sdk macosx' to find the right SDK path. This will make sure that we find an actual path in case you have Command Line Tools installed. llvm-svn: 287403	2016-11-18 23:25:57 +00:00
Chris Bieneman	6cc58e09c8	[CMake] bugpoint depends on intrinsics_gen CrashDebugger.cpp has the following include chain: llvm/Analysis/TargetTransformInfo.h llvm/IR/IntrinsicInst.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means bugpoint needs to depend on intrinsics_gen. llvm-svn: 287402	2016-11-18 23:25:30 +00:00
Sanjay Patel	47e577eb92	[InstCombine] add tests to show likely unwanted select widening; NFC This is a prerequisite patch for D26556: https://reviews.llvm.org/D26556 ...because there was no direct coverage for these folds (which in some cases are adding instructions). llvm-svn: 287400	2016-11-18 23:22:00 +00:00
Chris Bieneman	93fa1860d1	[CMake] llvm-split depends on intrinsics_gen llvm-split.cpp has the following include chain: llvm/Bitcode/BitcodeWriter.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-split needs to depend on intrinsics_gen. llvm-svn: 287399	2016-11-18 23:20:38 +00:00
Chris Bieneman	26df11770e	[CMake] llvm-lto depends on intrinsics_gen llvm-lto.cpp has the following include chain: llvm/Bitcode/BitcodeReader.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-lto needs to depend on intrinsics_gen. llvm-svn: 287398	2016-11-18 23:20:35 +00:00
Chris Bieneman	13c963916f	[CMake] llvm-ar depends on intrinsics_gen llvm-ar.cpp has the following include chain: llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-ar needs to depend on intrinsics_gen. llvm-svn: 287395	2016-11-18 23:04:27 +00:00
Chris Bieneman	8e47604975	[CMake] llvm-profdata depends on intrinsics_gen llvm-profdata.cpp has the following include chain: llvm/ProfileData/SampleProfReader.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means llvm-profdata needs to depend on intrinsics_gen. llvm-svn: 287394	2016-11-18 23:04:15 +00:00
Chris Bieneman	caf299ffe1	[CMake] LTO depends on intrinsics_gen lto.cpp has the following include chain: llvm/Bitcode/BitcodeReader.h llvm/IR/ModuleSummaryIndex.h llvm/IR/Module.h llvm/IR/Function.h llvm/IR/Argument.h llvm/IR/Attributes.h llvm/IR/Attributes.gen This means LTO needs to depend on intrinsics_gen. llvm-svn: 287393	2016-11-18 23:03:51 +00:00
Konstantin Zhuravlyov	aefee42e0f	[AMDGPU] Change frexp.exp intrinsic to return i16 for f16 input Differential Revision: https://reviews.llvm.org/D26862 llvm-svn: 287389	2016-11-18 22:31:08 +00:00
Simon Pilgrim	e40900dddd	[SelectionDAG] Add knowbits support for CONCAT_VECTOR opcode llvm-svn: 287387	2016-11-18 22:21:22 +00:00
Simon Pilgrim	3a5328ecdd	[X86] Add knownbits concat_vector test Support coming in a future patch llvm-svn: 287385	2016-11-18 21:59:38 +00:00
Eugene Zelenko	ae7ac95cc9	[Examples] Fix some Clang-tidy modernize-use-default and Include What You Use warnings; other minor fixes. Differential revision: https://reviews.llvm.org/D26433 llvm-svn: 287384	2016-11-18 21:57:58 +00:00
Michael Zolotukhin	5020c9971b	[LoopSimplify] Preserve LCSSA when removing edges from unreachable blocks. This fixes PR30454. llvm-svn: 287379	2016-11-18 21:01:12 +00:00
Geoff Berry	de50acc31e	[MIRPrinter] XFAIL test for powerpc This test introduced in r287368 is failing on powerpc for reasons unrelated to branch probabilities. See PR31062. llvm-svn: 287375	2016-11-18 20:08:05 +00:00
Mehdi Amini	bf4d8d033b	Revert "Add link-time detection of LLVM_ABI_BREAKING_CHECKS mismatch" This reverts commit r287352, LLDB CI is broken. llvm-svn: 287374	2016-11-18 20:02:34 +00:00
Matthias Braun	db39fd6c53	Statistic/Timer: Include timers in PrintStatisticsJSON(). Differential Revision: https://reviews.llvm.org/D25588 llvm-svn: 287370	2016-11-18 19:43:24 +00:00
Matthias Braun	9f15a79e5d	Timer: Track name and description. The previously used "names" are rather descriptions (they use multiple words and contain spaces), use short programming language identifier like strings for the "names" which should be used when exporting to machine parseable formats. Also removed a unused TimerGroup from Hexxagon. Differential Revision: https://reviews.llvm.org/D25583 llvm-svn: 287369	2016-11-18 19:43:18 +00:00
Geoff Berry	b51774ac8c	[MIRPrinter] Print raw branch probabilities as expected by MIRParser Fixes PR28751. Reviewers: MatzeB, qcolombet Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D26775 llvm-svn: 287368	2016-11-18 19:37:24 +00:00
Matt Arsenault	afe614cb38	AMDGPU: Fix unused variable warning llvm-svn: 287362	2016-11-18 18:33:36 +00:00
Hans Wennborg	105e05a2a4	Fix test from r287353: don't use /dev/null llvm-svn: 287360	2016-11-18 18:27:31 +00:00
Adam Nemet	e9bd022c41	[LTO] Add option to generate optimization records It is used to drive this from the clang driver via -mllvm. Same option name is used as in opt. Differential Revision: https://reviews.llvm.org/D26832 llvm-svn: 287356	2016-11-18 18:06:28 +00:00
Eugene Zelenko	23d071ef87	[DebugInfo] Fix some Clang-tidy modernize-use-default, modernize-use-equal-delete and Include What You Use warnings; other minor fixes (NFC). Per Zachary Turner and Mehdi Amini suggestion to make only post-commit reviews. llvm-svn: 287355	2016-11-18 18:00:19 +00:00
Hans Wennborg	aeacdc258b	IRMover: Avoid accidentally mapping types from the destination module (PR30799) During Module linking, it's possible for SrcM->getIdentifiedStructTypes(); to return types that are actually defined in the destination module (DstM). Depending on how the bitcode file was read, getIdentifiedStructTypes() might do a walk over all values, including metadata nodes, looking for types. In my case, a debug info metadata node was shared between the two modules, and it referred to a type defined in the destination module (see test case). Differential Revision: https://reviews.llvm.org/D26212 llvm-svn: 287353	2016-11-18 17:33:05 +00:00
Mehdi Amini	c311528516	Add link-time detection of LLVM_ABI_BREAKING_CHECKS mismatch Summary: LLVM will define a symbol, either EnableABIBreakingChecks or DisableABIBreakingChecks depending on the configuration setting for LLVM_ABI_BREAKING_CHECKS. The llvm-config.h header will add weak references to these symbols in every clients that includes this header. This should ensure that a mismatch triggers a link failure (or a load time failure for DSO). On MSVC, the pragma "detect_mismatch" is used instead. Reviewers: rnk, jroelofs Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D26841 llvm-svn: 287352	2016-11-18 17:28:10 +00:00
Ehsan Amiri	395be572f0	[PPC] limit line width to 80 characters NFC. Forgot to fix this in the original commit. llvm-svn: 287350	2016-11-18 16:24:27 +00:00
Simon Dardis	0e2ee3b4b9	[mips][msa] Implement f16 support The MIPS MSA ASE provides instructions to convert to and from half precision floating point. This patch teaches the MIPS backend to treat f16 as a legal type and how to promote such values to f32 for the usual set of operations. As a result of this, the fexup[lr].w intrinsics no longer crash LLVM during type legalization. Reviewers: zoran.jovanvoic, vkalintiris Differential Revision: https://reviews.llvm.org/D26398 llvm-svn: 287349	2016-11-18 16:17:44 +00:00
Simon Pilgrim	7bde5df5f0	[X86][AVX512] Split AVX512F/AVX512VL tests to demonstrate missed int2fp opportunities without AVX512VL llvm-svn: 287348	2016-11-18 15:31:36 +00:00
Tom Stellard	df613198c0	GlobalISel: Fix unconditional fallback with global isel abort is disabled Reviewers: t.p.northover, ab, qcolombet Subscribers: mehdi_amini, vkalintiris, wdng, dberris, llvm-commits, rovka Differential Revision: https://reviews.llvm.org/D26765 llvm-svn: 287344	2016-11-18 14:14:35 +00:00
Tom Stellard	01e65d2cfc	AMDGPU/SI: Remove zero_extend patterns for i16 ops selected to 32-bit insts Summary: The 32-bit instructions don't zero the high 16-bits like the 16-bit instructions do. Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, llvm-commits, tony-tye Differential Revision: https://reviews.llvm.org/D26828 llvm-svn: 287342	2016-11-18 13:53:34 +00:00
Florian Hahn	77382be56b	[simplifycfg][loop-simplify] Preserve loop metadata in 2 transformations. insertUniqueBackedgeBlock in lib/Transforms/Utils/LoopSimplify.cpp now propagates existing llvm.loop metadata to newly the added backedge. llvm::TryToSimplifyUncondBranchFromEmptyBlock in lib/Transforms/Utils/Local.cpp now propagates existing llvm.loop metadata to the branch instructions in the predecessor blocks of the empty block that is removed. Differential Revision: https://reviews.llvm.org/D26495 llvm-svn: 287341	2016-11-18 13:12:07 +00:00
Simon Pilgrim	7938bd666e	Cleanup function with clang-format. NFCI. llvm-svn: 287340	2016-11-18 12:16:18 +00:00
Nicolai Haehnle	ce2b589df5	AMDGPU: Fix legalization of MUBUF instructions in shaders Summary: The addr64-based legalization is incorrect for MUBUF instructions with idxen set as well as for BUFFER_LOAD/STORE_FORMAT_* instructions. This affects e.g. shaders that access buffer textures. Since we never actually need the addr64-legalization in shaders, this patch takes the easy route and keys off the calling convention. If this ever affects (non-OpenGL) compute, the type of legalization needs to be chosen based on some TSFlag. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=98664 Reviewers: arsenm, tstellarAMD Subscribers: kzhuravl, wdng, yaxunl, tony-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D26747 llvm-svn: 287339	2016-11-18 11:55:52 +00:00
Simon Pilgrim	dcd8433597	Fix spelling mistakes in MIPS target comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287338	2016-11-18 11:53:36 +00:00
Ehsan Amiri	ff0942e6ea	[Power9] Add patterns for vnegd, vnegw Exploit new instructions by adding patterns to .td file. https://reviews.llvm.org/D26551 llvm-svn: 287334	2016-11-18 11:05:55 +00:00
Simon Pilgrim	e995a8088d	Fix spelling mistakes in AMDGPU target comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287333	2016-11-18 11:04:02 +00:00
Simon Pilgrim	3e5045e8f1	[X86][AVX2] Add v8i32->v8i64 mul test (PR30845) llvm-svn: 287332	2016-11-18 11:00:36 +00:00
Simon Pilgrim	fd8bf984f4	Fix typo in comment. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287331	2016-11-18 10:52:12 +00:00
Ehsan Amiri	85818684c6	[PPC][DAGCombine] Convert SETCC to subtract when the result is zero extended When we see a SETCC whose only users are zero extend operations, we can replace it with a subtraction. This results in doing all calculations in GPRs and avoids CR use. Currently we do this only for ULT, ULE, UGT and UGE condition codes. There are ways that this can be extended. For example for signed condition codes. In that case we will be introducing additional sign extend instructions, so more careful profitability analysis may be required. Another direction to extend this is for equal, not equal conditions. Also when users of SETCC are any_ext or sign_ext, we might be able to do something similar. llvm-svn: 287329	2016-11-18 10:41:44 +00:00
Amaury Sechet	c00c9a9f61	Fix go binding to adapt the new attribute API https://reviews.llvm.org/D26339 llvm-svn: 287328	2016-11-18 10:11:02 +00:00
Craig Topper	1de753f7f5	[InstCombine][AVX-512] Teach InstCombineCalls how to handle the intrinsics for variable shift with 16-bit elements. This is a straightforward extension of the existing support for 32/64-bit element types. Just needed to add the additional instrinsics to the switches. llvm-svn: 287316	2016-11-18 06:04:33 +00:00
Craig Topper	02b5a1b50f	[AVX-512] Replace masked 16-bit element variable shift intrinsics with new unmasked versions and selects. The same thing was done to 32-bit and 64-bit element sizes previously. This will allow us to support these shuffls in InstCombineCalls along with the other variable shift intrinsics. llvm-svn: 287312	2016-11-18 05:04:44 +00:00
Matt Arsenault	eff1ad8d8e	AMDGPU: Move redundant setting of inst properties llvm-svn: 287311	2016-11-18 04:42:59 +00:00
Matt Arsenault	742deb2495	AMDGPU: Fix crash on illegal type for inlineasm There are still crashes on non-MVT types in other places. llvm-svn: 287310	2016-11-18 04:42:57 +00:00
Peter Collingbourne	63e10c9c96	Object: Simplify; remove unnecessary use of unique_ptr. llvm-svn: 287305	2016-11-18 03:20:36 +00:00
Matthias Braun	637488dbf8	MachineOperand: Add dump() method llvm-svn: 287302	2016-11-18 02:40:40 +00:00
Alexei Starovoitov	8f9f8210c1	convert bpf assembler to look like kernel verifier output since bpf instruction set was introduced people learned to read and understand kernel verifier output whereas llvm asm output stayed obscure and unknown. Convert llvm to emit assembler text similar to kernel to avoid this discrepancy Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 287300	2016-11-18 02:32:35 +00:00
Craig Topper	faad4c30fa	[Docs][TableGen] Remove reference to tablegen supporting octal integers. It doesn't and hasn't for at least 9 years. llvm-svn: 287299	2016-11-18 02:28:50 +00:00
Craig Topper	07f1c15995	[AVX-512] Support FCOPYSIGN for v16f32 and v8f64 Summary: This extends FCOPYSIGN support to 512-bit vectors. I've also added tests to show what the 128-bit and 256-bit cases look like with broadcast loads. Reviewers: delena, zvi, RKSimon, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26791 llvm-svn: 287298	2016-11-18 02:25:34 +00:00
Yichao Yu	4497a28bd1	Add an option to disable libedit Summary: This should provide the function similar to `--disable-libedit` with the autotools build system, which seems to be missing from the commit (r200595) that adds this. Reviewers: pcc, beanz Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D26550 llvm-svn: 287293	2016-11-18 01:25:49 +00:00
Justin Lebar	2d2292009f	[CUDA] Update docs to indicate that MacOS is now supported. llvm-svn: 287290	2016-11-18 00:42:00 +00:00
Justin Lebar	7880141d2b	[CUDA] Update docs; CUDA 8.0 is supported as of a while ago. llvm-svn: 287289	2016-11-18 00:41:40 +00:00
Davide Italiano	8651144353	[lli] Prefer `exit(1)` to `return 1` for consistency. llvm-svn: 287277	2016-11-17 22:59:13 +00:00
Davide Italiano	da8e6b2ec7	[lli] Factor out error handling. NFCI. llvm-svn: 287276	2016-11-17 22:58:13 +00:00
Dylan McKay	7293f9f7cc	[ReleaseNotes] Mention the completion of the upstreaming of the AVR backend llvm-svn: 287273	2016-11-17 22:26:09 +00:00
Petr Hosek	e4521c3523	[CMake] Error when LTO and lld are enabled on Darwin lld on Darwin does not currently support LTO. Differential Revision: https://reviews.llvm.org/D26715 llvm-svn: 287256	2016-11-17 20:22:49 +00:00
Simon Pilgrim	6ba672e542	Fix spelling mistakes in Hexagon target comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287248	2016-11-17 19:21:20 +00:00
Simon Pilgrim	9d15fb3c10	Fix spelling mistakes in X86 target comments. NFC. Identified by Pedro Giffuni in PR27636. llvm-svn: 287247	2016-11-17 19:03:05 +00:00
Eugene Zelenko	35a5fe9f07	[CodeView] Fix some Clang-tidy modernize-use-default, modernize-use-override and Include What You Use warnings; other minor fixes (NFC). Per Zachary Turner and Mehdi Amini suggestion to make only post-commit reviews. llvm-svn: 287243	2016-11-17 18:11:21 +00:00
Kostya Serebryany	97ff7672aa	[libFuzzer] better documentation for -fsanitize-coverage=trace-cmp llvm-svn: 287240	2016-11-17 17:31:54 +00:00
Anna Zaks	9cd5ed1241	[asan] Turn on Mach-O global metadata liveness tracking by default This patch turns on the metadata liveness tracking since all known issues have been resolved. The future has been implemented in https://reviews.llvm.org/D16737 and enables support of dead code stripping option on Mach-O platforms. As part of enabling the feature, I also plan on reverting the following patch to compiler-rt: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20160704/369910.html Differential Revision: https://reviews.llvm.org/D26772 llvm-svn: 287235	2016-11-17 16:55:40 +00:00
Konstantin Zhuravlyov	0a1a7b6b23	Revert "AMDGPU: Enable ConstrainCopy DAG mutation" This reverts commit r287146. This breaks few conformance tests. llvm-svn: 287233	2016-11-17 16:41:49 +00:00
Daniil Fukalov	4c3322cc84	[SCEV] limit recursion depth of CompareSCEVComplexity Summary: CompareSCEVComplexity goes too deep (50+ on a quite a big unrolled loop) and runs almost infinite time. Added cache of "equal" SCEV pairs to earlier cutoff of further estimation. Recursion depth limit was also introduced as a parameter. Reviewers: sanjoy Subscribers: mzolotukhin, tstellarAMD, llvm-commits Differential Revision: https://reviews.llvm.org/D26389 llvm-svn: 287232	2016-11-17 16:07:52 +00:00
Simon Pilgrim	67ef3b984a	Wdocumentation fix llvm-svn: 287224	2016-11-17 12:21:45 +00:00
Simon Pilgrim	8eca5520dc	[X86][SSE] Improve lowering of vXi64 multiply with known zero 32-bit halves vXi64 multiplication is lowered into 3 calls of vpmuludq with the upper/lower 32-bit halves. If any of these halves are zero then we can remove individual calls. Although there was isBuildVectorAllZeros code to do this I don't think it ever worked (maybe just for constant folded cases that don't seem to be tested for any longer). This requires additional X86ISD support for computeKnownBitsForTargetNode, so far I've just added support for X86ISD::VZEXT (VPMOVZX* - helping the AVX2+ cases). Partial fix for PR30845 Differential Revision: https://reviews.llvm.org/D26590 llvm-svn: 287223	2016-11-17 12:14:49 +00:00
Simon Pilgrim	c4d733cd6a	Fix spelling in comment. NFC. llvm-svn: 287222	2016-11-17 12:03:05 +00:00
Pavel Labath	10849a81f3	[cmake] Move LLVM_BUILD_STATIC check to an earlier point Summary: The motivation for this is to enable correct detection of dlopen() on Android. Android does not provide a static version of libdl, so if we add the -static flag after performing the check, it will succeed even though subsequent link steps will fail. With this change we correctly detect the absence of libdl in a LLVM_BUILD_STATIC build on Android. The link itself still does not succeed because the code does not check the result of this check properly, but I plan to fix that in a separate change. Reviewers: beanz Subscribers: danalbert, mgorny, srhines, tberghammer, llvm-commits Differential Revision: https://reviews.llvm.org/D26463 llvm-svn: 287220	2016-11-17 11:22:23 +00:00
Pablo Barrio	c41e856f53	[ARM] Relax restriction on variadic functions for tailcall optimization Summary: Variadic functions can be treated in the same way as normal functions with respect to the number and types of parameters. Reviewers: grosbach, olista01, t.p.northover, rengolin Subscribers: javed.absar, aemerson, llvm-commits Differential Revision: https://reviews.llvm.org/D26748 llvm-svn: 287219	2016-11-17 10:56:58 +00:00
Oren Ben Simhon	489d6eff4f	[X86] RegCall - Handling v64i1 in 32/64 bit target Register Calling Convention defines a new behavior for v64i1 types. This type should be saved in GPR. However for 32 bit machine we need to split the value into 2 GPRs (because each is 32 bit). Differential Revision: https://reviews.llvm.org/D26181 llvm-svn: 287217	2016-11-17 09:59:40 +00:00
Sanjoy Das	43ccb38bb5	Delete dead code and add asserts instead; NFC llvm-svn: 287214	2016-11-17 07:29:43 +00:00
Sanjoy Das	4a8fe09040	[ImplicitNullCheck] Fix an edge case where we were hoisting incorrectly ImplicitNullCheck keeps track of one instruction that the memory operation depends on that it also hoists with the memory operation. When hoisting this dependency, it would sometimes clobber a live-in value to the basic block we were hoisting the two things out of. Fix this by explicitly looking for such dependencies. I also noticed two redundant checks on `MO.isDef()` in IsMIOperandSafe. They're redundant since register MachineOperands are either Defs or Uses -- there is no third kind. I'll change the checks to asserts in a later commit. llvm-svn: 287213	2016-11-17 07:29:40 +00:00
Craig Topper	05b0fcd168	[X86] Fix formatting. NFC llvm-svn: 287211	2016-11-17 05:59:55 +00:00
Craig Topper	dfaf9201cb	[X86] Add a test case where, due to a bug in selectScalarSSELoad, we fold the same load twice. llvm-svn: 287210	2016-11-17 05:37:39 +00:00
Dean Michael Berris	3234d3a4bd	[XRay] Support AArch64 in LLVM This patch adds XRay support in LLVM for AArch64 targets. This patch is one of a series: Clang: https://reviews.llvm.org/D26415 compiler-rt: https://reviews.llvm.org/D26413 Author: rSerge Reviewers: rengolin, dberris Subscribers: amehsan, aemerson, llvm-commits, iid_iunknown Differential Revision: https://reviews.llvm.org/D26412 llvm-svn: 287209	2016-11-17 05:15:37 +00:00
Chris Bieneman	8036e0b21a	[CMake] [Darwin] Add support for debugging tablegen dependencies This patch adds an option to the build system LLVM_DEPENDENCY_DEBUGGING. Over time I plan to extend this to do more complex verifications, but the initial patch causes compile errors wherever there is missing a dependency on intrinsics_gen. Because intrinsics_gen is a compile-time dependency not a link-time dependency, everything that relies on the headers generated in intrinsics_gen needs an explicit dependency. llvm-svn: 287207	2016-11-17 04:36:59 +00:00
Chris Bieneman	05c279fc4b	[CMake] NFC. Updating CMake dependency specifications This patch updates a bunch of places where add_dependencies was being explicitly called to add dependencies on intrinsics_gen to instead use the DEPENDS named parameter. This cleanup is needed for a patch I'm working on to add a dependency debugging mode to the build system. llvm-svn: 287206	2016-11-17 04:36:50 +00:00
Konstantin Zhuravlyov	20ba24e231	[AMDGPU] Add missing test for rL287203 llvm-svn: 287204	2016-11-17 04:33:20 +00:00
Konstantin Zhuravlyov	d709efb0da	[AMDGPU] Custom lower f16 = fp_round f64 llvm-svn: 287203	2016-11-17 04:28:37 +00:00
Konstantin Zhuravlyov	3f0cdc7a11	[AMDGPU] Promote f16/i16 conversions to f32/i32 llvm-svn: 287201	2016-11-17 04:00:46 +00:00
Konstantin Zhuravlyov	662e01dfbe	[AMDGPU] Expand `br_cc` for f16 Differential Revision: https://reviews.llvm.org/D26732 llvm-svn: 287199	2016-11-17 03:49:01 +00:00
Lang Hames	fd264f7e84	[Orc] Clang-format the recent RPC update (r286620 and related). llvm-svn: 287195	2016-11-17 02:33:47 +00:00
Dehao Chen	41d72a8632	Use profile info to adjust loop unroll threshold. Summary: For flat loop, even if it is hot, it is not a good idea to unroll in runtime, thus we set a lower partial unroll threshold. For hot loop, we set a higher unroll threshold and allows expensive tripcount computation to allow more aggressive unrolling. Reviewers: davidxl, mzolotukhin Subscribers: sanjoy, mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D26527 llvm-svn: 287186	2016-11-17 01:17:02 +00:00
Justin Lebar	be0cfcc28a	[CUDA] Update docs to indicate that clang now supports std::complex in CUDA mode. The last remaining necessary change was D25403, landed as r287012. llvm-svn: 287184	2016-11-17 01:03:42 +00:00
Lang Hames	e3b74d3c4d	Remove a stale test case. llvm-svn: 287183	2016-11-17 01:02:52 +00:00
Peter Collingbourne	bda4498543	llvm-dis: Remove dead code. llvm-svn: 287182	2016-11-17 00:42:08 +00:00
Dylan McKay	48c26b2b12	[AVR] Remove some accidentally-commited code that broke the bots This is a remnant of an on-chip unit testing tool that has since been moved out-of-tree. It was accidentally committed in r287162. llvm-svn: 287180	2016-11-17 00:09:38 +00:00
Peter Collingbourne	f72a8d4e08	Introduce GlobalSplit pass. This pass splits globals into elements using inrange annotations on getelementptr indices. Differential Revision: https://reviews.llvm.org/D22295 llvm-svn: 287178	2016-11-16 23:40:26 +00:00
Dylan McKay	017a55b092	[AVR] Wrap all methods in the pseudo expansion pass in an anon namespace The '-fpermissive' compiler flag complains if the template specializations used in the class are used in a different namespace. llvm-svn: 287176	2016-11-16 23:06:14 +00:00
Dylan McKay	6dd69032c9	[AVR] Fix basic block naming in ctlz and cttz tests The branch selector would change the names. llvm-svn: 287174	2016-11-16 22:48:38 +00:00
Dylan McKay	5810c7ee6e	[AVR] Remove unused method from AVRTargetMachine llvm-svn: 287173	2016-11-16 22:48:30 +00:00
Dylan McKay	9701c42de9	[AVR] Add tests for counting leading/trailing zeros This adds two test files that verify the 'cttz' and 'ctlz' operations. llvm-svn: 287172	2016-11-16 22:38:43 +00:00
Sanjay Patel	066139a3ec	[x86] allow FP-logic ops when one operand is FP and result is FP We save an inter-register file move this way. If there's any CPU where the FP logic is slower, we could transform this back to int-logic in MachineCombiner. This helps, but doesn't solve, PR6137: https://llvm.org/bugs/show_bug.cgi?id=6137 The 'andn' test shows that we're missing a pattern match to recognize the xor with -1 constant as a 'not' op. llvm-svn: 287171	2016-11-16 22:34:05 +00:00
Ahmed Bougacha	f33f91af24	[AsmParser] Avoid recursing when lexing ';'. NFC. This should prevent stack overflows in non-optimized builds on .ll files with lots of consecutive commented-out lines. Instead of recursing into LexToken(), continue into a 'while (true)'. llvm-svn: 287170	2016-11-16 22:25:05 +00:00
Ahmed Bougacha	bd6ce9a247	[CodeGen] Pass references, not pointers, to MMI helpers. NFC. While there, rename them to follow the coding style. llvm-svn: 287169	2016-11-16 22:25:03 +00:00
Ahmed Bougacha	996961a461	Revert "Get GlobalISel to build on Linux after r286407" This reverts commit r286962. We want to avoid depending on SelectionDAG, and AddLandingPadInfo lives in CodeGen now. llvm-svn: 287168	2016-11-16 22:24:59 +00:00
Ahmed Bougacha	456dce8a84	[CodeGen] Pull MMI helpers from FunctionLoweringInfo to MMI. NFC. They're not SelectionDAG- or FunctionLoweringInfo-specific. They are, however, specific to building MMI from IR. We could make them members, but it's nice having MMI be a "simple" data structure and this logic kept separate. This also lets us reuse them from GlobalISel. llvm-svn: 287167	2016-11-16 22:24:56 +00:00
Ahmed Bougacha	2b4c127531	[CodeGen] Cleanup MachineModuleInfo doxygen comments. NFC. Remove redundant names and only keep header comments. llvm-svn: 287166	2016-11-16 22:24:53 +00:00
Ahmed Bougacha	74f8fcb369	[CodeGen] Sort MMI forward declarations. NFC. llvm-svn: 287165	2016-11-16 22:24:46 +00:00
Kevin Enderby	7fa40c9f2b	General clean up of error handling in llvm-objdump to remove its use of report_fatal_error(). No real functional change with this commit. The problem with report_fatal_error() is it does not include the tool name and the file name the for which the error message was generated. Uses of report_fatal_error() were change to report_error() or error() to get a better error and to make the code smaller and cleaner. Also changed things like error(errorToErrorCode(SOrErr.takeError())) to use report_error() with a file name and the llvm::Error (as well as the ArchitectureName if available) so the error message is printed. llvm-svn: 287163	2016-11-16 22:17:38 +00:00
Dylan McKay	a789f40002	[AVR] Add the pseudo instruction expansion pass Summary: A lot of the pseudo instructions are required because LLVM assumes that all integers of the same size as the pointer size are legal. This means that it will not currently expand 16-bit instructions to their 8-bit variants because it thinks 16-bit types are legal for the operations. This also adds all of the CodeGen tests that required the pass to run. Reviewers: arsenm, kparzysz Subscribers: wdng, mgorny, modocache, llvm-commits Differential Revision: https://reviews.llvm.org/D26577 llvm-svn: 287162	2016-11-16 21:58:04 +00:00
Vitaly Buka	e596986a44	Fix "isn't a prototype" warning llvm-svn: 287161	2016-11-16 21:51:39 +00:00
Peter Collingbourne	7d0c869b86	X86: Simplify X86ISD::Wrapper operand checks. NFCI. We only ever create TargetConstantPool, TargetJumpTable, TargetExternalSymbol, TargetGlobalAddress, TargetGlobalTLSAddress, MCSymbol and TargetBlockAddress nodes as operands of X86ISD::Wrapper nodes, so we can remove one check and invert the other. Also update the documentation comment for X86ISD::Wrapper. Differential Revision: https://reviews.llvm.org/D26731 llvm-svn: 287160	2016-11-16 21:48:59 +00:00
Sanjoy Das	df4b162e4d	[ImplicitNullChecks] Do not not handle call MachineInstrs We don't track callee clobbered registers correctly, so avoid hoisting across calls. Note: for this bug to trigger we need a `readonly` call target, since we already have logic to not hoist across potentially storing instructions either. llvm-svn: 287159	2016-11-16 21:45:22 +00:00
Peter Collingbourne	7a74803abf	Bitcode: Introduce initial multi-module reader API. Implement getLazyBitcodeModule() and parseBitcodeFile() in terms of it. Differential Revision: https://reviews.llvm.org/D26719 llvm-svn: 287156	2016-11-16 21:44:45 +00:00
Tim Northover	397f9d9d05	ARM: fix CodeGen for 64-bit shifts. One half of the shifts obviously needed conditional selection based on whether the shift amount is more than 32-bits, but leaving the other half as the natural shift isn't acceptable either: it's undefined behaviour to shift a 32-bit value by more than 31. llvm-svn: 287149	2016-11-16 20:54:28 +00:00
Rong Xu	66827427e1	Make block placement deterministic We fail to produce bit-to-bit matching stage2 and stage3 compiler in PGO bootstrap build. The reason is because LoopBlockSet is of SmallPtrSet type whose iterating order depends on the pointer value. This patch fixes this issue by changing to use SmallSetVector. Differential Revision: http://reviews.llvm.org/D26634 llvm-svn: 287148	2016-11-16 20:50:06 +00:00
Sanjay Patel	80baf69cb5	[InstCombine] replace unreachable with assert and remove unreachable code; NFCI llvm-svn: 287147	2016-11-16 20:40:02 +00:00
Matt Arsenault	3b36bb1d87	AMDGPU: Enable ConstrainCopy DAG mutation This fixes a probably unintended divergence from the default scheduler behavior. llvm-svn: 287146	2016-11-16 20:35:23 +00:00
Sanjay Patel	1b9560ffd6	[InstCombine] fix formatting and add FIXMEs to foldOperationIntoSelectOperand(); NFC llvm-svn: 287145	2016-11-16 20:18:34 +00:00
Geoff Berry	8301c645c8	[AArch64] Handle vector types in replaceZeroVectorStore. Summary: Extend replaceZeroVectorStore to handle more vector type stores, floating point zero vectors and set alignment more accurately on split stores. This is a follow-up change to r286875. This change fixes PR31038. Reviewers: MatzeB Subscribers: mcrosier, aemerson, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D26682 llvm-svn: 287142	2016-11-16 19:35:19 +00:00
Mandeep Singh Grang	000ce9a686	[LoopVectorize] Fix for non-determinism in codegen Summary: This patch fixes issues in codegen uncovered due to https://reviews.llvm.org/D26718 Reviewers: mssimpso Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D26727 llvm-svn: 287135	2016-11-16 18:53:17 +00:00
Tom Stellard	0d162b1c4f	AMDGPU/SI: Avoid creating unnecessary copies in the SIFixSGPRCopies pass Summary: 1. Don't try to copy values to and from the same register class. 2. Replace copies with of registers with immediate values with v_mov/s_mov instructions. The main purpose of this change is to make MachineSink do a better job of determining when it is beneficial to split a critical edge, since the pass assumes that copies will become move instructions. This prevents a regression in uniform-cfg.ll if we enable critical edge splitting for AMDGPU. Reviewers: arsenm Subscribers: arsenm, kzhuravl, llvm-commits Differential Revision: https://reviews.llvm.org/D23408 llvm-svn: 287131	2016-11-16 18:42:17 +00:00
Eugene Zelenko	caf280330f	[ExecutionEngine] Fix examples build broken in r287126 and other Include What You Use warnings. llvm-svn: 287130	2016-11-16 18:32:58 +00:00
Sanjay Patel	4ce99d4d24	fix comment formatting; NFC llvm-svn: 287127	2016-11-16 18:09:44 +00:00
Eugene Zelenko	cecb0183b2	[ExecutionEngine] Fix some Clang-tidy modernize-use-default, modernize-use-equals-delete and Include What You Use warnings; other minor fixes. Differential revision: https://reviews.llvm.org/D26729 llvm-svn: 287126	2016-11-16 18:07:33 +00:00
Sanjay Patel	7f3d51f840	[x86] add fake scalar FP logic instructions to ReplaceableInstrs to save some bytes We can replace "scalar" FP-bitwise-logic with other forms of bitwise-logic instructions. Scalar SSE/AVX FP-logic instructions only exist in your imagination and/or the bowels of compilers, but logically equivalent int, float, and double variants of bitwise-logic instructions are reality in x86, and the float variant may be a shorter instruction depending on which flavor (SSE or AVX) of vector ISA you have...so just prefer float all the time. This is a preliminary step towards solving PR6137: https://llvm.org/bugs/show_bug.cgi?id=6137 Differential Revision: https://reviews.llvm.org/D26712 llvm-svn: 287122	2016-11-16 17:42:40 +00:00
Lang Hames	d47588986e	[Orc] Re-enable the RPC unit test disabled in r286917. This unit test infinite-looped on s390x due to a thread_yield being optimized out. I've updated the QueueChannel class (where thread_yield was called) to use a condition variable instead. This should cause the unit test to behave correctly. llvm-svn: 287121	2016-11-16 17:31:09 +00:00
Reid Kleckner	3a83e76811	[sancov] Name the global containing the main source file name If the global name doesn't start with __sancov_gen, ASan will insert unecessary red zones around it. llvm-svn: 287117	2016-11-16 16:50:43 +00:00
Daniil Fukalov	e870398e48	test commit, changed tab to spaces, NFC llvm-svn: 287116	2016-11-16 16:41:40 +00:00
Pekka Jaaskelainen	8483cf0ae8	Add a little endian variant of TCE. llvm-svn: 287111	2016-11-16 15:22:23 +00:00
Simon Pilgrim	79416ea76a	[X86] Add integer division test for PR23590 Shows missed opportunity to recognise reduced integer division result size llvm-svn: 287110	2016-11-16 14:54:34 +00:00
Simon Pilgrim	b57dd17142	[X86][AVX512] Autoupgrade lossless i32/u32 to f64 conversion intrinsics with generic IR Both the (V)CVTDQ2PD (i32 to f64) and (V)CVTUDQ2PD (u32 to f64) conversion instructions are lossless and can be safely represented as generic SINT_TO_FP/UINT_TO_FP calls instead of x86 intrinsics without affecting final codegen. LLVM counterpart to D26686 Differential Revision: https://reviews.llvm.org/D26736 llvm-svn: 287108	2016-11-16 14:48:32 +00:00
Simon Pilgrim	9e355bc5bb	[X86][AVX512] Added some mask/maskz tests for sitofp/uitofp i32 to f64 llvm-svn: 287106	2016-11-16 14:24:04 +00:00
Simon Pilgrim	c223aa52b1	[X86] Regenerated integer divide tests to test on 32 and 64 bit targets llvm-svn: 287104	2016-11-16 14:12:11 +00:00
Simon Pilgrim	dd8c71c646	[X86][SSE] Added PSUBUS from SELECT tests from D25987 llvm-svn: 287103	2016-11-16 13:59:03 +00:00
Simon Dardis	8ca1cbccc6	[mips] Fix unsigned/signed type error MipsFastISel uses a a class to represent addresses with a signed member to represent the offset. MipsFastISel::emitStore, emitLoad and computeAddress all treated the offset as being positive. In cases where the offset was actually negative and a frame pointer was used, this would cause the constant synthesis routine to crash as it would generate an unexpected instruction sequence when frame indexes are replaced. Reviewers: vkalintiris Differential Revision: https://reviews.llvm.org/D26192 llvm-svn: 287099	2016-11-16 11:29:07 +00:00
Simon Dardis	7b7cb8d9dd	[mips] not instruction alias This patch adds the single operand form of the not alias to microMIPS and MIPS along with additional tests. This partially resolves PR/30381. Thanks to Sean Bruno for reporting the issue! llvm-svn: 287097	2016-11-16 11:04:49 +00:00
Pavel Labath	0c20e05e89	Remove TimeValue class Summary: All uses have been replaced by appropriate std::chrono types, and the class is now unused. Reviewers: zturner, mehdi_amini Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D26447 llvm-svn: 287094	2016-11-16 10:46:48 +00:00
Ayman Musa	4d60243bfd	[X86][AVX512] Removing llvm x86 intrinsics for _mm_mask_move_{ss\|sd} intrinsics. Differential Revision: https://reviews.llvm.org/D26128 llvm-svn: 287087	2016-11-16 09:00:28 +00:00
Craig Topper	6910fa0ef4	[X86] Remove the scalar intrinsics for fadd/fsub/fdiv/fmul Summary: These intrinsics have been unused for clang for a while. This patch removes them. We auto upgrade them to extractelements, a scalar operation and then an insertelement. This matches the sequence used by clangs intrinsic file. Reviewers: zvi, delena, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26660 llvm-svn: 287083	2016-11-16 05:24:10 +00:00
Davide Italiano	6cf09265f9	[ELF] Convert ELF.h to Expected<T>. This has two advantages: 1) We slowly move away from ErrorOr to the new handling interface, in the hope of having an uniform error handling in LLVM, eventually. 2) We're starting to have meaningful error messages for invalid object ELF files, rather than a generic "parse error". At some point we should include also the offset to improve the quality of the diagnostic. llvm-svn: 287081	2016-11-16 05:10:28 +00:00
Saleem Abdulrasool	d05c5aea47	test: use separate input file for test Rather than using sed to generate the input and pipe the result to strings, use the static input instead. llvm-svn: 287079	2016-11-16 04:08:46 +00:00
Konstantin Zhuravlyov	bf998c7003	[AMDGPU] Refactor v_mac_{f16, f32} patterns into a class NFC Differential Revision: https://reviews.llvm.org/D26711 llvm-svn: 287077	2016-11-16 03:39:12 +00:00
Matthias Braun	3d51cf0a2c	AArch64: Use DeadRegisterDefinitionsPass before regalloc. Doing this before register allocation reduces register pressure as we do not even have to allocate a register for those dead definitions. Differential Revision: https://reviews.llvm.org/D26111 llvm-svn: 287076	2016-11-16 03:38:27 +00:00
Richard Smith	6b335d1948	Fix build break when the host C compiler is C89. llvm-svn: 287075	2016-11-16 03:36:29 +00:00
Konstantin Zhuravlyov	2a87a42035	[AMDGPU] Handle f16 select{_cc} - Select `select` to `v_cndmask_b32` - Expand `select_cc` - Refactor patterns Differential Revision: https://reviews.llvm.org/D26714 llvm-svn: 287074	2016-11-16 03:16:26 +00:00
Dean Michael Berris	6eec7d4158	[XRay][docs] Define requirements on installed log handlers. Summary: We update the documentation to define what the requirements are for the provided XRay log handler. This is to make it clear that the function pointer provided must do internal synchronisation and that there are no guarantees provided by XRay on when the function shall be invoked once it has been installed as a log handler. Reviewers: rSerge, rengolin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26651 llvm-svn: 287073	2016-11-16 02:18:23 +00:00
Quentin Colombet	fb9b0cdcfe	[RegAllocGreedy] Record missed hint for late recoloring. In https://reviews.llvm.org/D25347, Geoff noticed that we still have useless copy that we can eliminate after register allocation. At the time the allocation is chosen for those copies, they are not useless but, because of changes in the surrounding code, later on they might become useless. The Greedy allocator already has a mechanism to deal with such cases with a late recoloring. However, we missed to record the some of the missed hints. This commit fixes that. llvm-svn: 287070	2016-11-16 01:07:12 +00:00

... 2 3 4 5 6 ...

141124 Commits