llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	be37ab864c	[AArch64] fix typos in test assertions llvm-svn: 315203	2017-10-09 01:29:54 +00:00
Craig Topper	c88883b07d	[X86] Remove a setLoadExtAction from the AVX512 section that uses an AVX512BW type and is alraedy present in the AVX512BW section. llvm-svn: 315202	2017-10-09 01:05:16 +00:00
Craig Topper	4f8656a7af	[X86] Enable extended comparison predicate support for SETUEQ/SETONE when targeting AVX instructions. We believe that despite AMD's documentation, that they really do support all 32 comparision predicates under AVX. Differential Revision: https://reviews.llvm.org/D38609 llvm-svn: 315201	2017-10-09 01:05:15 +00:00
Davide Italiano	b06e8fa223	[DWARFDIE] Rewrite `operator !=` using `operator ==`. NFCI. llvm-svn: 315200	2017-10-09 00:18:45 +00:00
Davide Italiano	54bb5ea21a	[SymbolFile/DWARF] Simplify two functions. NFCI. llvm-svn: 315199	2017-10-09 00:11:49 +00:00
Benjamin Kramer	e33eb771ca	Certain versions of clang require an explicit initialization for literal const members. include/clang/Lex/PreprocessorLexer.h:79:3: error: constructor for 'clang::PreprocessorLexer' must explicitly initialize the const member 'FID' llvm-svn: 315197	2017-10-08 21:28:47 +00:00
Benjamin Kramer	c24fb0718d	Remove unused variables. No functionality change. llvm-svn: 315196	2017-10-08 21:23:02 +00:00
Simon Pilgrim	2c742f919a	[X86][SSE] Don't call combineTo inside combineX86ShufflesRecursively. NFCI. Return the combined shuffle from combineX86ShufflesRecursively and perform the combineTo in the caller. Makes it easier for future patches to use this in functions that aren't actually shuffles themselves. llvm-svn: 315195	2017-10-08 20:58:14 +00:00
Benjamin Kramer	8a1d633fe2	Make SourceLocation, QualType and friends have constexpr constructors. No functionality change intended. llvm-svn: 315194	2017-10-08 20:53:36 +00:00
Jan Vesely	3c51ae5bd9	travis: Make sure we report failure even if only earlier checked files fail for loop would only report status of the last command v2: return '1' call test instead of '[' Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315193	2017-10-08 20:07:58 +00:00
Jan Vesely	136381dc38	check_external_calls.sh: Print number of calls in tested file. Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315192	2017-10-08 20:07:56 +00:00
Jan Vesely	80bb52ae75	ptx: Use __clc_nextafter to implement nextafter using clang builtin results in external library call Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315191	2017-10-08 19:34:00 +00:00
Jan Vesely	1de1444d62	Do not include clc_nextafter header globally Drop unused clc/math/clc_nextafter.h header Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315190	2017-10-08 19:33:58 +00:00
Jan Vesely	6a5c8ddb3a	math/nextafter: Use custom declaration inc file Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315189	2017-10-08 19:33:55 +00:00
Jan Vesely	72be1cc0be	math/binary_decl.inc: Do not declare mixed float/double functions fmin/fmax only need vector/scalar mix Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315188	2017-10-08 19:33:53 +00:00
Simon Pilgrim	6abbd33ec0	Tidyup with clang-format. NFCI. llvm-svn: 315187	2017-10-08 19:24:30 +00:00
Simon Pilgrim	135a2639f4	[X86][SSE] Add test case for PR27708 llvm-svn: 315186	2017-10-08 19:18:10 +00:00
Benjamin Kramer	16610028ea	Remove unused variables. No functionality change. llvm-svn: 315185	2017-10-08 19:11:02 +00:00
Craig Topper	977c546b0c	[X86] Regenerate fast-isel-select-pseudo-cmov.ll to prepare for D38609. llvm-svn: 315184	2017-10-08 17:54:50 +00:00
Javed Absar	f45d0b9849	[TableGen] Simplify, add range_loop in CodeGenSchedule llvm-svn: 315183	2017-10-08 17:23:30 +00:00
Simon Pilgrim	dc32c844f9	[X86] getTargetConstantBitsFromNode - add support for decoding scalar constants llvm-svn: 315182	2017-10-08 17:21:18 +00:00
Craig Topper	c97775c03c	[X86] Prefer MOVSS/SD over BLENDI during legalization. Remove BLENDI versions of scalar arithmetic patterns Summary: We currently disable some converting of shuffles to MOVSS/MOVSD during legalization if SSE41 is enabled. But later during shuffle combining we go back to prefering MOVSS/MOVSD. Additionally we have patterns that look for BLENDIs to detect scalar arithmetic operations. I believe due to the combining using MOVSS/MOVSD these are unnecessary. Interestingly, we still codegen blend instructions even though lowering/isel emit movss/movsd instructions. Turns out machine CSE commutes them to blend, and then commuting those blends back into blends that are equivalent to the original movss/movsd. This patch fixes the inconsistency in legalization to prefer MOVSS/MOVSD. The one test change was caused by this change. The problem is that we have integer types and are mostly selecting integer instructions except for the shufps. This shufps forced the execution domain, but the vpblendw couldn't have its domain changed with a naive instruction swap. We could fix this by special casing VPBLENDW based on the immediate to widen the element type. The rest of the patch is removing all the excess scalar patterns. Long term we should probably add isel patterns to make MOVSS/MOVSD emit blends directly instead of relying on the double commute. We may also want to consider emitting movss/movsd for optsize. I also wonder if we should still use the VEX encoded blendi instructions even with AVX512. Blends have better throughput, and that may outweigh the register constraint. Reviewers: RKSimon, zvi Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38023 llvm-svn: 315181	2017-10-08 16:57:23 +00:00
Benjamin Kramer	3f8c1189c7	Make more constructors constexpr or use =default. This lets the compiler reason about the type more easily. No functionality change intended. llvm-svn: 315180	2017-10-08 15:59:35 +00:00
Amara Emerson	1f83777bd3	[AArch64][GlobalISel] Add a test case for G_PHI of p0 instruction selection. llvm-svn: 315179	2017-10-08 15:29:35 +00:00
Amara Emerson	e205cab66f	[AArch64][GlobalISel] Add a test case for G_PHI of p0 regbank selection. llvm-svn: 315178	2017-10-08 15:29:31 +00:00
Amara Emerson	1cd89ca669	[AArch64][GlobalISel] Make G_PHI of p0 types legal. Differential Revision: https://reviews.llvm.org/D38621 llvm-svn: 315177	2017-10-08 15:29:11 +00:00
Simon Pilgrim	6410ca70aa	[X86][XOP] Add XOP oddshuffles tests XOP codegen is often different to generic AVX - thank you vpperm! llvm-svn: 315176	2017-10-08 12:58:15 +00:00
Gadi Haber	684944b822	[X86][SKX] Adding the scheduling information for the SKX target. Adding the scheduling information for the SkylakeServer (SKX) target. This patch adds the instruction scheduling information for the SkylakeServer (SKX) architecture target by adding the file X86SchedSkylakeServer.td located under the X86 Target. We used the scheduling information retrieved from the Skylake architects in order to create the file. The scheduling information includes latency, number of micro-Ops and used ports by each SKL instruction. The patch continues the scheduling replacement and insertion effort started with the SNB target in r310792, the HSW target in r311879 and the SkylakeClient (SKL) target in rL313613. Please expect some performance fluctuations due to code alignment effects. Reviewers: zvi, RKSimon, craig.topper, chandlerc, aymanmu Differential Revision: https://reviews.llvm.org/D38443 Change-Id: I5c228fcc09e9e5a99b6116e62b356c4f9b971185 llvm-svn: 315175	2017-10-08 12:52:54 +00:00
Ayman Musa	1170deb9c8	[X86] Add missing entries in 'MemoryFoldTable2Addr' to get complete form of the table. Get the folding table 'MemoryFoldTable2Addr' to a complete state as part of the process explained in https://reviews.llvm.org/D38028 Differential Revision: https://reviews.llvm.org/D38500 llvm-svn: 315174	2017-10-08 09:46:50 +00:00
Ayman Musa	993339b941	[X86][TableGen] Recommitting the X86 memory folding tables TableGen backend while disabling it by default. After the original commit ([[ https://reviews.llvm.org/rL304088 \| rL304088 ]]) was reverted, a discussion in llvm-dev was opened on 'how to accomplish this task'. In the discussion we concluded that the best way to achieve our goal (which is to automate the folding tables and remove the manually maintained tables) is: # Commit the tablegen backend disabled by default. # Proceed with an incremental updating of the manual tables - while checking the validity of each added entry. # Repeat previous step until we reach a state where the generated and the manual tables are identical. Then we can safely remove the manual tables and include the generated tables instead. # Schedule periodical (1 week/2 weeks/1 month) runs of the pass: - if changes appear (new entries): - make sure the entries are legal - If they are not, mark them as illegal to folding - Commit the changes (if there are any). CMake flag added for this purpose is "X86_GEN_FOLD_TABLES". Building with this flags will run the pass and emit the X86GenFoldTables.inc file under build/lib/Target/X86/ directory which is a good reference for any developer who wants to take part in the effort of completing the current folding tables. Differential Revision: https://reviews.llvm.org/D38028 llvm-svn: 315173	2017-10-08 09:20:32 +00:00
Craig Topper	bbca2f2978	[X86] Stop LowerSIGN_EXTEND_AVX512 from creating v8i16/v16i16/v16i8 vselects with a v8i1/v16i1 condition when BWI is not available. Some of the tests in vector-shuffle-v1.ll would get into an infinite loop without this. llvm-svn: 315172	2017-10-08 08:50:59 +00:00
Ayman Musa	5fc6dc58d7	[X86] Add new attribute to X86 instructions to enable marking them as "not memory foldable" This attribute will be used in a tablegen backend that generated the X86 memory folding tables which will be added in a future pass. Instructions with this attribute unset will be excluded from the full set of X86 instructions available for the pass. Differential Revision: https://reviews.llvm.org/D38027 llvm-svn: 315171	2017-10-08 08:32:56 +00:00
Jan Vesely	beb6591753	ldexp: Fix double precision function return type Fixes ~1200 external calls from nvtpx library. Reviewer: Jeroen Ketema Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315170	2017-10-08 06:56:14 +00:00
Rui Ueyama	e03ba02348	Rename ignoreInterpSection -> needsInterpSection. This should improve consistency. Also added comment. llvm-svn: 315169	2017-10-08 03:52:15 +00:00
Rui Ueyama	0ae2c24c5d	Use llvm::Optional instead of UINT_MAX to represent a null value. llvm-svn: 315168	2017-10-08 03:45:49 +00:00
Rui Ueyama	656be31174	Make a helper function a non-member function. NFC. llvm-svn: 315167	2017-10-08 02:54:42 +00:00
Rui Ueyama	872235743d	Use llvm::Optional instead of a magic number -1 to represent "no result". llvm-svn: 315166	2017-10-08 02:44:08 +00:00
Rui Ueyama	617e2f98a0	Make ScriptParser::checkSection a non-member function. This patch also make its return type to `void` because it always returns a given value as-is. llvm-svn: 315165	2017-10-08 02:27:02 +00:00
Rui Ueyama	9b18f50f79	Remove a trivial function. llvm-svn: 315164	2017-10-08 02:06:03 +00:00
Rui Ueyama	e8936bca72	Add comment. llvm-svn: 315163	2017-10-08 01:55:29 +00:00
Craig Topper	9563cab961	[X86] Simplify some code in getInsertVINSERTImmediate and getExtractVEXTRACTImmediate. NFC Replace one of the divides with a multiply. llvm-svn: 315162	2017-10-08 01:33:42 +00:00
Craig Topper	27170fee8d	[X86] If we see an insert of a bitcast into zero vector, canonicalize it to move the bitcast to the other side of the insert. This improves detection of zeroing of upper bits during isel. llvm-svn: 315161	2017-10-08 01:33:41 +00:00
Craig Topper	f7a19db649	[X86] Remove ISD::INSERT_SUBVECTOR handling from combineBitcastForMaskedOp. Add isel patterns to make up for it. This will allow for some flexibility in canonicalizing bitcasts around insert_subvector. llvm-svn: 315160	2017-10-08 01:33:40 +00:00
Craig Topper	16f2044fa8	[X86] Use getConstantOperandVal to simplify some code. NFC llvm-svn: 315159	2017-10-08 01:33:38 +00:00
Rui Ueyama	10f5867135	Remove redundant initialization code. llvm-svn: 315158	2017-10-08 00:45:34 +00:00
Vedant Kumar	2465e64846	cmake: Fix one more usage of append() append() isn't available with some cmake versions, so I need to use a different construct. I missed this case in r315144. http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA/39355 llvm-svn: 315157	2017-10-07 20:20:42 +00:00
Simon Pilgrim	9508fe7924	[X86][SSE] Match bitcasted BUILD_VECTOR of constants for v2i64 shifts on 64-bit targets (PR34855) Extension to rL315155, generate constant shifts on 64-bits as well as 32-bits. llvm-svn: 315156	2017-10-07 17:57:22 +00:00
Simon Pilgrim	70e1db78db	[X86][SSE] Match bitcasted v4i32 BUILD_VECTORS for v2i64 shifts on 64-bit targets (PR34855) We were already doing this for 32-bit targets, but we can generate these on 64-bits as well. llvm-svn: 315155	2017-10-07 17:42:17 +00:00
Craig Topper	90b76211d3	[SelectionDAG} Use KnownBits::isUnknown and hasConflict. NFC llvm-svn: 315154	2017-10-07 17:07:48 +00:00
Craig Topper	2f60295364	[X86] Add X86ISD::CMOV to computeKnownBitsForTargetNode and ComputeNumSignBitsForTargetNode. Summary: Implementations based on ISD::SELECT. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38663 llvm-svn: 315153	2017-10-07 16:51:19 +00:00

1 2 3 4 5 ...

273510 Commits All Branches Search

273510 Commits

All Branches