llvm-project

Commit Graph

Author	SHA1	Message	Date
Michal Gorny	62dc83b9b7	Reland r311836 - [Driver] Use arch type to find compiler-rt libraries (on Linux) Use llvm::Triple::getArchTypeName() when looking for compiler-rt libraries, rather than the exact arch string from the triple. This is more correct as it matches the values used when building compiler-rt (builtin-config-ix.cmake) which are the subset of the values allowed in triples. For example, this fixes an issue when the compiler set for i686-pc-linux-gnu triple would not find an i386 compiler-rt library, while this is the exact arch that is detected by compiler-rt. The same applies to any other i?86 variant allowed by LLVM. This also makes the special case for MSVC unnecessary, since now i386 will be used reliably for all 32-bit x86 variants. Differential Revision: https://reviews.llvm.org/D26796 llvm-svn: 311923	2017-08-28 20:29:52 +00:00
Davide Italiano	20cb7e887f	[LoopUnroll] Properly update loop structure in case of successful peeling. When peeling kicks in, it updates the loop preheader. Later, a successful full unroll of the loop needs to update a PHI which i-th argument comes from the loop preheader, so it'd better look at the correct block. Fixes PR33437. Differential Revision: https://reviews.llvm.org/D37153 llvm-svn: 311922	2017-08-28 20:29:33 +00:00
Joerg Sonnenberger	0f76a35c5e	Fix ARMv4 support ARMv4 doesn't support the "BX" instruction, which has been introduced with ARMv4t. Adjust the call lowering and tail call implementation accordingly. Further changes are necessary to ensure that presence of the v4t feature is correctly set. Most importantly, the "generic" CPU for thumb-* triples should include ARMv4t, since thumb mode without thumb support would naturally be pointless. Add a couple of asserts to ensure thumb instructions are not emitted without CPU support. Differential Revision: https://reviews.llvm.org/D37030 llvm-svn: 311921	2017-08-28 20:20:47 +00:00
Tobias Grosser	ee8ad1c0ff	[IslAst] Do not compare arrays in alias check which are known to be identical This possibly helps to avoid run-time check failures in the COSMO kernels. llvm-svn: 311920	2017-08-28 20:17:02 +00:00
Matthias Braun	3923b319ad	Try to fix compilation problem with libstdc++ llvm-svn: 311918	2017-08-28 20:11:28 +00:00
Matthias Braun	abbc4a7f1a	Address r311914 review comments llvm-svn: 311917	2017-08-28 20:11:27 +00:00
George Karpenkov	52a3e48e9f	[libFuzzer] Exclude a test failing on OS X. llvm-svn: 311916	2017-08-28 20:10:30 +00:00
Davide Italiano	9a09ae448d	[LoopUnroll] Add a cl::opt to force peeling, for testing purposes. Will be used to test the patch proposed in D37153. llvm-svn: 311915	2017-08-28 19:50:55 +00:00
Matthias Braun	afcff2d0d9	TableGen: Fix subreg composition/concatenation This fixes 2 problems in subregister hierarchies with multiple levels and tuples: 1) For bigger tuples computing secondary subregs would miss 2nd order effects. In the test case a register like `S10_S11_S12_S13_S14` with D5 = S10_S11, D6 = S12_S13 we would correctly compute sub0 = D5, sub1 = D6 but would miss the fact that we could now form ssub0_ssub1_ssub2_ssub3 (aka sub0_sub1) = D5_D6. This is fixed by changing computeSecondarySubRegs() to compute a fixpoint. 2) Fixing 1) exposed a problem where TableGen would create multiple names for effectively the same subregister index. In the test case the subregister index sub0 is composed from ssub0 and ssub1, and sub1 is composed from ssub2 and ssub3. TableGen should not create both sub0_sub1 and ssub0_ssub1_ssub2_ssub3 as infered subregister indexes. This changes the code to build a transitive closure of the subregister components before forming new concatenated subregister indexes. This fix was developed for an out of tree target. For the in-tree targets the only change is in the register information computed for ARM. There is a slight chance this fixed/improved some register coalescing around the QQQQ/QQ register classes there but I couldn't see/provoke any code generation differences. Differential Revision: https://reviews.llvm.org/D36913 llvm-svn: 311914	2017-08-28 19:48:42 +00:00
Matthias Braun	3fdc099a6d	TableGen: Add -gen-register-info-debug-dump Adds a new --gen-register-info-debug-dump mode to tablegen that dumps various register related information: - List of register classes with super and subclasses - List of subregister indexes with lanemasks - List of registers with subregisters I will use this in an upcoming commit to create a test. It may also be useful for target developers wanting to get an overview of all the register related information, esp. the things inferred by tablegen and not directly visible in the .td file. Differential Revision: https://reviews.llvm.org/D36911 llvm-svn: 311913	2017-08-28 19:48:40 +00:00
George Karpenkov	dacfd66dfd	[libFuzzer] Fix libFuzzer flag propagation for standalone builds. Under the previous configurations, flags from SANITIZER_COMMON were not propagated for standalone builds. Differential Revision: https://reviews.llvm.org/D37225 llvm-svn: 311912	2017-08-28 19:44:19 +00:00
George Karpenkov	16a05f2dfe	Proper dependency check for clang in compiler_rt. - Not having a dependency does not work in standalone build, as Clang does not exist. - if (TARGET clang) check is useless, as it is order-dependent, and Clang may not be registered yet. Differential Revision: https://reviews.llvm.org/D37228 llvm-svn: 311911	2017-08-28 19:39:05 +00:00
Alexey Bataev	f32ec1ab31	[OPENMP] Remove unused header files, NFC. llvm-svn: 311908	2017-08-28 19:26:54 +00:00
Geoff Berry	75c4ae3066	[ARM] Fix bug in ARMLoadStoreOptimizer when kill flags are missing. Summary: ARMLoadStoreOpt::FixInvalidRegPairOp() was only checking if one of the load destination registers to be split overlapped with the base register if the base register was marked as killed. Since kill flags may not always be present, this can lead to incorrect code. This bug was exposed by my MachineCopyPropagation change D30751 breaking the sanitizer-x86_64-linux-android buildbot. Also clean up some dead code and add an assert that a register offset is never encountered by this code, since it does not handle them correctly. Reviewers: MatzeB, qcolombet, t.p.northover Subscribers: aemerson, javed.absar, kristof.beyls, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D37164 llvm-svn: 311907	2017-08-28 19:03:45 +00:00
Taewook Oh	572f45a3c8	Create PHI node for the return value only when the return value has uses. Summary: Currently, a phi node is created in the normal destination to unify the return values from promoted calls and the original indirect call. This patch makes this phi node to be created only when the return value has uses. This patch is necessary to generate valid code, as compiler crashes with the attached test case without this patch. Without this patch, an illegal phi node that has no incoming value from `entry`/`catch` is created in `cleanup` block. I think existing implementation is good as far as there is at least one use of the original indirect call. `insertCallRetPHI` creates a new phi node in the normal destination block only when the original indirect call dominates its use and the normal destination block. Otherwise, `fixupPHINodeForNormalDest` will handle the unification of return values naturally without creating a new phi node. However, if there's no use, `insertCallRetPHI` still creates a new phi node even when the original indirect call does not dominate the normal destination block, because `getCallRetPHINode` returns false. Reviewers: xur, davidxl, danielcdh Reviewed By: xur Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37176 llvm-svn: 311906	2017-08-28 18:57:00 +00:00
Erich Keane	0a539b5859	Change Diagnostic Category size error from runtime to compiletime Diagnostic Categories are fairly annoying, and are only enforced by a runtime-debug-only assert. This puts in a touch more work to get this all done at compile-time with static asserts Differential Revision: https://reviews.llvm.org/D37122 llvm-svn: 311905	2017-08-28 18:53:17 +00:00
Zachary Turner	a7b041748d	[CodeView] Don't output S_UDT symbols for forward decls. S_UDT symbols are the debugger's "index" for all the structs, typedefs, classes, and enums in a program. If any of those structs/classes don't have a complete declaration, or if there is a typedef to something that doesn't have a complete definition, then emitting the S_UDT is unhelpful because it doesn't give the debugger enough information to do anything useful. On the other hand, it results in a huge size blow-up in the resulting PDB, which is exacerbated by an order of magnitude when linking with /DEBUG:FASTLINK. With this patch, we drop S_UDT records for types that refer either directly or indirectly (e.g. through a typedef, pointer, etc) to a class/struct/union/enum without a complete definition. This brings us about 50% of the way towards parity with /DEBUG:FASTLINK PDBs generated from cl-compiled object files. Differential Revision: https://reviews.llvm.org/D37162 llvm-svn: 311904	2017-08-28 18:49:04 +00:00
Stefan Pintilie	c35e4de388	[Power9] Add new instructions for floating point status and control registers. Added the following P9 instructions: mffsce, mffscdrn, mffscdrni, mffscrn, mffscrni, mffsl Differential Revision: https://reviews.llvm.org/D37167 llvm-svn: 311903	2017-08-28 18:46:01 +00:00
Craig Topper	3763f0e00d	[InstCombine] Call hasNoSignedWrap instead of hasNoUnsignedWrap to get the NSW flag when handling Add in SimplifyDemandedUseBits. This is a typo from r311789. This should fix PR34349. llvm-svn: 311902	2017-08-28 18:44:28 +00:00
Krzysztof Parzyszek	2164a271a3	[Hexagon] Check for potential bank conflicts in post-RA scheduling Insert artificial edges between loads that could cause a cache bank conflict. llvm-svn: 311901	2017-08-28 18:36:21 +00:00
Stanislav Mekhanoshin	312c557b3b	[AMDGPU] Fix regression in AMDGPULibCalls allowing native for doubles Under -cl-fast-relaxed-math we could use native_sqrt, but f64 was allowed to produce HSAIL's nsqrt instruction. HSAIL is not here and we stick with non-existing native_sqrt(double) as a result. Add check for f64 to not return native functions and also remove handling of f64 case for fold_sqrt. Differential Revision: https://reviews.llvm.org/D37223 llvm-svn: 311900	2017-08-28 18:00:08 +00:00
Reid Kleckner	ea2683ecb8	Fix inaccurate comment about -fdelayed-template-parsing and MSVC llvm-svn: 311899	2017-08-28 17:59:24 +00:00
Hans Wennborg	edd66ab9dc	Revert r311857 "Emit static constexpr member as available_externally definition" It caused PR759744. > Emit static constexpr member as available_externally definition > > By exposing the constant initializer, the optimizer can fold many > of these constructs. > > Differential Revision: https://reviews.llvm.org/D34992 llvm-svn: 311898	2017-08-28 17:53:00 +00:00
Stanislav Mekhanoshin	dad7cf62de	[AMDGPU] computeKnownBitsForTargetNode for 24 bit mul Differential Revision: https://reviews.llvm.org/D37168 llvm-svn: 311896	2017-08-28 16:35:37 +00:00
Krzysztof Parzyszek	95da97ec56	[Hexagon] Break up DAG mutations into separate classes, move to subtarget llvm-svn: 311895	2017-08-28 16:24:22 +00:00
Krzysztof Parzyszek	697297afa9	[Hexagon] Move pre-RA DAG mutations to scheduler constructor llvm-svn: 311894	2017-08-28 15:52:54 +00:00
Craig Topper	fa86fd928e	[X86] Make 128/256-bit extract_subvector Legal instead of Custom. Move combining with BUILD_VECTOR from Legalization to DAG combine EXTRACT_SUBVECTOR was marked Custom solely so we could combine it with BUILD_VECTOR operations to create smaller BUILD_VECTORS during Legalization. But that sort of combining should really be done by the DAG combiner. This patch adds the last piece of needed supported DAG combine to handle this. Once that's done we can make the EXTRACT_SUBVECTOR operations Legal. Differential Revision: https://reviews.llvm.org/D37197 llvm-svn: 311893	2017-08-28 15:32:50 +00:00
Craig Topper	029a21dfdc	[DAGCombiner] Teach visitEXTRACT_SUBVECTOR to turn extracts of BUILD_VECTOR into smaller BUILD_VECTORs Only do this before operations are legalized of BUILD_VECTOR is Legal for the target. Differential Revision: https://reviews.llvm.org/D37186 llvm-svn: 311892	2017-08-28 15:28:33 +00:00
Kostya Kortchinsky	476f21d87e	[sanitizer] Re-introduce kUseSeparateSizeClassForBatch for the 32-bit Primary Summary: Currently `TransferBatch` are located within the same memory regions as "regular" chunks. This is not ideal for security: they make for an interesting target to overwrite, and are not protected by the frontend (namely, Scudo). To solve this, we re-introduce `kUseSeparateSizeClassForBatch` for the 32-bit Primary allowing for `TransferBatch` to end up in their own memory region. Currently only Scudo would use this new feature, the default behavior remains unchanged. The separate `kBatchClassID` was used for a brief period of time previously but removed when the 64-bit ended up using the "free array". Reviewers: alekseyshl, kcc, eugenis Reviewed By: alekseyshl Subscribers: llvm-commits, kubamracek Differential Revision: https://reviews.llvm.org/D37082 llvm-svn: 311891	2017-08-28 15:20:02 +00:00
Ilya Biryukov	867e36de38	Changed Dockerfiles to install LLVM into /usr/local Summary: Previously, the installation path was simply '/'. Using '/usr/local' would ensure that LLVM installation does not conflict with software installed via package managers. Reviewers: mehdi_amini, klimek Reviewed By: klimek Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37213 llvm-svn: 311890	2017-08-28 15:12:24 +00:00
Simon Atanasyan	3876077ac7	[MIPS] Remove unused function's argument. NFC llvm-svn: 311889	2017-08-28 14:40:53 +00:00
Michael Kruse	a4f447c2a4	[PM] Properly require and preserve OptimizationRemarkEmitter. NFCI. Properly require and preserve the OptimizationRemarkEmitter for use in ScopPass. Previously one had to get the ORE from ScopDetection because CodeGeneration did not mark it as preserved. It would need to be recomputed which results in the legacy PM to throw away all previous SCoP analysis. This also changes the implementation of ScopPass::getAnalysisUsage to not unconditionally preserve all passes, but only those needed to be preserved by any SCoP pass (at least when using the legacy PM). This allows invalidating DependenceInfo (and IslAstInfo) in case the pass would cause them to change (e.g. OpTree, DeLICM, MaximalArrayExpansion) JSONImporter should also invalidate the DependenceInfo. In this patch it marks DependenceInfo as preserved anyway because some regression tests depend on it. Differential Revision: https://reviews.llvm.org/D37010 llvm-svn: 311888	2017-08-28 14:07:33 +00:00
Evgeny Mankov	1587086f88	[Support][CommandLine] Add cl::Option::setDefault() Add abstract virtual method setDefault() to class Option and implement it in its inheritors in order to be able to set all the options to its default values in user's code without actually knowing all these options. For instance: for (auto &OM : cl::getRegisteredOptions(cl::TopLevelSubCommand)) { cl::Option O = OM.second; O->setDefault(); } Reviewed by: rampitec, Eugene.Zelenko, kasaurov Differential Revision: http://reviews.llvm.org/D36877 llvm-svn: 311887	2017-08-28 13:39:43 +00:00
Alex Lorenz	595c644f84	Avoid missing std error code in RefactoringActionRulesTest.cpp This should fix this bot: http://bb.pgr.jp/builders/i686-mingw32-RA-on-linux llvm-svn: 311886	2017-08-28 12:03:08 +00:00
Michael Kruse	e983e6b1c5	[ZoneAlgo] Print rejection reasons to llvm::dbgs(). NFC. llvm-svn: 311885	2017-08-28 11:22:23 +00:00
Alex Lorenz	1586fa70a6	[refactor] initial support for refactoring action rules This patch implements the initial support for refactoring action rules. The first rule that's supported is a "source change" rule that returns a set of atomic changes. This patch is based on the ideas presented in my RFC: http://lists.llvm.org/pipermail/cfe-dev/2017-July/054831.html The following pieces from the RFC are added by this patch: - `createRefactoringRule` (known as `apply` in the RFC) - `requiredSelection` refactoring action rule requirement. - `selection::SourceSelectionRange` selection constraint. Differential Revision: https://reviews.llvm.org/D36075 llvm-svn: 311884	2017-08-28 11:12:05 +00:00
Peter Szecsi	0db84863d0	[StaticAnalyzer] LoopUnrolling: Keep track the maximum number of steps for each loop This way the unrolling can be restricted for loops which will take at most a given number of steps. It is defined as 128 in this patch and it seems to have a good number for that purpose. Differential Revision: https://reviews.llvm.org/D37181 llvm-svn: 311883	2017-08-28 10:50:28 +00:00
Andrew V. Tischenko	574962a3b3	The current version of LLVM X86 disassembler incorrectly interprets some possible sets of x86 prefixes. This patch is the first step to close PR7709 and PR17697. There will be next patch(es) to close relative PRs. Differential Revision: https://reviews.llvm.org/D36788 M lib/Target/X86/Disassembler/X86DisassemblerDecoder.cpp M lib/Target/X86/Disassembler/X86DisassemblerDecoder.h A test/MC/Disassembler/X86/prefixes-i386.s A test/MC/Disassembler/X86/prefixes-x86_64.s M test/MC/Disassembler/X86/prefixes.txt llvm-svn: 311882	2017-08-28 10:43:14 +00:00
Peter Szecsi	d0604acd8e	[StaticAnalyzer] LoopUnrolling: Excluding loops which splits the state Added check if the execution of the last step of the given unrolled loop has generated more branches. If yes, than treat it as a normal (non-unrolled) loop in the remaining part of the analysis. Differential Revision: https://reviews.llvm.org/D36962 llvm-svn: 311881	2017-08-28 10:34:50 +00:00
Peter Szecsi	d91554bc91	[StaticAnalyzer] LoopUnrolling fixes 1. The LoopUnrolling feature needs the LoopExit included in the CFG so added this dependency via the config options 2. The LoopExit element can be encountered even if we haven't encountered the block of the corresponding LoopStmt. So the asserts were not right. 3. If we are caching out the Node then we get a nullptr from generateNode which case was not handled. Differential Revision: https://reviews.llvm.org/D37103 llvm-svn: 311880	2017-08-28 10:21:24 +00:00
Gadi Haber	d76f7b824e	[X86][Haswell] Updating HSW instruction scheduling information This patch completely replaces the instruction scheduling information for the Haswell architecture target by modifying the file X86SchedHaswell.td located under the X86 Target. We used the scheduling information retrieved from the Haswell architects in order to replace and modify the existing scheduling. The patch continues the scheduling replacement effort started with the SNB target in r307529 and r310792. Information includes latency, number of micro-Ops and used ports by each HSW instruction. Please expect some performance fluctuations due to code alignment effects. Reviewers: RKSimon, zvi, aymanmus, craig.topper, m_zuckerman, igorb, dim, chandlerc, aaboud Differential Revision: https://reviews.llvm.org/D36663 llvm-svn: 311879	2017-08-28 10:04:16 +00:00
George Rimar	60608a8ae5	[ELF] - Simplify (use llvm::erase_if). NFC. llvm-svn: 311878	2017-08-28 09:28:15 +00:00
Gabor Horvath	857ccd2919	[analyzer][GSoC] Re-implemente current virtual calls checker in a path-sensitive way Patch by: Xin Wang Differential Revision: https://reviews.llvm.org/D34275 llvm-svn: 311877	2017-08-28 08:44:43 +00:00
NAKAMURA Takumi	ac08375ad5	Prune whitespaces in blank lines. llvm-svn: 311876	2017-08-28 07:48:37 +00:00
NAKAMURA Takumi	a1e97a77f5	Untabify. llvm-svn: 311875	2017-08-28 06:47:47 +00:00
Craig Topper	2c03e53f4e	[X86] Add support for __builtin_cpu_init This adds builtin_cpu_init which will emit a call to cpu_indicator_init in libgcc or compiler-rt. This is needed to support builtin_cpu_supports/builtin_cpu_is in an ifunc resolver. Differential Revision: https://reviews.llvm.org/D36336 llvm-svn: 311874	2017-08-28 05:43:23 +00:00
Craig Topper	33681161c4	[X86] Use getUnpackl helper to create an ISD::VECTOR_SHUFFLE instead of using X86ISD::UNPCKL in reduceVMULWidth. This runs fairly early, we should use target independent nodes if possible. llvm-svn: 311873	2017-08-28 05:14:38 +00:00
Craig Topper	2c77011d15	[X86] Add an early out to combineLoopMAddPattern and combineLoopSADPattern when SSE2 is disabled. Without this the madd.ll and sad.ll test cases both throw assertions if you run them with SSE2 disabled. llvm-svn: 311872	2017-08-28 04:29:08 +00:00
Dean Michael Berris	66faaceeb1	[XRay][compiler-rt] Return the pointer associated with the function instead of the sled Summary: XRay has erroneously been returning the address of the first sled in the instrumentation map for a function id instead of the (runtime-relocated) functison address. This causes confusion and issues for applications where: - The first sled in the function may not be an entry sled (due to re-ordering or some other reason). - The caller attempts to find a symbol associated with the pointer at runtime, because the sled may not be exactly where the function's known address is (in case of inlined functions or those that have an external definition for symbols). This fixes http://llvm.org/PR34340. Reviewers: eizan Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D37202 llvm-svn: 311871	2017-08-28 03:58:23 +00:00
Lang Hames	5d06c23d4c	[Error] Add a handleExpected utility. handleExpected is similar to handleErrors, but takes an Expected<T> as its first input value and a fallback functor as its second, followed by an arbitary list of error handlers (equivalent to the handler list of handleErrors). If the first input value is a success value then it is returned from handleErrors unmodified. Otherwise the contained error(s) are passed to handleErrors, along with the handlers. If handleErrors returns success (indicating that all errors have been handled) then handleExpected runs the fallback functor and returns its result. If handleErrors returns a failure value then the failure value is returned and the fallback functor is never run. This simplifies the process of re-trying operations that return Expected values. Without this utility such retry logic is cumbersome as the internal Error must be explicitly extracted from the Expected value, inspected to see if its handleable and then consumed: enum FooStrategy { Aggressive, Conservative }; Expected<Foo> tryFoo(FooStrategy S); Expected<Foo> Result; (void)!!Result; // "Check" Result so that it can be safely overwritten. if (auto ValOrErr = tryFoo(Aggressive)) Result = std::move(ValOrErr); else { auto Err = ValOrErr.takeError(); if (Err.isA<HandleableError>()) { consumeError(std::move(Err)); Result = tryFoo(Conservative); } else return std::move(Err); } with handleExpected, this can be re-written as: auto Result = handleExpected( tryFoo(Aggressive), []() { return tryFoo(Conservative); }, [](HandleableError&) { /* discard to handle */ }); llvm-svn: 311870	2017-08-28 03:36:46 +00:00

1 2 3 4 5 ...

270362 Commits All Branches Search

270362 Commits

All Branches