llvm-project

Commit Graph

Author	SHA1	Message	Date
George Rimar	ec02b8d4c0	[ELF] - Partial support of --gdb-index command line option (Part 3). Patch continues work started in D24706 and D25821. in this patch symbol table and constant pool areas were added to .gdb_index section output. This one finishes the implementation of --gdb-index functionality in LLD. Differential revision: https://reviews.llvm.org/D26283 llvm-svn: 289810	2016-12-15 12:07:53 +00:00
Alexey Bataev	70f090d568	[TESTS] Initial commit of tests, by Andrew Tischenko llvm-svn: 289809	2016-12-15 12:06:27 +00:00
Roman Gareev	15db81ef71	[NFC] Fix typos in getMacroKernelParams. llvm-svn: 289808	2016-12-15 12:00:57 +00:00
Alexey Bataev	67c90c7d95	[TESTS] Initial commit of tests, by Andrew Tischenko llvm-svn: 289807	2016-12-15 11:48:24 +00:00
Roman Gareev	8babe1a216	The order of the loops defines the data reused in the BLIS implementation of gemm ([1]). In particular, elements of the matrix B, the second operand of matrix multiplication, are reused between iterations of the innermost loop. To keep the reused data in cache, only elements of matrix A, the first operand of matrix multiplication, should be evicted during an iteration of the innermost loop. To provide such a cache replacement policy, elements of the matrix A can, in particular, be loaded first and, consequently, be least-recently-used. In our case matrices are stored in row-major order instead of column-major order used in the BLIS implementation ([1]). One of the ways to address it is to accordingly change the order of the loops of the loop nest. However, it makes elements of the matrix A to be reused in the innermost loop and, consequently, requires to load elements of the matrix B first. Since the LLVM vectorizer always generates loads from the matrix A before loads from the matrix B and we can not provide it. Consequently, we only change the BLIS micro kernel and the computation of its parameters instead. In particular, reused elements of the matrix B are successively multiplied by specific elements of the matrix A . Refs.: [1] - http://www.cs.utexas.edu/users/flame/pubs/TOMS-BLIS-Analytical.pdf Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: https://reviews.llvm.org/D25653 llvm-svn: 289806	2016-12-15 11:47:38 +00:00
Nemanja Ivanovic	552c8e960e	[Power9] Allow AnyExt immediates for XXSPLTIB In some situations, the BUILD_VECTOR node that builds a v18i8 vector by a splat of an i8 constant will end up with signed 8-bit values and other situations, it'll end up with unsigned ones. Handle both situations. Fixes PR31340. llvm-svn: 289804	2016-12-15 11:16:20 +00:00
Dylan McKay	4f590f28e7	[AVR] Support floats in the instrumention pass This also refactors some common code into the 'GetTypeName' method. llvm-svn: 289803	2016-12-15 11:02:41 +00:00
Eric Fiselier	f34964bdd7	Fix XFAILS for is_trivially_destructible trait llvm-svn: 289802	2016-12-15 11:00:07 +00:00
Pavel Labath	1f2c1b6ccd	Remove linux/personality.h wrapper This code is currently unused. Removing it should make porting of the linux plugin to NetBSD easier, and we can always add it later if needed. llvm-svn: 289801	2016-12-15 10:47:40 +00:00
Simon Pilgrim	9ebeac3eed	[CostModel][X86] Add tests for reverse shuffle costs llvm-svn: 289800	2016-12-15 10:45:53 +00:00
Eric Liu	26cf68af3a	[change-namespace] handling templated type aliases correctly. Summary: This fixes templated type aliases and templated type aliases in classes. Reviewers: hokein Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D27801 llvm-svn: 289799	2016-12-15 10:42:35 +00:00
Prakhar Bahuguna	bc35f21f70	Add missing triple target for numeric section flag test llvm-svn: 289798	2016-12-15 10:20:48 +00:00
Malcolm Parsons	8e67aa9a9b	[clang-tidy] Enhance modernize-use-auto to templated function casts Summary: Use auto when declaring variables that are initialized by calling a templated function that returns its explicit first argument. Fixes PR26763. Reviewers: aaron.ballman, alexfh, staronj, Prazek Subscribers: Eugene.Zelenko, JDevlieghere, cfe-commits Differential Revision: https://reviews.llvm.org/D27166 llvm-svn: 289797	2016-12-15 10:19:56 +00:00
George Rimar	232d11cb54	[ELF] - Attempt to fix ubuntu 64x buildbot (2). Fixed inaccurate member type: uint32_t -> size_t (http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/2984/steps/build/logs/stdio). llvm-svn: 289796	2016-12-15 09:59:18 +00:00
Pavel Labath	08c2e86802	Simplify format member detection in FormatVariadic Summary: This replaces the format member search, which was quite complicated, with a more direct approach to detecting whether a class should be formatted using the format-member method. Instead we use a special type llvm::format_adapter, which every adapter must inherit from. Then the search can be simply implemented with the is_base_of type trait. Aside from the simplification, I like this way more because it makes it more explicit that you are supposed to use this type only for adapter-like formattings, and the other approach (format_provider overloads) should be used as a default (a mistake I made when first trying to use this library). The only slight change in behaviour here is that now choose the format-adapter branch even if the format member invocation will fail to compile (e.g. because it is a non-const member function and we are passing a const adapter), whereas previously we would have gone on to search for format_providers for the type. However, I think that is actually a good thing, as it probably means the programmer did something wrong. Reviewers: zturner, inglorion Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27679 llvm-svn: 289795	2016-12-15 09:40:27 +00:00
Sjoerd Meijer	96e10b5a9e	[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently This is essentially a recommit of r285893, but with a correctness fix. The problem of the original commit was that this: bic r5, r7, #31 cbz r5, .LBB2_10 got rewritten into: lsrs r5, r7, #5 beq .LBB2_10 The result in destination register r5 is not the same and this is incorrect when r5 is not dead. So this fix includes checking the uses of the AND destination register. And also, compared to the original commit, some regression tests didn't need changing anymore because of this extra check. For completeness, this was the original commit message: For the common pattern (CMPZ (AND x, #bitmask), #0), we can do some more efficient instruction selection if the bitmask is one consecutive sequence of set bits (32 - clz(bm) - ctz(bm) == popcount(bm)). 1) If the bitmask touches the LSB, then we can remove all the upper bits and set the flags by doing one LSLS. 2) If the bitmask touches the MSB, then we can remove all the lower bits and set the flags with one LSRS. 3) If the bitmask has popcount == 1 (only one set bit), we can shift that bit into the sign bit with one LSLS and change the condition query from NE/EQ to MI/PL (we could also implement this by shifting into the carry bit and branching on BCC/BCS). 4) Otherwise, we can emit a sequence of LSLS+LSRS to remove the upper and lower zero bits of the mask. 1-3 require only one 16-bit instruction and can elide the CMP. 4 requires two 16-bit instructions but can elide the CMP and doesn't require materializing a complex immediate, so is also a win. Differential Revision: https://reviews.llvm.org/D27761 llvm-svn: 289794	2016-12-15 09:38:59 +00:00
Dylan McKay	4b028e2ee1	[AVR] Add argument indices to the instrumention hook functions This allows the instrumention hook functions to do better pretty-printing. llvm-svn: 289793	2016-12-15 09:38:09 +00:00
George Rimar	aff2530cf8	[ELF] - Attempt to fix ubuntu bot. (http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-ubuntu-fast/builds/2982) llvm-svn: 289792	2016-12-15 09:30:07 +00:00
Michael Kruse	7037fde427	Remove references to AssumptionCache. NFC. The AssumptionCache was removed in r289756 after being replaced by the an addtional operand list of affected values in r289755. The absence of that cache means that we have now have to manually search for llvm.assume intrinsics as now done by other passes (LazyValueInfo, CodeMetrics) do not take into account an llvm::Instruction's user lists (ScalarEvolution). llvm-svn: 289791	2016-12-15 09:25:14 +00:00
George Rimar	8b54739328	[ELF] - Partial support of --gdb-index command line option (Part 2). Patch continues work started in D24706, in this patch address area was added to .gdb_index section output. Differential revision: https://reviews.llvm.org/D25821 llvm-svn: 289790	2016-12-15 09:08:13 +00:00
Dean Michael Berris	76e56c6777	[XRay][compiler-rt][NFC] Deduplicate code in x86-64 trampolines. Summary: The layout of all registers saved on stack shouldn't deviate and will be reused in future trampolines as well. While there, fix whitespace and clarify comments. Author: mpel Reviewers: dberris Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27799 llvm-svn: 289789	2016-12-15 09:04:05 +00:00
Prakhar Bahuguna	13e9921ccc	Fix for build warning in execute-only support llvm-svn: 289788	2016-12-15 08:42:04 +00:00
Yaxun Liu	402804b6d6	Re-commit r289252 and r289285, and fix PR31374 llvm-svn: 289787	2016-12-15 08:09:08 +00:00
Prakhar Bahuguna	61ef150d53	[ARM] Implement execute-only support in CodeGen Summary: This implements execute-only support for ARM code generation, which prevents the compiler from generating data accesses to code sections. The following changes are involved: * Add the CodeGen option "-arm-execute-only" to the ARM code generator. * Add the clang flag "-mexecute-only" as well as the GCC-compatible alias "-mpure-code" to enable this option. * When enabled, literal pools are replaced with MOVW/MOVT instructions, with VMOV used in addition for floating-point literals. As the MOVT instruction is required, execute-only support is only available in Thumb mode for targets supporting ARMv8-M baseline or Thumb2. * Jump tables are placed in data sections when in execute-only mode. * The execute-only text section is assigned section ID 0, and is marked as unreadable with the SHF_ARM_PURECODE flag with symbol 'y'. This also overrides selection of ELF sections for globals. Reviewers: t.p.northover, rengolin Subscribers: llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D27450 llvm-svn: 289786	2016-12-15 07:59:24 +00:00
Prakhar Bahuguna	e640c6f765	Allow ELF section flags to be specified numerically Summary: GAS already allows flags for sections to be specified directly as a numeric value. This functionality is particularly useful for setting processor or application-specific values that may not be directly supported or understood by LLVM. This patch allows LLVM to use numeric section flag values verbatim if specified by the assembly file. Reviewers: grosbach, rafael, t.p.northover, rengolin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27451 llvm-svn: 289785	2016-12-15 07:59:15 +00:00
Prakhar Bahuguna	52a7dd7d78	[ARM] Implement execute-only support in CodeGen This implements execute-only support for ARM code generation, which prevents the compiler from generating data accesses to code sections. The following changes are involved: * Add the CodeGen option "-arm-execute-only" to the ARM code generator. * Add the clang flag "-mexecute-only" as well as the GCC-compatible alias "-mpure-code" to enable this option. * When enabled, literal pools are replaced with MOVW/MOVT instructions, with VMOV used in addition for floating-point literals. As the MOVT instruction is required, execute-only support is only available in Thumb mode for targets supporting ARMv8-M baseline or Thumb2. * Jump tables are placed in data sections when in execute-only mode. * The execute-only text section is assigned section ID 0, and is marked as unreadable with the SHF_ARM_PURECODE flag with symbol 'y'. This also overrides selection of ELF sections for globals. llvm-svn: 289784	2016-12-15 07:59:08 +00:00
Saleem Abdulrasool	342beeb91e	CodeGen: force builtins to be local Unfortunately _setjmp3 can be both import or local. The ASAN tests try to emulate the flags which makes this harder to detect. Rely on the linker creating or using thunks here instead. Should repair the ASAN windows bots. llvm-svn: 289783	2016-12-15 07:29:04 +00:00
George Rimar	14460e0216	[ELF] - Do not crash when move location counter backward. PR31335 shows that we do that in next case: SECTIONS { .text 0x2000 : {. = 0x100 ; *(.text) } } though documentations says that "If . is used inside a section description however, it refers to the byte offset from the start of that section, not an absolute address. " looks does not work as documented in bfd (as mentioned in comments for PR31335). Until we find out the expected behavior was suggested at least not to 'crash', what we do after trying to generate huge file. Differential revision: https://reviews.llvm.org/D27712 llvm-svn: 289782	2016-12-15 07:27:28 +00:00
Eric Fiselier	7dfa62687c	Fix typo llvm-svn: 289781	2016-12-15 07:23:44 +00:00
Eric Fiselier	f4d7c18628	Add tests for LWG 2796 llvm-svn: 289780	2016-12-15 07:15:39 +00:00
Sanjoy Das	93b1de0f8c	Add missing -mtriple to MIR test case llvm-svn: 289779	2016-12-15 07:13:50 +00:00
Eric Fiselier	3fede1c9c0	Add more test cases for PR31384 llvm-svn: 289778	2016-12-15 07:05:19 +00:00
Yaxun Liu	6f8d90999e	Attempt to fix llvm-readobj crash on ppc64 due to r289674 llvm-svn: 289777	2016-12-15 06:59:23 +00:00
Saleem Abdulrasool	6cb0744934	CodeGen: fix runtime function dll storage Properly attribute DLL storage to runtime functions. When generating the runtime function, scan for an existing declaration which may provide an explicit declaration (local storage) or a DLL import or export storage from the user. Honour that if available. Otherwise, if building with a local visibility of the public or standard namespaces (-flto-visibility-public-std), give the symbols local storage (it indicates a /MT[d] link, so static runtime). Otherwise, assume that the link is dynamic, and give the runtime function dllimport storage. This allows for implementations to get the correct storage as long as they are properly declared, the user to override the import storage, and in case no explicit storage is given, use of the import storage. llvm-svn: 289776	2016-12-15 06:59:05 +00:00
Daniel Jasper	befe7a3fc4	Fix go bindings after r289702 (hopefully, don't really know how to build them, build.sh seems to be broken). llvm-svn: 289775	2016-12-15 06:54:29 +00:00
Eric Fiselier	9ce1745464	Add test case for PR31384 llvm-svn: 289774	2016-12-15 06:38:07 +00:00
Eric Fiselier	347a1cc221	Revert r289727 due to PR31384 This patch reverts the changes to tuple which fixed construction from types derived from tuple. It breaks the code mentioned in llvm.org/PR31384. I'll follow this commit up with a test case. llvm-svn: 289773	2016-12-15 06:34:54 +00:00
Kostya Serebryany	628b43aab6	[libFuzzer] enable the failure-resistant merge by default (with trace-pc-guard only) llvm-svn: 289772	2016-12-15 06:21:21 +00:00
Dylan McKay	dc58eb543f	[AVR] Whitelist the avrlit config environment variables This allows us to use `lit` to run on-target execution tests. llvm-svn: 289769	2016-12-15 06:04:53 +00:00
Hal Finkel	f19e114237	Revert part of r289765 that is not necessary CS.doesNotAccessMemory(ArgNo) and CS.onlyReadsMemory(ArgNo) calls dataOperandHasImpliedAttr, so revert this part of r289765 because it should not be necessary. llvm-svn: 289768	2016-12-15 05:50:45 +00:00
Eric Fiselier	a0620a1c45	XFAIL test for more apple-clang versions llvm-svn: 289767	2016-12-15 05:41:07 +00:00
Hal Finkel	34f9d6ac11	Trying to fix NDEBUG build after r289764 llvm-svn: 289766	2016-12-15 05:33:19 +00:00
Hal Finkel	39fed399e1	Fix argument attribute queries with bundle operands When iterating over data operands in AA, don't make argument-attribute-specific queries on bundle operands. Trying to fix self hosting... llvm-svn: 289765	2016-12-15 05:09:15 +00:00
Sanjoy Das	d7389d6261	[MachineBlockPlacement] Don't make blocks "uneditable" Summary: This fixes an issue with MachineBlockPlacement due to a badly timed call to `analyzeBranch` with `AllowModify` set to true. The timeline is as follows: 1. `MachineBlockPlacement::maybeTailDuplicateBlock` calls `TailDup.shouldTailDuplicate` on its argument, which in turn calls `analyzeBranch` with `AllowModify` set to true. 2. This `analyzeBranch` call edits the terminator sequence of the block based on the physical layout of the machine function, turning an unanalyzable non-fallthrough block to a unanalyzable fallthrough block. Normally MBP bails out of rearranging such blocks, but this block was unanalyzable non-fallthrough (and thus rearrangeable) the first time MBP looked at it, and so it goes ahead and decides where it should be placed in the function. 3. When placing this block MBP fails to analyze and thus update the block in keeping with the new physical layout. Concretely, before (1) we have something like: ``` LBL0: < unknown terminator op that may branch to LBL1 > jmp LBL1 LBL1: ... A LBL2: ... B ``` In (2), analyze branch simplifies this to ``` LBL0: < unknown terminator op that may branch to LBL2 > ;; jmp LBL1 <- redundant jump removed LBL1: ... A LBL2: ... B ``` In (3), MachineBlockPlacement goes ahead with its plan of putting LBL2 after the first block since that is profitable. ``` LBL0: < unknown terminator op that may branch to LBL2 > ;; jmp LBL1 <- redundant jump LBL2: ... B LBL1: ... A ``` and the program now has incorrect behavior (we no longer fall-through from `LBL0` to `LBL1`) because MBP can no longer edit LBL0. There are several possible solutions, but I went with removing the teeth off of the `analyzeBranch` calls in TailDuplicator. That makes thinking about the result of these calls easier, and breaks nothing in the lit test suite. I've also added some bookkeeping to the MachineBlockPlacement pass and used that to write an assert that would have caught this. Reviewers: chandlerc, gberry, MatzeB, iteratee Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D27783 llvm-svn: 289764	2016-12-15 05:08:57 +00:00
Mehdi Amini	ba80a837ab	Revert "Fix printf specifier handling: invalid specifier should not be marked as "consuming data arguments"" This reverts commit r289762, wasn't ready to be pushed, it broke the printf tests. llvm-svn: 289763	2016-12-15 04:58:51 +00:00
Mehdi Amini	0dcbcb7eb8	Fix printf specifier handling: invalid specifier should not be marked as "consuming data arguments" llvm-svn: 289762	2016-12-15 04:51:22 +00:00
Mehdi Amini	ab11d83048	Fix os_log formating with arbitrary precision and field width llvm-svn: 289761	2016-12-15 04:02:31 +00:00
Peter Collingbourne	6ee0b4e9f5	COFF: Open and map input files asynchronously on Windows. Profiling revealed that the majority of lld's execution time on Windows was spent opening and mapping input files. We can reduce this cost significantly by performing these operations asynchronously. This change introduces a queue for all operations on input file data. When we discover that we need to load a file (for example, when we find a lazy archive for an undefined symbol, or when we read a linker directive to load a file from disk), the file operation is launched using a future and the symbol resolution operation is enqueued. This implies another change to symbol resolution semantics, but it seems to be harmless ("ninja All" in Chromium still succeeds). To measure the perf impact of this change I linked Chromium's chrome_child.dll with both thin and fat archives. Thin archives: Before (median of 5 runs): 19.50s After: 10.93s Fat archives: Before: 12.00s After: 9.90s On Linux I found that doing this asynchronously had a negative effect on performance, probably because the cost of mapping a file is small enough that it becomes outweighed by the cost of managing the futures. So on non-Windows platforms I use the deferred execution strategy. Differential Revision: https://reviews.llvm.org/D27768 llvm-svn: 289760	2016-12-15 04:02:23 +00:00
Craig Topper	ab5f355d8c	[AVX-512][InstCombine] Add masked scalar FMA intrinsics to SimplifyDemandedVectorElts. llvm-svn: 289759	2016-12-15 03:49:45 +00:00
Rui Ueyama	fd7ed23ee7	Rename functions as per post commit review for r289072. llvm-svn: 289758	2016-12-15 03:31:53 +00:00

1 2 3 4 5 ...

249859 Commits All Branches Search

249859 Commits

All Branches