llvm-project

Commit Graph

Author	SHA1	Message	Date
Pierre Gousseau	e3014fc144	The test added in r275267 does not work on read-only checkouts because of the use of touch -m -t. Following Tom Rybka suggestion, the test files are now copied to a temporary directory first. llvm-svn: 275415	2016-07-14 13:58:27 +00:00
Nico Weber	ecdf45b1e6	Teach fast isel calls and rets about stdcall. stdcall is callee-pop like thiscall, so the thiscall changes already did most of the work for this. This change only opts stdcall in and adds tests. llvm-svn: 275414	2016-07-14 13:54:26 +00:00
Simon Pilgrim	bed37ccd54	[X86][AVX] Added an additional vperm2f128 memory folding test llvm-svn: 275413	2016-07-14 13:40:53 +00:00
Simon Pilgrim	534e3240e8	Remove trailing whitespace. llvm-svn: 275412	2016-07-14 13:29:23 +00:00
Simon Pilgrim	3ecb6bdd5f	[X86][AVX2] Allow VPERMPD/VPERMQ shuffles to call combineShuffle This improves the situation discussed in D19228 where we were forcing VPERMPD/VPERMQ where VPERM2F128/VPERM2I128 would have been better. llvm-svn: 275411	2016-07-14 13:28:43 +00:00
Daniel Sanders	46fe6550ac	[mips] SelectionDAGISel subclasses now follow the optimization level. Summary: It was recently discovered that, for Mips's SelectionDAGISel subclasses, all optimization levels caused SelectionDAGISel to behave like -O2. This change adds the necessary plumbing to initialize the optimization level. Reviewers: andrew.w.kaylor Subscribers: andrew.w.kaylor, sdardis, dean, llvm-commits, vradosavljevic, petarj, qcolombet, probinson, dsanders Differential Revision: https://reviews.llvm.org/D14900 llvm-svn: 275410	2016-07-14 13:25:22 +00:00
Benjamin Kramer	56a46bc680	Upgrade all the .arcconfigs to https. llvm-svn: 275409	2016-07-14 13:15:37 +00:00
Aaron Ballman	977daf307d	Speculatively fix the sphinx build, which does not think the original code was valid nasm (http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/11854/steps/docs-llvm-html/logs/stdio ). llvm-svn: 275408	2016-07-14 13:08:16 +00:00
Aaron Ballman	c337fafa36	This is a malformed :option: tag -- we don't have an option directive that matches it, so turning it actual text instead of a markup tag. This will hopefully fix the clang docs build (http://lab.llvm.org:8011/builders/clang-sphinx-docs/builds/15194/steps/docs-clang-html/logs/stdio ) llvm-svn: 275407	2016-07-14 13:01:00 +00:00
Simon Pilgrim	053d32906f	[X86][AVX] Add support for narrowing 128-bit+ shuffle mask elements to 64-bits to allow combining Primarily this is to allow blend with zero instead of having to use vperm2f128, but we can use this in the future to deal with AVX512 cases where we need to keep the original element size to correctly fold masked operations. llvm-svn: 275406	2016-07-14 12:58:04 +00:00
Benjamin Kramer	69c476ccd2	[OpenCL] Actually activate Frontend/opencl.cl test and fix test bugs rL275318 added the test Frontend/opencl.cl test, but that test was never actually run because Frontend/lit.local.cfg doesn't contain the '.cl' file suffix. Once the test is activated, it fails with (unintended) compile errors in the newly added CHECK_INVALID_OPENCL_VERSION checks. This patch adds the '.cl' file suffix to Frontend/lit.local.cfg to activate the test and fixes the test bug by adding '-fblocks' to the relevant command lines. Patch by Martin Böhme! Differential Revision: http://reviews.llvm.org/D22349 llvm-svn: 275405	2016-07-14 12:56:21 +00:00
Aaron Ballman	745e752725	Correct the attribute documentation for the new XRay attributes. Fixes the documentation build. llvm-svn: 275404	2016-07-14 12:35:00 +00:00
Sjoerd Meijer	716abbb2f5	This converts a signed remainder instruction to unsigned remainder, which enables the code size optimisation to fold a rem and div into a single aeabi_uidivmod call. This was not happening before because sdiv was converted but srem not, and instructions with different signedness are not combined. Differential Revision: http://reviews.llvm.org/D22214 llvm-svn: 275403	2016-07-14 12:23:48 +00:00
Simon Pilgrim	700e4a1ab8	[X86][AVX] Add 128-bit wide shuffle tests that should combine to blend-with-zero llvm-svn: 275402	2016-07-14 12:21:40 +00:00
Sebastian Pop	63847d04e7	code hoisting pass based on GVN This pass hoists duplicated computations in the program. The primary goal of gvn-hoist is to reduce the size of functions before inline heuristics to reduce the total cost of function inlining. Pass written by Sebastian Pop, Aditya Kumar, Xiaoyu Hu, and Brian Rzycki. Important algorithmic contributions by Daniel Berlin under the form of reviews. Differential Revision: http://reviews.llvm.org/D19338 llvm-svn: 275401	2016-07-14 12:18:53 +00:00
Simon Pilgrim	a76a8e50e5	[X86][AVX] Add VBROADCASTF128/VBROADCASTI128 shuffle comments support llvm-svn: 275400	2016-07-14 12:07:43 +00:00
Dean Michael Berris	086639a6d0	Remove extra ';' to appease -Wpedantic Summary: Reviewers: dok Subscribers: llvm-commits llvm-svn: 275399	2016-07-14 11:46:41 +00:00
Simon Pilgrim	9e812169cc	[X86][AVX] Regenerate broadcast upgrade tests llvm-svn: 275398	2016-07-14 11:05:43 +00:00
Tobias Grosser	bd81a7eebc	Fix formatting llvm-svn: 275397	2016-07-14 10:53:00 +00:00
Tobias Grosser	aef5196f75	GPGPU: Map initial schedule to GPU schedule This change now applies ppcg's GPU mapping on our initial schedule. For this to work, we need to also initialize the set of all names (isl_ids) used in the scop as well as the program context. llvm-svn: 275396	2016-07-14 10:51:52 +00:00
Tobias Grosser	681bd5688f	GPGPU: Do not dump schedule by default llvm-svn: 275395	2016-07-14 10:51:47 +00:00
Pavel Labath	c54f9c4851	mark newly failing tests as XFAIL llvm-svn: 275394	2016-07-14 10:43:24 +00:00
Pavel Labath	fa3d652d26	[test] [linux] define PR_SET_PTRACER constants if the system does not provide them Android API <= 16 header do not have these symbols defined, but the kernel does support the relevant calls. And in general, since these calls are on a best-effort basis, it won't hurt even if we try to run in on a really ancient kernel. llvm-svn: 275393	2016-07-14 10:43:21 +00:00
Roman Gareev	6cf195b6d5	[NFC] Add full title/author information to "Apply the BLIS matmul optimization pattern" llvm-svn: 275392	2016-07-14 10:40:15 +00:00
Simon Pilgrim	b8c261c931	[X86][AVX2] VBROADCASTSSrr/VBROADCASTSSYrr require AVX2 not AVX llvm-svn: 275391	2016-07-14 10:37:14 +00:00
Tobias Grosser	f384594d5e	GPGPU: compute new schedule from polly scop To do so we copy the necessary information to compute an initial schedule from polly::Scop to ppcg's scop. Most of the necessary information is directly available and only needs to be passed on to ppcg, with the exception of 'tagged' access relations, access relations that additionally carry information about which memory access an access relation originates from. We could possibly perform the construction of tagged accesses as part of ScopInfo, but as this format is currently specific to ppcg we do not do this yet, but keep this functionality local to our GPU code generation. After the scop has been initialized, we compute data dependences and ask ppcg to compute an initial schedule. Some of this functionality is already available in polly::DependenceInfo and polly::ScheduleOptimizer, but to keep differences to ppcg small we use ppcg's functionality here. We may later investiage if a closer integration of these tools makes sense. llvm-svn: 275390	2016-07-14 10:22:25 +00:00
Tobias Grosser	e938517e37	GPGPU: create default initialized PPCG scop and gpu program At this stage, we do not yet modify the IR but just generate a default initialized ppcg_scop and gpu_prog and free both immediately. Both will later be filled with data from the polly::Scop and are needed to use PPCG for GPU schedule generation. This commit does not yet perform any GPU code generation, but ensures that the basic infrastructure has been put in place. We also add a simple test case to ensure the new code is run and use this opportunity to verify that GPU_CODEGEN tests are only run if GPU code generation has been enabled in cmake. llvm-svn: 275389	2016-07-14 10:22:19 +00:00
Benjamin Kramer	b67e1e2dd7	[clang-rename] add documentation clang-rename needs at least to have a minimum documentation to provide a small introduction for new users Patch by Kirill Bobyrev! Differential Revision: http://reviews.llvm.org/D22129 llvm-svn: 275388	2016-07-14 09:46:07 +00:00
Benjamin Kramer	1afefc0da3	[clang-rename] exit code-related bugfix and code cleanup This patch does the following: * enforces proper formatting for few files (i.e. deals with 80 linewidth violations and few other things) * ensures '\n' chars are passed to the output streams instead of "\n" strings * fixes a bug caused by calling cl::PrintHelpMessage(), which occasionally calls exit(0), so that exit(1) (which is right after cl::PrintHelpMessage line) becomes dead code Patch by Kirill Bobyrev! Differential Revision: http://reviews.llvm.org/D22091 llvm-svn: 275387	2016-07-14 09:46:03 +00:00
Haojian Wu	0c05e2e4b6	[include-fixer] Correct an incorrecst judgement about prefix scoped qualifiers. Summary: The judgement that checks whether the fully-qualified name has scoped qualifiers prefix is incorrect. Should always check whether the first matched postion is the beginning position. Reviewers: bkramer Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D22343 llvm-svn: 275386	2016-07-14 09:39:12 +00:00
Eugene Leviant	b030411414	[ELF] r275383 reverted due to buildbot failure llvm-svn: 275385	2016-07-14 09:21:24 +00:00
Asaf Badouh	a0b6f8fb56	[X86][AVX512F] minor fix of the parameter names add "__" prefix llvm-svn: 275384	2016-07-14 08:40:30 +00:00
Eugene Leviant	219d9b2b18	[ELF] Allow overriding reserved symbols in linker scripts llvm-svn: 275383	2016-07-14 08:26:41 +00:00
Sjoerd Meijer	38c2cd0c14	This implements a more optimal algorithm for selecting a base constant in constant hoisting. It not only takes into account the number of uses and the cost of expressions in which constants appear, but now also the resulting integer range of the offsets. Thus, the algorithm maximizes the number of uses within an integer range that will enable more efficient code generation. On ARM, for example, this will enable code size optimisations because less negative offsets will be created. Negative offsets/immediates are not supported by Thumb1 thus preventing more compact instruction encoding. Differential Revision: http://reviews.llvm.org/D21183 llvm-svn: 275382	2016-07-14 07:44:20 +00:00
Ilia K	beb1aa907d	Fix -break-enable/-break-disable commands (MI) * Previously -break-enable mistakenly set BP's enabled flag to false. * These commands print fake =breakpoint-modified messages, what's not needed anymore because that events are come in normal way. * Add tests for -break-enable/-break-disable commands Initial patch from xuefangliang@hotmail.com. The test case was improved by me. Differential Revision: http://reviews.llvm.org/D21757 llvm-svn: 275381	2016-07-14 07:43:14 +00:00
David Majnemer	666aa945a5	[InstCombine] Masked loads with undef masks can fold to normal loads We were able to fold masked loads with an all-ones mask to a normal load. However, we couldn't turn a masked load with a mask with mixed ones and undefs into a normal load. llvm-svn: 275380	2016-07-14 06:58:42 +00:00
David Majnemer	17a95aaa7b	Simplify llvm.masked.load w/ undef masks We can always pick the passthru value if the mask is undef: we are permitted to treat the mask as-if it were filled with zeros. llvm-svn: 275379	2016-07-14 06:58:37 +00:00
Craig Topper	6840f1150f	[AVX512] Implement EXTLOAD lowering with patterns to select existing VPMOVZX instructions instead of creating CodeGenOnly instructions. llvm-svn: 275378	2016-07-14 06:41:34 +00:00
Dean Michael Berris	25a1564e6c	Use hasFlag instead of hasArg Summary: Fix the build to use hasFlag instead of hasArg for checking some flags. Reviewers: echristo Subscribers: mehdi_amini, cfe-commits Differential Revision: http://reviews.llvm.org/D22338 llvm-svn: 275377	2016-07-14 06:37:46 +00:00
Eli Friedman	17e8ea18e9	[X86] Fix stupid typo in isel lowering. Apparently someone miscounted the number of zeros in the immediate. Fixes https://llvm.org/bugs/show_bug.cgi?id=28544 . llvm-svn: 275376	2016-07-14 05:48:25 +00:00
Matt Arsenault	ca7f5701f8	AMDGPU/R600: Delete/rename intrinsics no longer used by mesa Use the replacement pass to update the tests, and delete old names. llvm-svn: 275375	2016-07-14 05:47:17 +00:00
Rui Ueyama	3b04d833c4	Set sh_addralign to 1 instead of 0. ELF spec says that alignment of 0 is equivalent to 1. Previously, we arbitrary set to 0 or 1, but always setting to 1 makes our program simpler. llvm-svn: 275374	2016-07-14 05:46:24 +00:00
Rui Ueyama	0fad6ea551	Attempt to unbreak msan bot. r275301 made .got section be aligned on Target->GotEntrySize, so GotEntrySize must have been initialized. We didn't initialize it for AMDGPU. llvm-svn: 275373	2016-07-14 05:46:22 +00:00
Matt Arsenault	648e422bd9	AMDGPU/R600: Remove intrinsics with no tests and no users Mesa removed this path, so nothing is using these anymore. llvm-svn: 275372	2016-07-14 05:23:23 +00:00
Matt Arsenault	897eee4187	AMDGPU: Remove unused intrinsics llvm-svn: 275371	2016-07-14 05:23:19 +00:00
Matt Arsenault	aa94c1e7ee	AMDGPU: Fix test not actually testing anything It wasn't actually running the pass, and since it is missing the llvm prefix, the eh intrinsic was not really an IntrinsicInst. Also add missing test for lifetime markers. llvm-svn: 275370	2016-07-14 05:23:15 +00:00
Matt Arsenault	0bf9984bc8	AMDGPU: Remove dead code llvm-svn: 275369	2016-07-14 05:23:08 +00:00
Dean Michael Berris	39baab9326	Add C++ dependencies to xray runtime Summary: Depends on D21982 which implements the in-memory logging implementation of the XRay runtime. These additional changes also depends on D20352 which adds the bulk of XRay flags/dependencies when using the `-fxray-instrument` flag from Clang. Reviewers: echristo, rnk, aaron.ballman Subscribers: mehdi_amini, cfe-commits Differential Revision: http://reviews.llvm.org/D21983 llvm-svn: 275368	2016-07-14 04:58:44 +00:00
Dean Michael Berris	52735fc435	XRay: Add entry and exit sleds Summary: In this patch we implement the following parts of XRay: - Supporting a function attribute named 'function-instrument' which currently only supports 'xray-always'. We should be able to use this attribute for other instrumentation approaches. - Supporting a function attribute named 'xray-instruction-threshold' used to determine whether a function is instrumented with a minimum number of instructions (IR instruction counts). - X86-specific nop sleds as described in the white paper. - A machine function pass that adds the different instrumentation marker instructions at a very late stage. - A way of identifying which return opcode is considered "normal" for each architecture. There are some caveats here: 1) We don't handle PATCHABLE_RET in platforms other than x86_64 yet -- this means if IR used PATCHABLE_RET directly instead of a normal ret, instruction lowering for that platform might do the wrong thing. We think this should be handled at instruction selection time to by default be unpacked for platforms where XRay is not availble yet. 2) The generated section for X86 is different from what is described from the white paper for the sole reason that LLVM allows us to do this neatly. We're taking the opportunity to deviate from the white paper from this perspective to allow us to get richer information from the runtime library. Reviewers: sanjoy, eugenis, kcc, pcc, echristo, rnk Subscribers: niravd, majnemer, atrick, rnk, emaste, bmakam, mcrosier, mehdi_amini, llvm-commits Differential Revision: http://reviews.llvm.org/D19904 llvm-svn: 275367	2016-07-14 04:06:33 +00:00
Davide Italiano	ed4d5ea82a	[SCCP] Pass a Value * instead of templating this function. NFC. Thanks to Eli for the suggestion! llvm-svn: 275366	2016-07-14 03:02:34 +00:00

1 2 3 4 5 ...

236515 Commits All Branches Search

236515 Commits

All Branches