llvm-project

Commit Graph

Author	SHA1	Message	Date
Alina Sbirlea	6f937b1144	LoadStoreVectorizer: Remove TargetBaseAlign. Keep alignment for stack adjustments. Summary: TargetBaseAlign is no longer required since LSV checks if target allows misaligned accesses. A constant defining a base alignment is still needed for stack accesses where alignment can be adjusted. Previous patch (D22936) was reverted because tests were failing. This patch also fixes the cause of those failures: - x86 failing tests either did not have the right target, or the right alignment. - NVPTX failing tests did not have the right alignment. - AMDGPU failing test (merge-stores) should allow vectorization with the given alignment but the target info considers <3xi32> a non-standard type and gives up early. This patch removes the condition and only checks for a maximum size allowed and relies on the next condition checking for %4 for correctness. This should be revisited to include 3xi32 as a MVT type (on arsenm's non-immediate todo list). Note that checking the sizeInBits for a MVT is undefined (leads to an assertion failure), so we need to create an EVT, hence the interface change in allowsMisaligned to include the Context. Reviewers: arsenm, jlebar, tstellarAMD Subscribers: jholewinski, arsenm, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D23068 llvm-svn: 277735	2016-08-04 16:38:44 +00:00
Adrian Prantl	98d78405b0	Shamelessly add myself to CREDITS.TXT llvm-svn: 277734	2016-08-04 16:28:22 +00:00
Bruno Cardoso Lopes	4e786cf3de	[ASAN] Mark test/asan/TestCases/ill.cc as unsupported on darwin Introduced in r277621, this test is currently failing all around in public bots: http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_check/20787 and internal bots. Mark it as unsupported on darwin until we figure out how it should behave. llvm-svn: 277733	2016-08-04 15:57:30 +00:00
Daniel Sanders	5dcbac57c5	[mips] Set Personality and LSDA encoding for FreeBSD Reviewers: seanbruno, sdardis Subscribers: tberghammer, danalbert, srhines, dsanders, sdardis, llvm-commits, seanbruno Differential Revision: https://reviews.llvm.org/D23113 llvm-svn: 277732	2016-08-04 15:36:03 +00:00
Sanjay Patel	9d591d15ec	[InstCombine] use m_APInt to allow icmp eq (sub C1, X), C2 folds for splat constant vectors llvm-svn: 277731	2016-08-04 15:19:25 +00:00
Jonas Hahnfeld	d1f4b8f6e8	Add test case for nested creation of tasks For discussion in D23115 llvm-svn: 277730	2016-08-04 14:55:56 +00:00
Alexander Kornienko	6b2a4d5e8f	[clang-tidy] misc-argument-comment non-strict mode Summary: The misc-argument-comment check now ignores leading and trailing underscores and case. The new `StrictMode` local/global option can be used to switch back to strict checking. Add getLocalOrGlobal version for integral types, minor cleanups. Reviewers: hokein, aaron.ballman Subscribers: aaron.ballman, Prazek, cfe-commits Differential Revision: https://reviews.llvm.org/D23135 llvm-svn: 277729	2016-08-04 14:54:54 +00:00
Simon Pilgrim	c2370b810d	[X86][SSE] Split off shuffle mask canonicalization from lowerVectorShuffle. NFCI. The new function now returns true if the shuffle should be commuted. This will allow target shuffle combines to share the code. llvm-svn: 277728	2016-08-04 14:21:32 +00:00
Krzysztof Parzyszek	7773c58458	[Hexagon] Clear kill flags from modified registers in peephole optimizer llvm-svn: 277727	2016-08-04 14:17:16 +00:00
Tobias Grosser	f919d8b360	GPGPU: Support scalars that are mapped to shared memory llvm-svn: 277726	2016-08-04 13:57:29 +00:00
Nikolai Bozhenov	f679530ba1	[X86] Heuristic to selectively build Newton-Raphson SQRT estimation On modern Intel processors hardware SQRT in many cases is faster than RSQRT followed by Newton-Raphson refinement. The patch introduces a simple heuristic to choose between hardware SQRT instruction and Newton-Raphson software estimation. The patch treats scalars and vectors differently. The heuristic is that for scalars the compiler should optimize for latency while for vectors it should optimize for throughput. It is based on the assumption that throughput bound code is likely to be vectorized. Basically, the patch disables scalar NR for big cores and disables NR completely for Skylake. Firstly, scalar SQRT has shorter latency than NR code in big cores. Secondly, vector SQRT has been greatly improved in Skylake and has better throughput compared to NR. Differential Revision: https://reviews.llvm.org/D21379 llvm-svn: 277725	2016-08-04 12:47:28 +00:00
Tobias Grosser	8950cead7f	GPGPU: Disable verbose debug output llvm-svn: 277724	2016-08-04 12:44:03 +00:00
Tobias Grosser	b0dd95bcd2	Remove leftover debug output llvm-svn: 277723	2016-08-04 12:41:28 +00:00
Tobias Grosser	130ca30f92	GPGPU: Add private memory support llvm-svn: 277722	2016-08-04 12:39:03 +00:00
Tobias Grosser	b513b4916b	GPGPU: Add support for shared memory llvm-svn: 277721	2016-08-04 12:18:14 +00:00
Rafael Espindola	a4b41dca31	Remove redundant argument. But always set Script<ELFT>::X->OutputSections. llvm-svn: 277720	2016-08-04 12:13:05 +00:00
Hrvoje Varga	846bdb746d	[mips][microMIPS] Implement CFC1, CFC2, CTC1 and CTC2 instructions Differential Revision: https://reviews.llvm.org/D22347 llvm-svn: 277719	2016-08-04 11:22:52 +00:00
Simon Pilgrim	c8fe132756	[X86] Dropped XOP ctbits checks - they match the AVX checks llvm-svn: 277718	2016-08-04 11:04:13 +00:00
Jonas Hahnfeld	20236611d4	kmp_taskdeps.cpp: Fix debugging output node->dn.task is only filled after the dependencies are already processed. This currently leads to unhelpful output from KA_TRACE or even a crash if one enables KMP_SUPPORT_GRAPH_OUTPUT. llvm-svn: 277717	2016-08-04 11:03:47 +00:00
Simon Pilgrim	5d5ca9c0cb	[X86][SSE] Add initial costs for vector CTTZ/CTLZ llvm-svn: 277716	2016-08-04 10:51:41 +00:00
Ying Yi	0ef31b7960	[LLVM-COV]Replace tabs to the space indentations in the HTML coverage report. When using orbis-llvm-cov.exe to generate the HTML report, the HTML report can look quite different to the source file if it includes tabs.The default tab size is 2 spaces instead of 8 spaces. A command line switch is be added to set the tab size. Differential Revision: https://reviews.llvm.org/D23087 llvm-svn: 277715	2016-08-04 10:39:43 +00:00
Jonas Hahnfeld	3d88f0c3fb	Remove LLVM_ENABLE_LIBCXXABI libc++.so is now a linker script that includes -lc++abi if necessary. Differential Revision: https://reviews.llvm.org/D22861 llvm-svn: 277714	2016-08-04 10:24:48 +00:00
Simon Pilgrim	8ae6dad49b	[X86][SSE] Don't decide when to scalarize CTTZ/CTLZ for performance at lowering - this is what cost models are for Improved CTTZ/CTLZ costings will be added shortly llvm-svn: 277713	2016-08-04 10:14:39 +00:00
Benjamin Kramer	87e6d99487	Make isExternC work on VarDecls too. llvm-svn: 277712	2016-08-04 10:02:03 +00:00
George Rimar	54a5486918	[ELF] - Attemp to fix buildbot. http://lab.llvm.org:8011/builders/lld-x86_64-darwin13/builds/25733/steps/test_lld/logs/stdio Fix: removed excessive whitespace. llvm-svn: 277711	2016-08-04 09:49:26 +00:00
George Rimar	eefa758ee2	[ELF] - Linkerscript: implemented ASSERT() keyword. ASSERT(exp, message) Ensure that exp is non-zero. If it is zero, then exit the linker with an error code, and print message. ASSERT is useful and was seen in few projects in the wild. Differential revision: https://reviews.llvm.org/D22912 llvm-svn: 277710	2016-08-04 09:29:31 +00:00
Kirill Bobyrev	8100940b8b	[clang-rename] add missing clang-format improvements r277702 introduced clang-format changes so that later commits wouldn't introduce non-functional changes while running clang-format before commiting. Though, few changes by clang-format weren't in the patch. llvm-svn: 277709	2016-08-04 09:23:30 +00:00
Simon Dardis	57f4ae4625	[mips] Enable tail calls by default Enable tail calls by default for (micro)MIPS(64). microMIPS is slightly more tricky than doing it for MIPS(R6) or microMIPSR6. microMIPS has two instruction encodings: 16bit and 32bit along with some restrictions on the size of the instruction that can fill the delay slot. For safe tail calls for microMIPS, the delay slot filler attempts to find a correct size instruction for the delay slot of TAILCALL pseudos. Reviewers: dsanders, vkalintris Subscribers: jfb, dsanders, sdardis, llvm-commits Differential Revision: https://reviews.llvm.org/D21138 llvm-svn: 277708	2016-08-04 09:17:07 +00:00
Tobias Grosser	b187515784	GPGPU: Cache PTX kernels We always keep a number of already compiled kernels available to ensure to avoid costly recompilation. llvm-svn: 277707	2016-08-04 09:15:58 +00:00
George Rimar	9e5386ceae	[ELF] - Linkerscript: Fixed SORT_BY_ALIGNMENT sorting order. According to spec: "SORT_BY_ALIGNMENT will sort sections into descending order by alignment before placing them in the output file" Previously they were sorted into ascending order. llvm-svn: 277706	2016-08-04 08:56:17 +00:00
George Rimar	b32733423f	[ELF] - Remove trailing whitespaces. NFC. llvm-svn: 277705	2016-08-04 08:26:02 +00:00
Diana Picus	ddddbc2440	Typo fix in comment. NFC llvm-svn: 277704	2016-08-04 08:25:08 +00:00
Eugene Leviant	c7611fc567	[ELF] Linkerscript: remove repeated sections in filter() llvm-svn: 277703	2016-08-04 08:20:23 +00:00
Miklos Vajna	0c07f0cb0b	Run clang-format on clang-rename code So that later commits don't introduce non-functional changes when running clang-format before committing. Reviewers: klimek Differential Revision: https://reviews.llvm.org/D23153 llvm-svn: 277702	2016-08-04 07:43:29 +00:00
Dean Michael Berris	7e9abea2ae	[XRay] Align entry and return sleds to 2 byte boundaries This should ensure that we can atomically write two bytes (on top of the retq and the one past it) and have those two bytes not straddle cache lines. We also move the label past the alignment instruction so that we can refer to the actual first instruction, as opposed to potential padding before the aligned instruction. Update the tests to allow us to reflect the new order of assembly. Reviewers: rSerge, echristo, majnemer Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D23101 llvm-svn: 277701	2016-08-04 07:37:28 +00:00
Matt Arsenault	b0e32f1ba1	AMDGPU: Fix a slow test by using basic regalloc This just tests that the register limit isn't exceeded, so the regisetr allocation doesn't need to be great.' The critically slow part is all in greedy RA, so switch to basic. llvm-svn: 277700	2016-08-04 07:04:54 +00:00
Tobias Grosser	00bb5a99f5	GPGPU: Handle scalar array references Pass the content of scalar array references to the alloca on the kernel side and do not pass them additional as normal LLVM scalar value. llvm-svn: 277699	2016-08-04 06:55:59 +00:00
Tobias Grosser	3216f8546c	BlockGenerator: Assert that we do not get alloca of array access llvm-svn: 277698	2016-08-04 06:55:53 +00:00
Tobias Grosser	576932728d	GPGPU: Pass subtree values correctly to the kernel llvm-svn: 277697	2016-08-04 06:55:49 +00:00
Eric Christopher	abb2b54ad3	After PR28761 use -Wall with -Werror in builtins tests to identify possible problems in headers. llvm-svn: 277696	2016-08-04 06:02:50 +00:00
Amaury Sechet	bf3adfdbfb	Fix intrinsics.ll test llvm-svn: 277695	2016-08-04 05:35:25 +00:00
Amaury Sechet	6bea674c43	Add popcount(n) == bitsize(n) -> n == -1 transformation. Summary: As per title. Reviewers: majnemer, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23139 llvm-svn: 277694	2016-08-04 05:27:20 +00:00
David Majnemer	4eefd6bca4	Forgot the dyn_cast_or_null intended for r277691. llvm-svn: 277693	2016-08-04 04:47:18 +00:00
Bruno Cardoso Lopes	3076db8da0	[Darwin] Exclude interception union tests on Darwin and Android Since the directory is empty on Darwin, disable the inclusion and avoid the warning below. Exclude on Android as well to match the behavior from lib/interception/tests/CMakeLists.txt lit.py: /Users/buildslave/jenkins/sharedspace/clang-R_master@2/llvm/utils/lit/lit/discovery.py:224: warning: input '/Users/buildslave/jenkins/sharedspace/clang-R_master@2/clang-build/Build/tools/clang/runtime/compiler-rt-bins/test/interception/Unit' contained no tests This fixes the above warning in some of public bots, like http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_check/8686 Differential Revision: https://reviews.llvm.org/D23128 rdar://problem/27581108 llvm-svn: 277692	2016-08-04 04:46:39 +00:00
David Majnemer	909793fa63	Reinstate "[CloneFunction] Don't remove side effecting calls" This reinstates r277611 + r277614 and reverts r277642. A cast_or_null should have been a dyn_cast_or_null. llvm-svn: 277691	2016-08-04 04:24:02 +00:00
Bruno Cardoso Lopes	bd887581fc	Revert "GVN-hoist: enable by default" & "Make GVN Hoisting obey optnone/bisect." This reverts commits r277685 & r277688. r277685 broke compiler-rt compilation http://lab.llvm.org:8080/green/job/clang-stage1-configure-RA_build/23335 and r277685 is a followup from it. llvm-svn: 277690	2016-08-04 04:16:24 +00:00
Chandler Carruth	a053a88df5	[PM] Change the name of the repeating utility to something less overloaded (and simpler). Sean rightly pointed out in code review that we've started using "wrapper pass" as a specific part of the old pass manager, and in fact it is more applicable there. Here, we really have a pass template to build a repeated pass, so call it that. llvm-svn: 277689	2016-08-04 03:52:53 +00:00
Sebastian Pop	b33bfa198c	Make GVN Hoisting obey optnone/bisect. Differential Revision: https://reviews.llvm.org/D23136 llvm-svn: 277688	2016-08-04 02:05:08 +00:00
Rui Ueyama	c163318b21	Remove buggy PROVIDE-in-output-description command. With the previous change, it is now obvious that readProvide in this context appended new commands to a wrong command list. It was mistakenly adding new commands to the top level. Thus, all commands inside output section descriptions were interpreted as they were written on top level. PROVIDE command naturally requires symbol assignment support in the output section description. We don't have that one yet. I removed the implementation because there's no way to fix it now. We can resurrect the test once we support the symbol assignment (with a modification to detect errors that we failed to find as described.) llvm-svn: 277687	2016-08-04 02:03:29 +00:00
Rui Ueyama	104165643e	Make ScriptParser::read* functions more functional style. Previously, many read* functions created new command objects and add them directly to the top-level data structure. This is not work for some commands because some commands, such as the assignment, can appear inside and outside of the output section description. This patch is to not append objects to the top-level data structure. Callers are now responsible to do that. llvm-svn: 277686	2016-08-04 02:03:27 +00:00

1 2 3 4 5 ...

238579 Commits All Branches Search

238579 Commits

All Branches