llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	980b280f50	[LibCallSimplifier] fold memset(malloc(x), 0, x) --> calloc(1, x) This is a step towards solving PR25892: https://llvm.org/bugs/show_bug.cgi?id=25892 It won't handle the reported case. As noted by the 'TODO' comments in the patch, we need to relax the hasOneUse() constraint and also match patterns that include memset_chk() and the llvm.memset() intrinsic in addition to memset(). Differential Revision: http://reviews.llvm.org/D16337 llvm-svn: 258816	2016-01-26 16:17:24 +00:00
Chad Rosier	f662fb3dc8	Revert "[Driver] Make sure -fno-math-builtin option is being passed by the driver." This reverts commit r258814. llvm-svn: 258815	2016-01-26 16:16:53 +00:00
Chad Rosier	17d2e8789c	[Driver] Make sure -fno-math-builtin option is being passed by the driver. Support for the -fno-math-builtin option was added in r186899. The codegen side is being tested in test/CodeGen/nomathbuiltin.c. The missing part was just passing the option through the driver. PR26317 llvm-svn: 258814	2016-01-26 15:52:05 +00:00
Chad Rosier	38fd54edc5	[Driver] Update FIXME comment now that PR4941 has been addressed. The actual fix should be addressed by someone who can test on Darwin. llvm-svn: 258813	2016-01-26 15:46:29 +00:00
Matthew Simpson	61d5a18469	Revert "Reapply commit r258404 with fix" This commit exposes a crash in computeKnownBits on the Chromium buildbots. Reverting to investigate. Reference: https://llvm.org/bugs/show_bug.cgi?id=26307 llvm-svn: 258812	2016-01-26 15:45:49 +00:00
Igor Laevsky	03a670c0ec	Re-submit r256008 "Improve DWARFDebugFrame::parse to also handle __eh_frame." Originally this change was causing failures on windows buildbots. But those problems were fixed in r258806. llvm-svn: 258811	2016-01-26 15:09:42 +00:00
Dan Gohman	fb619e9686	[WebAssembly] Fix a typo in a comment. llvm-svn: 258810	2016-01-26 14:55:17 +00:00
Michael Kruse	ee6a4fc680	Unique phi write accesses Ensure that there is at most one phi write access per PHINode and ScopStmt. In particular, this would be possible for non-affine subregions with multiple exiting blocks. We replace multiple MAY_WRITE accesses by one MUST_WRITE access. The written value is constructed using a PHINode of all exiting blocks. The interpretation of the PHI WRITE's "accessed value" changed from the incoming value to the PHI like for PHI READs since there is no unique incoming value. Because region simplification shuffles around PHI nodes -- particularly with exit node PHIs -- the PHINodes at analysis time does not always exist anymore in the code generation pass. We instead remember the incoming block/value pair in the MemoryAccess. Differential Revision: http://reviews.llvm.org/D15681 llvm-svn: 258809	2016-01-26 13:33:27 +00:00
Michael Kruse	ad28e5a589	Unique value read accesses Keep at most one value read MemoryAccess per value and statement; multiple generated loads do not have any additional effect. As one such MemoryAccess can cater multiple uses within the statement, the AccessInstruction property is not unique any more and set to nullptr. Differential Revision: http://reviews.llvm.org/D15510 llvm-svn: 258808	2016-01-26 13:33:15 +00:00
Michael Kruse	436db620e7	Unique value write accesses Ensure there is at most one write access per definition of an llvm::Value. Keep track of already created value write access by using a (dense) map. Replace addValueWriteAccess by ensureValueStore which can be uses more liberally without worrying to add redundant accesses. It will be used, e.g. in a logical correspondant for value reads -- ensureValueReload -- to ensure that the expected definition has been written when loading it. Differential Revision: http://reviews.llvm.org/D15483 llvm-svn: 258807	2016-01-26 13:33:10 +00:00
Igor Laevsky	0e1605a3b4	[DebugInfo] Fix DWARFDebugFrame instruction operand ordering We can't rely on the evalution order of function arguments. Differential Revision: http://reviews.llvm.org/D16509 llvm-svn: 258806	2016-01-26 13:31:11 +00:00
Alexey Bataev	1189bd0205	[OPENMP 4.5] Allow arrays in 'reduction' clause. OpenMP 4.5, alogn with array sections, allows to use variables of array type in reductions. llvm-svn: 258804	2016-01-26 12:20:39 +00:00
Johannes Doerfert	6f50c29ab2	[FIX] Domain generation error due to loops in non-affine regions llvm-svn: 258803	2016-01-26 11:03:25 +00:00
Johannes Doerfert	432658d7b8	[FIX] Build correct domain for non-affine region SCoPs llvm-svn: 258802	2016-01-26 11:01:41 +00:00
Alexander Kornienko	dc84150e4f	Fix crashing on user-defined conversion. Summary: Fix the assertion failure for the user-defined conversion method. e.g.: operator bool() Reviewers: alexfh, aaron.ballman Subscribers: aaron.ballman, cfe-commits Patch by Cong Liu! Differential Revision: http://reviews.llvm.org/D16536 llvm-svn: 258801	2016-01-26 10:56:27 +00:00
Ewan Crawford	b649b0053b	[RenderScript] Provide option to specify a single allocation to print Patch replaces the 'renderscript allocation list' command flag --refresh, with a new option --id <ID>. This new option only prints the details of a single allocation with a given id, rather than printing all the allocations. Functionality from the removed '--refresh' flag will be moved into its own command in a subsequent commit. llvm-svn: 258800	2016-01-26 10:41:08 +00:00
Tobias Grosser	f2cdd144e5	BlockGenerators: Replace getNewScalarValue with getNewValue Both functions implement the same functionality, with the difference that getNewScalarValue assumes that globals and out-of-scop scalars can be directly reused without loading them from their corresponding stack slot. This is correct for sequential code generation, but causes issues with outlining code e.g. for OpenMP code generation. getNewValue handles such cases correctly. Hence, we can replace getNewScalarValue with getNewValue. This is not only more future proof, but also eliminates a bunch of code. The only functionality that was available in getNewScalarValue that is lost is the on-demand creation of scalar values. However, this is not necessary any more as scalars are always loaded at the beginning of each basic block and will consequently always be available when scalar stores are generated. As this was not the case in older versions of Polly, it seems the on-demand loading is just some older code that has not yet been removed. Finally, generateScalarLoads also generated loads for values that are loop invariant, available in GlobalMap and which are preferred over the ones loaded in generateScalarLoads. Hence, we can just skip the code generation of such scalar values, avoiding the generation of dead code. Differential Revision: http://reviews.llvm.org/D16522 llvm-svn: 258799	2016-01-26 10:01:35 +00:00
Simon Pilgrim	46696ef93c	[X86][SSE] Add zero element and general 64-bit VZEXT_LOAD support to EltsFromConsecutiveLoads This patch adds support for trailing zero elements to VZEXT_LOAD loads (and checks that no zero elts occur within the consecutive load). It also generalizes the 64-bit VZEXT_LOAD load matching to work for loads other than 2x32-bit loads. After this patch it will also be easier to add support for other basic load patterns like 32-bit VZEXT_LOAD loads, PMOVZX and subvector load insertion. Differential Revision: http://reviews.llvm.org/D16217 llvm-svn: 258798	2016-01-26 09:30:08 +00:00
Ismail Donmez	c9655d9bd5	Fix compilations with msvc's /Zc:strictStrings llvm-svn: 258797	2016-01-26 08:24:57 +00:00
Rui Ueyama	231b5e23c5	Simplify. NFC. llvm-svn: 258796	2016-01-26 07:17:29 +00:00
Rui Ueyama	3ae28a4758	Simplify. NFC. llvm-svn: 258795	2016-01-26 07:17:27 +00:00
Matt Arsenault	cf70cb9d00	AMDGPU: Add amdgcn cube builtins llvm-svn: 258794	2016-01-26 06:37:54 +00:00
Craig Topper	b9c932f26e	[X86] Mark LDS/LES as not being allowed in 64-bit mode. Their opcodes are used as part of the VEX prefix in 64-bit mode. Clearly the disassembler implicitly decoded them as AVX instructions in 64-bit mode, but I think the AsmParser would have encoded them. llvm-svn: 258793	2016-01-26 06:10:15 +00:00
Rui Ueyama	d6cea14cbb	Simplify. NFC. This new code should be logically equivalent to the previous code. llvm-svn: 258792	2016-01-26 04:58:58 +00:00
Enrico Granata	dd54a3a887	Reverting r258759 as it is breaking the OSX build llvm-svn: 258791	2016-01-26 04:53:10 +00:00
Matt Arsenault	bee7575e1a	AMDGPU: Move AMDGPU intrinsics only used by R600 llvm-svn: 258790	2016-01-26 04:49:24 +00:00
Matt Arsenault	382d945d16	AMDGPU: Tidy minor td file issues Make comments and indentation more consistent. Rearrange a few things to be in a more consistent order, such as organizing subtarget features from those describing an actual device property, and those used as options. llvm-svn: 258789	2016-01-26 04:49:22 +00:00
Matt Arsenault	c5f6152911	AMDGPU: Make v32i8/v64i8 illegal types Old intrinsics were forcing these, but they have now all been removed. This fixes large i8 vector operations generally being broken. llvm-svn: 258788	2016-01-26 04:43:48 +00:00
Matt Arsenault	018179fc46	AMDGPU: Remove old sample intrinsics I did my best to try to update all the uses in tests that just happened to use the old ones to the newer intrinsics. I'm not sure I got all of the immediate operand conversions correct, since the value seems to have been ignored by the old pattern but I don't think it really matters. llvm-svn: 258787	2016-01-26 04:38:08 +00:00
Matt Arsenault	051d6f9fde	AMDGPU: Add new amdgcn intrinsics for cube instructions More cleanup to try to get all intrinsics using the correct amdgcn prefix that are as close to the instruction as possible. llvm-svn: 258786	2016-01-26 04:29:56 +00:00
Matt Arsenault	9a10cea7fb	AMDGPU: Implement read_register and write_register intrinsics Some of the special intrinsics now that now correspond to a instruction also have special setting of some registers, e.g. llvm.SI.sendmsg sets m0 as well as use s_sendmsg. Using these explicit register intrinsics may be a better option. Reading the exec mask and others may be useful for debugging. For this I'm not sure this is entirely correct because we would want this to be convergent, although it's possible this is already treated sufficently conservatively. llvm-svn: 258785	2016-01-26 04:29:24 +00:00
Matt Arsenault	cee02ccc1d	AMDGPU: Note mesa version in release notes llvm-svn: 258784	2016-01-26 04:29:15 +00:00
Matt Arsenault	0c3e2338fe	AMDGPU: Restore AMDGPU prefixed rsq intrinsic for now Also move into backend intrinsics to discourage use of the old name. llvm-svn: 258783	2016-01-26 04:14:16 +00:00
Xiuli Pan	bb4d8d30b1	Recommit: R258773 [OpenCL] Pipe builtin functions Fix arc patch fuzz error. Summary: Support for the pipe built-in functions for OpenCL 2.0. The pipe builtin functions may have infinite kinds of element types, one approach would be to just generate calls that would always use generic types such as void*. This patch is based on bader's opencl support patch on SPIR-V branch. Reviewers: Anastasia, pekka.jaaskelainen Subscribers: keryell, bader, cfe-commits Differential Revision: http://reviews.llvm.org/D15914 llvm-svn: 258782	2016-01-26 04:03:48 +00:00
Dan Gohman	bdf08d5da6	[WebAssembly] Optimize memcpy/memmove/memcpy calls. These calls return their first argument, but because LLVM uses an intrinsic with a void return type, they can't use the returned attribute. Generalize the store results pass to optimize these calls too. llvm-svn: 258781	2016-01-26 04:01:11 +00:00
Dan Gohman	be6f196bff	[WebAssembly] Remove a completed entry from the README.txt. llvm-svn: 258780	2016-01-26 03:43:48 +00:00
Dan Gohman	bb3722430f	[WebAssembly] Implement unaligned loads and stores. Differential Revision: http://reviews.llvm.org/D16534 llvm-svn: 258779	2016-01-26 03:39:31 +00:00
Richard Trieu	3a5c958182	Fix -Wnull-conversion for long macros. Move the function to get a macro name from DiagnosticRenderer.cpp to Lexer.cpp so that other files can use it. Lexer now has two functions to get the immediate macro name, the newly added one is better for diagnostic purposes. Make -Wnull-conversion use this function for better NULL macro detection. llvm-svn: 258778	2016-01-26 02:51:55 +00:00
Haicheng Wu	f1c00a22be	[LIR] Add support for structs and hand unrolled loops This is a recommit of r258620 which causes PR26293. The original message: Now LIR can turn following codes into memset: typedef struct foo { int a; int b; } foo_t; void bar(foo_t f, unsigned n) { for (unsigned i = 0; i < n; ++i) { f[i].a = 0; f[i].b = 0; } } void test(foo_t f, unsigned n) { for (unsigned i = 0; i < n; i += 2) { f[i] = 0; f[i+1] = 0; } } llvm-svn: 258777	2016-01-26 02:27:47 +00:00
Ehsan Akhgari	fefe300a62	Recommit the test for r258720 using -### llvm-svn: 258776	2016-01-26 02:23:05 +00:00
David Majnemer	747f168e8d	Revert "[OpenCL] Pipe builtin functions" This reverts commit r258773, it broke the build bots: http://bb.pgr.jp/builders/cmake-clang-x86_64-linux/builds/43853 llvm-svn: 258775	2016-01-26 02:22:31 +00:00
Reid Kleckner	0b5220d0aa	Use binary search for intrinsic ID lookups This improves compile time of Function.cpp from 57s to 37s for me locally. Intrinsic IDs are cached on the Function object, so this shouldn't regress performance. llvm-svn: 258774	2016-01-26 02:06:41 +00:00
Xiuli Pan	3a9952c9e7	[OpenCL] Pipe builtin functions Summary: Support for the pipe built-in functions for OpenCL 2.0. The pipe builtin functions may have infinite kinds of element types, one approach would be to just generate calls that would always use generic types such as void*. This patch is based on bader's opencl support patch on SPIR-V branch. Reviewers: Anastasia, pekka.jaaskelainen Subscribers: keryell, bader, cfe-commits Differential Revision: http://reviews.llvm.org/D15914 llvm-svn: 258773	2016-01-26 02:06:04 +00:00
Ehsan Akhgari	09a8a8a59d	Revert the test for r258720 temporarily This test is failing on a bot for reasons that are unclear to me. Reverting for now... llvm-svn: 258772	2016-01-26 01:51:47 +00:00
Matthias Braun	db320773e1	LiveIntervalAnalysis: Improve some comments As recommended by Justin. llvm-svn: 258771	2016-01-26 01:40:48 +00:00
David Majnemer	fedb8d4bab	[Sema] Remove stray semicolons. No functional change is intended. llvm-svn: 258769	2016-01-26 01:39:17 +00:00
David Majnemer	d3d91bd17f	[Sema] Incomplete types are OK for covariant returns Per C++14 [class.virtual]p8, it is OK for the return type's class type to be incomplete so long as the return type is the same between the base and complete classes. This fixes PR26297. llvm-svn: 258768	2016-01-26 01:37:01 +00:00
Rui Ueyama	5ec41f3b74	Add missing template instantiations. llvm-svn: 258767	2016-01-26 01:32:00 +00:00
Rafael Espindola	cc3ae413ce	Fix MSVC build. llvm-svn: 258766	2016-01-26 01:30:07 +00:00
Zachary Turner	a37eac51de	Fix TestRerun.py on Windows. This is another example of a test that was looking for the thread at index 0 instead of requesting the thread that was stopped at the created breakpoint. This assumption isn't true on Windows 10. llvm-svn: 258764	2016-01-26 01:19:50 +00:00

1 2 3 4 5 ...

220918 Commits All Branches Search

220918 Commits

All Branches