llvm-project

Commit Graph

Author	SHA1	Message	Date
Nick Lewycky	6ca07ca618	If a variable template is inside a context with template arguments that is being instantiated, and that instantiation fails, fail our instantiation instead of crashing. Errors have already been emitted. llvm-svn: 244515	2015-08-10 21:54:08 +00:00
Oleksiy Vyalov	9dcdd2ee03	Revert r244308 since it's introducing test regressions on Linux: - TestLldbGdbServer.py both clang & gcc, i386 and x86_64 - TestConstVariables.py gcc, i386 and x86_64 - 112 failures clang, i386 llvm-svn: 244514	2015-08-10 21:49:50 +00:00
Alex Lorenz	e5101e2016	MachineVerifier: Handle the optional def operand in a PATCHPOINT instruction. The PATCHPOINT instructions have a single optional defined register operand, but the machine verifier can't verify the optional defined register operands. This commit makes sure that the machine verifier won't report an error when a PATCHPOINT instruction doesn't have its optional defined register operand. This change will allow us to enable the machine verifier for the code generation tests for the patchpoint intrinsics. Reviewers: Juergen Ributzka llvm-svn: 244513	2015-08-10 21:47:36 +00:00
Reid Kleckner	c25c7944f0	[llvm-symbolizer] Remove underscores and other C mangling on Windows Summary: This makes it so that reports symbolized after the fact with llvm-symbolizer are more similar to the ones we generate at runtime with in-process dbghelp. Reviewers: samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11785 llvm-svn: 244512	2015-08-10 21:47:11 +00:00
Rafael Espindola	b61d67cf89	Update for llvm api change. llvm-svn: 244511	2015-08-10 21:30:13 +00:00
Rafael Espindola	aae5541455	Don't iterate over all sections in the ELFFile constructor. With this we finally have an ELFFile that is O(1) to construct. This is helpful for programs like lld which have to do their own section walk. llvm-svn: 244510	2015-08-10 21:29:35 +00:00
Sanjay Patel	cc6554361c	remove function names from comments; NFC llvm-svn: 244509	2015-08-10 21:28:16 +00:00
Alex Lorenz	2f43dd5a12	StackMap: FastISel: Add an appropriate number of immediate operands to the frame setup instruction. This commit ensures that the stack map lowering code in FastISel adds an appropriate number of immediate operands to the frame setup instruction. The previous code added just one immediate operand, which was fine for a target like AArch64, but on X86 the ADJCALLSTACKDOWN64 instruction needs two explicit operands. This caused the machine verifier to report an error when the old code added just one. Reviewers: Juergen Ributzka Differential Revision: http://reviews.llvm.org/D11853 llvm-svn: 244508	2015-08-10 21:27:03 +00:00
Rafael Espindola	0f2517314a	Rename improperly named variable. NFC. llvm-svn: 244507	2015-08-10 21:25:44 +00:00
Tyler Nowicki	40e5d08a74	Remove non-ascii characters. llvm-svn: 244506	2015-08-10 21:18:01 +00:00
Tyler Nowicki	655e573dc5	Make fp vectorization test X86 specified to avoid cost-model related problems on arm-thumb and hexagon. llvm-svn: 244505	2015-08-10 21:14:38 +00:00
Rafael Espindola	3db2273861	Add a test showing that objdump (and so ObjectFIle) can handle shndx. It was already passing, we were just not testing the code. llvm-svn: 244504	2015-08-10 21:00:15 +00:00
JF Bastien	fa9746dc8d	x86: Emit LAHF/SAHF instead of PUSHF/POPF NaCl's sandbox doesn't allow PUSHF/POPF out of security concerns (priviledged emulators have forgotten to mask system bits in the past, and EFLAGS's DF bit is a constant source of hilarity). Commit r220529 fixed PR20376 by saving cmpxchg's flags result using EFLAGS, this commit now generated LAHF/SAHF instead, for all of x86 (not just NaCl) because it leads to an overall performance gain over PUSHF/POPF. As with the previous patch this code generation is pretty bad because it occurs very later, after register allocation, and in many cases it rematerializes flags which were already available (e.g. already in a register through SETE). Fortunately it's somewhat rare that this code needs to fire. I did [[ https://github.com/jfbastien/benchmark-x86-flags \| a bit of benchmarking ]], the results on an Intel Haswell E5-2690 CPU at 2.9GHz are: \| Time per call (ms) \| Runtime (ms) \| Benchmark \| \| 0.000012514 \| 6257 \| sete.i386 \| \| 0.000012810 \| 6405 \| sete.i386-fast \| \| 0.000010456 \| 5228 \| sete.x86-64 \| \| 0.000010496 \| 5248 \| sete.x86-64-fast \| \| 0.000012906 \| 6453 \| lahf-sahf.i386 \| \| 0.000013236 \| 6618 \| lahf-sahf.i386-fast \| \| 0.000010580 \| 5290 \| lahf-sahf.x86-64 \| \| 0.000010304 \| 5152 \| lahf-sahf.x86-64-fast \| \| 0.000028056 \| 14028 \| pushf-popf.i386 \| \| 0.000027160 \| 13580 \| pushf-popf.i386-fast \| \| 0.000023810 \| 11905 \| pushf-popf.x86-64 \| \| 0.000026468 \| 13234 \| pushf-popf.x86-64-fast \| Clearly `PUSHF`/`POPF` are suboptimal. It doesn't really seems to be worth teaching LLVM about individual flags, at least not for this purpose. Reviewers: rnk, jvoung, t.p.northover Subscribers: llvm-commits Differential revision: http://reviews.llvm.org/D6629 llvm-svn: 244503	2015-08-10 20:59:36 +00:00
Chih-Hung Hsieh	00b6f74935	Fix test case to work with -Asserts builds. When clang is built with -DLLVM_ENABLE_ASSERTIONS=Off, it does not create names for IR values. Differential Revision: http://reviews.llvm.org/D11437 llvm-svn: 244502	2015-08-10 20:58:54 +00:00
Artem Belevich	b7e4aab40c	[CUDA] Add implicit __attribute__((used)) to all __global__ functions. This allows emitting kernels that were instantiated from the host code and which would never be explicitly referenced otherwise. Differential Revision: http://reviews.llvm.org/D11666 llvm-svn: 244501	2015-08-10 20:57:02 +00:00
Rafael Espindola	a01ff22bb1	Use higher level functions in llvm-objdump. This matches the rest of llvm-objdump better and isolates it from upcoming changes to ELFFile. llvm-svn: 244500	2015-08-10 20:50:40 +00:00
Sanjay Patel	d09391c8cd	fix minsize detection: minsize attribute implies optimizing for size llvm-svn: 244499	2015-08-10 20:45:44 +00:00
Sanjay Patel	178f8cba51	[x86, SSE]]add missing tests for load folding with partial register update The minsize case is wrong; that will be fixed in the next commit. llvm-svn: 244498	2015-08-10 20:34:34 +00:00
Artem Belevich	194ba60fe2	[CUDA] Added stubs for new attributes used by CUDA headers. The main purpose is to avoid errors and warnings while parsing CUDA header files. The attributes are currently unused otherwise. Differential version: http://reviews.llvm.org/D11690 llvm-svn: 244497	2015-08-10 20:33:56 +00:00
Rafael Espindola	821a64c7f3	Delete getDotSymtabSec. Another step in avoiding iterating over all sections in the ELFFile constructor. llvm-svn: 244496	2015-08-10 20:25:04 +00:00
Simon Pilgrim	a3a72b41de	[InstCombine] Move SSE2/AVX2 arithmetic vector shift folding to instcombiner As discussed in D11760, this patch moves the (V)PSRA(WD) arithmetic shift-by-constant folding to InstCombine to match the logical shift implementations. Differential Revision: http://reviews.llvm.org/D11886 llvm-svn: 244495	2015-08-10 20:21:15 +00:00
Tyler Nowicki	8e7661ec05	Removed unused and incorrectly implemented classof() on Optimization Remark base class. llvm-svn: 244494	2015-08-10 20:13:32 +00:00
Colin LeMahieu	3d9057470f	[TableGen] NFC improving comments about what the tokenized identifiers will contain. llvm-svn: 244493	2015-08-10 19:58:06 +00:00
Tyler Nowicki	8a0925cb62	Append options for floating-point commutivity when related diagnostics are produced. With this patch clang appends the command line options that would allow vectorization when floating-point commutativity is required. Specifically those are enabling fast-math or specifying a loop hint. llvm-svn: 244492	2015-08-10 19:56:40 +00:00
Jonathan Roelofs	f45295c366	Fix a few more cases of 'CHECK[^:]*$'. NFCI llvm-svn: 244491	2015-08-10 19:56:39 +00:00
Nick Lewycky	00a5d21803	Fix typo. llvm-svn: 244490	2015-08-10 19:54:11 +00:00
Tyler Nowicki	c1a86f5866	Late evaluation of the fast-math vectorization requirement. This patch moves the verification of fast-math to just before vectorization is done. This way we can tell clang to append the command line options would that allow floating-point commutativity. Specifically those are enableing fast-math or specifying a loop hint. llvm-svn: 244489	2015-08-10 19:51:46 +00:00
Reid Kleckner	c2e3ba48e3	[dllimport] A non-imported class with an imported key can't have a key Summary: The vtable takes its DLL storage class from the class, not the key function. When they disagree, the vtable won't be exported by the DLL that defines the key function. The easiest way to ensure that importers of the class emit their own vtable is to say that the class has no key function. Reviewers: hans, majnemer Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D11913 llvm-svn: 244488	2015-08-10 19:39:01 +00:00
Jonathan Roelofs	5dcf157443	Fix another case of 'CHECK[^:]*$'. NFCI llvm-svn: 244486	2015-08-10 19:22:55 +00:00
Tyler Nowicki	4d62f2e039	Modify diagnostic messages to clearly indicate the why interleaving wasn't done. Sometimes interleaving is not beneficial, as determined by the cost-model and sometimes it is disabled by a loop hint (by the user). This patch modifies the diagnostic messages to make it clear why interleaving wasn't done. llvm-svn: 244485	2015-08-10 19:14:16 +00:00
James Y Knight	3994be87de	[Sparc] Implement i64 load/store support for 32-bit sparc. The LDD/STD instructions can load/store a 64bit quantity from/to memory to/from a consecutive even/odd pair of (32-bit) registers. They are part of SparcV8, and also present in SparcV9. (Although deprecated there, as you can store 64bits in one register). As recommended on llvmdev in the thread "How to enable use of 64bit load/store for 32bit architecture" from Apr 2015, I've modeled the 64-bit load/store operations as working on a v2i32 type, rather than making i64 a legal type, but with few legal operations. The latter does not (currently) work, as there is much code in llvm which assumes that if i64 is legal, operations like "add" will actually work on it. The same assumption does not hold for v2i32 -- for vector types, it is workable to support only load/store, and expand everything else. This patch: - Adds a new register class, IntPair, for even/odd pairs of registers. - Modifies the list of reserved registers, the stack spilling code, and register copying code to support the IntPair register class. - Adds support in AsmParser. (note that in asm text, you write the name of the first register of the pair only. So the parser has to morph the single register into the equivalent paired register). - Adds the new instructions themselves (LDD/STD/LDDA/STDA). - Hooks up the instructions and registers as a vector type v2i32. Adds custom legalizer to transform i64 load/stores into v2i32 load/stores and bitcasts, so that the new instructions can actually be generated, and marks all operations other than load/store on v2i32 as needing to be expanded. - Copies the unfortunate SelectInlineAsm hack from ARMISelDAGToDAG. This hack undoes the transformation of i64 operands into two arbitrarily-allocated separate i32 registers in SelectionDAGBuilder. and instead passes them in a single IntPair. (Arbitrarily allocated registers are not useful, asm code expects to be receiving a pair, which can be passed to ldd/std.) Also adds a bunch of test cases covering all the bugs I've added along the way. Differential Revision: http://reviews.llvm.org/D8713 llvm-svn: 244484	2015-08-10 19:11:39 +00:00
Rafael Espindola	fe0e4e4c87	rename toELFShdrIter to getSection and move it closer to getSymbol. NFC. llvm-svn: 244483	2015-08-10 19:10:37 +00:00
Rafael Espindola	1904667846	toELFSymIter and getSymbol are now the same thing. Merge them. llvm-svn: 244482	2015-08-10 19:07:56 +00:00
Jonathan Roelofs	49e46ce8e2	Fix a bunch of trivial cases of 'CHECK[^:]*$' in the tests. NFCI I looked into adding a warning / error for this to FileCheck, but there doesn't seem to be a good way to avoid it triggering on the instances of it in RUN lines. llvm-svn: 244481	2015-08-10 19:01:27 +00:00
Rafael Espindola	fc2b6fa31c	Use continue to reduce indentation. NFC. llvm-svn: 244480	2015-08-10 18:57:42 +00:00
Chad Rosier	c56a9132d0	[AArch64] Convert a conditional check that will always be true to an assert. NFC. llvm-svn: 244479	2015-08-10 18:42:45 +00:00
Michael Kruse	874b5c2197	Correct non-existing past participle of split in filename llvm-svn: 244478	2015-08-10 18:37:34 +00:00
Rafael Espindola	904c81dc9e	Add a test for our handling of shndx. It was already working, but missing a test. llvm-svn: 244477	2015-08-10 18:28:24 +00:00
Yaron Keren	2ad3b336f1	Recommit r244470+ r244471 together, the bot failed between them. llvm-svn: 244476	2015-08-10 18:27:51 +00:00
Filipe Cabecinhas	2bbdbcb835	Fix typo. llvm-svn: 244475	2015-08-10 18:26:29 +00:00
Igor Laevsky	4709c03715	[IndVarSimplify] Make cost estimation in RewriteLoopExitValues smarter Differential Revision: http://reviews.llvm.org/D11687 llvm-svn: 244474	2015-08-10 18:23:58 +00:00
David Majnemer	3a4f95867f	[clang-cl] Add support for CL and _CL_ environment variables cl uses 'CL' and '_CL_' to prepend and append command line options to the given argument vector. There is an additional quirk whereby '#' is transformed into '='. Differential Revision: http://reviews.llvm.org/D11896 llvm-svn: 244473	2015-08-10 18:16:32 +00:00
Yaron Keren	1a1e1ca949	Revert r244470 and 244471 while looking into it. llvm-svn: 244472	2015-08-10 18:14:56 +00:00
Yaron Keren	b27259b224	Second part of r244470 (source file was unsaved in editor). llvm-svn: 244471	2015-08-10 18:06:01 +00:00
Yaron Keren	f850d9846e	Really implement David Blaikie suggestion in full of seperating variable initialization from its usage in the push_back making collapse of the two statements unlikely even without a comment. llvm-svn: 244470	2015-08-10 18:03:35 +00:00
Zachary Turner	38e64175db	Allow dosep.py to print dotest.py output on success. Previously all test output was reported by each individual instance of dotest.py. After a recent patch, dosep gets dotest outptu via a pipe, and selectively decides which output to print. This breaks certain scripts which rely on having full output of each dotest instance to do various parsing and/or log-scraping. While we make no promises about the format of dotest output, it's easy to restore this to the old behavior for now, although it is behind a flag. To re-enable full output, run dosep.py with the -s option. Differential Revision: http://reviews.llvm.org/D11816 Reviewed By: Chaoren Lin llvm-svn: 244469	2015-08-10 17:46:11 +00:00
Chih-Hung Hsieh	241a890bd7	Correct x86_64 fp128 calling convention These changes are for Android x86_64 targets to be compatible with current Android g++ and conform to AMD64 ABI. https://llvm.org/bugs/show_bug.cgi?id=23897 * Return type of long double (fp128) should be fp128, not x86_fp80. * Vararg of long double (fp128) could be in register and overflowed to memory. https://llvm.org/bugs/show_bug.cgi?id=24111 * Return value of long double (fp128) _Complex should be in memory like a structure of {fp128,fp128}. Differential Revision: http://reviews.llvm.org/D11437 llvm-svn: 244468	2015-08-10 17:33:31 +00:00
Mark Heffernan	397a98d86d	Add new llvm.loop.unroll.enable metadata for use with "#pragma unroll". This change adds the new unroll metadata "llvm.loop.unroll.enable" which directs the optimizer to unroll a loop fully if the trip count is known at compile time, and unroll partially if the trip count is not known at compile time. This differs from "llvm.loop.unroll.full" which explicitly does not unroll a loop if the trip count is not known at compile time With this change "#pragma unroll" generates "llvm.loop.unroll.enable" rather than "llvm.loop.unroll.full" metadata. This changes the semantics of "#pragma unroll" slightly to mean "unroll aggressively (fully or partially)" rather than "unroll fully or not at all". The motivating example for this change was some internal code with a loop marked with "#pragma unroll" which only sometimes had a compile-time trip count depending on template magic. When the trip count was a compile-time constant, everything works as expected and the loop is fully unrolled. However, when the trip count was not a compile-time constant the "#pragma unroll" explicitly disabled unrolling of the loop(!). Removing "#pragma unroll" caused the loop to be unrolled partially which was desirable from a performance perspective. llvm-svn: 244467	2015-08-10 17:29:39 +00:00
Mark Heffernan	8939154a22	Add new llvm.loop.unroll.enable metadata. This change adds the unroll metadata "llvm.loop.unroll.enable" which directs the optimizer to unroll a loop fully if the trip count is known at compile time, and unroll partially if the trip count is not known at compile time. This differs from "llvm.loop.unroll.full" which explicitly does not unroll a loop if the trip count is not known at compile time. The "llvm.loop.unroll.enable" is intended to be added for loops annotated with "#pragma unroll". llvm-svn: 244466	2015-08-10 17:28:08 +00:00
Chad Rosier	caed6db51e	Typo. Move comment closer to relevant code. NFC. llvm-svn: 244465	2015-08-10 17:17:19 +00:00

1 2 3 4 5 ...

207425 Commits All Branches Search

207425 Commits

All Branches