llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Smith	00db2f13f8	PR20473: Don't "deduplicate" string literals with the same value but different lengths! In passing, simplify string literal deduplication by relying on LLVM to deduplicate the underlying constant values. llvm-svn: 214222	2014-07-29 21:20:12 +00:00
Tobias Grosser	b3af390087	Revert "Emit column debug information for loads" This broke the following gdb tests: gdb.base__annota1.exp gdb.base__consecutive.exp gdb.python__py-symtab.exp gdb.reverse__consecutive-precsave.exp gdb.reverse__consecutive-reverse.exp I will look into this. This reverts commit 214162. llvm-svn: 214163	2014-07-29 06:53:14 +00:00
Tobias Grosser	01b923d55b	Emit column debug information for loads This allows us to give more precise diagnostics. Diego kindly tested the impact on debug info size: "The increase on average debug sizes is 0.1%. The total file size increase is ~0%." llvm-svn: 214162	2014-07-29 06:10:47 +00:00
Adam Nemet	fce1ad0b99	[AVX512] Add non-masking FP store intrinsics Part of <rdar://problem/17688758> llvm-svn: 214099	2014-07-28 17:14:45 +00:00
Adam Nemet	a3ebe6214b	[AVX512] Add FP add/sub/mul intrinsics Part of <rdar://problem/17688758> llvm-svn: 214098	2014-07-28 17:14:42 +00:00
Adam Nemet	062ba618f5	[AVX512] Add CHECK-LABELs to test/CodeGen/avx512f-builtins.c llvm-svn: 214095	2014-07-28 17:14:36 +00:00
Ulrich Weigand	8afad61a93	[PowerPC] Support ELFv1/ELFv2 ABI selection via -mabi= option While Clang now supports both ELFv1 and ELFv2 ABIs, their use is currently hard-coded via the target triple: powerpc64-linux is always ELFv1, while powerpc64le-linux is always ELFv2. These are of course the most common scenarios, but in principle it is possible to support the ELFv2 ABI on big-endian or the ELFv1 ABI on little-endian systems (and GCC does support that), and there are some special use cases for that (e.g. certain Linux kernel versions could only be built using ELFv1 on LE). This patch implements the Clang side of supporting this, based on the LLVM commit 214072. The command line options -mabi=elfv1 or -mabi=elfv2 select the desired ABI if present. (If not, Clang uses the same default rules as now.) Specifically, the patch implements the following changes based on the presence of the -mabi= option: In the driver: - Pass the appropiate -target-abi flag to the back-end - Select the correct dynamic loader version (/lib64/ld64.so.[12]) In the preprocessor: - Define _CALL_ELF to the appropriate value (1 or 2) In the compiler back-end: - Select the correct ABI in TargetInfo.cpp - Select the desired ABI for LLVM via feature (elfv1/elfv2) llvm-svn: 214074	2014-07-28 13:17:52 +00:00
Ehsan Akhgari	755597c83d	Fix test/CodeGen/ms-inline-asm.c from r213916. llvm-svn: 213919	2014-07-25 02:39:33 +00:00
Ehsan Akhgari	fa2d9aa798	Fix test/CodeGen/ms-inline-asm.cpp from r213916. llvm-svn: 213918	2014-07-25 02:35:50 +00:00
Ehsan Akhgari	2f93b448a8	clang-cl: Merge adjacent single-line __asm blocks Summary: This patch extends the __asm parser to make it keep parsing input tokens as inline assembly if a single-line __asm line is followed by another line starting with __asm too. It also makes sure that we correctly keep matching braces in such situations by separating the notions of how many braces we are matching and whether we are in single-line asm block mode. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4598 llvm-svn: 213916	2014-07-25 02:27:14 +00:00
Mark Heffernan	c888e41c0c	Add support for #pragma nounroll. llvm-svn: 213885	2014-07-24 18:09:38 +00:00
Mark Heffernan	44ca416a64	Rename metadata in test which was missed when renaming loop unroll metadata in r213771. llvm-svn: 213775	2014-07-23 17:59:07 +00:00
Mark Heffernan	450c23843e	In unroll pragma syntax and loop hint metadata, change "enable" forms to a new form using the string "full". llvm-svn: 213771	2014-07-23 17:31:31 +00:00
Tim Northover	18b7512faa	AArch64: use aarch64_be instead of arm64_be in all tests. arm64_be doesn't really exist; it was useful for testing while AArch64 and ARM64 were separate, but now the only real way to refer to the system is aarch64_be. llvm-svn: 213747	2014-07-23 12:57:31 +00:00
Robert Lytton	26149def6e	remove hardcoded metadata numbers from tests llvm-svn: 213659	2014-07-22 14:47:42 +00:00
Elena Demikhovsky	fcc6df310d	AVX-512: Added intrinsics to clang. The set is small, that what I have right now. Everybody is welcome to add more. llvm-svn: 213641	2014-07-22 11:31:39 +00:00
Mark Heffernan	34735af3cb	Rename metadata llvm.loop.vectorize.unroll to llvm.loop.vectorize.interleave. llvm-svn: 213587	2014-07-21 23:10:56 +00:00
Mark Heffernan	bd26f5ea4d	Add support for '#pragma unroll'. llvm-svn: 213574	2014-07-21 18:08:34 +00:00
Ulrich Weigand	601957fa23	[PowerPC] Optimize passing certain aggregates by value In addition to enabling ELFv2 homogeneous aggregate handling, LLVM support to pass array types directly also enables a performance enhancement. We can now pass (non-homogeneous) aggregates that fit fully in registers as direct integer arrays, using an element type to encode the alignment requirement (that would otherwise go to the "byval align" field). This is preferable since "byval" forces the back-end to write the aggregate out to the stack, even if it could be passed fully in registers. This is particularly annoying on ELFv2, if there is no parameter save area available, since we then need to allocate space on the callee's stack just to hold those aggregates. Note that to implement this optimization, this patch does not attempt to fully anticipate register allocation rules as (defined in the ABI and) implemented in the back-end. Instead, the patch is simply passing any aggregate passed by value using the array mechanism if its size is up to 64 bytes. This means that some of those will end up being passed in stack slots anyway, but the generated code shouldn't be any worse either. (Large aggregates remain passed using "byval" to enable optimized copying via memcpy etc.) llvm-svn: 213495	2014-07-21 00:56:36 +00:00
Ulrich Weigand	b712237da6	[PowerPC] Support the ELFv2 ABI This patch implements clang support for the PowerPC ELFv2 ABI. Together with a series of companion patches in LLVM, this makes clang/LLVM fully usable on powerpc64le-linux. Most of the ELFv2 ABI changes are fully implemented on the LLVM side. On the clang side, we only need to implement some changes in how aggregate types are passed by value. Specifically, we need to: - pass (and return) "homogeneous" floating-point or vector aggregates in FPRs and VRs (this is similar to the ARM homogeneous aggregate ABI) - return aggregates of up to 16 bytes in one or two GPRs The second piece is trivial to implement in any case. To implement the first piece, this patch makes use of infrastructure recently enabled in the LLVM PowerPC back-end to support passing array types directly, where the array element type encodes properties needed to handle homogeneous aggregates correctly. Specifically, the array element type encodes: - whether the parameter should be passed in FPRs, VRs, or just GPRs/stack slots (for float / vector / integer element types, respectively) - what the alignment requirements of the parameter are when passed in GPRs/stack slots (8 for float / 16 for vector / the element type size for integer element types) -- this corresponds to the "byval align" field With this support in place, the clang part simply needs to detect whether an aggregate type implements a float / vector homogeneous aggregate as defined by the ELFv2 ABI, and if so, pass/return it as array type using the appropriate float / vector element type. llvm-svn: 213494	2014-07-21 00:48:09 +00:00
Hal Finkel	48d53e2c4c	Use the dereferenceable attribute on C99 array parameters with static In C99, an array parameter declarator might have the form: direct-declarator '[' 'static' type-qual-list[opt] assign-expr ']' where the static keyword indicates that the caller will always provide a pointer to the beginning of an array with at least the number of elements specified by the assignment expression. For constant sizes, we can use the new dereferenceable attribute to pass this information to the optimizer. For VLAs, we don't know the size, but (for addrspace(0)) do know that the pointer must be nonnull (and so we can use the nonnull attribute). llvm-svn: 213444	2014-07-19 01:41:07 +00:00
Oliver Stannard	e022851f3b	[ARM] Fix AAPCS regression caused by r211898 r211898 introduced a regression where a large struct, which would normally be passed ByVal, was causing padding to be inserted to prevent the backend from using some GPRs, in order to follow the AAPCS. However, the type of the argument was not being set correctly, so the backend cannot align 8-byte aligned struct types on the stack. The fix is to not insert the padding arguments when the argument is being passed ByVal. llvm-svn: 213359	2014-07-18 09:09:31 +00:00
Kevin Qin	110db6f2ad	[AArch64] Implement Clang CLI interface proposal about "-march". 1. Revert "Add default feature for CPUs on AArch64 target in Clang" at r210625. Then, all enabled feature will by passed explicitly by -target-feature in -cc1 option. 2. Get "-mfpu" deprecated. 3. Implement support of "-march". Usage is: -march=armv8-a+[no]feature For instance, "-march=armv8-a+neon+crc+nocrypto". Here "armv8-a" is necessary, and CPU names are not acceptable. Candidate features are fp, neon, crc and crypto. Where conflicting feature modifiers are specified, the right-most feature is used. 4. Implement support of "-mtune". Usage is: -march=CPU_NAME For instance, "-march=cortex-a57". This option will ONLY get micro-architectural feature enabled specifying to target CPU, like "+zcm" and "+zcz" for cyclone. Any architectural features WON'T be modified. 5. Change usage of "-mcpu" to "-mcpu=CPU_NAME+[no]feature", which is an alias to "-march={feature of CPU_NAME}+[no]feature" and "-mtune=CPU_NAME" together. Where this option is used in conjunction with -march or -mtune, those options take precedence over the appropriate part of this option. llvm-svn: 213353	2014-07-18 07:03:22 +00:00
Alexey Samsonov	c993933e78	Check-labelize ubsan tests llvm-svn: 213334	2014-07-17 23:53:44 +00:00
NAKAMURA Takumi	0c5f4edba4	clang/test/CodeGen/ms-inline-asm.c: Fix for -Asserts. llvm-svn: 213329	2014-07-17 22:51:49 +00:00
Nico Weber	9a08847e6d	Add a test for PR20343 after llvm r213303. llvm-svn: 213305	2014-07-17 20:25:36 +00:00
Alexey Samsonov	24cad99307	[UBSan] Add !nosanitize metadata to the code generated by UBSan. This is used to mark the instructions emitted by Clang to implement variety of UBSan checks. Generally, we don't want to instrument these instructions with another sanitizers (like ASan). Reviewed in http://reviews.llvm.org/D4544 llvm-svn: 213291	2014-07-17 18:46:27 +00:00
Yi Kong	28d7b02687	ARM: Add ACLE memory barrier intrinsic mapping llvm-svn: 213261	2014-07-17 12:45:17 +00:00
Ehsan Akhgari	d86ca7a9c7	Upstream an MS inline assembly test from Mozilla's inline assembly code Summary: I'm planning on upstreaming some test cases for the inline assembly usage in the Mozilla code base. A lot of these test cases test the recent fixes to this code. Reviewers: rnk Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D4508 llvm-svn: 213255	2014-07-17 11:38:22 +00:00
Yi Kong	19a29ac0d0	Port memory barriers intrinsics to AArch64 Memory barrier __builtin_arm_[dmb, dsb, isb] intrinsics are required to implement their corresponding ACLE and MSVC intrinsics. This patch ports ARM dmb, dsb, isb intrinsic to AArch64. Requires LLVM r213247. Differential Revision: http://reviews.llvm.org/D4521 llvm-svn: 213250	2014-07-17 10:52:06 +00:00
Tim Northover	6dbcbac98b	IR: update Clang to use polymorphic __fp16 conversion intrinsics. There should be no change in semantics at this stage. llvm-svn: 213249	2014-07-17 10:51:31 +00:00
Hal Finkel	3e49fda0d4	Add basic (noop) CodeGen support for __assume Clang supports __assume, at least at the semantic level, when MS extensions are enabled. Unfortunately, trying to actually compile code using __assume would result in this error: error: cannot compile this builtin function yet __assume is an optimizer hint, and can be ignored at the IR level. Until LLVM supports assumptions at the IR level, a noop lowering is valid, and that is what is done here. llvm-svn: 213206	2014-07-16 22:44:54 +00:00
David Majnemer	ade4bee761	CodeGen: Let arrays be inputs to inline asm An array showing up in an inline assembly input is accepted in ICC and GCC 4.8 This fixes PR20201. Differential Revision: http://reviews.llvm.org/D4382 llvm-svn: 212954	2014-07-14 16:27:53 +00:00
Yi Kong	472e521cec	ARM: Add NOP intrinsic mapping in arm_acle.h llvm-svn: 212950	2014-07-14 15:32:29 +00:00
Yi Kong	4d5e23f53a	ARM: Implement __builtin_arm_nop intrinsic This patch implements __builtin_arm_nop intrinsic for AArch32 and AArch64, which generates hint 0x0, the alias of NOP instruction. This intrinsic is necessary to implement ACLE __nop intrinsic. Differential Revision: http://reviews.llvm.org/D4495 llvm-svn: 212947	2014-07-14 15:20:09 +00:00
Yi Kong	19222dcb4c	Add test cases for AArch64 hints codegen llvm-svn: 212909	2014-07-13 16:17:30 +00:00
Saleem Abdulrasool	3b165e7dbb	tests: use a more precise target for tests llvm-svn: 212892	2014-07-12 23:40:53 +00:00
Saleem Abdulrasool	572250d60a	CodeGen: support hint intrinsics from ACLE on AArch64 This adds support for the ACLE hint intrinsics on AArch64 similar to ARM. This is required to properly support ACLE on AArch64. llvm-svn: 212890	2014-07-12 23:27:22 +00:00
Yi Kong	4e00ce7d0c	Improve comments of ARM ACLE header file and tests Include section number in ARM ACLE specification for easier navigation. llvm-svn: 212887	2014-07-12 22:48:13 +00:00
Hal Finkel	d8442b1b21	Add nonnull in CodeGen for __attribute__((returns_nonnull)) As a follow-up to r212835, also add the LLVM nonnull function attribute when __attribute__((returns_nonnull)) is provided. llvm-svn: 212874	2014-07-12 04:51:04 +00:00
Alexey Samsonov	15c9669615	[ASan] Collect unmangled names of global variables in Clang to print them in error reports. Currently ASan instrumentation pass creates a string with global name for each instrumented global (to include global names in the error report). Global name is already mangled at this point, and we may not be able to demangle it at runtime (e.g. there is no __cxa_demangle on Android). Instead, create a string with fully qualified global name in Clang, and pass it to ASan instrumentation pass in llvm.asan.globals metadata. If there is no metadata for some global, ASan will use the original algorithm. This fixes https://code.google.com/p/address-sanitizer/issues/detail?id=264. llvm-svn: 212872	2014-07-12 00:42:52 +00:00
Reid Kleckner	f392ec6ecc	Form a CallExpr from __noop without parens MSVC accepts __noop without any trailing parens and treats it like a literal zero. We don't treat __noop as an integer literal, but now at least we can parse a naked __noop expression. Reviewers: rsmith Differential Revision: http://reviews.llvm.org/D4476 llvm-svn: 212860	2014-07-11 23:54:29 +00:00
Reid Kleckner	ed5d4adb36	MS extension: Make __noop be the integer zero, not void We still don't accept '__noop;', and we don't consider __noop to be the integer literal zero. More work is needed. llvm-svn: 212839	2014-07-11 20:22:55 +00:00
Hal Finkel	82504f03ce	Add nonnull in CodeGen for __attribute__((nonnull)) We now have an LLVM-level nonnull attribute that can be applied to function parameters, and we emit it for reference types (as of r209723), but did not emit it when an __attribute__((nonnull)) was provided. Now we will. llvm-svn: 212835	2014-07-11 17:35:21 +00:00
Alexey Samsonov	848560125d	[UBSan] Introduce type-based blacklisting. Teach UBSan vptr checker to ignore technically invalud down-casts on blacklisted types. Based on http://reviews.llvm.org/D4407 by Byoungyoung Lee! llvm-svn: 212770	2014-07-10 22:34:19 +00:00
Ulrich Weigand	b4153254b7	Fix (and reenable) ppc64-align-struct.c test for non-assert builds. llvm-svn: 212757	2014-07-10 19:19:03 +00:00
David Blaikie	cceed090d2	Quick (attempted) fix for non-asserts builds for a test introduced in r212743. llvm-svn: 212752	2014-07-10 18:40:54 +00:00
Ulrich Weigand	581badce4b	[PowerPC] ABI support for aligned by-value aggregates This patch adds support for respecting the ABI and type alignment of aggregates passed by value. Currently, all aggregates are aligned at 8 bytes in the parameter save area. This is incorrect for two reasons: - Aggregates that need alignment of 16 bytes or more should be aligned at 16 bytes in the parameter save area. This is implemented by using an appropriate "byval align" attribute in the IR. - Aggregates that need alignment beyond 16 bytes need to be dynamically realigned by the caller. This is implemented by setting the Realign flag of the ABIArgInfo::getIndirect call. In addition, when expanding a va_arg call accessing a type that is aligned at 16 bytes in the argument save area (either one of the aggregate types as above, or a vector type which is already aligned at 16 bytes), code needs to align the va_list pointer accordingly. Reviewed by Hal Finkel. llvm-svn: 212743	2014-07-10 17:20:07 +00:00
Ulrich Weigand	f4eba98853	[PowerPC] ABI support for non-Altivec vector types This patch adds support for passing arguments of non-Altivec vector type (i.e. defined via attribute ((vector_size (...)))) on powerpc64-linux. While such types are not mentioned in the formal ABI document, this patch implements a calling convention compatible with GCC: - Vectors of size < 16 bytes are passed in a GPR - Vectors of size > 16 bytes are passed via reference Note that vector types with a number of elements that is not a power of 2 are not supported by GCC, so there is no pre-existing ABI to follow. We choose to pass those (of size < 16) as if widened to the next power of two, so they might end up in a vector register or in a GPR. (Sizes > 16 are always passed via reference as well.) Reviewed by Hal Finkel. llvm-svn: 212734	2014-07-10 16:39:01 +00:00
Daniel Sanders	cfbb71dfb6	[mips] clz is defined to give 32 for zero. Similarly, dclz gives 64. Summary: While debugging another issue, I noticed that Mips currently specifies that the count leading zero builtins are undefined when the input is zero. The architecture specifications say that the clz and dclz instructions write 32 or 64 respectively when given zero. This doesn't fix any bugs that I'm aware of but it may improve optimisation in some cases. Differential Revision: http://reviews.llvm.org/D4431 llvm-svn: 212618	2014-07-09 13:43:19 +00:00

1 2 3 4 5 ...

2590 Commits