llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	e8c3721165	ARM: use LLVM's atomicrmw instructions when ldrex/strex are available. Having some kind of weird kernel-assisted ABI for these when the native instructions are available appears to be (and should be) the exception; OSs have been gradually opting in for years and the code was getting silly. So let LLVM decide whether it's possible/profitable to inline them by default. Patch by Phoebe Buckheister. llvm-svn: 212598	2014-07-09 09:24:43 +00:00
Alexey Samsonov	e7a8ccfaad	[Sanitizer] Reduce the usage of sanitizer blacklist in CodeGenModule Get rid of cached CodeGenModule::SanOpts, which was used to turn off sanitizer codegen options if current LLVM Module is blacklisted, and use plain LangOpts.Sanitize instead. 1) Some codegen decisions (turning TBAA or writable strings on/off) shouldn't depend on the contents of blacklist. 2) llvm.asan.globals should always be created, even if the module is blacklisted - soon Clang's CodeGen where we read sanitizer blacklist files, so we should properly report which globals are blacklisted to the backend. llvm-svn: 212499	2014-07-07 23:34:34 +00:00
Nico Weber	9b982078e9	Add an AST node for __leave statements, hook it up. Codegen is still missing (and I won't work on that), but __leave is now as implemented as __try and friends. llvm-svn: 212425	2014-07-07 00:12:30 +00:00
Ehsan Akhgari	0f89fac7a5	Add support for nested blocks in Microsoft inline assembly This fixes http://llvm.org/PR20204. llvm-svn: 212389	2014-07-06 05:26:54 +00:00
Saleem Abdulrasool	e700cab4e9	CodeGen: add support for a few MSVC ARM intrinsics This adds support for simple MSVC compatibility mode intrinsics. These intrinsics are simple in that they are either directly passed through to the annotated MSBuiltin intrinsic or they mirror existing GCC builtins. llvm-svn: 212378	2014-07-05 20:10:05 +00:00
Ehsan Akhgari	3e0dd89adf	Add a test case for the tilde operator in Microsoft inline assembly llvm-svn: 212373	2014-07-05 15:04:06 +00:00
Alexey Bataev	b75daeecbe	Fixed test CodeGen/captured-statements.c for powerpc64-linux. llvm-svn: 212311	2014-07-04 04:06:11 +00:00
Gerolf Hoflehner	2344cfc335	Restore global static array in test case llvm-svn: 212285	2014-07-03 19:30:33 +00:00
Yi Kong	4efadfb0b0	[ARM] Implement ISB memory barrier intrinsic Adds support for __builtin_arm_isb. Also corrects DSB and ISB instructions modelling by adding has-side-effects property. llvm-svn: 212277	2014-07-03 16:01:25 +00:00
Renato Golin	47843efcf6	Add the __qdbl intrinsic to the arm_acle.h header Patch by: Moritz Roth llvm-svn: 212264	2014-07-03 10:14:52 +00:00
Robert Lytton	57765d5347	Move the calling of emitTargetMD() later. Summary: Because a global created by GetOrCreateLLVMGlobal() is not finalised until later viz: extern char a[]; char f(){ return a[5];} char a[10]; Change MangledDeclNames to use a MapVector rather than a DenseMap so that the Metadata is output in order of original declaration, so to make deterministic and improve human readablity. Differential Revision: http://reviews.llvm.org/D4176 llvm-svn: 212263	2014-07-03 09:30:33 +00:00
Christian Pirker	c3d3217525	ARMEB: Fix function result return for composite types Reviewed at http://reviews.llvm.org/D4364 llvm-svn: 212261	2014-07-03 09:28:12 +00:00
Saleem Abdulrasool	ece7217f70	ARM: rename ARM builtins to use __builtin_arm prefix This corrects SVN r212196's naming change to use the proper prefix of `__builtin_arm_` instead of `__builtin_`. Thanks to Yi Kong for pointing out the incorrect naming! llvm-svn: 212253	2014-07-03 02:43:20 +00:00
Saleem Abdulrasool	4bddd9d400	CodeGen: make target builtins support languages This extends the target builtin support to allow language specific annotations (i.e. LANGBUILTIN). This is to allow MSVC compatibility whilst retaining the ability to have EABI targets use a __builtin_ prefix. This is merely to allow uniformity in the EABI case where the unprefixed name is provided as an alias in the header. llvm-svn: 212196	2014-07-02 17:41:27 +00:00
Alexey Samsonov	4f319cca42	[ASan] Print exact source location of global variables in error reports. See https://code.google.com/p/address-sanitizer/issues/detail?id=299 for the original feature request. Introduce llvm.asan.globals metadata, which Clang (or any other frontend) may use to report extra information about global variables to ASan instrumentation pass in the backend. This metadata replaces llvm.asan.dynamically_initialized_globals that was used to detect init-order bugs. llvm.asan.globals contains the following data for each global: 1) source location (file/line/column info); 2) whether it is dynamically initialized; 3) whether it is blacklisted (shouldn't be instrumented). Source location data is then emitted in the binary and can be picked up by ASan runtime in case it needs to print error report involving some global. For example: 0x... is located 4 bytes to the right of global variable 'C::array' defined in '/path/to/file:17:8' (0x...) of size 40 These source locations are printed even if the binary doesn't have any debug info. This is an ABI-breaking change. ASan initialization is renamed to __asan_init_v4(). Pre-built libraries compiled with older Clang will not work with the fresh runtime. llvm-svn: 212188	2014-07-02 16:54:41 +00:00
Tim Northover	3acd6bd0b6	ARM: add support for v8 ldaex/stlex builtins. ARMv8 adds (to both AArch32 and AArch64) acquiring and releasing variants of the exclusive operations, in line with the C++11 memory model. This adds support for two new intrinsics to expose them to C & C++ developers directly: __builtin_arm_ldaex and __builtin_arm_stlex, in direct analogy with the versions with no implicit barrier. rdar://problem/15885451 llvm-svn: 212175	2014-07-02 12:56:02 +00:00
Tim Northover	1471cb17ae	X86: inline all atomic operations up to 128-bits. The backend can cope with all of these now, so Clang should give it the chance. On CPUs without cmpxchg16b (e.g. the original athlon64) LLVM can reform the libcalls. rdar://problem/13496295 llvm-svn: 212173	2014-07-02 10:25:45 +00:00
Alexey Bataev	f94baeb363	Added test for capturing VLA types if the captured variable is a function parameter. llvm-svn: 212170	2014-07-02 07:05:22 +00:00
Gerolf Hoflehner	012dff0b23	Enable test/CodeGen/indirect-goto.c in 64b for local arrays In 32b mode the reference count for block addresses is not zero. This prevents inlining and constant folding and causes the test to fail. Changing the triple allows runnning the test in 64b mode. The array in foo2 is now local instead of static until at lower optimization levels the interprocedural constant propagator is invoked before the global optimizer. llvm-svn: 212092	2014-07-01 05:10:06 +00:00
Bob Wilson	84941b92e6	Temporarily disable the indirect-goto.c test. llvm r212077 causes this test to fail. We need to reorder some passes and possibly make other changes to reenable the optimization being tested here. llvm-svn: 212091	2014-07-01 04:56:06 +00:00
Andrea Di Biagio	eb606a3c27	[x86] Add Clang support for intrinsic __rdpmc. This patch adds intrinsic __rdpmc to header file 'ia32intrin.h'. Intrinsic __rdmpc can be used to read performance monitoring counters. It is implemented as a direct call to __builtin_ia32_rdpmc. It takes as input a value representing the index of the performance counter to read. The value of the performance counter is then returned as a unsigned 64-bit quantity. llvm-svn: 212053	2014-06-30 18:23:58 +00:00
Alexey Bataev	06812bc98b	Second part of fix in CodeGen/captured-statements-nested.c llvm-svn: 212028	2014-06-30 09:14:10 +00:00
Alexey Bataev	e686c1d7ef	Test fix llvm-svn: 212026	2014-06-30 09:05:08 +00:00
Alexey Bataev	41ff27e9a1	Fixed incompatibility in CodeGen/captured-statements-nested.c with MSVC llvm-svn: 212025	2014-06-30 08:37:48 +00:00
Alexey Bataev	be5af7b9c7	Fixed CodeGen/captured-statements-nested.c test llvm-svn: 212024	2014-06-30 08:17:11 +00:00
Alexey Bataev	6de13e86e9	Disable CodeGen/captured-statements-nested.c llvm-svn: 212018	2014-06-30 05:07:42 +00:00
Alexey Bataev	83222d6109	Fixed CodeGen/captured-statements-nested.c test llvm-svn: 212016	2014-06-30 05:02:50 +00:00
Alexey Bataev	8dfca43296	Disable CodeGen/captured-statements-nested.c llvm-svn: 212014	2014-06-30 03:30:41 +00:00
Alexey Bataev	18da16cab5	Temp XFAIL CodeGen/captured-statements-nested.c to fix the test llvm-svn: 212013	2014-06-30 03:14:43 +00:00
Alexey Bataev	aca7fcf276	Using of variable length arrays in captured statements and OpenMP constructs. Differential Revision: http://reviews.llvm.org/D4067 llvm-svn: 212010	2014-06-30 02:55:54 +00:00
Alp Toker	f082e73696	Remove some incorrect test suppressions These don't actually require any registered backend to run. This commit tests the water with a handful of fixes for what is a more widespread problem. llvm-svn: 212008	2014-06-30 01:34:09 +00:00
Saleem Abdulrasool	24bd7da2d2	Basic: fix handling for Windows Itanium environment This corrects the handling for i686-windows-itanium. This environment is nearly identical to Windows MSVC, except it uses the itanium ABI for C++. llvm-svn: 211991	2014-06-28 23:34:11 +00:00
Alp Toker	f76e6d8e6b	Get arm_acle tests from r211962 working llvm-svn: 211979	2014-06-28 06:51:27 +00:00
Yi Kong	a44c4d7173	Introduce arm_acle.h supporting existing LLVM builtin intrinsics Summary: This patch introduces ACLE header file, implementing extensions that can be directly mapped to existing Clang intrinsics. It implements for both AArch32 and AArch64. Reviewers: t.p.northover, compnerd, rengolin Reviewed By: compnerd, rengolin Subscribers: rnk, echristo, compnerd, aemerson, mroth, cfe-commits Differential Revision: http://reviews.llvm.org/D4296 llvm-svn: 211962	2014-06-27 21:25:42 +00:00
Oliver Stannard	3f32b9be7f	[ARM] Fix AAPCS non-compliance caused by very large structs This is a fix to the code in clang which inserts padding arguments to ensure that the ARM backend can emit AAPCS-VFP compliant code. This code needs to track the number of registers which have been allocated in order to do this. When passing a very large struct (>64 bytes) by value, clang emits IR which takes a pointer to the struct, but the backend converts this back to passing the struct in registers and on the stack. The bug was that this was being considered by clang to only use one register, meaning that there were situations in which padding arguments were incorrectly emitted by clang. llvm-svn: 211898	2014-06-27 13:59:27 +00:00
James Molloy	b452f78ad2	[ARM-BE] Generate correct NEON intrinsics for big endian systems. The NEON intrinsics in arm_neon.h are designed to work on vectors "as-if" loaded by (V)LDR. We load vectors "as-if" (V)LD1, so the intrinsics are currently incorrect. This patch adds big-endian versions of the intrinsics that does the "obvious but dumb" thing of reversing all vector inputs and all vector outputs. This will produce extra REVs, but we trust the optimizer to remove them. llvm-svn: 211893	2014-06-27 11:53:35 +00:00
Eli Bendersky	b198b4e864	Rename loop unrolling and loop vectorizer metadata to have a common prefix. [Clang part] These patches rename the loop unrolling and loop vectorizer metadata such that they have a common 'llvm.loop.' prefix. Metadata name changes: llvm.vectorizer.* => llvm.loop.vectorizer.* llvm.loopunroll.* => llvm.loop.unroll.* This was a suggestion from an earlier review (http://reviews.llvm.org/D4090) which added the loop unrolling metadata. Patch by Mark Heffernan. llvm-svn: 211712	2014-06-25 15:42:16 +00:00
James Molloy	b8fd41926c	CHECK-LABEL'ify this test. llvm-svn: 211687	2014-06-25 11:50:56 +00:00
James Molloy	7d64a0eec4	[AArch32] Fix a stupid error in an architectural guard The < 8 instead of <= 8 meant that a bunch of vreinterprets were not available on v8 AArch32. Simplify the guard to just !defined(aarch64) while we're at it, and enable some v8 AArch32 testing. llvm-svn: 211686	2014-06-25 11:46:24 +00:00
David Majnemer	0c43d8077e	AST: Initialization with dllimport functions in C The C++ language requires that the address of a function be the same across all translation units. To make __declspec(dllimport) useful, this means that a dllimported function must also obey this rule. MSVC implements this by dynamically querying the import address table located in the linked executable. This means that the address of such a function in C++ is not constant (which violates other rules). However, the C language has no notion of ODR nor does it permit dynamic initialization whatsoever. This requires implementations to _not_ dynamically query the import address table and instead utilize a wrapper function that will be synthesized by the linker which will eventually query the import address table. The effect this has is, to say the least, perplexing. Consider the following C program: __declspec(dllimport) void f(void); typedef void (*fp)(void); static const fp var = &f; const fp fun() { return &f; } int main() { return fun() == var; } MSVC will statically initialize "var" with the address of the wrapper function and "fun" returns the address of the actual imported function. This means that "main" will return false! Note that LLVM's optimizers are strong enough to figure out that "main" should return true. However, this result is dependent on having optimizations enabled! N.B. This change also permits the usage of dllimport declarators inside of template arguments; they are sufficiently constant for such a purpose. Add tests to make sure we don't regress here. llvm-svn: 211677	2014-06-25 08:15:07 +00:00
Rafael Espindola	0a500af186	Correctly Load Mixed FP-GP Variadic Arguments for x86-64. According to the x86-64 ABI, structures with both floating point and integer members are split between floating-point and general purpose registers, and consecutive 32-bit floats can be packed into a single floating point register. In the case of variadic functions these are stored to memory and the position recorded in the va_list. This was already correctly implemented in llvm.va_start. The problem is that the code in clang for implementing va_arg was reading floating point registers from the wrong location. Patch by Thomas Jablin. Fixes PR20018. llvm-svn: 211626	2014-06-24 20:01:50 +00:00
Ulrich Weigand	bebc55b13b	[PowerPC] Fix small argument stack slot offset for LE When small arguments (structures < 8 bytes or "float") are passed in a stack slot in the ppc64 SVR4 ABI, they must reside in the least significant part of that slot. On BE, this means that an offset needs to be added to the stack address of the parameter, but on LE, the least significant part of the slot has the same address as the slot itself. For the most part, this is handled in the LLVM back-end, where I just fixed the LE case in commit r211368. However, there is one piece of the clang front-end that is also aware of these stack-slot offsets: PPC64_SVR4_ABIInfo::EmitVAArg. This patch updates that routine to take endianness into account. llvm-svn: 211370	2014-06-20 16:37:40 +00:00
Oliver Stannard	e3a4fb6512	Add module flags metadata to record the settings for enum and wchar width Add module flags metadata to record the settings for enum and wchar width, to allow correct ARM build attribute generation llvm-svn: 211354	2014-06-20 12:43:07 +00:00
Oliver Stannard	c8e3b5f849	Improve robustness of tests for module flags metadata Fix clang tests to not break if the ID numbers of module flags metadata nodes change. llvm-svn: 211276	2014-06-19 16:10:21 +00:00
Saleem Abdulrasool	11415c6120	tests: relax ms-intrinsics test Relax the tests to allow for differences between release and debug builds. This should fix the buildbots. Thanks to Benjamin Kramer and Eric Christo for their invaluable tip that this was release build specific issue. llvm-svn: 211227	2014-06-18 21:48:44 +00:00
Saleem Abdulrasool	114efe0dc8	CodeGen: improve ms instrincics support Add support for _InterlockedCompareExchangePointer, _InterlockExchangePointer, _InterlockExchange. These are available as a compiler intrinsic on ARM and x86. These are used directly by the Windows SDK headers without use of the intrin header. llvm-svn: 211216	2014-06-18 20:51:10 +00:00
Tim Northover	831d728f9a	AArch64: re-enable tests that were looking for a non-existent backend. In the final phase of the merge, I managed to disable a bunch of Clang tests accidentally. Fortunately none of them seem to have broken in the interim. llvm-svn: 211149	2014-06-18 08:37:28 +00:00
James Molloy	dee4ab08ba	Rewrite ARM NEON intrinsic emission completely. There comes a time in the life of any amateur code generator when dumb string concatenation just won't cut it any more. For NeonEmitter.cpp, that time has come. There were a bunch of magic type codes which meant different things depending on the context. There were a bunch of special cases that really had no reason to be there but the whole thing was so creaky that removing them would cause something weird to fall over. There was a 1000 line switch statement for code generation involving string concatenation, which actually did lexical scoping to an extent (!!) with a bunch of semi-repeated cases. I tried to refactor this three times in three different ways without success. The only way forward was to rewrite the entire thing. Luckily the testing coverage on this stuff is absolutely massive, both with regression tests and the "emperor" random test case generator. The main change is that previously, in arm_neon.td a bunch of "Operation"s were defined with special names. NeonEmitter.cpp knew about these Operations and would emit code based on a huge switch. Actually this doesn't make much sense - the type information was held as strings, so type checking was impossible. Also TableGen's DAG type actually suits this sort of code generation very well (surprising that...) So now every operation is defined in terms of TableGen DAGs. There are a bunch of operators to use, including "op" (a generic unary or binary operator), "call" (to call other intrinsics) and "shuffle" (take a guess...). One of the main advantages of this apart from making it more obvious what is going on, is that we have proper type inference. This has two obvious advantages: 1) TableGen can error on bad intrinsic definitions easier, instead of just generating wrong code. 2) Calls to other intrinsics are typechecked too. So we no longer need to work out whether the thing we call needs to be the Q-lane version or the D-lane version - TableGen knows that itself! Here's an example: before: case OpAbdl: { std::string abd = MangleName("vabd", typestr, ClassS) + "(__a, __b)"; if (typestr[0] != 'U') { // vabd results are always unsigned and must be zero-extended. std::string utype = "U" + typestr.str(); s += "(" + TypeString(proto[0], typestr) + ")"; abd = "(" + TypeString('d', utype) + ")" + abd; s += Extend(utype, abd) + ";"; } else { s += Extend(typestr, abd) + ";"; } break; } after: def OP_ABDL : Op<(cast "R", (call "vmovl", (cast $p0, "U", (call "vabd", $p0, $p1))))>; As an example of what happens if you do something wrong now, here's what happens if you make $p0 unsigned before the call to "vabd" - that is, $p0 -> (cast "U", $p0): arm_neon.td:574:1: error: No compatible intrinsic found - looking up intrinsic 'vabd(uint8x8_t, int8x8_t)' Available overloads: - float64x2_t vabdq_v(float64x2_t, float64x2_t) - float64x1_t vabd_v(float64x1_t, float64x1_t) - float64_t vabdd_f64(float64_t, float64_t) - float32_t vabds_f32(float32_t, float32_t) ... snip ... This makes it seriously easy to work out what you've done wrong in fairly nasty intrinsics. As part of this I've massively beefed up the documentation in arm_neon.td too. Things still to do / on the radar: - Testcase generation. This was implemented in the previous version and not in the new one, because - Autogenerated tests are not being run. The testcase in test/ differs from the autogenerated version. - There were a whole slew of special cases in the testcase generation that just felt (and looked) like hacks. If someone really feels strongly about this, I can try and reimplement it too. - Big endian. That's coming soon and should be a very small diff on top of this one. llvm-svn: 211101	2014-06-17 13:11:27 +00:00
Jim Grosbach	8ddd66928c	AArch64: Fix silly think-o in tests. rdar://9283021 llvm-svn: 211064	2014-06-16 22:18:26 +00:00
Jim Grosbach	79140826bc	AArch64: Support for __builtin_arm_rbit() and __builtin_arm_rbit64(). __builtin_arm_rbit() and __builtin_arm_rbit64(). rdar://9283021 llvm-svn: 211060	2014-06-16 21:56:02 +00:00
Jim Grosbach	171ec34544	ARM: Support for __builtin_arm_rbit() intrinsic. Reverse the bits in a word. Maps to the RBIT instruction. rdar://9283021 llvm-svn: 211059	2014-06-16 21:55:58 +00:00
Tim Northover	ba2b33b4fe	Fix test for release builds. llvm-svn: 210934	2014-06-13 20:00:38 +00:00
Tim Northover	cadbbe1537	Atomics: emit "cmpxchg weak" where possible Most builtins date from before the "cmpxchg weak" was a gleam in the C++ committee's eye, so fortunately not much needs to change. But a few of them do acknowledge that failure is possible. For these, we'll emit the usual cartesian product of cmpxchg operations if we can't statically determine weakness. CodeGen can sort it out later if the function gets inlined. The only other non-trivial aspect of this is (I think) that we emit the scalar expression for "IsWeak" once, at the beginning, and propagate its value through the successive blocks. There's not much in it, but it's slightly more consistent with the existing handling of FailureOrder. llvm-svn: 210932	2014-06-13 19:43:04 +00:00
Alexey Samsonov	e595e1ade0	Remove top-level Clang -fsanitize= flags for optional ASan features. Init-order and use-after-return modes can currently be enabled by runtime flags. use-after-scope mode is not really working at the moment. The only problem I see is that users won't be able to disable extra instrumentation for init-order and use-after-scope by a top-level Clang flag. But this instrumentation was implicitly enabled for quite a while and we didn't hear from users hurt by it. llvm-svn: 210924	2014-06-13 17:53:44 +00:00
Tim Northover	b49b04bbe0	IR-change: cmpxchg operations now return { iN, i1 }. This is a minimal fix for clang. I'll soon add support for generating weak variants when requested, but that's not really necessary for the LLVM change in isolation. llvm-svn: 210907	2014-06-13 14:24:59 +00:00
Tim Northover	d7756c5a68	Tests: use CHECK-LABEL to help debugging failures llvm-svn: 210906	2014-06-13 14:24:48 +00:00
Brad Smith	378e7f9b78	Use dwarf-2 by default on OpenBSD and FreeBSD. The Tools.cpp part of the patch partially based on a patch from FreeBSD's LLVM tree. llvm-svn: 210883	2014-06-13 03:35:37 +00:00
Eli Bendersky	86483b3a0c	Add loop unroll pragma support http://reviews.llvm.org/D4089 Patch by Mark Heffernan. llvm-svn: 210667	2014-06-11 17:56:26 +00:00
Bill Schmidt	56a6967000	[PPC64LE] Fix vec_sld and vec_vsldoi for little endian The vec_sld and vec_vsldoi interfaces perform a left-shift on vector arguments for both big and little endian. However, because they rely on the vec_perm interface which is endian-dependent, the permutation vector needs to be reversed for LE to get the proper shift direction. I've added some extra testing for these interfaces for LE in the builtins-ppc-altivec.c. llvm-svn: 210657	2014-06-11 15:48:46 +00:00
Reid Kleckner	4173f6aff9	Really fix DOS newlines introduced in r210330 r210369 didn't quite catch all of them. llvm-svn: 210593	2014-06-10 21:35:24 +00:00
Evgeniy Stepanov	2be29929be	Fix line numbers for code inlined from __nodebug__ functions. Instructions from __nodebug__ functions don't have file:line information even when inlined into no-nodebug functions. As a result, intrinsics (SSE and other) from <*intrin.h> clang headers _never_ have file:line information. With this change, an instruction without !dbg metadata gets one from the call instruction when inlined. Fixes PR19001. llvm-svn: 210459	2014-06-09 09:09:19 +00:00
Bill Schmidt	7f6596bb13	[PPC64LE] Implement little-endian semantics for vec_sums The PowerPC vsumsws instruction, accessed via vec_sums, is defined architecturally with a big-endian bias, in that the second input vector and the result always reference big-endian element 3 (little-endian element 0). For ease of porting, the programmer wants elements 3 in both cases. To provide this semantics, for little endian we generate a permute for the second input vector prior to the vsumsws instruction, and generate a permute for the result vector following the vsumsws instruction. The correctness of this code is tested by the new sums.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. llvm-svn: 210449	2014-06-09 03:31:47 +00:00
Joey Gouly	41181d140c	Convert tests I recently add to use -verify instead of FileCheck. This uncovered something strange. Diagnostics for InlineAsm have source locations that don't really map to where they are within the .c source file. llvm-svn: 210440	2014-06-08 21:28:54 +00:00
Bill Schmidt	d7c53a91df	[PPC64LE] Implement little-endian semantics for vec_unpack[hl] The PowerPC vector-unpack-high and vector-unpack-low instructions are defined architecturally with a big-endian bias, in that the vector element numbering is assumed to be "left to right" regardless of whether the processor is in big-endian or little-endian mode. This effectively reverses the meaning of "high" and "low." Such a definition is unnatural for little-endian code generation. To facilitate ease of porting, the vec_unpackh and vec_unpackl interfaces are designed to use natural element ordering, so that elements are numbered according to little-endian design principles when code is generated for a little-endian target. The desired semantics can be achieved by using the opposite instruction for little-endian mode. That is, when a call to vec_unpackh appears in the code, a vector-unpack-low is generated, and when a call to vec_unpackl appears in the code, a vector-unpack-high is generated. The correctness of this code is tested by the new unpack.c test added in a previous patch, as well as the modifications to builtins-ppc-altivec.c in the present patch. Note that these interfaces were originally incorrectly implemented when they take a vector pixel argument. This patch corrects this implementation for both big- and little-endian code generation. llvm-svn: 210391	2014-06-07 02:20:52 +00:00
Bill Schmidt	86f673a005	[PPC64LE] Update test for vec_sum2s interface Commit r210384 prematurely included changes to the little-endian implementation of the vec_sum2s interface. This patch modifies test/CodeGen/builtins-ppc-altivec.c to test those changes. llvm-svn: 210389	2014-06-07 01:47:42 +00:00
Bill Schmidt	7f0a5c5141	[PPC64LE] Update builtins-ppc-altivec.c for PPC64 and PPC64LE The Altivec builtin test case test/CodeGen/builtins-ppc-altivec.c has always been executed only for 32-bit PowerPC. These tests are equally valid for 64-bit PowerPC. This patch updates the test to be run for three targets: powerpc-unknown-unknown, powerpc64-unknown-unknown, and powerpc64le-unknown-unknown. The expected code generation changes for some of the Altivec builtins for little endian, so this patch adds new CHECK-LE variants to the test for the powerpc64le target. These tests satisfy the testing requirements for some previous patches committed over the last couple of days for lib/Headers/altivec.h: r210279 for vec_perm, r210337 for vec_mul[eo], and r210340 for vec_pack. llvm-svn: 210384	2014-06-06 23:12:00 +00:00
Aaron Ballman	b06b15aa28	Adding a new #pragma for the vectorize and interleave optimization hints. Patch thanks to Tyler Nowicki! llvm-svn: 210330	2014-06-06 12:40:24 +00:00
Joey Gouly	5798b26c65	When an inline-asm diagnostic is reported by the backend, report it with the correct severity. Previously all inline-asm diagnostics were reported as errors. llvm-svn: 210286	2014-06-05 21:23:42 +00:00
Renato Golin	0d2f580200	Fix bot for named register test llvm-svn: 210275	2014-06-05 16:52:20 +00:00
Renato Golin	2e31e4e47b	Add pointer types to global named register This patch adds support for pointer types in global named registers variables. It'll be lowered as a pair of read/write_register and inttoptr/ptrtoint calls. Also adds some early checks on types on SemaDecl to avoid the assert. Tests changed accordingly. (PR19837) llvm-svn: 210274	2014-06-05 16:45:22 +00:00
Robert Lytton	6adb20f720	XCore target: Fix 'typestring' binding qualifier to the array and not the type Differential Revision: http://reviews.llvm.org/D3949 llvm-svn: 210250	2014-06-05 09:06:21 +00:00
Rafael Espindola	27c60b512a	Update for llvm API change. Aliases in llvm now hold an arbitrary expression. llvm-svn: 210063	2014-06-03 02:42:01 +00:00
Michael J. Spencer	dd59775f06	[CodeGen] Don't cast and use SizeTy instead of Int32Ty when constructing {extract,insert} vector element instructions. llvm-svn: 209942	2014-05-31 00:22:12 +00:00
Adam Nemet	286ae08e7d	Implement AVX1 vbroadcast intrinsics with vector initializers These intrinsics are special because they directly take a memory operand (AVX2 adds the register counterparts). Typically, other non-memop intrinsics take registers and then it's left to isel to fold memory operands. In order to LICM intrinsics directly reading memory, we require that no stores are in the loop (LICM) or that the folded load accesses constant memory (MachineLICM). When neither is the case we fail to hoist a loop-invariant broadcast. We can work around this limitation if we expose the load as a regular load and then just implement the broadcast using the vector initializer syntax. This exposes the load to LICM and other optimizations. At the IR level this is translated into a series of insertelements. The sequence is already recognized as a broadcast so there is no impact on the quality of codegen. _mm256_broadcast_pd and _mm256_broadcast_ps are not updated by this patch because right now we lack the DAG-combiner smartness to recover the broadcast instructions. This will be tackled in a follow-on. There will be completing changes on the LLVM side to remove the LLVM intrinsics and to auto-upgrade bitcode files. Fixes <rdar://problem/16494520> llvm-svn: 209846	2014-05-29 20:47:29 +00:00
Alexey Samsonov	c054d9813c	[ASan] Hoist blacklisting globals from init-order checking to Clang. Clang knows about the sanitizer blacklist and it makes no sense to add global to the list of llvm.asan.dynamically_initialized_globals if it will be blacklisted in the instrumentation pass anyway. Instead, we should do as much blacklisting as possible (if not all) in the frontend. llvm-svn: 209789	2014-05-29 01:43:53 +00:00
Sanjay Patel	1585fb94ab	added Intel's BMI intrinsic variants (fixes PR19431 - http://llvm.org/bugs/show_bug.cgi?id=19431) llvm-svn: 209769	2014-05-28 20:26:57 +00:00
Warren Hunt	583db1979c	Reverting 209503 - Breaks asan blacklists I opened a discussion on cfe-commits. Ideally we've got a few things that need to happen. CompilerRT should probably have blacklists tests. Asan should probably not depend on that specific field. llvm-svn: 209766	2014-05-28 19:17:45 +00:00
NAKAMURA Takumi	753d70ce53	Let clang/test/CodeGen/pr19841.cpp tolerant of MS mangler. llvm-svn: 209726	2014-05-28 10:53:06 +00:00
Renato Golin	a627a103d0	Fix pr19841, bb are also unnamed llvm-svn: 209668	2014-05-27 17:01:21 +00:00
Renato Golin	345c9cc5f4	Fix pr19841.cpp on release mode llvm-svn: 209666	2014-05-27 16:51:36 +00:00
Renato Golin	e7b3d5dcb4	Revert small change to EmitDeclRefLValue That small change, although it looked harmless, it made emitting the LValue on the PHI node without the proper cast. Reverting it fixes PR19841. llvm-svn: 209663	2014-05-27 16:46:27 +00:00
Nico Rieck	755a36f593	IRGen: Add more tests for dll attributes llvm-svn: 209596	2014-05-25 10:34:16 +00:00
Tim Northover	573cbee543	AArch64/ARM64: rename ARM64 components to AArch64 This keeps Clang consistent with backend naming conventions. llvm-svn: 209579	2014-05-24 12:52:07 +00:00
Tim Northover	25e8a6754e	AArch64/ARM64: update Clang after AArch64 removal. A few (mostly CodeGen) parts of Clang were tightly coupled to the AArch64 backend. Now that it's gone, they will not even compile. I've also deduplicated RUN lines in many of the AArch64 tests. This might improve "make check-all" time noticably: some of those NEON tests were monsters. llvm-svn: 209578	2014-05-24 12:51:25 +00:00
Hans Wennborg	e9277401b7	This test doesn't need -O2 -disable-llvm-optzns I forgot to fix this one in r209145. We use these flags on dllimport tests to make sure we emit code for available_externaly functions and don't inline the IR. llvm-svn: 209564	2014-05-23 23:29:44 +00:00
Nico Rieck	4da7debf7d	Fix broken FileCheck prefix llvm-svn: 209541	2014-05-23 19:07:25 +00:00
Robert Lytton	57dd5cf441	Fix '-main-file-name <name>' so that it is used for the ModuleID. Summary: Previously, you could not specify the original file name when passing a preprocessed file into the compiler Now you can use 'clang -Xclang -main-file-name -Xclang <original file name> ...' Or 'clang -cc1 -main-file-name <original file name> ...' llvm-svn: 209503	2014-05-23 07:34:08 +00:00
Matt Arsenault	328b52e88a	Forgot to add updated datalayout test llvm-svn: 209465	2014-05-22 18:57:49 +00:00
Renato Golin	9258aa5543	Make global named registers internal variables llvm-svn: 209289	2014-05-21 10:40:27 +00:00
Eric Christopher	6c553d6240	Remove test. Replacing it with a backend test with the optimized IR. llvm-svn: 209260	2014-05-21 00:00:01 +00:00
Eric Christopher	bd8652d272	Make this test emit llvm IR rather than assembly. llvm-svn: 209255	2014-05-20 23:23:51 +00:00
Duncan P. N. Exon Smith	6f782b12aa	Fix testcase from r209228 llvm-svn: 209229	2014-05-20 19:20:23 +00:00
Duncan P. N. Exon Smith	d22b97c30b	GlobalValue: Testcase for hidden visibility and local linkage This is a testcase for r209227, a change in LLVM that automatically sets visibility to default when the linkage is changed to local (rather than asserting). What this testcase triggers is hard to reproduce otherwise: the `GlobalValue` is created (with non-local linkage), the visibility is set to hidden, and then the linkage is set to local. PR19760 llvm-svn: 209228	2014-05-20 19:04:31 +00:00
Peter Collingbourne	41af7c2fdc	Implement the flatten attribute. This is a GNU attribute that causes calls within the attributed function to be inlined where possible. It is implemented by giving such calls the alwaysinline attribute. Differential Revision: http://reviews.llvm.org/D3816 llvm-svn: 209217	2014-05-20 17:12:51 +00:00
Robert Lytton	db8c1cb02c	XCore target: sort typestring enum fields alphabetically llvm-svn: 209196	2014-05-20 07:19:33 +00:00
Adrian Prantl	2dbdd20d37	Demote the "Debug Info Version" module flag to llvm::Module::Warning behavior on mismatch. The AutoUpgrader will drop incompatible debug info any way and also emit a warning diagnostic for it. rdar://problem/16926122 llvm-svn: 209182	2014-05-19 23:40:06 +00:00
Renato Golin	c296d951a7	Using SmallString and correct addr var llvm-svn: 209180	2014-05-19 23:25:25 +00:00
Renato Golin	156a853ccb	Fix usage of string when StringRef was needed Also adding a variable to the test, so release bots match %1. This should also calm the gdb buildbot. . llvm-svn: 209171	2014-05-19 22:36:19 +00:00
Peter Collingbourne	b4728c12e8	Implement the no_split_stack attribute. This is a GNU attribute that allows split stacks to be turned off on a per-function basis. Differential Revision: http://reviews.llvm.org/D3817 llvm-svn: 209167	2014-05-19 22:14:34 +00:00
Renato Golin	230c5eb4bd	Non-allocatable Global Named Register This patch implements global named registers in Clang, lowering to the just created intrinsics in LLVM (@llvm.read/write_register). A new type of LValue had to be created (Register), which just adds support to carry the metadata node containing the name of the register. Two new methods to emit loads and stores interoperate with another to emit the named metadata node. No guarantees are being made and only non-allocatable global variable named registers are being supported. Local named register support is unchanged. llvm-svn: 209149	2014-05-19 18:15:42 +00:00

1 2 3 4 5 ...

2590 Commits