llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	6e9110abe9	[PowerPC] Add asm parser support for bitmask forms of rotate-and-mask instructions The asm syntax for the 32-bit rotate-and-mask instructions can take a 32-bit bitmask instead of an (mb, me) pair. This syntax is not specified in the Power ISA manual, but is accepted by GNU as, and is documented in IBM's Assembler Language Reference. The GNU Multiple Precision Arithmetic Library (gmp) contains assembly that uses this syntax. To implement this, I moved the isRunOfOnes utility function from PPCISelDAGToDAG.cpp to PPCMCTargetDesc.h. llvm-svn: 233483	2015-03-28 19:42:41 +00:00
Hal Finkel	cd5553ed39	[ConstantFold] Don't fold ppc_fp128 <-> int bitcasts PPC_FP128 is really the sum of two consecutive doubles, where the first double is always stored first in memory, regardless of the target endianness. The memory layout of i128, however, depends on the target endianness, and so we can't fold this without target endianness information. As a result, we must not do this folding in lib/IR/ConstantFold.cpp (it could be done instead in Analysis/ConstantFolding.cpp, but that's not done now). Fixes PR23026. llvm-svn: 233481	2015-03-28 16:44:57 +00:00
Duncan P. N. Exon Smith	a8b3a1f374	Verifier: Allow subroutine types to have no type array Loosen one check from r233446: as long as `DIBuilder` requires a non-null type for every subprogram, we should allow a null type array. Also add tests for the rest of `MDSubroutineType`, which were somehow missing. llvm-svn: 233468	2015-03-28 02:43:53 +00:00
Duncan P. N. Exon Smith	ce2b983425	DebugInfo: Fix testcases with invalid MDSubprogram nodes Fix testcases that don't pass the verifier after a WIP patch to check `MDSubprogram` operands more effectively. I found the following issues: - When `isDefinition: false`, the `variables:` field might point at `!{i32 786468}`, or at a tuple that pointed at an empty tuple with the comment "previously: invalid DW_TAG_base_type" (I vaguely recall adding those comments during an upgrade script). In these cases, I just dropped the array. - The `variables:` field might point at something like `!{!{!8}}`, where `!8` was an `MDLocation`. I removed the extra layer of indirection. - Invalid `type:` (not an `MDSubroutineType`). llvm-svn: 233466	2015-03-28 02:26:45 +00:00
Akira Hatanaka	ee97475b2e	[ARM] Enable changing instprinter's behavior based on the per-function subtarget. llvm-svn: 233451	2015-03-27 23:41:42 +00:00
Sanjoy Das	fe0e0fff92	[SCEV] Look at backedge dominating conditions. Summary: This change teaches ScalarEvolution::isLoopBackedgeGuardedByCond to look at edges within the loop body that dominate the latch. We don't do an exhaustive search for all possible edges, but only a quick walk up the dom tree. Reviewers: atrick, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8627 llvm-svn: 233447	2015-03-27 23:18:08 +00:00
Duncan P. N. Exon Smith	49e6a70fe3	Verifier: Call verifyModule() from llc and opt Change `llc` and `opt` to run `verifyModule()`. This ensures that we check the full module before `FunctionPass::doInitialization()` ever gets called (I was getting crashes in `DwarfDebug` instead of verifier failures when testing a WIP patch that checks operands of compile units). In `opt`, also move up debug-info-stripping so that it still runs before verification. There was a fair bit of broken code that was sitting in tree. Interestingly, some were cases of a `select` that referred to itself in `-instcombine` tests (apparently an intermediate result). I split them off to `*-noverify.ll` tests with RUN lines like this: opt < %s -S -disable-verify -instcombine \| opt -S \| FileCheck %s This avoids verifying the input file (so we can get the broken code into `-instcombine), but still verifies the output with a second call to `opt` (to verify that `-instcombine` will clean it up like it should). llvm-svn: 233432	2015-03-27 22:04:28 +00:00
Duncan P. N. Exon Smith	988a7f8b79	DebugInfo: Fix bad debug info for compile units and types Fix debug info in these tests, which started failing with a WIP patch to verify compile units and types. The problems look like they were all caused by bitrot. They fell into these categories: - Using `!{i32 0}` instead of `!{}`. - Using `!{null}` instead of `!{}`. - Using `!MDExpression()` instead of `!{}`. - Using `!8` instead of `!{!8}`. - `file:` references that pointed at `MDCompileUnit`s instead of the same `MDFile` as the compile unit. - `file:` references that were numerically off-by-one or (off-by-ten). llvm-svn: 233415	2015-03-27 20:46:33 +00:00
Ahmed Bougacha	91b959352c	[R600/SI] Fix testcase check line. Missing colon, instruction typo. llvm-svn: 233414	2015-03-27 20:41:42 +00:00
Akira Hatanaka	bceb2a5a1c	[AArch64InstPrinter] Use the feature bits of the subtarget passed to the print method. This enables the instprinter to print a different system register name based on the feature bits of the per-function subtarget. Differential Revision: http://reviews.llvm.org/D8668 llvm-svn: 233412	2015-03-27 20:37:20 +00:00
Ahmed Bougacha	faf8065a99	[CodeGen] Don't attempt a tail-call with a non-forwarded explicit sret. Tailcalls are only OK with forwarded sret pointers. With explicit sret, one approximation is to check that the pointer isn't an Instruction, as in that case it might point into some local memory (alloca). That's not OK with tailcalls. Explicit sret counterpart to r233409. Differential Revison: http://reviews.llvm.org/D8510 llvm-svn: 233410	2015-03-27 20:35:49 +00:00
Ahmed Bougacha	e2bd5d36b3	[CodeGen] Don't attempt a tail-call with implicit sret. Tailcalls are only OK with forwarded sret pointers. With sret demotion, they're not, as we'd have a pointer into a soon-to-be-dead stack frame. Differential Revison: http://reviews.llvm.org/D8510 llvm-svn: 233409	2015-03-27 20:28:30 +00:00
Alexei Starovoitov	13cf2cc405	[bpf] add support for bpf pseudo instruction Expose bpf pseudo load instruction via intrinsic. It is used by front-ends that can encode file descriptors directly into IR instead of relying on relocations. llvm-svn: 233396	2015-03-27 18:51:42 +00:00
Quentin Colombet	2e27df717a	[RegisterCoalescer] Refine the terminal rule to still consider the terminal nodes. When a node is terminal it is pushed at the end of the list of the copies to coalesce instead of being completely ignored. In effect, this reduces its priority over non-terminal nodes. Because of that, we do not miss the rematerialization opportunities, nor the copies that can be merged with more complex, than the terminal rule, interference checks. Related to PR22768. llvm-svn: 233395	2015-03-27 18:37:15 +00:00
Duncan P. N. Exon Smith	e2c61d9eec	LLParser: Require non-null scope for MDLocation and MDLocalVariable Change `LLParser` to require a non-null `scope:` field for both `MDLocation` and `MDLocalVariable`. There's no need to wait for the verifier for this check. This also allows their `::getImpl()` methods to assert that the incoming scope is non-null. llvm-svn: 233394	2015-03-27 17:56:39 +00:00
Adrian Prantl	c2401ddf84	Add a -raw option to the -section mode of llvm-objdump. llvm-svn: 233390	2015-03-27 17:31:15 +00:00
Duncan P. N. Exon Smith	3d2afaa29e	Verifier: Check fields of MDVariable subclasses Check fields from `MDLocalVariable` and `MDGlobalVariable` and change the accessors to downcast to the right types. `getType()` still returns `Metadata*` since it could be an `MDString`-based reference. Since local variables require non-null scopes, I also updated `LLParser` to require a `scope:` field. A number of testcases had grown bitrot and started failing with this patch; I committed them separately in r233349. If I just broke your out-of-tree testcases, you're probably hitting similar problems (so have a look there). llvm-svn: 233389	2015-03-27 17:29:58 +00:00
Duncan P. N. Exon Smith	b4499078d1	DebugInfo: Fix another bitrotted testcase Fix another case of a missing `scope:` field on an `MDLocalVariable`. llvm-svn: 233388	2015-03-27 17:29:11 +00:00
Rafael Espindola	2fcfb5e5c7	Work around pr23045 and make it easier to reproduce. Dropping old debug format requires the entire module to be read upfront. This was failing only with the gold plugin, but that is just because llvm-link was not upgrading metadata. The new testcase using llvm-link shows the problem. llvm-svn: 233381	2015-03-27 15:55:06 +00:00
Rafael Espindola	b61beca40c	Close unique sections when switching away from them. It is not possible to switch back to unique secitons, so close them automatically when switching away. llvm-svn: 233380	2015-03-27 15:01:40 +00:00
Yaron Keren	39972f7723	Fix subprogram-linkonce-weak.ll and subprogram-linkonce-weak-odr.ll for Windows. llvm-svn: 233375	2015-03-27 13:52:12 +00:00
James Molloy	0cbb2a8603	Reapply r233175 and r233183: float2int. This re-adds float2int to the tree, after fixing PR23038. It turns out the argument to APSInt() is true-if-unsigned, rather than true-if-signed :(. Added testcase and explanatory comment. llvm-svn: 233370	2015-03-27 10:36:57 +00:00
Andrew Trick	43adfb30d5	Complete the MachineScheduler fix made way back in r210390. "Fix the MachineScheduler's logic for updating ready times for in-order. Now the scheduler updates a node's ready time as soon as it is scheduled, before releasing dependent nodes." This fix was only made in one variant of the ScheduleDAGMI driver. Francois de Ferriere reported the issue in the other bit of code where it was also needed. I never got around to coming up with a test case, but it's an obvious fix that shouldn't be delayed any longer. I'll try to refactor this code a little better. I did verify performance on a wide variety of targets and saw no negative impact with this fix. llvm-svn: 233366	2015-03-27 06:10:13 +00:00
Philip Reames	e1bf27045d	Require a GC strategy be specified for functions which use gc.statepoint This was discussed a while back and I left it optional for migration. Since it's been far more than the 'week or two' that was discussed, time to actually make this manditory. llvm-svn: 233357	2015-03-27 05:09:33 +00:00
Philip Reames	f8f0933b48	Allow explicit spill slots to be specified for a gc.statepoint This patch adds support for explicitly provided spill slots in the GC arguments of a gc.statepoint. This is somewhat analogous to gcroot, but leverages the STATEPOINT MI node and StackMap infrastructure. The motivation for this is: 1) The stack spilling code for gc.statepoints hasn't advanced as fast as I'd like. One major option is to give up on doing spilling in the backend and do it at the IR level instead. We'd give up the ability to have gc values in registers, but that's a minor cost in practice. We are not neccessarily moving in that direction, but having the ability to prototype such a thing cheaply is interesting. 2) I want to port the gcroot lowering to use the statepoint infastructure. Given the metadata printers for gcroot expect a fixed set of stack roots, it's easiest to just reuse the explicit stack slots and pass them directly to the underlying statepoint. I'm holding off on the documentation for the new feature until I'm reasonable sure this is going to stick around. llvm-svn: 233356	2015-03-27 04:52:48 +00:00
Andrew Trick	5533adc117	Reintroduce the SelectionDAG scheduler test for r233351. This test returns nonnative integer types which aren't supported on all targets. The real issue with the SelectionDAG scheduler is with x86 EFLAGS. llvm-svn: 233355	2015-03-27 04:42:52 +00:00
David Majnemer	b919dd693f	WinEH: Create a parent frame alloca for HandlerType xdata tables We don't have any logic to emit those tables yet, so the SDAG lowering of this intrinsic is just a stub. We can see the intrinsic in the prepared IR, though. llvm-svn: 233354	2015-03-27 04:17:07 +00:00
Andrew Trick	46863e5565	This test should have been target specific. I missed that. llvm-svn: 233353	2015-03-27 04:04:35 +00:00
Andrew Trick	e97ff5a2ad	Fix a bug in SelectionDAG scheduling backtracking code: PR22304. It can happen (by line CurSU->isPending = true; // This SU is not in AvailableQueue right now.) that a SUnit is mark as available but is not in the AvailableQueue. For SUnit being selected for scheduling both conditions must be met. This patch mainly defensively protects from invalid removing a node from a queue. Sometimes nodes are marked isAvailable but are not in the queue because they have been defered due to some hazard. Patch by Pawel Bylica! llvm-svn: 233351	2015-03-27 03:44:13 +00:00
Nick Lewycky	ffb0864b44	Revert r233175 and r233183 with it. This pulls float2int back out of the tree, due to PR23038. llvm-svn: 233350	2015-03-27 02:00:11 +00:00
Duncan P. N. Exon Smith	219c8d3876	DebugInfo: Update testcases with invalid variables Fix testcases whose variables are invalid. I'm working on a patch that adds `Verifier` checks for `MDLocalVariable` (and `MDGlobalVariable`), and these failed because: - `scope:` fields need to point at `MDLocalScope` and can't be null. - `file:` fields need to point at `MDFile`. - `inlinedAt:` fields need to point at `MDLocation`. llvm-svn: 233349	2015-03-27 01:58:34 +00:00
Ahmed Bougacha	821880a7a1	[AsmPrinter] Don't assert on GOT equivalent non-constant users. We used to dyn_cast<Constant> in the recursive call, but cast<> in the initial one, and there can be non-Constant initial users. llvm-svn: 233346	2015-03-27 01:40:54 +00:00
Derek Schuff	b051389f04	Use movw/movt instead of constant pool loads to lower byval parameter copies Summary: The ARM backend can use a loop to implement copying byval parameters before a call. In non-thumb2 mode it uses a constant pool load to materialize the trip count. For targets that need movt instead (e.g. Native Client), use the same code as in thumb2 mode to materialize the trip count. Reviewers: jfb, t.p.northover Differential Revision: http://reviews.llvm.org/D8442 llvm-svn: 233324	2015-03-26 22:11:00 +00:00
Duncan P. N. Exon Smith	264899823f	Verifier: Check accessors of MDLocation Check accessors of `MDLocation`, and change them to `cast<>` down to the right types. Also add type-safe factory functions. All the callers that handle broken code need to use the new versions of the accessors (`getRawScope()` instead of `getScope()`) that still return `Metadata*`. This is also necessary for things like `MDNodeKeyImpl<MDLocation>` (in LLVMContextImpl.h) that need to unique the nodes when their operands might still be forward references of the wrong type. In the `Value` hierarchy, consumers that handle broken code use `getOperand()` directly. However, debug info nodes have a ton of operands, and their order (even their existence) isn't stable yet. It's safer and more maintainable to add an explicit "raw" accessor on the class itself. llvm-svn: 233322	2015-03-26 22:05:04 +00:00
Rafael Espindola	aeed3cbce0	Fix PR23025. There is something in link.exe that requires a relocation to use a global symbol. Not doing so breaks the chrome build on windows. This patch sets isWeak for that to work. To compensate, we then need to look past those symbols when not creating relocations. This patch includes an ELF test that matches GNU as behaviour. I am still reducing the chrome build issue and will add a test once that is done. llvm-svn: 233318	2015-03-26 21:11:00 +00:00
Justin Bogner	54dd7df1c0	[ARM] Fix some non-portable shell syntax in r233301's tests The "\|&" operator isn't POSIX, so it can fail depending on the host's default shell. Avoid it. There were also a couple of places that did "2>1", but this creates a file called "1". They clearly meant "2>&1". llvm-svn: 233309	2015-03-26 19:24:13 +00:00
Duncan P. N. Exon Smith	c947892d10	Reapply "Linker: Drop function pointers for overridden subprograms" This reverts commit r233254, effectively reapplying r233164 (and its successors), with an additional testcase for when subprograms match exactly. This fixes PR22792 (again). I'm using the same approach, but I've moved up the call to `stripReplacedSubprograms()`. The function pointers need to be dropped before mapping any metadata from the source module, or else this can drop the function from new subprograms that have merged (via Metadata uniquing) with the old ones. Dropping the pointers first prevents them from merging. ** The original commit message follows. ** Linker: Drop function pointers for overridden subprograms Instead of dropping subprograms that have been overridden, just set their function pointers to `nullptr`. This is a minor adjustment to the stop-gap fix for PR21910 committed in r224487, and fixes the crasher from PR22792. The problem that r224487 put a band-aid on: how do we find the canonical subprogram for a `Function`? Since the backend currently relies on `DebugInfoFinder` (which does a naive in-order traversal of compile units and picks the first subprogram) for this, r224487 tried dropping non-canonical subprograms. Dropping subprograms fails because the backend also builds up a map from subprogram to compile unit (`DwarfDebug::SPMap`) based on the subprogram lists. A missing subprogram causes segfaults later when an inlined reference (such as in this testcase) is created. Instead, just drop the `Function` pointer to `nullptr`, which nicely mirrors what happens when an already-inlined `Function` is optimized out. We can't really be sure that it's the same definition anyway, as the testcase demonstrates. This still isn't completely satisfactory. Two flaws at least that I can think of: - I still haven't found a straightforward way to make this symmetric in the IR. (Interestingly, the DWARF output is already symmetric, and I've tested for that to be sure we don't regress.) - Using `DebugInfoFinder` to find the canonical subprogram for a function is kind of crazy. We should just attach metadata to the function, like this: define weak i32 @foo(i32, i32) !dbg !MDSubprogram(...) { llvm-svn: 233302	2015-03-26 18:35:30 +00:00
Vladimir Sukharev	4b18c727a2	[ARM] Add v8.1a "Rounding Double Multiply Add/Subtract" extension Reviewers: t.p.northover Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8503 llvm-svn: 233301	2015-03-26 18:29:02 +00:00
Sanjoy Das	14598830fe	[SCEV] Revert bailout added in r75511. Summary: With the introduction of MarkPendingLoopPredicates in r157092, I don't think the bailout is needed anymore. Reviewers: atrick, nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8624 llvm-svn: 233296	2015-03-26 17:28:26 +00:00
Benjamin Kramer	7fa8c430f7	InstCombine: fold (A << C) == (B << C) --> ((A^B) & (~0U >> C)) == 0 Anding and comparing with zero can be done in a single instruction on most archs so this is a bit cheaper. llvm-svn: 233291	2015-03-26 17:12:06 +00:00
Vladimir Sukharev	c632cda8b2	[AArch64, ARM] Add v8.1a architecture and generic cpu New architecture and cpu added, following http://community.arm.com/groups/processors/blog/2014/12/02/the-armv8-a-architecture-and-its-ongoing-development Reviewers: t.p.northover Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8505 llvm-svn: 233290	2015-03-26 17:05:54 +00:00
Jingyue Wu	177a81578f	[SLSR] handle candidate form &B[i * S] Summary: This patch enhances SLSR to handle another candidate form &B[i * S]. If we found two candidates S1: X = &B[i * S] S2: Y = &B[i' * S] and S1 dominates S2, we can replace S2 with Y = &X[(i' - i) * S] Test Plan: slsr-gep.ll X86/no-slsr.ll: verify that we do not run SLSR on GEPs that already fit into an addressing mode Reviewers: eliben, atrick, meheff, hfinkel Reviewed By: hfinkel Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D7459 llvm-svn: 233286	2015-03-26 16:49:24 +00:00
Andrea Di Biagio	8f7feec5fd	[X86][FastIsel] Teach how to select vector load instructions. This patch teaches fast-isel how to select 128-bit vector load instructions. Added test CodeGen/X86/fast-isel-vecload.ll Differential Revision: http://reviews.llvm.org/D8605 llvm-svn: 233270	2015-03-26 11:29:02 +00:00
Duncan P. N. Exon Smith	7124230682	Revert "Linker: Drop function pointers for overridden subprograms" This reverts commit r233164 and its testcase follow-ups in r233165, r233207, r233214, and r233221. It apparently unleashed an LTO bootstrap failure, at least on Darwin: http://lab.llvm.org:8080/green/job/clang-stage2-configure-Rlto_build/3376/ I'm reproducing now. llvm-svn: 233254	2015-03-26 05:27:45 +00:00
Duncan P. N. Exon Smith	5fbcc46a4a	bugpoint: Verify input files Like r233229 for `llvm-link`, start verifying input files to `bugpoint`. llvm-svn: 233253	2015-03-26 05:03:10 +00:00
Quentin Colombet	2c6e0597c6	[RegisterCoalescer] Add a rule to consider more profitable copies first when those are in the same basic block. The previous approach was the topological order of the basic block. By default this rule is disabled. Related to PR22768. llvm-svn: 233241	2015-03-26 01:01:48 +00:00
Eric Christopher	9f74ca5e0f	Testcase for r233239. llvm-svn: 233240	2015-03-26 00:57:33 +00:00
Duncan P. N. Exon Smith	041ceb7d57	llvm-link: Verify input modules Otherwise, broken input modules can cause assertions. I've updated two of the testcases that started failing (modules that had `Require` flags but didn't meet their own requirements), but Rafael and I decided that test/Linker/2011-08-22-ResolveAlias.ll should just be deleted outright -- it's a leftover of the way llvm-gcc used to implement weakref. llvm-svn: 233229	2015-03-25 23:22:10 +00:00
Sanjoy Das	e561fee2a4	[ValueTracking] Fix PR23011. Summary: `ComputeNumSignBits` returns incorrect results for `srem` instructions. This change fixes the issue and adds a test case. Reviewers: nadav, nicholas, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8600 llvm-svn: 233225	2015-03-25 22:33:53 +00:00
Simon Pilgrim	09f3ff9a0a	[DAGCombiner] Add support for TRUNCATE + FP_EXTEND vector constant folding This patch adds supports for the vector constant folding of TRUNCATE and FP_EXTEND instructions and tidies up the SINT_TO_FP and UINT_TO_FP instructions to match. It also moves the vector constant folding for the FNEG and FABS instructions to use the DAG.getNode() functionality like the other unary instructions. Differential Revision: http://reviews.llvm.org/D8593 llvm-svn: 233224	2015-03-25 22:30:31 +00:00
Duncan P. N. Exon Smith	bb2958a042	Linker: Stop using -gmlt test/Linker/subprogram-linkonce-weak.ll As dblaikie pointed out, if I stop setting `emissionKind: 2` then the backend won't do magical things on Linux vs. Darwin. I had wrongly assumed that there were stricter requirements on the input if we weren't in line-tables-only mode, but apparently not. With that knowledge, clean up this testcase a little more. - Set `emissionKind: 1`. - Add back checks for the weak version of @foo. - Check more robustly that we have the right subprograms by checking the `DW_AT_decl_file` and `DW_AT_decl_line` which now show up. - Check the line table in isolation (since it's no longer doubling as an indirect test for the subprogram of the weak version of @foo). llvm-svn: 233221	2015-03-25 21:36:41 +00:00
Duncan P. N. Exon Smith	c6d5cfe162	Linker: Loosen checks slightly from r233207 According to at least one bot [1], function prologues aren't always empty for these functions. Skip that part of the follow-up check. llvm-svn: 233214	2015-03-25 20:51:21 +00:00
Reid Kleckner	7e9546b378	WinEH: Create an unwind help alloca for __CxxFrameHandler3 xdata tables We don't have any logic to emit those tables yet, so the sdag lowering of this intrinsic is just a stub. We can see the intrinsic in the prepared IR, though. llvm-svn: 233209	2015-03-25 20:10:36 +00:00
Duncan P. N. Exon Smith	f9dc7036d3	Linker: Rewrite dwarfdump checks from r233164 Rewrite the checks from r233164 that I temporarily disabled in r233165. It turns out that the line-tables only debug info we emit from `llc` is (intentionally) different on Linux than on Darwin. r218129 started skipping emission of subprograms with no inlined subroutines, and r218702 was a spiritual revert of that behaviour for Darwin. I think we can still test this in a platform-neutral way. - Stop checking for the possibly missing `DW_TAG_subprogram` defining the debug info for the real version of `@foo`. - Start checking the line tables, ensuring that the right debug info was used to generate them (grabbing `DW_AT_low_pc` from the compile unit). - I changed up the line numbers used in the "weak" version so it's easier to follow. This should hopefully finish off PR22792. llvm-svn: 233207	2015-03-25 19:57:42 +00:00
Kit Barton	535e69de34	Add Hardware Transactional Memory (HTM) Support This patch adds Hardware Transaction Memory (HTM) support supported by ISA 2.07 (POWER8). The intrinsic support is based on GCC one [1], but currently only the 'PowerPC HTM Low Level Built-in Function' are implemented. The HTM instructions follows the RC ones and the transaction initiation result is set on RC0 (with exception of tcheck). Currently approach is to create a register copy from CR0 to GPR and comapring. Although this is suboptimal, since the branch could be taken directly by comparing the CR0 value, it generates code correctly on both test and branch and just return value. A possible future optimization could be elimitate the MFCR instruction to branch directly. The HTM usage requires a recently newer kernel with PPC HTM enabled. Tested on powerpc64 and powerpc64le. This is send along a clang patch to enabled the builtins and option switch. [1] https://gcc.gnu.org/onlinedocs/gcc/PowerPC-Hardware-Transactional-Memory-Built-in-Functions.html Phabricator Review: http://reviews.llvm.org/D8247 llvm-svn: 233204	2015-03-25 19:36:23 +00:00
Peter Collingbourne	ff2a64cf1b	Simplify missing-file-line.ll test. llvm-svn: 233201	2015-03-25 17:58:09 +00:00
Peter Collingbourne	b736065f78	DebugInfo: Permit DW_TAG_structure_type, DW_TAG_member, DW_TAG_typedef tags with empty file names. Some languages, such as Go, have pre-defined structure types (e.g. "string" is essentially a pointer/length pair) or pre-defined "typedef" types (e.g. "error" is essentially a typedef for a specific interface type). Such types do not have associated source location, so a Go frontend would be correct not to associate a file name with such types. This change relaxes the DIType verifier to permit unlocated types with these tags. Differential Revision: http://reviews.llvm.org/D8588 llvm-svn: 233200	2015-03-25 17:44:49 +00:00
Sanjay Patel	2f8f019daf	[X86, AVX] improve insertion into zero element of 256-bit vector This patch allows AVX blend instructions to handle insertion into the low element of a 256-bit vector for the appropriate data types. For f32, instead of: vblendps $1, %xmm1, %xmm0, %xmm1 ## xmm1 = xmm1[0],xmm0[1,2,3] vblendps $15, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0,1,2,3],ymm0[4,5,6,7] we get: vblendps $1, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0],ymm0[1,2,3,4,5,6,7] For f64, instead of: vmovsd %xmm1, %xmm0, %xmm1 ## xmm1 = xmm1[0],xmm0[1] vblendpd $3, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0,1],ymm0[2,3] we get: vblendpd $1, %ymm1, %ymm0, %ymm0 ## ymm0 = ymm1[0],ymm0[1,2,3] For the hardware-neglected integer data types, I left a TODO comment in the code and added regression tests for a follow-on patch. Differential Revision: http://reviews.llvm.org/D8609 llvm-svn: 233199	2015-03-25 17:36:01 +00:00
Sanjay Patel	defd9b9b4c	use update_llc_test_checks.py to tighten checking in these tests 1. There were no CHECK-LABELs, so we could match instructions from the wrong function. 2. The use of zero operands meant multiple xor instructions could match some CHECKs. 3. The test was over-specified to need a Sandybridge CPU and Darwin triple. llvm-svn: 233198	2015-03-25 17:34:11 +00:00
Daniel Jasper	1c012293c8	Make exit-code test use same mechanism as existing one. The other version doesn't properly work with our internal test runner, which sets pipefail. llvm-svn: 233188	2015-03-25 14:35:40 +00:00
Rafael Espindola	f275ad8af1	Fix fixup evaluation when deciding what to relocate with. The previous logic was to first try without relocations at all and failing that stop on the first defined symbol. That was inefficient and incorrect in the case part of the expression could be simplified and another part could not (see included test). We now stop the evaluation when we get to a variable whose value can change (i.e. is weak). llvm-svn: 233187	2015-03-25 13:16:53 +00:00
Andrea Di Biagio	460948c9ab	[optnone] Skip pass Float2Int on optnone functions. Added test Float2Int/float2int-optnone.ll to verify that pass Float2Int is not run on optnone functions. llvm-svn: 233183	2015-03-25 12:22:37 +00:00
Lang Hames	9528bbaae0	[Orc][lli] Add a very simple Orc-based lazy JIT to lli. This ensures that we're building and testing the CompileOnDemand layer, at least in a basic way. Currently x86-64 only, and with limited to no library calls enabled (depending on host platform). Patches welcome. ;) To enable access to the lazy JIT, this patch replaces the '-use-orcmcjit' lli option with a new option: '-jit-kind={ mcjit \| orc-mcjit \| orc-lazy }'. All regression tests are updated to use the new option, and one trivial test of the new lazy JIT is added. llvm-svn: 233182	2015-03-25 12:11:48 +00:00
Andrea Di Biagio	07a26d6b2f	[X86] Simplify check lines in tests. No functional change. Also, removed unused check lines from test atomic6432.ll. llvm-svn: 233181	2015-03-25 11:44:19 +00:00
James Molloy	cb75d92458	Reapply r233062: "float2int": Add a new pass to demote from float to int where possible. Now with a fix for PR23008 and extra regression test. llvm-svn: 233175	2015-03-25 10:03:42 +00:00
Justin Bogner	35b4b1a4e1	test: Fix the dependencies for the check-llvm-* targets In r233009 we gained specific check-llvm-* build targets for invoking specific parts of the test suite, but they were copying the dependencies for check-all, rather than just listing the dependencies for check-llvm. This moves the creation of these targets next to the check-llvm target, and uses that target's configuration rather than the check-all config. llvm-svn: 233174	2015-03-25 08:07:47 +00:00
Duncan P. N. Exon Smith	419d523d62	Linker: Temporarily disable dwarfdump checks from r233164 At least one Linux bot [1] doesn't like my dwarfdump checks, so I've disable those until I can investigate what's going on there. I'll continue to track this in PR22792. [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/22863 llvm-svn: 233165	2015-03-25 02:43:04 +00:00
Duncan P. N. Exon Smith	004ced3b08	Linker: Drop function pointers for overridden subprograms Instead of dropping subprograms that have been overridden, just set their function pointers to `nullptr`. This is a minor adjustment to the stop-gap fix for PR21910 committed in r224487, and fixes the crasher from PR22792. The problem that r224487 put a band-aid on: how do we find the canonical subprogram for a `Function`? Since the backend currently relies on `DebugInfoFinder` (which does a naive in-order traversal of compile units and picks the first subprogram) for this, r224487 tried dropping non-canonical subprograms. Dropping subprograms fails because the backend also builds up a map from subprogram to compile unit (`DwarfDebug::SPMap`) based on the subprogram lists. A missing subprogram causes segfaults later when an inlined reference (such as in this testcase) is created. Instead, just drop the `Function` pointer to `nullptr`, which nicely mirrors what happens when an already-inlined `Function` is optimized out. We can't really be sure that it's the same definition anyway, as the testcase demonstrates. This still isn't completely satisfactory. Two flaws at least that I can think of: - I still haven't found a straightforward way to make this symmetric in the IR. (Interestingly, the DWARF output is already symmetric, and I've tested for that to be sure we don't regress.) - Using `DebugInfoFinder` to find the canonical subprogram for a function is kind of crazy. We should just attach metadata to the function, like this: define weak i32 @foo(i32, i32) !dbg !MDSubprogram(...) { llvm-svn: 233164	2015-03-25 02:26:32 +00:00
Rafael Espindola	dbb4021b64	Produce an error instead of asserting on invalid .sleb128/.uleb128. llvm-svn: 233155	2015-03-25 00:25:37 +00:00
Paul Robinson	284f0451cf	'optnone' should not disable DAG combiner. Reverts the code change from r221168 and the relevant test. It was a mistake to disable the combiner, and based on the ultimate definition of 'optnone' we shouldn't have considered the test case as failing in the first place. llvm-svn: 233153	2015-03-25 00:10:24 +00:00
Philip Reames	4dbd88f3b4	!invariant.load semantics with potentially clobbering calls A load from an invariant location is assumed to not alias any otherwise potentially aliasing stores. Our implementation only applied this rule to store instructions themselves whereas they it should apply for any memory accessing instruction. This results in both FRE and PRE becoming more effective at eliminating invariant loads. Note that as a follow on change I will likely move this into AliasAnalysis itself. That's where the TBAA constant flag is handled and the semantics are essentially the same. I'd like to separate the semantic change from the refactoring and thus have extended the hack that's already in MemoryDependenceAnalysis for this change. Differential Revision: http://reviews.llvm.org/D8591 llvm-svn: 233140	2015-03-24 23:54:54 +00:00
Rafael Espindola	c9e7068cdd	Don't be over eager in evaluating a subtraction with a weak symbol. In a subtraction of the form A - B, if B is weak, there is no way to represent that on ELF since all relocations add the value of a symbol. llvm-svn: 233139	2015-03-24 23:48:44 +00:00
Reid Kleckner	11470c48d0	X86: Fix frameescape when not using an FP We can't use TargetFrameLowering::getFrameIndexOffset directly, because Win64 really wants the offset from the stack pointer at the end of the prologue. Instead, use X86FrameLowering::getFrameIndexOffsetFromSP(), which is a pretty close approximiation of that. It fails to handle cases with interestingly large stack alignments, which is pretty uncommon on Win64 and is TODO. llvm-svn: 233137	2015-03-24 23:46:01 +00:00
Justin Bogner	96966c7bf5	Update a test I missed in r233132 llvm-svn: 233134	2015-03-24 23:44:03 +00:00
Justin Bogner	5a07bb8b5f	llvm-cov: Require a subcommand when invoked as llvm-cov A while ago llvm-cov gained support for clang's instrumentation based profiling in addition to its gcov support, and subcommands were added to choose which behaviour to use. When no subcommand was specified, we fell back to gcov compatibility with a warning that a subcommand would be required in the future. Now, we require the subcommand. Note that if the basename of llvm-cov is gcov (via symlink or hardlink, for example), we still use the gcov compatible behaviour with no subcommand required. llvm-svn: 233132	2015-03-24 23:34:36 +00:00
Frederic Riss	10e728a48c	[dsymutil] Temporarily disable some tests on windows. It seems one windows bot fails since I added ilne table linking to llvm-dsymutil (see r232333 commit thread). Disable the affected tests until I can figure out what's happening. llvm-svn: 233130	2015-03-24 23:11:07 +00:00
Sanjay Patel	e304bea010	optimize the AVX2 (integer) version of vperm2 into a shuffle ...because this is what happens when an instruction set puts its underwear on after its pants. This is an extension of r232852, r233100, and 233110: http://llvm.org/viewvc/llvm-project?view=revision&revision=232852 http://llvm.org/viewvc/llvm-project?view=revision&revision=233100 http://llvm.org/viewvc/llvm-project?view=revision&revision=233110 llvm-svn: 233127	2015-03-24 22:39:29 +00:00
Philip Reames	2b969d7010	Merge empty landing pads in SimplifyCFG This patch tries to merge duplicate landing pads when they branch to a common shared target. Given IR that looks like this: lpad1: %exn = landingpad {i8, i32} personality i32 (...) @__gxx_personality_v0 cleanup br label %shared_resume lpad2: %exn2 = landingpad {i8, i32} personality i32 (...) @__gxx_personality_v0 cleanup br label %shared_resume shared_resume: call void @fn() ret void } We can rewrite the users of both landing pad blocks to use one of them. This will generally allow the shared_resume block to be merged with the common landing pad as well. Without this change, tail duplication would likely kick in - creating N (2 in this case) copies of the shared_resume basic block. Differential Revision: http://reviews.llvm.org/D8297 llvm-svn: 233125	2015-03-24 22:28:45 +00:00
Rafael Espindola	14ebbc6d99	Add -m -m elf_x86_64 to gold invocations. Otherwise the tests would fail if the default was not elf_x86_64. This fixes PR22966. Patch by H.J. Lu! llvm-svn: 233124	2015-03-24 22:20:19 +00:00
David Blaikie	1a6bb9fcf6	Revert "Remove an InstCombine that seems to have become redundant." Assertion fires in compiler-rt. Guess it does fire.. This reverts commit r233116. llvm-svn: 233121	2015-03-24 21:50:35 +00:00
Rafael Espindola	8b4817b5f7	Reset the CFA offset at the start of every FDE. This fixes PR21515. llvm-svn: 233120	2015-03-24 21:47:31 +00:00
Peter Collingbourne	2f69d0fe1d	MC: Add more stringent symbol checking to test. llvm-svn: 233118	2015-03-24 21:47:00 +00:00
David Blaikie	e37e10dc57	Remove an InstCombine that seems to have become redundant. Assert that this doesn't fire - I'll remove all of this later, but just leaving it in for a while in case this is firing & we just don't have test coverage. llvm-svn: 233116	2015-03-24 21:31:31 +00:00
Sanjay Patel	43a87fdc79	[X86, AVX] instcombine vperm2 intrinsics with zero inputs into shuffles This is the IR optimizer follow-on patch for D8563: the x86 backend patch that converts this kind of shuffle back into a vperm2. This is also a continuation of the transform that started in D8486. In that patch, Andrea suggested that we could convert vperm2 intrinsics that use zero masks into a single shuffle. This is an implementation of that suggestion. Differential Revision: http://reviews.llvm.org/D8567 llvm-svn: 233110	2015-03-24 20:36:42 +00:00
Rafael Espindola	8448a24380	[llvm-readobj] add support for macho universal binary. Patch by Keyue Hu (Chilledheart)! llvm-svn: 233107	2015-03-24 20:26:55 +00:00
Hans Wennborg	e42c64551a	Revert r233062 ""float2int": Add a new pass to demote from float to int where possible." This caused PR23008, compiles failing with: "Use still stuck around after Def is destroyed: %.sroa.speculated" Also reverting follow-up r233064. llvm-svn: 233105	2015-03-24 20:07:08 +00:00
Sanjoy Das	337d46b36f	[IRCE] Fix a regression introduced in r232444. IRCE should not try to eliminate range checks that check an induction variable against a loop-varying length. llvm-svn: 233101	2015-03-24 19:29:18 +00:00
Sanjay Patel	99d246d7d7	[X86, AVX] recognize shufflevector with zero input as a vperm2 (PR22984) vperm2x128 instructions have the special ability (aka free hardware capability) to shuffle zero values into a vector. This patch recognizes that type of shuffle and generates the appropriate control byte. https://llvm.org/bugs/show_bug.cgi?id=22984 Differential Revision: http://reviews.llvm.org/D8563 llvm-svn: 233100	2015-03-24 19:19:07 +00:00
Duncan P. N. Exon Smith	fc25da101c	Verifier: Start recursing into !dbg attachments The main verifier already recurses through the other entry points, so we might as well descend here too. This temporarily duplicates some work already done in `verifyDebugInfo()`, but eventually I'll be removing the other side. llvm-svn: 233095	2015-03-24 17:32:19 +00:00
Daniel Sanders	c676f2a8bb	[mips] Support 16-bit offsets for 'm' inline assembly memory constraint. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8435 llvm-svn: 233086	2015-03-24 15:19:14 +00:00
Marek Olsak	949f5dab95	R600/SI: Select V_BFE_U32 for and+shift with a non-literal offset llvm-svn: 233079	2015-03-24 13:40:34 +00:00
Marek Olsak	9b72868d17	R600/SI: Custom-select 32-bit S_BFE from bitwise opcodes llvm-svn: 233078	2015-03-24 13:40:27 +00:00
Marek Olsak	63a7b084eb	R600/SI: Improve BFM support llvm-svn: 233077	2015-03-24 13:40:21 +00:00
Marek Olsak	7d77728c97	R600/SI: Use V_FRACT_F64 for faster 64-bit floor on SI Other f64 opcodes not supported on SI can be lowered in a similar way. v2: use complex VOP3 patterns llvm-svn: 233076	2015-03-24 13:40:15 +00:00
Marek Olsak	43650e45c3	R600/SI: Expand fract to floor, then only select V_FRACT on CI V_FRACT is buggy on SI. R600-specific code is left intact. v2: drop the multiclass, use complex VOP3 patterns llvm-svn: 233075	2015-03-24 13:40:08 +00:00
Daniel Sanders	a73d8fe2ad	[mips] Distinguish 'R', 'ZC', and 'm' inline assembly memory constraint. Summary: Previous behaviour of 'R' and 'm' has been preserved for now. They will be improved in subsequent commits. The offset permitted by ZC varies according to the subtarget since it is intended to match the restrictions of the pref, ll, and sc instructions. The restrictions on these instructions are: * For microMIPS: 12-bit signed offset. * For Mips32r6/Mips64r6: 9-bit signed offset. * Otherwise: 16-bit signed offset. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8414 llvm-svn: 233063	2015-03-24 11:26:34 +00:00
James Molloy	408df5160c	"float2int": Add a new pass to demote from float to int where possible. It is possible to have code that converts from integer to float, performs operations then converts back, and the result is provably the same as if integers were used. This can come from different sources, but the most obvious is a helper function that uses floats but the arguments given at an inlined callsites are integers. This pass considers all integers requiring a bitwidth less than or equal to the bitwidth of the mantissa of a floating point type (23 for floats, 52 for doubles) as exactly representable in floating point. To reduce the risk of harming efficient code, the pass only attempts to perform complete removal of inttofp/fptoint operations, not just move them around. llvm-svn: 233062	2015-03-24 11:15:23 +00:00
Simon Pilgrim	481f4146cd	[SelectionDAG] Fixed issue with uitofp vector constant folding being treated as sitofp While the uitofp scalar constant folding treats an integer as an unsigned value (from lang ref): %X = sitofp i8 -1 to double ; yields double:-1.0 %Y = uitofp i8 -1 to double ; yields double:255.0 The vector constant folding was always using sitofp: %X = sitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double -1.0, double -1.0> %Y = uitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double -1.0, double -1.0> This patch fixes this so that the correct opcode is used for sitofp and uitofp. %X = sitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double -1.0, double -1.0> %Y = uitofp <2 x i8> <i8 -1, i8 -1> to <2 x double> ; yields <double 255.0, double 255.0> Differential Revision: http://reviews.llvm.org/D8560 llvm-svn: 233033	2015-03-23 22:44:55 +00:00
Duncan P. N. Exon Smith	9b9cc2dad4	DebugInfo: Overload get() in DIDescriptor subclasses Continue to simplify the `DIDescriptor` subclasses, so that they behave more like raw pointers. Remove `getRaw()`, replace it with an overloaded `get()`, and overload the arrow and cast operators. Two testcases started to crash on the arrow operators with this change because of `scope:` references that weren't real scopes. I fixed them. Soon I'll add verifier checks for them too. This also adds explicit dereference operators. Previously, the builtin dereference against `operator MDNode *()` would have worked, but now the builtins are ambiguous. llvm-svn: 233030	2015-03-23 21:54:07 +00:00
Rafael Espindola	f2b408c64e	Refactor how passes get a symbol at the end of a section. There is now a canonical symbol at the end of a section that different passes can request. This also allows us to assert that we don't switch back to a section whose end symbol has already been printed. llvm-svn: 233026	2015-03-23 21:22:04 +00:00
Ahmed Bougacha	d1655cb1c0	[AArch64, ARM] Enable GlobalMerge with -O3 rather than -O1. The pass used to be enabled by default with CodeGenOpt::Less (-O1). This is too aggressive, considering the pass indiscriminately merges all globals together. Currently, performance doesn't always improve, and, on code that uses few globals (e.g., the odd file- or function- static), more often than not is degraded by the optimization. Lengthy discussion can be found on llvmdev (AArch64-focused; ARM has similar problems): http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-February/082800.html Also, it makes tooling and debuggers less useful when dealing with globals and data sections. GlobalMerge needs to better identify those cases that benefit, and this will be done separately. In the meantime, move the pass to run with -O3 rather than -O1, on both ARM and AArch64. llvm-svn: 233024	2015-03-23 21:17:36 +00:00
Chad Rosier	384ade9b11	[AArch64] Add FileCheck that was missing from test in r232967. llvm-svn: 233013	2015-03-23 20:25:15 +00:00
Matt Arsenault	f5b2cd891a	R600/SI: Allow commuting compares This enables very common cases to switch to the smaller encoding. All of the standard LLVM canonicalizations of comparisons are the opposite of what we want. Compares with constants are moved to the RHS, but the first operand can be an inline immediate, literal constant, or SGPR using the 32-bit VOPC encoding. There are additional bad canonicalizations that should also be fixed, such as canonicalizing ge x, k to gt x, (k + 1) if this makes k no longer an inline immediate value. llvm-svn: 232988	2015-03-23 18:45:30 +00:00
Chad Rosier	affe181b39	[AArch64] Enable rematerialization of float 0 values. Patch by Geoff Berry<gberry@codeaurora.org>. llvm-svn: 232967	2015-03-23 17:19:34 +00:00
Bradley Smith	ae0ad9c95d	Revert "[ARM] Add more pattern matching for f16 <-> f64 conversions" This change is incorrect since it converts double rounding into single rounding, which can produce different results. Instead this optimization will be done by modifying Clang's codegen to not produce double rounding in the first place. This reverts commit r232954. llvm-svn: 232962	2015-03-23 16:52:52 +00:00
Tom Stellard	f0a575f6be	R600/SI: Fix crash in SIInstrInfo::areLoadsFromSameBasePtr() This function assumed that SMRD instructions always have immediate offsets, which is not always the case. llvm-svn: 232957	2015-03-23 16:06:01 +00:00
Bradley Smith	bc0f0d8c49	[ARM] Add more pattern matching for f16 <-> f64 conversions Specifically when the conversion is done in two steps, f16 -> f32 -> f64. For example: %1 = tail call float @llvm.convert.from.fp16.f32(i16 %0) %conv = fpext float %1 to double to: vcvtb.f64.f16 llvm-svn: 232954	2015-03-23 15:59:54 +00:00
Petar Jovanovic	5b4362276b	Fix sign extension for MIPS64 in makeLibCall function Fixing sign extension in makeLibCall for MIPS64. In MIPS64 architecture all 32 bit arguments (int, unsigned int, float 32 (soft float)) must be sign extended. This fixes test "MultiSource/Applications/oggenc/". Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D7791 llvm-svn: 232943	2015-03-23 12:28:13 +00:00
Hal Finkel	8f7c5a7f18	[SDAG] Don't widen VSETCC during type legalization for split operands Because the operands of a vector SETCC node can be of a different type from the result (and often are), it can happen that even if we'd prefer to widen the result type of the SETCC, the operands have been split instead. In this case, the SETCC result also must be split. This mirrors what is done in WidenVecRes_SELECT, and should be NFC elsewhere because if the operands are not widened the following calls to GetWidenedVector will assert (which is what was happening in the test case). llvm-svn: 232935	2015-03-23 08:22:43 +00:00
Lang Hames	1565992679	[Orc] Add missing -use-orcmcjit flag to a number of Orc regression tests. llvm-svn: 232931	2015-03-23 06:02:49 +00:00
Duncan P. N. Exon Smith	03c37c9099	Prevent CHECK-NOTs from matching file paths A build directory with a name like `build-Werror` would hit a false positive on these `CHECK-NOT`s before, since the actual error line looks like: .../build-Werror/bin/llvm-as <stdin>:1:2: error: ... Switch to using: CHECK-NOT: error: (note the trailing semi-colon) to avoid matching almost any file path. llvm-svn: 232917	2015-03-22 15:58:21 +00:00
Benjamin Kramer	d6aa0ec737	[SimplifyLibCalls] Fix negative shifts being produced by the memchr -> bitfield transform. llvm-svn: 232903	2015-03-21 22:04:26 +00:00
Benjamin Kramer	7857d723f1	[SimplifyLibCalls] Turn memchr(const, C, const) into a bitfield check. strchr("123!", C) != nullptr is a common pattern to check if C is one of 1, 2, 3 or !. If the largest element of the string is smaller than the target's register size we can easily create a bitfield and just do a simple test for set membership. int foo(char C) { return strchr("123!", C) != nullptr; } now becomes cmpl $64, %edi ## range check sbbb %al, %al movabsq $0xE000200000001, %rcx btq %rdi, %rcx ## bit test sbbb %cl, %cl andb %al, %cl ## and the two conditions andb $1, %cl movzbl %cl, %eax ## returning an int ret (imho the backend should expand this into a series of branches, but that's a different story) The code is currently limited to bit fields that fit in a register, so usually 64 or 32 bits. Sadly, this misses anything using alpha chars or {}. This could be fixed by just emitting a i128 bit field, but that can generate really ugly code so we have to find a better way. To some degree this is also recreating switch lowering logic, but we can't simply emit a switch instruction and thus change the CFG within instcombine. llvm-svn: 232902	2015-03-21 21:09:33 +00:00
Matt Arsenault	da5ece8e35	R600: Cleanup test with multiple check prefixes llvm-svn: 232901	2015-03-21 19:15:46 +00:00
Benjamin Kramer	691363e7f2	SimplifyLibCalls: Add basic optimization of memchr calls. This is just memchr(x, y, 0) -> nullptr and constant folding. llvm-svn: 232896	2015-03-21 15:36:21 +00:00
Simon Pilgrim	307cb8fe5d	Tidied up vec_zero_cse.ll test. NFCI. Added target triple and refactored the CHECKs to be per function. llvm-svn: 232894	2015-03-21 14:05:12 +00:00
David Majnemer	e165502ed7	MemoryDependenceAnalysis: Don't miscompile atomics r216771 introduced a change to MemoryDependenceAnalysis that allowed it to reason about acquire/release operations. However, this change does not ensure that the acquire/release operations pair. Unfortunately, this leads to miscompiles as we won't see an acquire load as properly memory effecting. This largely reverts r216771. This fixes PR22708. llvm-svn: 232889	2015-03-21 06:19:17 +00:00
Tim Northover	000f994633	AArch64: simplify test case llvm-svn: 232886	2015-03-21 04:37:08 +00:00
Eric Christopher	faad620569	Remove the bare getSubtargetImpl call from the AArch64 port. As part of this add a test that shows we can generate code for functions that specifically enable a subtarget feature. llvm-svn: 232884	2015-03-21 04:04:50 +00:00
Eric Christopher	83eb13c967	Remove the bare getSubtargetImpl call from the PPC port. As part of this add a test that shows we can generate code with for functions that differ by subtarget feature. llvm-svn: 232882	2015-03-21 03:36:02 +00:00
Eric Christopher	c5a85af3b2	Cache the Function dependent subtarget on the MachineFunction. As preparation for removing the getSubtargetImpl() call from TargetMachine go ahead and flip the switch on caching the function dependent subtarget and remove the bare getSubtargetImpl call from the X86 port. As part of this add a few tests that show we can generate code and assemble on X86 based on features/cpu on the Function. llvm-svn: 232879	2015-03-21 03:13:10 +00:00
Kostya Serebryany	f4e35cc47d	[sanitizer] experimental tracing for cmp instructions llvm-svn: 232873	2015-03-21 01:29:36 +00:00
Ahmed Bougacha	7173b669b4	[CodeGen][IfCvt] Don't re-ifcvt blocks with unanalyzable terminators. If we couldn't analyze its terminator (i.e., it's an indirectbr, or some other weirdness), we can't safely re-if-convert a predicated block, because we can't tell whether the predicated terminator can fallthrough (it does). Currently, we would completely ignore the fallthrough successor. In the added testcase, this means we used to generate: ... @ %entry: cmp r5, #21 ittt ne @ %cc1f: cmpne r7, #42 @ %cc2t: strne.w r5, [r8] movne pc, r10 @ %cc1t: ... Whereas the successor of %cc1f was originally %bb1. With the fix, we get the correct: ... @ %entry: cmp r5, #21 itt eq @ %cc1t: streq.w r5, [r11] moveq pc, r0 @ %cc1f: cmp r7, #42 itt ne @ %cc2t: strne.w r5, [r8] movne pc, r10 @ %bb1: ... rdar://20192768 Differential Revision: http://reviews.llvm.org/D8509 llvm-svn: 232872	2015-03-21 01:23:15 +00:00
Ahmed Bougacha	e6bb09ac3f	[AArch64] Prefer UZP for concat_vector of illegal truncs. Follow-up to r232459: prefer a UZP shuffle to the intermediate truncs. llvm-svn: 232871	2015-03-21 01:08:39 +00:00
Yunzhong Gao	f44c06b96f	Tell lit.cfg about more Windows triples. For example, the host triple on my 64-bit PC is x86_64-pc-windows-msvc. llvm-svn: 232854	2015-03-20 22:08:40 +00:00
Sanjay Patel	ccf5f24b7b	[X86, AVX] instcombine common cases of vperm2* intrinsics into shuffles vperm2* intrinsics are just shuffles. In a few special cases, they're not even shuffles. Optimizing intrinsics in InstCombine is better than handling this in the front-end for at least two reasons: 1. Optimizing custom-written SSE intrinsic code at -O0 makes vector coders really angry (and so I have regrets about some patches from last week). 2. Doing mask conversion logic in header files is hard to write and subsequently read. There are a couple of TODOs in this patch to complete this optimization. Differential Revision: http://reviews.llvm.org/D8486 llvm-svn: 232852	2015-03-20 21:47:56 +00:00
Andrew Kaylor	3170e5620e	Fixing a bug with WinEH PHI handling llvm-svn: 232851	2015-03-20 21:42:54 +00:00
Sanjay Patel	c88f724fed	[X86] Prefer blendps over insertps codegen for one special case With this patch, for this one exact case, we'll generate: blendps %xmm0, %xmm1, $1 instead of: insertps %xmm0, %xmm1, $0 If there's a memory operand available for load folding and we're optimizing for size, we'll still generate the insertps. The detailed performance data motivation for this may be found in D7866; in summary, blendps has 2-3x throughput vs. insertps on widely used chips. Differential Revision: http://reviews.llvm.org/D8332 llvm-svn: 232850	2015-03-20 21:19:52 +00:00
Rafael Espindola	36a15cb975	Don't declare all text sections at the start of the .s The code this patch removes was there to make sure the text sections went before the dwarf sections. That is necessary because MachO uses offsets relative to the start of the file, so adding a section can change relaxations. The dwarf sections were being printed at the start just to produce symbols pointing at the start of those sections. The underlying issue was fixed in r231898. The dwarf sections are now printed when they are about to be used, which is after we printed the text sections. To make sure we don't regress, the patch makes the MachO streamer assert if CodeGen puts anything unexpected after the DWARF sections. llvm-svn: 232842	2015-03-20 20:00:01 +00:00
Duncan P. N. Exon Smith	1de3dc5731	Bugpoint: Fix invalid 'inlinedAt:' references in testcase These are causing crashes in `DebugInfoFinder` after a WIP patch to increase strictness of `DIDescriptor` accessors. llvm-svn: 232839	2015-03-20 19:51:34 +00:00
Rafael Espindola	bdfbde56e0	Reorganize the x86 ELF relocation selection logic. The main differences are: * Split in 32 and 64 bit functions. * First switch on the Modifier so that we have only one non fully covered switch. * Map the fixup kind first to a x86_64 (or i386) specific enum, to make it easy to handle cases like X86::reloc_riprel_4byte_movq_load. * Switch on IsPCRel last, which reduces code duplication. Fixes pr22308. llvm-svn: 232837	2015-03-20 19:48:54 +00:00
Duncan P. N. Exon Smith	a3bdc328a5	Verifier: Check that !dbg attachments have the right type A WIP patch makes `DIDescriptor` accessors more strict, which in turn causes the `DebugInfoFinder` to crash on wrongly typed `!dbg` attachments. Catch that error up front in `Verifier::visitInstruction()`. Also remove a test that we "handle" invalid `!dbg` attachments, added back in r99938. We don't want to handle those anymore. Note: I'm not recursing and verifying the debug info graph reachable from this node; that work is already done by `verifyDebugInfo()`. llvm-svn: 232834	2015-03-20 19:26:58 +00:00
Duncan P. N. Exon Smith	541133b79d	Rewrite test/Feature/md_on_instruction.ll This test is supposed to be testing whether metadata attachments to instructions work, but it was using invalid debug info to do so. (This was causing assertion failures in the `DebugInfoFinder` with a WIP patch to be more strict about `DIDescriptor` accessors.) Rather than fix the debug info -- which is better tested elsewhere -- just test the IR feature directly. llvm-svn: 232828	2015-03-20 18:34:53 +00:00
Wei Mi	6c428d6ff6	Correctly estimate SROA savings for store operands in inline cost analysis. When estimating SROA savings, we want to see if an address is derived off an alloca in the caller. For store instructions, operand 1 is the address operand, but the current code uses operand 0. Use getPointerOperand for loads and stores to fix this. Patch by Easwaran Raman. http://reviews.llvm.org/D8425 llvm-svn: 232827	2015-03-20 18:33:12 +00:00
John Brawn	1f26a47630	[ARM] Fix handling of thumb1 out-of-range frame offsets LocalStackSlotPass assumes that isFrameOffsetLegal doesn't change its answer when the base register changes. Unfortunately this isn't true in thumb1, where SP-based loads allow a larger offset than non-SP-based loads, and this causes the base register reuse code to generate instructions that are unencodable, causing an assertion failure. Solve this by adding a BaseReg parameter to isFrameOffsetLegal, which ARMBaseRegisterInfo can then make use of to give the correct answer. Differential Revision: http://reviews.llvm.org/D8419 llvm-svn: 232825	2015-03-20 17:20:07 +00:00
Daniel Jasper	214997c63b	[MBP] Don't outline short optional branches With the option -outline-optional-branches, LLVM will place optional branches out of line (more details on r231230). With this patch, this is not done for short optional branches. A short optional branch is a branch containing a single block with an instruction count below a certain threshold (defaulting to 3). Still everything is guarded under -outline-optional-branches). Outlining a short branch can't significantly improve code locality. It can however decrease performance because of the additional jmp and in cases where the optional branch is hot. This fixes a compile time regression I have observed in a benchmark. Review: http://reviews.llvm.org/D8108 llvm-svn: 232802	2015-03-20 10:00:37 +00:00
Tom Stellard	8d19f9b681	R600/SI: Add missing CHECK-LABEL lines to a test llvm-svn: 232797	2015-03-20 03:12:42 +00:00
Nick Lewycky	be8af48824	When simplifying a SCEV truncate by distributing, consider it a simplification to replace a cast, even if we end up with a trunc around the term. Fixes PR22960! llvm-svn: 232794	2015-03-20 02:25:00 +00:00
Peter Collingbourne	bbdc26ba1b	test: Make a start on a test suite for libLTO. This works in a similar way to the gold plugin tests. We search for a compatible linker on $PATH and use it to run tests against our just-built libLTO. To start with, test the just added opt level functionality. Differential Revision: http://reviews.llvm.org/D8472 llvm-svn: 232785	2015-03-19 23:55:38 +00:00
Owen Anderson	db4201235b	Fix a nasty bug in DAGCombine of STORE nodes. This is very related to the bug fixed in r174431. The problem is that SelectionDAG does not include alignment in the uniquing of loads and stores. When an otherwise no-op DAGCombine would increase the alignment of a load or store, the original node would be returned (with the alignment increased), which would cause the node not to be processed by any further DAGCombines. I don't have a direct testcase for this that manifests on an in-tree target, but I did see some noise in the tests for other targets and have updated them for it. llvm-svn: 232780	2015-03-19 22:48:57 +00:00
Reid Kleckner	c759fe90bc	WinEH: Make llvm.eh.actions emission match the EH docs This switches the sense of the i32 values and updates the test cases. We can also use CHECK-SAME to clean up some tests, and reduce the visual noise from bitcasts. llvm-svn: 232774	2015-03-19 22:31:02 +00:00
Sanjay Patel	d5c2d287f9	[X86, AVX] use blends instead of insert128 with index 0 Another case of x86-specific shuffle strength reduction: avoid generating insert*128 instructions with index 0 because they are slower than their non-lane-changing blend equivalents. Shuffle lowering already catches most of these cases, but the zero vector case and some other paths such as in the modified test in vector-shuffle-256-v32.ll were getting through. Differential Revision: http://reviews.llvm.org/D8366 llvm-svn: 232773	2015-03-19 22:29:40 +00:00
Peter Collingbourne	994ba3d29c	LowerBitSets: Avoid reusing byte set addresses. Each use of the byte array uses a different alias. This makes the backend less likely to reuse previously computed byte array addresses, improving the security of the CFI mechanism based on this pass. Differential Revision: http://reviews.llvm.org/D8455 llvm-svn: 232770	2015-03-19 22:02:10 +00:00
Peter Collingbourne	070843d60b	libLTO, llvm-lto, gold: Introduce flag for controlling optimization level. This change also introduces a link-time optimization level of 1. This optimization level runs only the globaldce pass as well as cleanup passes for passes that run at -O0, specifically simplifycfg which cleans up lowerbitsets. http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20150316/266951.html llvm-svn: 232769	2015-03-19 22:01:00 +00:00
Krzysztof Parzyszek	02fd29452c	Unxfail test/CodeGen/Generic/vector.ll now passing on Hexagon llvm-svn: 232758	2015-03-19 20:22:17 +00:00
Peter Collingbourne	152b936683	gold: Make powerpc support optional for the tests. Differential Revision: http://reviews.llvm.org/D8400 llvm-svn: 232744	2015-03-19 18:23:31 +00:00
Artem Belevich	9e8a039318	Add support for __nvvm_reflect changes in libdevice in CUDA-7.0 Summary: CUDA 7.0's libdevice uses slightly different IR to call __nvvm_reflect and that triggers an assertion in nvvm_reflect optimization pass. This change allows nvvm_reflect pass to deal with both old and new ways to pass an argument to __nvvm_reflect. Test Plan: ninja check-all Reviewers: eliben, echristo Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D8399 llvm-svn: 232732	2015-03-19 17:05:35 +00:00
Krzysztof Parzyszek	421133470f	[Hexagon] Add support for vector instructions llvm-svn: 232728	2015-03-19 16:33:08 +00:00
Daniel Jasper	5add63f21e	[InstCombine] Don't fold a GEP into itself through a PHI node This can only occur (I think) through the back-edge of the loop. However, folding a GEP into itself means that the value of the previous iteration needs to be stored in the meantime, thus requiring an additional register variable to be live, but not actually achieving anything (the gep still needs to be executed once per loop iteration). The attached test case is derived from: typedef unsigned uint32; typedef unsigned char uint8; inline uint8 f(uint32 value, uint8 target) { while (value >= 0x80) { value >>= 7; ++target; } ++target; return target; } uint8 g(uint32 b, uint8 target) { target = f(b, f(42, target)); return target; } What happens is that the GEP stored in incptr2 is folded into itself through the loop's back-edge and the phi-node stored in loopptr, effectively incrementing the ptr by "2" in each iteration instead of "1". In this case, it is actually increasing the number of GEPs required as the GEP before the loop can't be folded away anymore. For comparison: With this patch: define i8* @test4(i32 %value, i8* %buffer) { entry: %cmp = icmp ugt i32 %value, 127 br i1 %cmp, label %loop.header, label %exit loop.header: ; preds = %entry br label %loop.body loop.body: ; preds = %loop.body, %loop.header %buffer.pn = phi i8* [ %buffer, %loop.header ], [ %loopptr, %loop.body ] %newval = phi i32 [ %value, %loop.header ], [ %shr, %loop.body ] %loopptr = getelementptr inbounds i8, i8* %buffer.pn, i64 1 %shr = lshr i32 %newval, 7 %cmp2 = icmp ugt i32 %newval, 16383 br i1 %cmp2, label %loop.body, label %loop.exit loop.exit: ; preds = %loop.body br label %exit exit: ; preds = %loop.exit, %entry %0 = phi i8* [ %loopptr, %loop.exit ], [ %buffer, %entry ] %incptr3 = getelementptr inbounds i8, i8* %0, i64 2 ret i8* %incptr3 } Without this patch: define i8* @test4(i32 %value, i8* %buffer) { entry: %incptr = getelementptr inbounds i8, i8* %buffer, i64 1 %cmp = icmp ugt i32 %value, 127 br i1 %cmp, label %loop.header, label %exit loop.header: ; preds = %entry br label %loop.body loop.body: ; preds = %loop.body, %loop.header %0 = phi i8* [ %buffer, %loop.header ], [ %loopptr, %loop.body ] %loopptr = phi i8* [ %incptr, %loop.header ], [ %incptr2, %loop.body ] %newval = phi i32 [ %value, %loop.header ], [ %shr, %loop.body ] %shr = lshr i32 %newval, 7 %incptr2 = getelementptr inbounds i8, i8* %0, i64 2 %cmp2 = icmp ugt i32 %newval, 16383 br i1 %cmp2, label %loop.body, label %loop.exit loop.exit: ; preds = %loop.body br label %exit exit: ; preds = %loop.exit, %entry %ptr2 = phi i8* [ %incptr2, %loop.exit ], [ %incptr, %entry ] %incptr3 = getelementptr inbounds i8, i8* %ptr2, i64 1 ret i8* %incptr3 } Review: http://reviews.llvm.org/D8245 llvm-svn: 232718	2015-03-19 11:05:08 +00:00
Rafael Espindola	7f45440bd9	Note that we don't support COFF on PPC. Should bring back the windows bots. llvm-svn: 232701	2015-03-19 02:40:56 +00:00
Justin Bogner	cfb53e49b6	llvm-cov: Only emit colour by default if the output is a tty This replaces the -no-color flag with a -color={auto\|always\|never} option, with auto as the default, which is much saner. llvm-svn: 232693	2015-03-19 00:02:23 +00:00
Simon Pilgrim	cf1d7df2e3	Fixed failing test due to missing target triple causing different results on different buildbots. llvm-svn: 232685	2015-03-18 22:51:45 +00:00
Rafael Espindola	242548906d	Teach getDefaultFormat that we only support ELF on some architectures. This should bring the windows bots back. It is a bit ugly, but it is better than what we had before: The triple would say that the object format was COFF, but llc/llvm-mc would produce an ELF. llvm-svn: 232683	2015-03-18 22:19:16 +00:00
Simon Pilgrim	5ec5c9cafe	[X86][SSE] Avoid scalarization of v2i64 vector shifts (REAPPLIED) Fixed broken tests. Differential Revision: http://reviews.llvm.org/D8416 llvm-svn: 232682	2015-03-18 22:18:51 +00:00
Eric Christopher	050f590a0c	Revert "[X86][SSE] Avoid scalarization of v2i64 vector shifts" as it appears to have broken tests/bots. This reverts commit r232660. llvm-svn: 232670	2015-03-18 21:01:00 +00:00
Reid Kleckner	0f9e27a371	Use WinEHPrepare to outline SEH finally blocks No outlining is necessary for SEH catch blocks. Use the blockaddr of the handler in place of the usual outlined function. Reviewers: majnemer, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D8370 llvm-svn: 232664	2015-03-18 20:26:53 +00:00
Simon Pilgrim	5c837edc2a	[X86][SSE] Avoid scalarization of v2i64 vector shifts Currently v2i64 vectors shifts (non-equal shift amounts) are scalarized, costing 4 x extract, 2 x x86-shifts and 2 x insert instructions - and it gets even more awkward on 32-bit targets. This patch separately shifts the vector by both shift amounts and then shuffles the partial results back together, costing 2 x shuffles and 2 x sse-shifts instructions (+ 2 movs on pre-AVX hardware). Note - this patch only improves the SHL / LSHR logical shifts as only these are supported in SSE hardware. Differential Revision: http://reviews.llvm.org/D8416 llvm-svn: 232660	2015-03-18 19:35:31 +00:00
Matthias Braun	3b36533112	TableGen: Fix register class lane masks being too conservative. When calculating the lanemask of a register class we have to include the masks of subregisters supported by any of the class members, not just the ones supported by all class members. This fixes problems when coalescing towards a subclass with additional subregisters available. The attached testcase works fine as is, but does crash if you enable subregister liveness on x86 without this change applied. llvm-svn: 232652	2015-03-18 17:56:09 +00:00
Rafael Espindola	38438bae21	Handle X86::reloc_riprel_4byte in 32 bits mode. We can get there with .code64. Fixes pr22349. llvm-svn: 232651	2015-03-18 17:33:40 +00:00
Sanjay Patel	e4cdb8fcb3	Use utils/update_llc_test_checks.py to update all CHECKs The checks here were so vague that we could nuke intrinsics from existence and still pass the test because we'd match the function name. llvm-svn: 232647	2015-03-18 16:38:44 +00:00
Krzysztof Parzyszek	47ab1f2007	[Hexagon] Intrinsics for circular and bit-reversed loads and stores llvm-svn: 232645	2015-03-18 16:23:44 +00:00
Sanjay Patel	e90d0387d9	fixed to test features, not CPU model The 'vmovntdq' was only passing due to a fluke in SandyBridge codegen that splits 32-byte stores in half, but that meant that the test was not correctly checking for the 32-byte store that we thought we were generating. The lax checking in this file will be addressed in another commit. There are bigger problems here. llvm-svn: 232644	2015-03-18 16:07:10 +00:00
Krzysztof Parzyszek	78cc36fed7	[Hexagon] Handle ENDLOOP0 in InsertBranch and RemoveBranch llvm-svn: 232643	2015-03-18 15:56:43 +00:00
Sid Manning	51c3560c48	Add support for .ifnes psuedo-op. llvm-svn: 232636	2015-03-18 14:20:54 +00:00
Daniel Jasper	9ec834036e	Change test to accept an additional critical edge split. The two hot blocks are right next to each other and I verified that there is no performance regression by compressing/uncompressing some files with a minigzip built with the different options. llvm-svn: 232629	2015-03-18 12:45:45 +00:00
John Brawn	0dbcd65442	[ARM] Align stack objects passed to memory intrinsics Memcpy, and other memory intrinsics, typically tries to use LDM/STM if the source and target addresses are 4-byte aligned. In CodeGenPrepare look for calls to memory intrinsics and, if the object is on the stack, 4-byte align it if it's large enough that we expect that memcpy would want to use LDM/STM to copy it. Differential Revision: http://reviews.llvm.org/D7908 llvm-svn: 232627	2015-03-18 12:01:59 +00:00
John Brawn	2063cb6305	Add missing newline to end of test file. llvm-svn: 232626	2015-03-18 10:45:12 +00:00
Josh Magee	89f5dec0bb	Add testcases for BEXTR. These BEXTR cases are a check for the 64-bit load form and two negative cases where the bitrange is non-contiguous. From a private patch equivalent to r189742/PR17028. llvm-svn: 232580	2015-03-18 01:34:06 +00:00
Krzysztof Parzyszek	d5972bdaf8	Missed testcase for r232577 llvm-svn: 232578	2015-03-18 00:44:46 +00:00
Sanjoy Das	cb8bca1777	[SCEV] Make isImpliedCond smarter. Summary: This change teaches isImpliedCond to infer things like "X sgt 0" => "X - 1 sgt -1". The `ConstantRange` class has the logic to do the heavy lifting, this change simply gets ScalarEvolution to exploit that when reasonable. Depends on D8345 Reviewers: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8346 llvm-svn: 232576	2015-03-18 00:41:29 +00:00
David Majnemer	e48237df95	DAGCombiner: fold (xor (shl 1, x), -1) -> (rotl ~1, x) Targets which provide a rotate make it possible to replace a sequence of (XOR (SHL 1, x), -1) with (ROTL ~1, x). This saves an instruction on architectures like X86 and POWER(64). Differential Revision: http://reviews.llvm.org/D8350 llvm-svn: 232572	2015-03-18 00:03:36 +00:00
David Majnemer	7db449a6e7	COFF: Let globals with private linkage reside in their own section COFF COMDATs (for selection kinds other than 'select any') require at least one non-section symbol in the symbol table. Satisfy this by morally enhancing the linkage from private to internal. Differential Revision: http://reviews.llvm.org/D8394 llvm-svn: 232570	2015-03-17 23:54:51 +00:00
Pirama Arumuga Nainar	12aeefc63b	Fix bug while building FP16 constant vectors for AArch64 Summary: Building FP16 constant vectors caused the FP16 data to be bitcast to i64. This patch creates a BITCAST node with the correct value, and adds a test to verify correct handling. Reviewers: mcrosier Reviewed By: mcrosier Subscribers: mcrosier, jmolloy, ab, srhines, llvm-commits, rengolin, aemerson Differential Revision: http://reviews.llvm.org/D8369 llvm-svn: 232562	2015-03-17 23:10:29 +00:00
Kevin Enderby	8e29ec9ec2	Add the option -no-symbolic-operands to llvm-objdump used with -macho and -disassemble to not symbolic operands when disassembling. llvm-svn: 232558	2015-03-17 22:26:11 +00:00
Rafael Espindola	7fce7e62db	Emit the offset directly instead of creating a dummy expression. We were creating an expression of the form (S+C)-S which is just C. Patch by Frédéric Riss. I just added the testcase. llvm-svn: 232549	2015-03-17 21:30:21 +00:00
Kevin Enderby	ab5e6c9925	Add the option, -no-leading-addr llvm-objdump used with -macho and -disassemble or -section to not print the leading addresses on each line. llvm-svn: 232547	2015-03-17 21:07:39 +00:00
David Majnemer	63b1d99943	Revert "COFF: Let globals with private linkage reside in their own section" This reverts commit r232539. This was committed accidently. llvm-svn: 232543	2015-03-17 20:41:11 +00:00
David Majnemer	47e3842982	COFF: Let globals with private linkage reside in their own section Summary: COFF COMDATs (for selection kinds other than 'select any') require at least one non-section symbol in the symbol table. Satisfy this by morally enhancing the linkage from private to internal. Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8374 llvm-svn: 232539	2015-03-17 20:39:25 +00:00
Rafael Espindola	9ab09237dc	Centralize the handling of unique ids for temporary labels. Before this patch code wanting to create temporary labels for a given entity (function, cu, exception range, etc) had to keep its own counter to have stable symbol names. createTempSymbol would still add a suffix to make sure a new symbol was always returned, but it kept a single counter. Because of that, if we were to use just createTempSymbol("cu_begin"), the label could change from cu_begin42 to cu_begin43 because some other code started using temporary labels. Simplify this by just keeping one counter per prefix and removing the various specialized counters. llvm-svn: 232535	2015-03-17 20:07:06 +00:00
Michael Zolotukhin	6d8a2aa976	TLI: Add addVectorizableFunctionsFromVecLib. Also, add several entries to vectorizable functions table, and corresponding tests. The table isn't complete, it'll be populated later. Review: http://reviews.llvm.org/D8131 llvm-svn: 232531	2015-03-17 19:50:55 +00:00
Michael Zolotukhin	c3d60efb1d	TTI: Honour cost model for estimating cost of vector-intrinsic and calls. Review: http://reviews.llvm.org/D8096 llvm-svn: 232528	2015-03-17 19:37:28 +00:00
Richard Barton	30934c0926	[ARM] Fix offset calculation in ARMBaseRegisterInfo::needsFrameBaseReg The input offset to needsFrameBaseReg is a negative value below the top of the stack frame, but when converting to a positive offset from the bottom of the stack frame this value was negated, causing the final offset to be too large by twice the input offset's magnitude. Fix that by not negating the offset. Patch by John Brawn Differential Revision: http://reviews.llvm.org/D8316 llvm-svn: 232513	2015-03-17 18:20:47 +00:00
Michael Liao	24fcae8fa0	[SwitchLowering] Remove incoming values in the reverse order - To prevent invalidating successive indices. llvm-svn: 232510	2015-03-17 18:03:10 +00:00
Kevin Enderby	6a22175d59	Add the option, -dis-symname to llvm-objdump used with -macho and -disassemble to disassemble just one symbol’s instructions. llvm-svn: 232503	2015-03-17 17:10:57 +00:00
Dmitry Vyukov	618d580ec9	asan: optimization experiments The experiments can be used to evaluate potential optimizations that remove instrumentation (assess false negatives). Instead of completely removing some instrumentation, you set Exp to a non-zero value (mask of optimization experiments that want to remove instrumentation of this instruction). If Exp is non-zero, this pass will emit special calls into runtime (e.g. __asan_report_exp_load1 instead of __asan_report_load1). These calls make runtime terminate the program in a special way (with a different exit status). Then you run the new compiler on a buggy corpus, collect the special terminations (ideally, you don't see them at all -- no false negatives) and make the decision on the optimization. The exact reaction to experiments in runtime is not implemented in this patch. It will be defined and implemented in a subsequent patch. http://reviews.llvm.org/D8198 llvm-svn: 232502	2015-03-17 16:59:19 +00:00
Samuel Antao	f68156015d	Fix R0 use in PowerPC VSX store for FastIsel. The VSX stores are sometimes generated with a undefined index register, causing %noreg to be used and R0 to be emitted later on. The semantics of the VSX store (e.g. stdsdx) requires R0 to be used as base if we want zero to be used in the computation of the effective address instead of the content of R0. This patch checks if no index register was generated and forces R0 to be used as base address. llvm-svn: 232486	2015-03-17 15:00:57 +00:00
Rafael Espindola	ccfbbd5596	Use createTempSymbol to avoid collisions instead of an ad hoc method. llvm-svn: 232483	2015-03-17 14:50:32 +00:00
Rafael Espindola	ec8da3de01	Call EmitFunctionHeader just before EmitFunctionBody. This avoids switching to .AMDGPU.config and back and hardcoding the section it switches back to. llvm-svn: 232479	2015-03-17 14:34:42 +00:00
Rafael Espindola	dc4263c760	Move the EH symbol to the asm printer and use it for the SJLJ case too. llvm-svn: 232475	2015-03-17 13:57:48 +00:00
Toma Tabacu	dcebf5b901	[mips] [IAS] Add support for the XOR $reg,imm pseudo-instruction. Summary: This adds a MipsInstAlias which expands to XORi $reg,$reg,imm. For example, "xor $6, 0x3A" should be expanded to "xori $6, $6, 58". This should work for all MIPS ISAs. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D8284 llvm-svn: 232473	2015-03-17 13:17:44 +00:00
Rafael Espindola	ba41539548	Replace a use of GetTempSymbol with createTempSymbol. This is cleaner and avoids a crash in a corner case. llvm-svn: 232471	2015-03-17 12:54:04 +00:00
Renato Golin	1235060734	[ARM] Add support for ARMV6K subtarget (LLVM) ARMv6K is another layer between ARMV6 and ARMV6T2. This is the LLVM side of the changes. ARMV6 family LLVM implementation. +-------------------------------------+ \| ARMV6 \| +----------------+--------------------+ \| ARMV6M (thumb) \| ARMV6K (arm,thumb) \| <- From ARMV6K and ARMV6M processors +----------------+--------------------+ have support for hint instructions \| ARMV6T2 (arm,thumb,thumb2) \| (SEV/WFE/WFI/NOP/YIELD). They can +-------------------------------------+ be either real or default to NOP. \| ARMV7 (arm,thumb,thumb2) \| The two processors also use +-------------------------------------+ different encoding for them. Patch by Vinicius Tinti. llvm-svn: 232468	2015-03-17 11:55:28 +00:00
Ahmed Bougacha	e0afb1fe6c	[AArch64] Use intermediate step for concat_vectors of illegal truncs. Optimize concat_vectors of truncated vectors, where the intermediate type is illegal, to avoid said illegality, e.g., (v4i16 (concat_vectors (v2i16 (truncate (v2i64))), (v2i16 (truncate (v2i64))))) -> (v4i16 (truncate (v4i32 (concat_vectors (v2i32 (truncate (v2i64))), (v2i32 (truncate (v2i64))))))) This isn't really target-specific, and, as such, would best go in the DAGCombiner. However, ISD::TRUNCATE legality isn't keyed on both input and result type, so we might generate worse code when we don't know better. On AArch64 we know it's fine for v2i64->v4i16 and v4i32->v8i8. rdar://20022387 llvm-svn: 232459	2015-03-17 03:23:09 +00:00
Sanjoy Das	1ac93a5d89	[IRCE] Re-commit tests cases. Re-commit the test cases added in r232444. These now use -irce-print-changed-loops and -irce-print-range-checks so they run correctly on a without asserts build of llvm. llvm-svn: 232452	2015-03-17 01:40:24 +00:00
Sanjoy Das	95d041215a	[IRCE] Delete two tests. I accidentally checked in two tests that used -debug-only -- these fail on a release LLVM build. Temporarily delete these from the repo to keep the bots green while I fix this locally. llvm-svn: 232446	2015-03-17 00:54:50 +00:00
Sanjoy Das	e2cde6f195	[IRCE] Support half-range checks. This change to IRCE gets it to recognize "half" range checks. Half range checks are range checks that only either check if the index is `slt` some positive integer ("length") or if the index is `sge` `0`. The range solver does not try to be clever / aggressive about solving half-range checks -- it transforms "I < L" to "0 <= I < L" and "0 <= I" to "0 <= I < INT_SMAX". This is safe, but not always optimal. llvm-svn: 232444	2015-03-17 00:42:13 +00:00
Justin Bogner	d0a6243529	llvm-cov: Warn instead of error if a .gcda has arcs from an exit block Patch by Vanderson M. Rosario. Thanks! llvm-svn: 232443	2015-03-17 00:18:51 +00:00
Justin Bogner	3faa76bfab	GCOV: Make the exit block placement from r223193 optional By default we want our gcov emission to stay 4.2 compatible, which means we need to continue emit the exit block last by default. We add an option to emit it before the body for users that need it. llvm-svn: 232438	2015-03-16 23:52:03 +00:00
Peter Collingbourne	ad0bdcd238	LowerBitSets: do not use private aliases at all on Darwin. LLVM currently turns these into linker-private symbols, which can be dead stripped by the Darwin linker. llvm-svn: 232435	2015-03-16 23:36:24 +00:00
David Blaikie	12cf5d70e8	Add testing for mismatched explicit type on a gep operator when loading from bitcode llvm-svn: 232427	2015-03-16 22:03:50 +00:00

... 2 3 4 5 6 ...

29484 Commits