llvm-project

Commit Graph

Author	SHA1	Message	Date
Quentin Colombet	6674b095b8	[PeepholeOptimizer] Enable the advanced copy optimization by default. The advanced copy optimization does not yield any difference on the whole llvm test-suite + SPECs, either in compile time or runtime (binaries are identical), but has a big potential when data go back and forth between register files as demonstrated with test/CodeGen/ARM/adv-copy-opt.ll. Note: This was measured for both Os and O3 for armv7s, arm64, and x86_64. <rdar://problem/12702965> llvm-svn: 216236	2014-08-21 22:23:52 +00:00
Philip Reames	4e8cb79425	Whitespace change to reduce diff in future patch. Patch 2 of 11 in 'Add a "probe-stack" attribute' review thread Patch by: john.kare.alsaker@gmail.com llvm-svn: 216235	2014-08-21 22:19:16 +00:00
Philip Reames	34fcca723b	[X86] Split out the logic to select the stack probe function (NFC) Patch 1 of 11 in 'Add a "probe-stack" attribute' review thread. Patch by: <john.kare.alsaker@gmail.com> llvm-svn: 216233	2014-08-21 22:15:20 +00:00
Robin Morisset	26b808922b	Add hooks for emitLeading/TrailingFence llvm-svn: 216232	2014-08-21 22:09:25 +00:00
Robin Morisset	59c23cd946	Rename AtomicExpandLoadLinked into AtomicExpand AtomicExpandLoadLinked is currently rather ARM-specific. This patch is the first of a group that aim at making it more target-independent. See http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-August/075873.html for details The command line option is "atomic-expand" llvm-svn: 216231	2014-08-21 21:50:01 +00:00
Quentin Colombet	6b36337c09	[PeepholeOptimizer] Update the kill flags when extending the live-range of the source of a copy. <rdar://problem/12702965> llvm-svn: 216229	2014-08-21 21:34:06 +00:00
Justin Bogner	c8dc50fa51	Fix a URL (NFC) llvm-svn: 216228	2014-08-21 21:09:24 +00:00
Juergen Ributzka	addb75a4f3	[FastISel][AArch64] Use the correct register class to make the MI verifier happy. This is mostly achieved by providing the correct register class manually, because getRegClassFor always returns the GPRAllRegClass for MVT::i32 and MVT::i64. Also cleanup the code to use the FastEmitInst_ method whenever possible. This makes sure that the operands' register class is properly constrained. For all the remaining cases this adds the missing constrainOperandRegClass calls for each operand. llvm-svn: 216225	2014-08-21 20:57:57 +00:00
David Blaikie	1961f14cf9	Explicitly pass ownership of the MemoryBuffer to AddNewSourceBuffer using std::unique_ptr llvm-svn: 216223	2014-08-21 20:44:56 +00:00
Tom Stellard	745f2eddef	R600/SI: Teach moveToVALU how to handle more S_LOAD_* instructions llvm-svn: 216220	2014-08-21 20:41:00 +00:00
Tom Stellard	162a947160	R600/SI: Make sure SCRATCH_WAVE_OFFSET is added as Live-In to the function This fixes a crash in an ocl conformance test. llvm-svn: 216219	2014-08-21 20:40:58 +00:00
Tom Stellard	8e52375bb5	R600/SI: Remove unused SGPR spilling code llvm-svn: 216218	2014-08-21 20:40:56 +00:00
Tom Stellard	c5cf2f04d9	R600/SI: Use eliminateFrameIndex() to expand SGPR spill pseudos This will simplify the SGPR spilling and also allow us to use MachineFrameInfo for calculating offsets, which should be more reliable than our custom code. This fixes a crash in some cases where a register would be spilled in a branch such that the VGPR defined for spilling did not dominate all the uses when restoring. This fixes a crash in an ocl conformance test. The test requries register spilling and is too big to include. llvm-svn: 216217	2014-08-21 20:40:54 +00:00
Tom Stellard	11aa80cc4a	R600/SI: Handle VCC in SIRegisterInfo::getPhysRegSubReg() This fixes a crash in an ocl conformance test. The test requries register spilling and is too big to include. llvm-svn: 216216	2014-08-21 20:40:50 +00:00
Rafael Espindola	33466a745e	Rewrite the gold plugin to fix pr19901. There is a fundamental difference between how the gold API and lib/LTO view the LTO process. The gold API talks about a particular symbol in a particular file. The lib/LTO API talks about a symbol in the merged module. The merged module is then defined in terms of the IR semantics. In particular, a linkonce_odr GV is only copied if it is used, since it is valid to drop unused linkonce_odr GVs. In the testcase in pr19901 both properties collide. What happens is that gold asks us to keep a particular linkonce_odr symbol, but the IR linker doesn't copy it to the merged module and we never have a chance to ask lib/LTO to keep it. This patch fixes it by having a more direct implementation of the gold API. If it asks us to keep a symbol, we change the linkage so it is not linkonce. If it says we can drop a symbol, we do so. All of this before we even send the module to lib/Linker. Since now we don't have to produce LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN, during symbol resolution we can use a temporary LLVMContext and do lazy module loading. This allows us to keep the minimum possible amount of allocated memory around. This should also allow as much parallelism as we want, since there is no shared context. llvm-svn: 216215	2014-08-21 20:28:55 +00:00
Jonathan Roelofs	f00e7e143e	Satiate the sanitizer build bot This fixes a missing initializer from r216182 llvm-svn: 216212	2014-08-21 20:09:15 +00:00
Rafael Espindola	7cebf36a95	Move some logic to populateLTOPassManager. This will avoid code duplication in the next commit which calls it directly from the gold plugin. llvm-svn: 216211	2014-08-21 20:03:44 +00:00
Adam Nemet	5ed17dad95	[AVX512] Add class to group common template arguments related to vector type We discussed the issue of generality vs. readability of the AVX512 classes recently. I proposed this approach to try to hide and centralize the mappings we commonly perform based on the vector type. A new class X86VectorVTInfo captures these. The idea is to pass an instance of this class to classes/multiclasses instead of the corresponding ValueType. Then the class/multiclass can use its field for things that derive from the type rather than passing all those as separate arguments. I modified avx512_valign to demonstrate this new approach. As you can see instead of 7 related template parameters we now have one. The downside is that we have to refer to fields for the derived values. I named the argument '_' in order to make this as invisible as possible. Please let me know if you absolutely hate this. (Also once we allow local initializations in multiclasses we can recover the original version by assigning the fields to local variables.) Another possible use-case for this class is to directly map things, e.g.: RegisterClass KRC = X86VectorVTInfo<32, i16>.KRC llvm-svn: 216209	2014-08-21 19:50:07 +00:00
Alex Lorenz	936b99c942	Coverage Mapping: add function's hash to coverage function records. The profile data format was recently updated and the new indexing api requires the code coverage tool to know the function's hash as well as the function's name to get the execution counts for a function. Differential Revision: http://reviews.llvm.org/D4994 llvm-svn: 216207	2014-08-21 19:23:25 +00:00
Rafael Espindola	40bfd6db57	llvm-gcc is dead. llvm-svn: 216206	2014-08-21 19:22:24 +00:00
Eric Fiselier	5b0e0e9436	[LIT] Remove documentation for method since it does not exist llvm-svn: 216204	2014-08-21 18:52:58 +00:00
Rafael Espindola	216e0c0617	Respect LibraryInfo in populateLTOPassManager and use it. NFC. llvm-svn: 216203	2014-08-21 18:49:52 +00:00
Rafael Espindola	df1836f750	Remove dead code. NFC. llvm-svn: 216201	2014-08-21 18:11:21 +00:00
Quentin Colombet	0c740d4b9a	[AArch64] Run a peephole pass right after AdvSIMD pass. The AdvSIMD pass may produce copies that are not coalescer-friendly. The peephole optimizer knows how to fix that as demonstrated in the test case. <rdar://problem/12702965> llvm-svn: 216200	2014-08-21 18:10:07 +00:00
Juergen Ributzka	c83265a6c5	[FastISel][AArch64] Factor out ANDWri instruction generation into a helper function. NFCI. llvm-svn: 216199	2014-08-21 18:02:25 +00:00
Moritz Roth	dfdda0d41c	Thumb1 load/store optimizer: Improve code to materialize new base register. There are two add-immediate instructions in Thumb1: tADDi8 and tADDi3. Only the latter supports using different source and destination registers, so whenever we materialize a new base register (at a certain offset) we'd do so by moving the base register value to the new register and then adding in place. This patch changes the code to use a single tADDi3 if the offset is small enough to fit in 3 bits. Differential Revision: http://reviews.llvm.org/D5006 llvm-svn: 216193	2014-08-21 17:11:03 +00:00
Hans Wennborg	f4cb573268	Use returns_nonnull in BumpPtrAllocator and MallocAllocator to avoid null-check in placement new In both Clang and LLVM, this is a common pattern: Size = sizeof(DeclRefExpr) + SomeExtraStuff; void *Mem = Context.Allocate(Size, llvm::alignOf<DeclRefExpr>()); return new (Mem) DeclRefExpr(...); The annoying thing is that because the default placement-new operator has a nothrow specification, the compiler will insert a null check of Mem before calling the DeclRefExpr constructor. This null check is redundant for us, because we expect the allocation functions to never return null. By annotating the allocator functions with returns_nonnull, we can optimize away these checks. Compiling clang with a recent version of Clang and measuring with: $ perf stat -r20 bin/clang.patch -fsyntax-only -w gcc.c && perf stat -r20 bin/clang.orig -fsyntax-only -w gcc.c Shows a 2.4% speed-up (+- 0.8%). The pattern occurs in LLVM too. Measuring with -O3 (and now using bzip2.c instead, because it's smaller): $ perf stat -r20 bin/clang.patch -O3 -w bzip2.c && perf stat -r20 bin/clang.orig -O3 -w bzip2.c Shows 4.4 % speed-up (+- 1%). If anyone knows of a similar attribute we can use for MSVC, or some other technique to get rid off the null check there, please let me know. Differential Revision: http://reviews.llvm.org/D4989 llvm-svn: 216192	2014-08-21 17:10:00 +00:00
Juergen Ributzka	95c0f153e4	[FastISel][AArch64] Remove redundant test. These tests and many more are already covered by fast-isel-addressing-modes.ll. llvm-svn: 216186	2014-08-21 16:40:05 +00:00
Jonathan Roelofs	5e98ff967b	Add a thread-model knob for lowering atomics on baremetal & single threaded systems http://reviews.llvm.org/D4984 llvm-svn: 216182	2014-08-21 14:35:47 +00:00
Rafael Espindola	e07caad9e7	Handle inlining in populateLTOPassManager like in populateModulePassManager. No functionality change. llvm-svn: 216178	2014-08-21 13:35:30 +00:00
Zinovy Nis	33406da5f4	[CLNUP] Remove return after llvm_unreachable. Thanks to Hal Finkel for pointing. llvm-svn: 216176	2014-08-21 13:30:05 +00:00
Benjamin Kramer	ff8b883772	DAGCombiner: Make concat_vector combine safe for EVTs and concat_vectors with many arguments. PR20677 llvm-svn: 216175	2014-08-21 13:28:02 +00:00
Rafael Espindola	208bc533cd	Move DisableGVNLoadPRE from populateLTOPassManager to PassManagerBuilder. llvm-svn: 216174	2014-08-21 13:13:17 +00:00
Josh Klontz	fbe17d6a32	X86AsmPrinter MCJIT MSVC bug fix. Summary: This bug was introduced in r213006 which makes an assumption that MCSection is COFF for Windows MSVC. This assumption is broken for MCJIT users where ELF is used instead [1]. The fix is to change the MCSection cast to a dyn_cast. [1] http://lists.cs.uiuc.edu/pipermail/llvmdev/2013-December/068407.html. Reviewers: majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4872 llvm-svn: 216173	2014-08-21 12:55:27 +00:00
Oliver Stannard	51b1d460cb	[ARM] Enable DP copy, load and store instructions for FPv4-SP The FPv4-SP floating-point unit is generally referred to as single-precision only, but it does have double-precision registers and load, store and GPR<->DPR move instructions which operate on them. This patch enables the use of these registers, the main advantage of which is that we now comply with the AAPCS-VFP calling convention. This partially reverts r209650, which added some AAPCS-VFP support, but did not handle return values or alignment of double arguments in registers. This patch also adds tests for Thumb2 code generation for floating-point instructions and intrinsics, which previously only existed for ARM. llvm-svn: 216172	2014-08-21 12:50:31 +00:00
Rafael Espindola	18b2a258c3	Sort declarations. llvm-svn: 216171	2014-08-21 12:39:07 +00:00
Benjamin Kramer	002a1ced06	Make format_object_base's destructor protected and non-virtual. It's not meant to be used with operator delete and this avoids emitting virtual dtors for every derived format object. llvm-svn: 216170	2014-08-21 11:22:05 +00:00
Erik Verbruggen	2b98bd2a80	Reassociate x + -0.1234 * y into x - 0.1234 * y This does not require -ffast-math, and it gives CSE/GVN more options to eliminate duplicate expressions in, e.g.: return ((x + 0.1234 * y) * (x - 0.1234 * y)); Differential Revision: http://reviews.llvm.org/D4904 llvm-svn: 216169	2014-08-21 10:45:30 +00:00
Benjamin Kramer	b791ef21d2	X86: Turn redundant if into an assertion. While there remove noop casts. llvm-svn: 216168	2014-08-21 10:31:37 +00:00
Robert Khasanov	46409eae8e	[x86] Added _addcarry_ and _subborrow_ intrinsics llvm-svn: 216164	2014-08-21 09:43:43 +00:00
Robert Khasanov	86ca6aaf40	[x86] SMAP: added HasSMAP attribute for CLAC/STAC, corrected attributes llvm-svn: 216163	2014-08-21 09:34:12 +00:00
Robert Khasanov	7c5a843646	[x86] Broadwell: ADOX/ADCX. Added _addcarryx_u{32\|64} intrinsics to LLVM. llvm-svn: 216162	2014-08-21 09:27:00 +00:00
Robert Khasanov	98441b6e7f	[x86] Enable Broadwell target. Added FeatureSMAP. Broadwell ISA includes Haswell ISA + ADX + RDSEED + SMAP llvm-svn: 216161	2014-08-21 09:16:12 +00:00
Zinovy Nis	0a36cba29d	[INDVARS] Extend using of widening of induction variables for the cases of "sub nsw" and "mul nsw" instructions. Currently only "add nsw" are widened. This patch eliminates tons of "sext" instructions for 64 bit code (and the corresponding target code) in cases like: int N = 100; float *A; void foo(int x0, int x1) { float A_cur = &A[0][0]; float * A_next = &A[1][0]; for(int x = x0; x < x1; ++x). { // Currently only [x+N] case is widened. Others 2 cases lead to sext. // This patch fixes it, so all 3 cases do not need sext. const float div = A_cur[x + N] + A_cur[x - N] + A_cur[x * N]; A_next[x] = div; } } ... > clang++ test.cpp -march=core-avx2 -Ofast -fno-unroll-loops -fno-tree-vectorize -S -o - Differential Revision: http://reviews.llvm.org/D4695 llvm-svn: 216160	2014-08-21 08:25:45 +00:00
Elena Demikhovsky	08f8596cc0	IntelJITEventListener updates to fix breaks by recent changes to EngineBuilder and DIContext. By Arch Robison. llvm-svn: 216159	2014-08-21 07:01:55 +00:00
Craig Topper	71b7b68b74	Repace SmallPtrSet with SmallPtrSetImpl in function arguments to avoid needing to mention the size. llvm-svn: 216158	2014-08-21 05:55:13 +00:00
David Majnemer	5d1aeba2ea	InstCombine: Fold ((A \| B) & C1) ^ (B & C2) -> (A & C1) ^ B if C1^C2=-1 Adapted from a patch by Richard Smith, test-case written by me. llvm-svn: 216157	2014-08-21 05:14:48 +00:00
Craig Topper	3ced27c835	Remove custom implementations of max/min in StringRef that was originally added to work an old gcc bug. I believe its been fixed by now. llvm-svn: 216156	2014-08-21 04:31:10 +00:00
Eric Fiselier	a4e211edad	add self to credits llvm-svn: 216155	2014-08-21 04:27:11 +00:00
Jiangning Liu	950844fadb	Fix a bug around truncating vector in const prop. In constant folding stage, "TRUNC" can't handle vector data type. llvm-svn: 216149	2014-08-21 02:12:35 +00:00
Jiangning Liu	deb4b5fc37	Revert r216066, "Optimize ZERO_EXTEND and SIGN_EXTEND in both SelectionDAG Builder and type". llvm-svn: 216147	2014-08-21 01:59:30 +00:00
Quentin Colombet	689623009b	[PeepholeOptimizer] Take advantage of the isInsertSubreg property in the advanced copy optimization. This is the final step patch toward transforming: udiv r0, r0, r2 udiv r1, r1, r3 vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 bx lr into: udiv r0, r0, r2 udiv r1, r1, r3 bx lr Indeed, thanks to this patch, this optimization is able to look through vmov.32 d16[0], r0 vmov.32 d16[1], r1 and is able to rewrite the following sequence: vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 into simple generic GPR copies that the coalescer managed to remove. <rdar://problem/12702965> llvm-svn: 216144	2014-08-21 00:19:16 +00:00
Quentin Colombet	84f15bd1b0	[ARM] Mark VSETLNi32 with the InsertSubreg property and implement the related target hook. This patch teaches the compiler that: dX = VSETLNi32 dY, rZ, imm is the same as: dX = INSERT_SUBREG dY, rZ, translateImmToSubIdx(imm) <rdar://problem/12702965> llvm-svn: 216143	2014-08-21 00:10:52 +00:00
James Molloy	a88896b5c0	[LoopVectorize] Up the maximum unroll factor to 4 for AArch64 Only for Cortex-A57 and Cyclone for now, where it has shown wins. llvm-svn: 216141	2014-08-21 00:02:51 +00:00
James Molloy	82c995d450	[LoopVectorizer] Limit unroll factor in the presence of nested reductions. If we have a scalar reduction, we can increase the critical path length if the loop we're unrolling is inside another loop. Limit, by default to 2, so the critical path only gets increased by one reduction operation. llvm-svn: 216140	2014-08-20 23:53:52 +00:00
Quentin Colombet	7e3da6677a	Add isInsertSubreg property. This patch adds a new property: isInsertSubreg and the related target hooks: TargetIntrInfo::getInsertSubregInputs and TargetInstrInfo::getInsertSubregLikeInputs to specify that a target specific instruction is a (kind of) INSERT_SUBREG. The approach is similar to r215394. <rdar://problem/12702965> llvm-svn: 216139	2014-08-20 23:49:36 +00:00
Jonathan Roelofs	44937d98a3	Lower thumbv4t & thumbv5 lo->lo copies through a push-pop sequence On pre-v6 hardware, 'MOV lo, lo' gives undefined results, so such copies need to be avoided. This patch trades simplicity for implementation time at the expense of performance... As they say: correctness first, then performance. See http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-August/075998.html for a few ideas on how to make this better. llvm-svn: 216138	2014-08-20 23:38:50 +00:00
Quentin Colombet	a56749064a	Mention the right target hook in the comment on isExtractSubreg property. llvm-svn: 216137	2014-08-20 23:25:28 +00:00
Quentin Colombet	67639df146	[PeepholeOptimizer] Take advantage of the isExtractSubreg property in the advanced copy optimization. This patch is a step toward transforming: udiv r0, r0, r2 udiv r1, r1, r3 vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 bx lr into: udiv r0, r0, r2 udiv r1, r1, r3 bx lr Indeed, thanks to this patch, this optimization is able to look through vmov r0, r1, d16 but it does not understand yet vmov.32 d16[0], r0 vmov.32 d16[1], r1 Comming patches will fix that and update the related test case. <rdar://problem/12702965> llvm-svn: 216136	2014-08-20 23:13:02 +00:00
Yi Jiang	1a4e73d7bf	New InstCombine pattern: (icmp ult/ule (A + C1), C3) \| (icmp ult/ule (A + C2), C3) to (icmp ult/ule ((A & ~(C1 ^ C2)) + max(C1, C2)), C3) under certain condition llvm-svn: 216135	2014-08-20 22:55:40 +00:00
Alexey Samsonov	e5864c69a8	Don't allow MCStreamer::EmitIntValue to output 0-byte integers. It makes no sense and can hide bugs. In particular, it lead to left shift by 64 bits, which is an undefined behavior, properly reported by UBSan. llvm-svn: 216134	2014-08-20 22:46:38 +00:00
Quentin Colombet	deb82eab3e	[ARM] Mark VMOVRRD with the ExtractSubreg property and implement the related target hook. This patch teaches the compiler that: rX, rY = VMOVRRD dZ is the same as: rX = EXTRACT_SUBREG dZ, ssub_0 rY = EXTRACT_SUBREG dZ, ssub_1 <rdar://problem/12702965> llvm-svn: 216132	2014-08-20 22:16:19 +00:00
Alexey Samsonov	fffd56ecdf	Fix undefined behavior (left shift of negative value) in SystemZ backend. This bug is reported by UBSan. llvm-svn: 216131	2014-08-20 21:56:43 +00:00
Quentin Colombet	7e75cbaf47	Add isExtractSubreg property. This patch adds a new property: isExtractSubreg and the related target hooks: TargetIntrInfo::getExtractSubregInputs and TargetInstrInfo::getExtractSubregLikeInputs to specify that a target specific instruction is a (kind of) EXTRACT_SUBREG. The approach is similar to r215394. <rdar://problem/12702965> llvm-svn: 216130	2014-08-20 21:51:26 +00:00
Alexey Samsonov	e229ec5bfc	Fix null reference creation in SelectionDAG constructor. Store TargetSelectionDAGInfo as a pointer instead of a reference: getSelectionDAGInfo() may not be implemented for certain backends (e.g. it's not currently implemented for R600). This bug is reported by UBSan. llvm-svn: 216129	2014-08-20 21:40:15 +00:00
Alexey Samsonov	2651ae6513	Fix undefined behavior (left shift of negative value) in Hexagon backend. This bug is reported by UBSan. llvm-svn: 216125	2014-08-20 21:22:03 +00:00
Alexey Samsonov	ea0aee622e	Cleanup: Delete seemingly unused reference to MachineDominatorTree from ScheduleDAGInstrs. llvm-svn: 216124	2014-08-20 20:57:26 +00:00
Sanjay Patel	bba72c7c1e	Don't prevent a vselect of constants from becoming a single load (PR20648). Fix for PR20648 - http://llvm.org/bugs/show_bug.cgi?id=20648 This patch checks the operands of a vselect to see if all values are constants. If yes, bail out of any further attempts to create a blend or shuffle because SelectionDAGLegalize knows how to turn this kind of vselect into a single load. This already happens for machines without SSE4.1, so the added checks just send more targets down that path. Differential Revision: http://reviews.llvm.org/D4934 llvm-svn: 216121	2014-08-20 20:34:56 +00:00
Duncan P. N. Exon Smith	7bb10f8a85	X86: Add missing triples from r216119 llvm-svn: 216120	2014-08-20 19:58:59 +00:00
Duncan P. N. Exon Smith	b18263531d	X86: Align the stack on word boundaries in LowerFormalArguments() The goal of the patch is to implement section 3.2.3 of the AMD64 ABI correctly. The controlling sentence is, "The size of each argument gets rounded up to eightbytes. Therefore the stack will always be eightbyte aligned." The equivalent sentence in the i386 ABI page 37 says, "At all times, the stack pointer should point to a word-aligned area." For both architectures, the stack pointer is not being rounded up to the nearest eightbyte or word between the last normal argument and the first variadic argument. Patch by Thomas Jablin! llvm-svn: 216119	2014-08-20 19:40:59 +00:00
Alexey Samsonov	8968e6d1b0	Fix null reference creation in ScheduleDAGInstrs constructor call. Both MachineLoopInfo and MachineDominatorTree may be null in ScheduleDAGMI constructor call. It is undefined behavior to take references to these values. This bug is reported by UBSan. llvm-svn: 216118	2014-08-20 19:36:05 +00:00
Keno Fischer	d750723d29	Do not insert a tail call when returning multiple values on X86 Summary: This fixes http://llvm.org/bugs/show_bug.cgi?id=19530. The problem is that X86ISelLowering erroneously thought the third call was eligible for tail call elimination. It would have been if it's return value was actually the one returned by the calling function, but here that is not the case and additional values are being returned. Test Plan: Test case from the original bug report is included. Reviewers: rafael Reviewed By: rafael Subscribers: rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D4968 llvm-svn: 216117	2014-08-20 19:00:37 +00:00
Alexey Samsonov	314c643b4a	Fix undefined behavior (left shift by 64 bits) in ScaledNumber::toString(). This bug is reported by UBSan. llvm-svn: 216116	2014-08-20 18:30:07 +00:00
Sanjay Patel	f3cfeef2e9	critical-anti-dependency breaker: don't use reg def info from kill insts (PR20308) In PR20308 ( http://llvm.org/bugs/show_bug.cgi?id=20308 ), the critical-anti-dependency breaker caused a miscompile because it broke a WAR hazard using a register that it thinks is available based on info from a kill inst. Until PR18663 is solved, we shouldn't use any def/use info from a kill because they are really just nops. This patch adds guard checks for kills around calls to ScanInstruction() where the DefIndices array is set. For good measure, add an assert in ScanInstruction() so we don't hit this bug again. The test case is a reduced version of the code from the bug report. Differential Revision: http://reviews.llvm.org/D4977 llvm-svn: 216114	2014-08-20 18:03:00 +00:00
Quentin Colombet	03e43f8e68	[PeepholeOptimizer] Refactor the advanced copy optimization to take advantage of the isRegSequence property. This is a follow-up of r215394 and r215404, which respectively introduces the isRegSequence property and uses it for ARM. Thanks to the property introduced by the previous commits, this patch is able to optimize the following sequence: vmov d0, r2, r3 vmov d1, r0, r1 vmov r0, s0 vmov r1, s2 udiv r0, r1, r0 vmov r1, s1 vmov r2, s3 udiv r1, r2, r1 vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 bx lr into: udiv r0, r0, r2 udiv r1, r1, r3 vmov.32 d16[0], r0 vmov.32 d16[1], r1 vmov r0, r1, d16 bx lr This patch refactors how the copy optimizations are done in the peephole optimizer. Prior to this patch, we had one copy-related optimization that replaced a copy or bitcast by a generic, more suitable (in terms of register file), copy. With this patch, the peephole optimizer features two copy-related optimizations: 1. One for rewriting generic copies to generic copies: PeepholeOptimizer::optimizeCoalescableCopy. 2. One for replacing non-generic copies with generic copies: PeepholeOptimizer::optimizeUncoalescableCopy. The goals of these two optimizations are slightly different: one rewrite the operand of the instruction (#1), the other kills off the non-generic instruction and replace it by a (sequence of) generic instruction(s). Both optimizations rely on the ValueTracker introduced in r212100. The ValueTracker has been refactored to use the information from the TargetInstrInfo for non-generic instruction. As part of the refactoring, we switched the tracking from the index of the definition to the actual register (virtual or physical). This one change is to provide better consistency with register related APIs and to ease the use of the TargetInstrInfo. Moreover, this patch introduces a new helper class CopyRewriter used to ease the rewriting of generic copies (i.e., #1). Finally, this patch adds a dead code elimination pass right after the peephole optimizer to get rid of dead code that may appear after rewriting. This is related to <rdar://problem/12702965>. Review: http://reviews.llvm.org/D4874 llvm-svn: 216088	2014-08-20 17:41:48 +00:00
Andrew Trick	2223f8edbd	Tweak CFGPrinter to wrap very long names. I added wrapping to the CFGPrinter a while back so the -view-cfg output is actually viewable. I've since enountered very long mangled names with the same problem, so I'm slightly tweaking this code to work in that case. llvm-svn: 216087	2014-08-20 17:38:12 +00:00
Rafael Espindola	1c509715e6	Remove unused field. llvm-svn: 216086	2014-08-20 17:33:44 +00:00
Juergen Ributzka	e1bb055ed3	[FastISel][AArch64] Don't fold the sign-/zero-extend from i1 into the compare. This fixes a bug I introduced in a previous commit (r216033). Sign-/Zero- extension from i1 cannot be folded into the ADDS/SUBS instructions. Instead both operands have to be sign-/zero-extended with separate instructions. Related to <rdar://problem/17913111>. llvm-svn: 216073	2014-08-20 16:34:15 +00:00
Rafael Espindola	061beab673	Quick fix for an use after free. llvm-svn: 216071	2014-08-20 15:19:37 +00:00
Dan Liew	2661dfc7b9	Add note to LangRef about how function arguments can be unnamed and how this affects the numbering of unnamed temporaries. llvm-svn: 216070	2014-08-20 15:06:30 +00:00
Aaron Ballman	474972585a	Silencing a -Wcast-qual warning. NFC. llvm-svn: 216068	2014-08-20 12:54:13 +00:00
Aaron Ballman	bf6ee22113	Silencing an MSVC C4334 warning ('<<' : result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)). NFC. llvm-svn: 216067	2014-08-20 12:14:35 +00:00
Jiangning Liu	f841b3b79e	Optimize ZERO_EXTEND and SIGN_EXTEND in both SelectionDAG Builder and type legalization stage. With those two optimizations, fewer signed/zero extension instructions can be inserted, and then we can expose more opportunities to Machine CSE pass in back-end. llvm-svn: 216066	2014-08-20 12:05:15 +00:00
Pavel Chupin	01a4e0a1ef	[x32] Fix FrameIndex check in SelectLEA64_32Addr Summary: Fixes http://llvm.org/bugs/show_bug.cgi?id=20016 reproducible on new lea-5.ll case. Also use RSP/RBP for x32 lea to save 1 byte used for 0x67 prefix in ESP/EBP case. Test Plan: lea tests modified to include x32/nacl and new test added Reviewers: nadav, dschuff, t.p.northover Subscribers: llvm-commits, zinovy.nis Differential Revision: http://reviews.llvm.org/D4929 llvm-svn: 216065	2014-08-20 11:59:22 +00:00
Yi Kong	c655f0c898	ARM: Fix codegen for rbit intrinsic LLVM generates illegal `rbit r0, #352` instruction for rbit intrinsic. According to ARM ARM, rbit only takes register as argument, not immediate. The correct instruction should be rbit <Rd>, <Rm>. The bug was originally introduced in r211057. Differential Revision: http://reviews.llvm.org/D4980 llvm-svn: 216064	2014-08-20 10:40:20 +00:00
Bill Wendling	5d53607392	Update projects lists. llvm-svn: 216048	2014-08-20 07:32:09 +00:00
Bill Wendling	e971d6fd26	Add libcxxabi to the projects. llvm-svn: 216047	2014-08-20 07:30:08 +00:00
David Majnemer	42158f3eea	InstCombine: Annotate sub with nuw when we prove it's safe We can prove that a 'sub' can be a 'sub nuw' if the left-hand side is negative and the right-hand side is non-negative. llvm-svn: 216045	2014-08-20 07:17:31 +00:00
Craig Topper	298f63803b	Fix an off by 1 bug that prevented SmallPtrSet from using all of its 'small' capacity. Then fix the early return in the move constructor that prevented 'small' moves from clearing the NumElements in the moved from object. The directed test missed this because it was always testing large moves due to the off by 1 bug. llvm-svn: 216044	2014-08-20 04:41:36 +00:00
NAKAMURA Takumi	b8083476f1	Constants.h: Fix possible typo in r216015. [-Wdocumentation] llvm-svn: 216043	2014-08-20 04:22:47 +00:00
Peter Collingbourne	f39430bd4a	[dfsan] Treat vararg custom functions like unimplemented functions. Because declarations of these functions can appear in places like autoconf checks, they have to be handled somehow, even though we do not support vararg custom functions. We do so by printing a warning and calling the uninstrumented function, as we do for unimplemented functions. llvm-svn: 216042	2014-08-20 01:40:23 +00:00
Juergen Ributzka	0781b860e4	[FastISel][AArch64] Use the proper FMOV instruction to materialize a +0.0. Use FMOVWSr/FMOVXDr instead of FMOVSr/FMOVDr, which have the proper register class to be used with the zero register. This makes the MachineInstruction verifier happy again. This is related to <rdar://problem/18027157>. llvm-svn: 216040	2014-08-20 01:10:36 +00:00
David Majnemer	57d5bc8849	InstCombine: Annotate sub with nsw when we prove it's safe We can prove that a 'sub' can be a 'sub nsw' under certain conditions: - The sign bits of the operands is the same. - Both operands have more than 1 sign bit. The subtraction cannot be a signed overflow in either case. llvm-svn: 216037	2014-08-19 23:36:30 +00:00
Hans Wennborg	fd1f0f17c5	BumpPtrAllocator: don't accept 0 for the alignment parameter It seems unnecessary to have to use an extra branch to check for this special case. http://reviews.llvm.org/D4945 llvm-svn: 216036	2014-08-19 23:35:33 +00:00
Juergen Ributzka	c0886dd5b0	[FastISel][AArch64] Factor out ADDS/SUBS instruction emission and add support for extensions and shift folding. Factor out the ADDS/SUBS instruction emission code into helper functions and make the helper functions more clever to support most of the different ADDS/SUBS instructions the architecture support. This includes better immedediate support, shift folding, and sign-/zero-extend folding. This fixes <rdar://problem/17913111>. llvm-svn: 216033	2014-08-19 22:29:55 +00:00
Rafael Espindola	3f3d7acbcf	Split parseAssembly into parseAssembly and parseAssemblyInto. This should restore the functionality of parsing new code into an existing module without the confusing interface. llvm-svn: 216031	2014-08-19 22:05:47 +00:00
Alexey Samsonov	96b02d1573	Delete unused argument in AArch64MCInstLower constructor: it doesn't use Mangler, and Mangler is in fact not even created when AArch64MCInstLower is constructed. This bug is reported by UBSan. llvm-svn: 216030	2014-08-19 21:51:08 +00:00
Duncan P. N. Exon Smith	23046653be	LangRef: Move example of function-scope uselistorder to a function Should make the example added in r216025 a little more clear. llvm-svn: 216027	2014-08-19 21:48:04 +00:00
Duncan P. N. Exon Smith	0a448fbca3	IR: Implement uselistorder assembly directives Implement `uselistorder` and `uselistorder_bb` assembly directives, which allow the use-list order to be recovered when round-tripping to assembly. This is the bulk of PR20515. llvm-svn: 216025	2014-08-19 21:30:15 +00:00
Lang Hames	df7be5105a	[MCJIT] Add an i386 RuntimeDyldMachO test case. llvm-svn: 216024	2014-08-19 21:26:36 +00:00
Duncan P. N. Exon Smith	35de5b8ca1	IR: Fix a missed case when threading OnlyIfReduced through ConstantExpr In r216015 I missed propagating `OnlyIfReduced` through the inline versions of `getGetElementPtr()` (I was relying on compile failures on mismatches between the header and source signatures to get them all). llvm-svn: 216023	2014-08-19 21:18:21 +00:00
Duncan P. N. Exon Smith	c8eccd1147	verify-uselistorder: Force -preserve-bc-use-list-order llvm-svn: 216022	2014-08-19 21:08:27 +00:00
Juergen Ributzka	6afeffb078	[FastISel][AArch64] Extend floating-point materialization test. This adds the missing test that I promised for r215753 to test the materialization of the floating-point value +0.0. Related to <rdar://problem/18027157>. llvm-svn: 216019	2014-08-19 20:35:07 +00:00
Rafael Espindola	066b46abcb	fix the gcc build llvm-svn: 216018	2014-08-19 20:06:25 +00:00
Lang Hames	eb99df8ee2	[MCJIT] Allow '$' characters in symbol names in RuntimeDyldChecker. llvm-svn: 216017	2014-08-19 20:04:45 +00:00
Duncan P. N. Exon Smith	33de00cf59	IR: Fix ConstantExpr::replaceUsesOfWithOnConstant() Change `ConstantExpr` to follow the model the other constants are using: only malloc a replacement if it's going to be used. This fixes a subtle bug where if an API user had used `ConstantExpr::get()` already to create the replacement but hadn't given it any users, we'd delete the replacement. This relies on r216015 to thread `OnlyIfReduced` through `ConstantExpr::getWithOperands()`. llvm-svn: 216016	2014-08-19 20:03:35 +00:00
Duncan P. N. Exon Smith	170eb985da	IR: Thread OnlyIfReduced through ConstantExpr::getWithOperands() In order to change `ConstantExpr::replaceUsesOfWithOnConstant()` to work like other constants (e.g., using `ConstantArray::getImpl()`), thread `OnlyIfReduced` through as necessary. When `OnlyIfReduced` is false, there's no functionality change. When it's true, if there's no constant folding or type changes `nullptr` is returned instead of the new constant. `ConstantExpr::replaceUsesOfWithOnConstant()` will be updated to use the "true" version in a follow-up commit. llvm-svn: 216015	2014-08-19 19:45:37 +00:00
Rafael Espindola	a5dbed4936	Fix the MSVC build. llvm-svn: 216014	2014-08-19 19:45:15 +00:00
Juergen Ributzka	b46ea081ad	Reapply [FastISel][AArch64] Add support for more addressing modes (r215597). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: FastISel didn't take much advantage of the different addressing modes available to it on AArch64. This commit allows the ComputeAddress method to recognize more addressing modes that allows shifts and sign-/zero-extensions to be folded into the memory operation itself. For Example: lsl x1, x1, #3 --> ldr x0, [x0, x1, lsl #3] ldr x0, [x0, x1] sxtw x1, w1 lsl x1, x1, #3 --> ldr x0, [x0, x1, sxtw #3] ldr x0, [x0, x1] llvm-svn: 216013	2014-08-19 19:44:17 +00:00
Juergen Ributzka	e3698ab6e3	Reapply [FastISel][X86] Add large code model support for materializing floating-point constants (r215595). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: In the large code model for X86 floating-point constants are placed in the constant pool and materialized by loading from it. Since the constant pool could be far away, a PC relative load might not work. Therefore we first materialize the address of the constant pool with a movabsq and then load from there the floating-point value. Fixes <rdar://problem/17674628>. llvm-svn: 216012	2014-08-19 19:44:13 +00:00
Juergen Ributzka	89d187b387	Reapply [FastISel][X86] Use XOR to materialize the "0" value (r215594). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. llvm-svn: 216011	2014-08-19 19:44:10 +00:00
Juergen Ributzka	4952c35afd	Reapply [FastISel][X86] Emit more efficient instructions for integer constant materialization (r215593). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: This mostly affects the i64 value type, which always resulted in an 15byte mobavsq instruction to materialize any constant. The custom code checks the value of the immediate and tries to use a different and smaller mov instruction when possible. This fixes <rdar://problem/17420988>. llvm-svn: 216010	2014-08-19 19:44:06 +00:00
Juergen Ributzka	7e23f77d82	Reapply [FastISel][AArch64] Make use of the zero register when possible (r215591). Note: This was originally reverted to track down a buildbot error. Reapply without any modifications. Original commit message: This change materializes now the value "0" from the zero register. The zero register can be folded by several instruction, so no materialization is need at all. Fixes <rdar://problem/17924413>. llvm-svn: 216009	2014-08-19 19:44:02 +00:00
Duncan P. N. Exon Smith	349f6982ac	ADT: Unit test for ArrayRef::equals change in r215986 llvm-svn: 216008	2014-08-19 19:18:46 +00:00
Duncan P. N. Exon Smith	909620a33d	IR: De-duplicate code for replacing operands in place This is non-trivial and sits in three places. Move it to ConstantUniqueMap. llvm-svn: 216007	2014-08-19 19:13:30 +00:00
Juergen Ributzka	4bf6c01cdb	Reapply [FastISel] Let the target decide first if it wants to materialize a constant (215588). Note: This was originally reverted to track down a buildbot error. This commit exposed a latent bug that was fixed in r215753. Therefore it is reapplied without any modifications. I run it through SPEC2k and SPEC2k6 for AArch64 and it didn't introduce any new regeressions. Original commit message: This changes the order in which FastISel tries to materialize a constant. Originally it would try to use a simple target-independent approach, which can lead to the generation of inefficient code. On X86 this would result in the use of movabsq to materialize any 64bit integer constant - even for simple and small values such as 0 and 1. Also some very funny floating-point materialization could be observed too. On AArch64 it would materialize the constant 0 in a register even the architecture has an actual "zero" register. On ARM it would generate unnecessary mov instructions or not use mvn. This change simply changes the order and always asks the target first if it likes to materialize the constant. This doesn't fix all the issues mentioned above, but it enables the targets to implement such optimizations. Related to <rdar://problem/17420988>. llvm-svn: 216006	2014-08-19 19:05:24 +00:00
Rafael Espindola	113ba3e6c0	Fix a pair of use after free. Should bring the bots back. llvm-svn: 216005	2014-08-19 18:59:14 +00:00
Rafael Espindola	48af1c2a1a	Don't own the buffer in object::Binary. Owning the buffer is somewhat inflexible. Some Binaries have sub Binaries (like Archive) and we had to create dummy buffers just to handle that. It is also a bad fit for IRObjectFile where the Module wants to own the buffer too. Keeping this ownership would make supporting IR inside native objects particularly painful. This patch focuses in lib/Object. If something elsewhere used to own an Binary, now it also owns a MemoryBuffer. This patch introduces a few new types. * MemoryBufferRef. This is just a pair of StringRefs for the data and name. This is to MemoryBuffer as StringRef is to std::string. * OwningBinary. A combination of Binary and a MemoryBuffer. This is needed for convenience functions that take a filename and return both the buffer and the Binary using that buffer. The C api now uses OwningBinary to avoid any change in semantics. I will start a new thread to see if we want to change it and how. llvm-svn: 216002	2014-08-19 18:44:46 +00:00
Alexey Samsonov	f17f03e00e	Hide two different AlignMode enums in anonymous namespaces. This bug is reported by UBSan. llvm-svn: 216001	2014-08-19 18:40:39 +00:00
Renato Golin	06d601fb3e	Revert "Small refactor on VectorizerHint for deduplication" This reverts commit r215994 because MSVC 2012 can't cope with its C++11 goodness. llvm-svn: 215999	2014-08-19 18:08:50 +00:00
Juergen Ributzka	5460cbfda4	[FastISel][AArch64] Fix a few BuildMI callsites where the result register was added as an operand register. This fixes a few BuildMI callsites where the result register was added by using addReg, which is per default a use and therefore an operand register. Also use the zero register as result register when emitting a compare instruction (SUBS with unused result register). llvm-svn: 215997	2014-08-19 17:41:53 +00:00
Renato Golin	dd6394d833	Small refactor on VectorizerHint for deduplication Previously, the hint mechanism relied on clean up passes to remove redundant metadata, which still showed up if running opt at low levels of optimization. That also has shown that multiple nodes of the same type, but with different values could still coexist, even if temporary, and cause confusion if the next pass got the wrong value. This patch makes sure that, if metadata already exists in a loop, the hint mechanism will never append a new node, but always replace the existing one. It also enhances the algorithm to cope with more metadata types in the future by just adding a new type, not a lot of code. llvm-svn: 215994	2014-08-19 17:30:43 +00:00
Alex Lorenz	6a128331a6	Docs: add documentation for the coverage mapping format. Differential Revision: http://reviews.llvm.org/D4729 llvm-svn: 215990	2014-08-19 17:05:58 +00:00
Rafael Espindola	11c07d7eec	Modernize the .ll parsing interface. * Use StringRef instead of std::string& * Return a std::unique_ptr<Module> instead of taking an optional module to write to (was not really used). * Use current comment style. * Use current naming convention. llvm-svn: 215989	2014-08-19 16:58:54 +00:00
Duncan P. N. Exon Smith	38f556d96d	CodingStandards: Document std::equal misbehaviour I should have included this as part of r215986, which worked around this corner by changing ArrayRef::equals() not to use std::equal. Alas. llvm-svn: 215988	2014-08-19 16:49:40 +00:00
Duncan P. N. Exon Smith	317c139f23	Reapply r215966, r215965, r215964, r215963, r215960, r215959, r215958, and r215957 This reverts commit r215981, which reverted the above commits because MSVC std::equal asserts on nullptr iterators, and thes commits introduced an `ArrayRef::equals()` on empty ArrayRefs. ArrayRef was changed not to use std::equal in r215986. llvm-svn: 215987	2014-08-19 16:39:58 +00:00
Duncan P. N. Exon Smith	16bec8e667	ADT: Avoid using std::equal in ArrayRef::equals MSVC's STL has a bug in `std::equal()`: it asserts on nullptr iterators, causing a block revert in r215981. This works around that by re-writing `ArrayRef::equals()` to do the work itself. llvm-svn: 215986	2014-08-19 16:36:21 +00:00
Aaron Ballman	e4b91dca91	Reverting r215966, r215965, r215964, r215963, r215960, r215959, r215958, and r215957 (these commits all rely on previous commits) due to build breakage. These commits cause failed assertions when testing Clang using MSVC 2013. The asserts are triggered from the std::equal call within ArrayRef::equals due to being passed invalid input (ArrayRef.begin() is returning a nullptr which is problematic). llvm-svn: 215981	2014-08-19 14:59:02 +00:00
Toma Tabacu	85618b3194	[mips] Add assembler support for .set arch=x directive. Summary: This directive is similar to ".set mipsX". It is used to change the CPU target of the assembler, enabling it to accept instructions for a specific CPU. This patch only implements the r4000 CPU (which is treated internally as generic mips3) and the generic ISAs. Contains work done by Matheus Almeida. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4884 llvm-svn: 215978	2014-08-19 14:22:52 +00:00
Mayur Pandey	960507beb4	InstCombine: ((A & ~B) ^ (~A & B)) to A ^ B Proof using CVC3 follows: $ cat t.cvc A, B : BITVECTOR(32); QUERY BVXOR((A & ~B),(~A & B)) = BVXOR(A,B); $ cvc3 t.cvc Valid. Differential Revision: http://reviews.llvm.org/D4898 llvm-svn: 215974	2014-08-19 08:19:19 +00:00
Craig Topper	97ebe53032	Const-correct and prevent a copy of a SmallPtrSet. llvm-svn: 215973	2014-08-19 07:44:27 +00:00
Craig Topper	1674d9988d	Prevent use of the implicit copy constructor on SmallPtrSetImpl. An accidental copy caused my SmallPtrSet->SmallPtrSetImpl conversion commit to fail the other day. llvm-svn: 215971	2014-08-19 06:57:14 +00:00
Mayur Pandey	75b76c6a92	test commit (spelling correction) llvm-svn: 215970	2014-08-19 06:41:55 +00:00
Rafael Espindola	2a8a2795ee	Make it explicit that ExecutionEngine takes ownership of the modules. llvm-svn: 215967	2014-08-19 04:04:25 +00:00
Duncan P. N. Exon Smith	687744d011	IR: Reduce RAUW traffic in ConstantVector Avoid creating a new `ConstantVector` on an RAUW of one of its members. This reduces RAUW traffic on any containing constant. This is part of PR20515. llvm-svn: 215966	2014-08-19 02:24:46 +00:00
Duncan P. N. Exon Smith	b11cd6fbc5	IR: Fix ConstantArray::replaceUsesOfWithOnConstant() Previously, `ConstantArray::replaceUsesOfWithOnConstant()` neglected to check whether it becomes a `ConstantDataArray`. Call `ConstantArray::getImpl()` to check for that. llvm-svn: 215965	2014-08-19 02:21:00 +00:00
Duncan P. N. Exon Smith	d8c60542a4	IR: Factor out replaceUsesOfWithOnConstantImpl(), NFC Factor out common code, and take advantage of the new function to add early returns to the callers. llvm-svn: 215964	2014-08-19 02:16:51 +00:00
Duncan P. N. Exon Smith	1c6a963d34	IR: Split up Constant{Array,Vector}::get(), NFC Introduce `getImpl()` that tries the simplification logic from `get()` and then gives up. This allows the logic to be reused elsewhere in a follow-up commit. llvm-svn: 215963	2014-08-19 02:11:30 +00:00
Akira Hatanaka	452ea6698f	[X86, X87 stackifier] Do not mark an operand of a debug instruction as kill. <rdar://problem/16952634> llvm-svn: 215962	2014-08-19 02:09:57 +00:00
Duncan P. N. Exon Smith	dfd04582ae	IR: Reduce RAUW traffic in ConstantExpr Avoid RAUW-ing `ConstantExpr` when an operand changes unless the new `ConstantExpr` already has users. This prevents the RAUW from rippling up the expression tree unnecessarily. This commit indirectly adds test coverage for r215953 (this is how I came across the bug). This is part of PR20515. llvm-svn: 215960	2014-08-19 01:12:53 +00:00
Duncan P. N. Exon Smith	344ebf7c9a	IR: Replace uses of ConstantAggrUniqueMap with ConstantUniqueMap Now that `ConstantAggrUniqueMap` and `ConstantUniqueMap` work the same way, change the aggregates to use the new one. llvm-svn: 215959	2014-08-19 01:02:18 +00:00
Duncan P. N. Exon Smith	864ccec435	Remove extraneous typenames from r215957 llvm-svn: 215958	2014-08-19 00:55:34 +00:00
Duncan P. N. Exon Smith	8d12558bad	IR: Rewrite ConstantUniqueMap Rewrite `ConstantUniqueMap` to be more similar to `ConstantAggrUniqueMap`. - Use a `DenseMap` with custom MapInfo instead of a `std::map` with linear lookups and deletion. - Don't waste memory explicitly storing (heavyweight) keys. Only `ConstantExpr` and `InlineAsm` actually use this data structure, so I also updated them to use it. This code cleanup is a precursor to reducing RAUW traffic on `ConstantExpr` -- I felt badly adding a new (linear) call to `ConstantUniqueMap::FindExistingKey`, so this designs away the concern. A follow-up commit will transition the users of `ConstantAggrUniqueMap` over. llvm-svn: 215957	2014-08-19 00:42:32 +00:00
Duncan P. N. Exon Smith	3fa26b7661	IR: Declare LookupKey right before its use, NFC llvm-svn: 215956	2014-08-19 00:24:26 +00:00
Duncan P. N. Exon Smith	1cd4aebefe	IR: ArrayRef-ize {Insert,Extract}ValueConstantExpr constructors No functionality change. llvm-svn: 215955	2014-08-19 00:23:17 +00:00
Duncan P. N. Exon Smith	25fb110f21	Prevent clang-format from moving the namespace closing brace, NFC llvm-svn: 215954	2014-08-19 00:21:04 +00:00
Duncan P. N. Exon Smith	7b859ff22c	NVPTX: Use RAUW instead of reinventing the wheel This code had a homemade RAUW that was incorrect when a user was a constant: instead of calling `replaceUsersWithOnConstant()` it would incorrectly update the operand in-place, invalidating `LLVMContextImpl::ExprConstants`. RAUW does the job better. The ValueHandle that `GVMap` is holding onto needs to be removed first, so this commit also removes each variable from the map on-the-fly. Since deletions from `ExprConstants` use a linear search that compares directly on the pointer value (instead of using the key), there isn't an obvious way to expose this with a testcase. llvm-svn: 215953	2014-08-19 00:20:02 +00:00
Duncan P. N. Exon Smith	171699083d	LLParser: Handle BlockAddresses on-the-fly Previously all `blockaddress()` constants were treated as forward references. They were resolved twice: once at the end of the function in question, and again at the end of the module. Furthermore, if the same blockaddress was referenced N times, the parser created N distinct `GlobalVariable`s (one for each reference). Instead, resolve all block addresses at the beginning of the function, creating the standard `BasicBlock` forward references used for all other basic block references. After the function, all references can be resolved immediately. To check for the condition of parsing block addresses from within the same function, I created a reference to the current per-function-state in `BlockAddressPFS`. Also, create only one forward-reference per basic block. Because forward references to block addresses are rare, the data structure here shouldn't matter. If somehow it does someday, this can be pretty easily changed to a `DenseMap<std::pair<ValID, ValID>, GV>`. This is part of PR20515. llvm-svn: 215952	2014-08-19 00:13:19 +00:00
Duncan P. N. Exon Smith	8e3669586b	verify-uselistorder: Call verifyModule() and improve output Call `verifyModule()` after parsing and after every transformation. Also convert some `DEBUG(dbgs())` to `errs()` to increase visibility into what's going on. llvm-svn: 215951	2014-08-18 23:44:14 +00:00
Rafael Espindola	57d9ab648c	Use a range loop. NFC. llvm-svn: 215948	2014-08-18 23:15:59 +00:00

1 2 3 4 5 ...

107065 Commits