llvm-project

Commit Graph

Author	SHA1	Message	Date
Quentin Colombet	57fb040beb	[X86] Fix the value of the low mask for the lowering of MUL_LOHI for v4i32. Found by code inspection. llvm-svn: 215604	2014-08-13 23:49:24 +00:00
Fariborz Jahanian	d288fad374	Objective-C. Handle case of multiple class methods found in global pool as well. rdar://16808765 llvm-svn: 215603	2014-08-13 23:38:04 +00:00
Nick Kledzik	ebf09360ec	[mach-o] Fix stub generation to work for dylibs and bundles Split up the CRuntimeFile into one part for output types that need an entry point and another part for output types that use stubs. Add file 'test/mach-o/Inputs/libSystem.yaml' for use by test cases that use -dylib and therefore may now need the helper symbol in libSystem.dylib. llvm-svn: 215602	2014-08-13 23:31:07 +00:00
Rafael Espindola	9b1e0a83b1	Whitespace fix. Sorry about the noise. llvm-svn: 215601	2014-08-13 23:29:23 +00:00
Akira Hatanaka	b74db09c97	[AArch64, fast-isel] Fall back to SelectionDAG to select tail calls. Certain functions such as objc_autoreleaseReturnValue have to be called as tail-calls even at -O0. Since normal fast-isel doesn't emit calls as tail calls, we have to fall back to SelectionDAG to select calls that are marked as tail. <rdar://problem/17991614> llvm-svn: 215600	2014-08-13 23:23:58 +00:00
Yi Kong	45a09319bf	ARM: Add mappings for ACLE prefetch intrinsics Implement __pld, __pldx, __pli and __plix builtin intrinsics as specified in ARM ACLE 2.0. llvm-svn: 215599	2014-08-13 23:20:15 +00:00
Nick Kledzik	b47683223f	[mach-o] set ordinal in n_desc for undefined symbols Mach-o uses "two-level namespace" where each undefined symbols is associated with a specific dylib. This means at runtime the loader (dyld) does need to search all loaded dylibs for that symbol but rather just the one specified. Now that llvm-nm -m prints out that info, properly set that info, and test it in the hello world test cases. llvm-svn: 215598	2014-08-13 23:11:42 +00:00
Juergen Ributzka	98347d902e	[FastISel][AArch64] Add support for more addressing modes. FastISel didn't take much advantage of the different addressing modes available to it on AArch64. This commit allows the ComputeAddress method to recognize more addressing modes that allows shifts and sign-/zero-extensions to be folded into the memory operation itself. For Example: lsl x1, x1, #3 --> ldr x0, [x0, x1, lsl #3] ldr x0, [x0, x1] sxtw x1, w1 lsl x1, x1, #3 --> ldr x0, [x0, x1, sxtw #3] ldr x0, [x0, x1] llvm-svn: 215597	2014-08-13 22:53:29 +00:00
Han Ming Ong	70d8170711	<rdar://problem/18001677> Use the right mechanism to put the XPC services now that Xcode supports the workflow. llvm-svn: 215596	2014-08-13 22:44:37 +00:00
Juergen Ributzka	0f8bc043c5	[FastISel][X86] Add large code model support for materializing floating-point constants. In the large code model for X86 floating-point constants are placed in the constant pool and materialized by loading from it. Since the constant pool could be far away, a PC relative load might not work. Therefore we first materialize the address of the constant pool with a movabsq and then load from there the floating-point value. Fixes <rdar://problem/17674628>. llvm-svn: 215595	2014-08-13 22:25:35 +00:00
Juergen Ributzka	ba8b79e932	[FastISel][X86] Use XOR to materialize the "0" value. llvm-svn: 215594	2014-08-13 22:22:17 +00:00
Juergen Ributzka	230494b399	[FastISel][X86] Emit more efficient instructions for integer constant materialization. This mostly affects the i64 value type, which always resulted in an 15byte mobavsq instruction to materialize any constant. The custom code checks the value of the immediate and tries to use a different and smaller mov instruction when possible. This fixes <rdar://problem/17420988>. llvm-svn: 215593	2014-08-13 22:18:11 +00:00
NAKAMURA Takumi	b6ac3f93e7	clang/test/Index/index-module.m: Tweak expressions to meet dos path on win32. llvm-svn: 215592	2014-08-13 22:14:49 +00:00
Juergen Ributzka	24080d60fa	[FastISel][AArch64] Make use of the zero register when possible. This change materializes now the value "0" from the zero register. The zero register can be folded by several instruction, so no materialization is need at all. Fixes <rdar://problem/17924413>. llvm-svn: 215591	2014-08-13 22:13:14 +00:00
NAKAMURA Takumi	15fb86c4e6	ClangTidyTests: Suppress FixHeaderGuards on win32 for now. FIXME: It seems this might be incompatible to dos path. Investigating. llvm-svn: 215590	2014-08-13 22:12:38 +00:00
NAKAMURA Takumi	c54e6e833e	[CMake] Update libdeps in clangTidyLLVMModule. llvm-svn: 215589	2014-08-13 22:12:28 +00:00
Juergen Ributzka	7cee768e55	[FastISel] Let the target decide first if it wants to materialize a constant. This changes the order in which FastISel tries to materialize a constant. Originally it would try to use a simple target-independent approach, which can lead to the generation of inefficient code. On X86 this would result in the use of movabsq to materialize any 64bit integer constant - even for simple and small values such as 0 and 1. Also some very funny floating-point materialization could be observed too. On AArch64 it would materialize the constant 0 in a register even the architecture has an actual "zero" register. On ARM it would generate unnecessary mov instructions or not use mvn. This change simply changes the order and always asks the target first if it likes to materialize the constant. This doesn't fix all the issues mentioned above, but it enables the targets to implement such optimizations. Related to <rdar://problem/17420988>. llvm-svn: 215588	2014-08-13 22:08:02 +00:00
Gerolf Hoflehner	fe2c11ffd6	[MachineCombiner] Removal of dangling DBG_VALUES after combining [20598] This is a cleaner solution to the problem described in r215431. When instructions are combined a dangling DBG_VALUE is removed. This resolves bug 20598. llvm-svn: 215587	2014-08-13 22:07:36 +00:00
Juergen Ributzka	2b98e393f2	[FastISel][X86] Refactor constant materialization. NFCI. Split the constant materialization code into three separate helper functions for Integer-, Floating-Point-, and GlobalValue-Constants. llvm-svn: 215586	2014-08-13 22:01:55 +00:00
Justin Bogner	5ea05aed15	test/CodeGen: Don't rely on a value's number in check lines The tests in r215568 hard code a value as %0 in their checks. This isn't correct in asserts builds. llvm-svn: 215585	2014-08-13 21:54:06 +00:00
Juergen Ributzka	a5b083853c	[FastISel][ARM] Use MOVT/MOVW if the subtarget requests it. This change is also in preparation for a future change to make sure that the constant materialization uses MOVT/MOVW when available and not a load from the constant pool. llvm-svn: 215584	2014-08-13 21:42:19 +00:00
Juergen Ributzka	2cbcf7aad9	[FastISel][ARM] Fix a bug in the integer materialization code. getRegClassFor returns the incorrect register class when in Thumb2 mode. This fix simply manually selects the register class as in the code just a few lines above. There is no test case for this code, because the code is currently unreachable. This will be changed in a future commit and existing test cases will exercise this code. llvm-svn: 215583	2014-08-13 21:39:18 +00:00
Juergen Ributzka	5ae43a136b	[FastISel][AArch64] Cleanup constant materialization code. NFCI. Cleanup and prepare constant materialization code for future commits. llvm-svn: 215582	2014-08-13 21:34:04 +00:00
Fariborz Jahanian	0ded42451d	Objective-C. Minor refactoring of my last patch. // rdar://16808765 llvm-svn: 215581	2014-08-13 21:24:14 +00:00
Gerolf Hoflehner	caa8bfd13b	[Cleanup] Utility function to erase instruction and mark DBG_Values New function to erase a machine instruction and mark DBG_VALUE for removal. A DBG_VALUE is marked for removal when it references an operand defined in the instruction. Use the new function to cleanup code in dead machine instruction removal pass. llvm-svn: 215580	2014-08-13 21:15:23 +00:00
Richard Smith	c9cbde7e8b	[modules] Fix a rejects-valid resulting from emitting an inline function recursively within the emission of another inline function. This ultimately led to us emitting the same inline function definition twice, which we then rejected because we believed we had a mangled name conflict. llvm-svn: 215579	2014-08-13 21:15:09 +00:00
Hans Wennborg	d2228c0f72	AArch64: replace __func__ with LLVM_FUNCTION_NAME MSVC doesn't define __func__. llvm-svn: 215578	2014-08-13 21:08:39 +00:00
Fariborz Jahanian	30ae8d4413	Objective-C. This patch is to resolve the method used in method expression to the best method found in global method pools. This is wip. // rdar://16808765 llvm-svn: 215577	2014-08-13 21:07:35 +00:00
Quentin Colombet	abea99f65a	[MachineDominatorTree] Provide a method to inform a MachineDominatorTree that a critical edge has been split. The MachineDominatorTree will when lazy update the underlying dominance properties when require. Context This is a follow-up of r215410. Each time a critical edge is split this invalidates the dominator tree information. Thus, subsequent queries of that interface will be slow until the underlying information is actually recomputed (costly). Problem Prior to this patch, splitting a critical edge needed to query the dominator tree to update the dominator information. Therefore, splitting a bunch of critical edges will likely produce poor performance as each query to the dominator tree will use the slow query path. This happens a lot in passes like MachineSink and PHIElimination. Proposed Solution Splitting a critical edge is a local modification of the CFG. Moreover, as soon as a critical edge is split, it is not critical anymore and thus cannot be a candidate for critical edge splitting anymore. In other words, the predecessor and successor of a basic block inserted on a critical edge cannot be inserted by critical edge splitting. Using these observations, we can pile up the splitting of critical edge and apply then at once before updating the DT information. The core of this patch moves the update of the MachineDominatorTree information from MachineBasicBlock::SplitCriticalEdge to a lazy MachineDominatorTree. Performance Thanks to this patch, the motivating example compiles in 4- minutes instead of 6+ minutes. No test case added as the motivating example as nothing special but being huge! The binaries are strictly identical for all the llvm test-suite + SPECs with and without this patch for both Os and O3. Regarding compile time, I observed only noise, although on average I saw a small improvement. <rdar://problem/17894619> llvm-svn: 215576	2014-08-13 21:00:07 +00:00
Benjamin Kramer	46d1544d70	Fix (re-)creation of unittest lit.site.cfg for clang-tools-extra. This has been hiding really well. Hopefully brings the builders suffering from outdated lit.site.cfg files back to life. llvm-svn: 215575	2014-08-13 20:41:26 +00:00
Jan Vesely	0cd3ec6cfa	utils: Fix segfault in flattencfg v2: continue iterating through the rest of the bb use for loop v3: initialize FlattenCFG pass in ScalarOps add test v4: split off initializing flattencfg to a separate patch add comment Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215574	2014-08-13 20:31:53 +00:00
Jan Vesely	5a956d49f7	Initialize FlattenCFG pass Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215573	2014-08-13 20:31:52 +00:00
Zachary Turner	f818625103	Update clang-format file. llvm-svn: 215572	2014-08-13 20:08:28 +00:00
Alexey Samsonov	3551e311f0	Simplify a few loops over CallArgList/FunctionArgList. NFC llvm-svn: 215571	2014-08-13 20:06:24 +00:00
Tobias Grosser	b5c440f33f	Fix formatting llvm-svn: 215570	2014-08-13 19:39:08 +00:00
Yi Kong	a5548431a5	AArch64: Prefetch intrinsic llvm-svn: 215569	2014-08-13 19:18:20 +00:00
Yi Kong	26d104a9ec	ARM: Prefetch intrinsics llvm-svn: 215568	2014-08-13 19:18:14 +00:00
Rafael Espindola	bb415eac2e	Simplify memory ownership with std::unique_ptr. llvm-svn: 215567	2014-08-13 18:59:01 +00:00
Rafael Espindola	5f2bb7d9b8	Simplify ownership with std::unique_ptr. NFC. llvm-svn: 215566	2014-08-13 18:49:01 +00:00
Ed Maste	6bc862dcf5	Use consistent capitalization for ENABLE_THREADS in tests llvm-svn: 215565	2014-08-13 18:27:12 +00:00
Matt Arsenault	74ef277774	R600: Correctly set the src value offset for scalarized kernel args This for some reason fixes v1i64 kernel arguments on pre-SI. This currently breaks some other cases in the kernel-args.ll test for R600, but I'm not particularly confident in the new output. VTX_READ_* are not used for some of the scalarized cases, and the code reading from the constant buffer doesn't make much sense to me. llvm-svn: 215564	2014-08-13 18:14:11 +00:00
Johannes Doerfert	5aa2194ea5	[Polly] Remove the PoCC and ScopLib support Remove the PoCC and ScopLib support from Polly as we do not have a user/maintainer for it. Differential Revision: http://reviews.llvm.org/D4871 llvm-svn: 215563	2014-08-13 17:49:16 +00:00
Zachary Turner	c7826524ac	Get test executables compiling on Windows. Many of the test executables use pthreads directly. This isn't portable on Windows, so this patch converts these test to use C++11 threads and mutexes. Since Windows' implementation of std::thread classes throw and catch from header files, this patch also disables exceptions when compiling with clang on Windows. Reviewed by: Todd Fiala, Ed Maste Differential Revision: http://reviews.llvm.org/D4816 llvm-svn: 215562	2014-08-13 17:44:53 +00:00
Rafael Espindola	d3d657ca0f	Revert what looks like an unintended change in r215557. Should fix test ulibc driver tests. llvm-svn: 215561	2014-08-13 17:15:42 +00:00
Rafael Espindola	4674a876cd	Small cleanup: Don't duplicate default behavior. std::unique_ptr is null initialized and reset default to null. Thanks to David Blaikie for noticing. llvm-svn: 215560	2014-08-13 17:08:22 +00:00
Rafael Espindola	fa49c0bf10	Use std::unique_ptr to simplify memory management a bit. llvm-svn: 215559	2014-08-13 16:47:00 +00:00
Benjamin Kramer	a7c40ef022	Canonicalize header guards into a common format. Add header guards to files that were missing guards. Remove #endif comments as they don't seem common in LLVM (we can easily add them back if we decide they're useful) Changes made by clang-tidy with minor tweaks. llvm-svn: 215558	2014-08-13 16:26:38 +00:00
Benjamin Kramer	2f5db8b3db	Header guard canonicalization, clang part. Modifications made by clang-tidy with minor tweaks. llvm-svn: 215557	2014-08-13 16:25:19 +00:00
Benjamin Foster	bdefab961a	Test commit, remove trailing whitespace llvm-svn: 215556	2014-08-13 16:11:50 +00:00
Andrea Di Biagio	ace8e1e3d4	[DAGCombiner] Improved target independent vector shuffle combine rule. This patch improves the existing algorithm in DAGCombiner that attempts to fold shuffles according to rule: shuffle(shuffle(x, y, M1), undef, M2) -> shuffle(y, undef, M3) Before this change, there were cases where the DAGCombiner conservatively avoided folding shuffles even if the resulting mask would have been legal. That is because the algorithm wrongly assumed that commuting an illegal shuffle mask would always produce an illegal mask. With this change, we now correctly compute the commuted shuffle mask before calling method 'isShuffleMaskLegal' on it. On X86, this improves for example the codegen for the following function: define <4 x i32> @test(<4 x i32> %A, <4 x i32> %B) { %1 = shufflevector <4 x i32> %B, <4 x i32> %A, <4 x i32> <i32 1, i32 2, i32 6, i32 7> %2 = shufflevector <4 x i32> %1, <4 x i32> undef, <4 x i32> <i32 2, i32 3, i32 2, i32 3> ret <4 x i32> %2 } Before this change the X86 backend (-mcpu=corei7) generated the following assembly code for function @test: shufps $-23, %xmm0, %xmm1 # xmm1 = xmm1[1,2],xmm0[2,3] movhlps %xmm1, %xmm1 # xmm1 = xmm1[1,1] movaps %xmm1, %xmm0 Now we produce: movhlps %xmm0, %xmm0 # xmm0 = xmm0[1,1] Added extra test cases in combine-vec-shuffle-2.ll to verify that we correctly fold according to the above-mentioned rule. llvm-svn: 215555	2014-08-13 16:09:40 +00:00

1 2 3 4 5 ...

180677 Commits All Branches Search

180677 Commits

All Branches