llvm-project

Commit Graph

Author	SHA1	Message	Date
Chad Rosier	51afe6397b	Whitespace. llvm-svn: 159300	2012-06-27 22:34:28 +00:00
Jack Carter	8ad0c272af	The ELF relocation record format is different for N64 which many Mips 64 ABIs use than for O64 which many if not all other target ABIs use. Most architectures have the following 64 bit relocation record format: typedef struct { Elf64_Addr r_offset; /* Address of reference / Elf64_Xword r_info; / Symbol index and type of relocation / } Elf64_Rel; typedef struct { Elf64_Addr r_offset; Elf64_Xword r_info; Elf64_Sxword r_addend; } Elf64_Rela; Whereas N64 has the following format: typedef struct { Elf64_Addr r_offset;/ Address of reference / Elf64_Word r_sym; / Symbol index / Elf64_Byte r_ssym; / Special symbol / Elf64_Byte r_type3; / Relocation type / Elf64_Byte r_type2; / Relocation type / Elf64_Byte r_type; / Relocation type / } Elf64_Rel; typedef struct { Elf64_Addr r_offset;/ Address of reference / Elf64_Word r_sym; / Symbol index / Elf64_Byte r_ssym; / Special symbol / Elf64_Byte r_type3; / Relocation type / Elf64_Byte r_type2; / Relocation type / Elf64_Byte r_type; / Relocation type */ Elf64_Sxword r_addend; } Elf64_Rela; The structure is the same size, but the r_info data element is now 5 separate elements. Besides the content aspects, endian byte reordering will be different for the area with each element being endianized separately. I treat this as generic and continue to pass r_type as an integer masking and unmasking the byte sized N64 values for N64 mode. I've implemented this and it causes no affect on other current targets. This passes make check. Jack llvm-svn: 159299	2012-06-27 22:28:30 +00:00
Anshuman Dasgupta	20013f1368	Refactor and speed up DFA generator. Patch by Ivan Llopard! llvm-svn: 159281	2012-06-27 19:38:29 +00:00
Matt Beaumont-Gay	a58862310c	Revert r159136 due to PR13124. Original commit message: If a constant or a function has linkonce_odr linkage and unnamed_addr, mark it hidden. Being linkonce_odr guarantees that it is available in every dso that needs it. Being a constant/function with unnamed_addr guarantees that the copies don't have to be merged. llvm-svn: 159272	2012-06-27 17:10:33 +00:00
Duncan Sands	206fc30ab1	When users ask for -mcpu=help or -mattr=help, just output the help without requiring a module. Original patch by Sunay Ismail, simplified by Arnaud de Grandmaison, then complicated by me (if a triple was specified on the command line, output help for that triple, not for the default). llvm-svn: 159268	2012-06-27 16:23:48 +00:00
Duncan Sands	514db117bd	Some reassociate optimizations create new instructions, which they insert just before the expression root. Any existing operators that are changed to use one of them needs to be moved between it and the expression root, and recursively for the operators using that one. When I rewrote RewriteExprTree I accidentally inverted the logic, resulting in the compacting going down from operators to operands rather than up from operands to the operators using them, oops. Fix this, resolving PR12963. llvm-svn: 159265	2012-06-27 14:19:00 +00:00
Richard Barton	57b7d16e34	Teach assembler to handle capitalised operation values for DSB instructions llvm-svn: 159259	2012-06-27 09:48:23 +00:00
Chandler Carruth	aa324c9078	Clean up the 'check' CMake build rule a bit, notable renaming it to 'check-llvm'. Don't worry! 'check' still works! =] To rationalize the names of targets used to run tests, the vague plan is the following: make check-llvm # run LLVM reg/unit tests (currently 'check') make check-clang # run Clang reg/unit tests (currently 'clang-test') make check-rt # run CompilerRT reg/unit tests make check-asan # run ASan reg/unit tests (subset of -rt) make check-tsan # run TSan reg/unit tests (subset of -rt) make check-all # run as much of the above as is available The last one respects what projects are checked out and built for a given tree. Personally, I would like to eventually make 'check' be an alias for 'check-all'. For now however, it is an alias for 'check-llvm', and thus no behavior has changed. While this patch and my plan only really apply to CMake, I think it might be good to similarly rationalize the naming scheme for the Make builds. llvm-svn: 159258	2012-06-27 09:44:16 +00:00
Richard Barton	4b7558ef9a	Prevent ARM Assembler crashing on unrecognised assembly format for DSB instruction llvm-svn: 159257	2012-06-27 09:36:19 +00:00
Bill Wendling	c66b152e07	Sphinxify the exception handling doc. llvm-svn: 159254	2012-06-27 07:20:57 +00:00
Akira Hatanaka	d030738b8f	Silence uninitialized variable warning in MipsISelDAGToDAG.cpp. llvm-svn: 159243	2012-06-27 00:49:46 +00:00
Akira Hatanaka	ad31cd9a01	Test case for r159240. llvm-svn: 159242	2012-06-27 00:40:34 +00:00
Chandler Carruth	276abc5d4a	Exclude both libcxx and compiler-rt until we get their CMake builds suitable for building as a whole-project. llvm-svn: 159241	2012-06-27 00:30:08 +00:00
Akira Hatanaka	62871a3460	Fix bug in computation of stack size in MipsFrameLowering.cpp. llvm-svn: 159240	2012-06-27 00:20:39 +00:00
Bill Wendling	3b70d784a2	Reduce indentation in function. Rearrange some methods. No functionality change. llvm-svn: 159239	2012-06-26 23:22:18 +00:00
Jim Grosbach	8ccdbd19c8	TableGen: AsmMatcher diagnostics preference detail. Don't override a custom diagnostic w/ a generic InvalidOperand, all else being equal. llvm-svn: 159238	2012-06-26 22:58:01 +00:00
Bill Wendling	e02a1f8cf2	Revamp how debugging information is emitted for debug info objects. It's not necessary for each DI class to have its own copy of `print' and `dump'. Instead, just give DIDescriptor those methods and have it call the appropriate debugging printing routine based on the type of the debug information. llvm-svn: 159237	2012-06-26 22:57:33 +00:00
Evan Cheng	a75127871c	Add a missing check to avoid dereference null. No sensible test case possible. Sorry. rdar://11745134 llvm-svn: 159236	2012-06-26 22:54:59 +00:00
Evan Cheng	319be53a1f	Remove a instcombine transform that (no longer?) makes sense: // C - zext(bool) -> bool ? C - 1 : C if (ZExtInst *ZI = dyn_cast<ZExtInst>(Op1)) if (ZI->getSrcTy()->isIntegerTy(1)) return SelectInst::Create(ZI->getOperand(0), SubOne(C), C); This ends up forming sext i1 instructions that codegen to terrible code. e.g. int blah(_Bool x, _Bool y) { return (x - y) + 1; } => movzbl %dil, %eax movzbl %sil, %ecx shll $31, %ecx sarl $31, %ecx leal 1(%rax,%rcx), %eax ret Without the rule, llvm now generates: movzbl %sil, %ecx movzbl %dil, %eax incl %eax subl %ecx, %eax ret It also helps with ARM (and pretty much any target that doesn't have a sext i1 :-). The transformation was done as part of Eli's r75531. He has given the ok to remove it. rdar://11748024 llvm-svn: 159230	2012-06-26 22:03:13 +00:00
Benjamin Kramer	efe4028693	Implement getHostCPUName for ARM/linux. This will be used to implement -march=native in clang. The cpuid registers are only available in privileged mode so we don't have an OS-independent way of implementing this. ARM doesn't provide a list of processor IDs so the list is somewhat incomplete. llvm-svn: 159228	2012-06-26 21:36:32 +00:00
Rafael Espindola	e0eaa043eb	Fix llc's -print-before=pass and -print-after=pass. llvm-svn: 159227	2012-06-26 21:33:36 +00:00
Manman Ren	a09820414a	X86: add GATHER intrinsics (AVX2) in LLVM Support the following intrinsics: llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256 llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256 Modified Disassembler to handle VSIB addressing mode. llvm-svn: 159221	2012-06-26 19:47:59 +00:00
Tim Northover	e5629966df	Teach TableGen to put chains on more instructions When generating selection tables for Pat instances, TableGen relied on an output Instruction's Pattern field being set to infer whether a chain should be added. This patch adds additional logic to check various flag fields so that correct code can be generated even if Pattern is unset. llvm-svn: 159217	2012-06-26 18:46:28 +00:00
Argyrios Kyrtzidis	46785f9461	Fix ThreadLocalImpl::getInstance for --disable-threads. PR13114. llvm-svn: 159210	2012-06-26 17:13:58 +00:00
Jakob Stoklund Olesen	59a0d3243b	Allow targets to inject passes before the virtual register rewriter. Such passes can be used to tweak the register assignments in a target-dependent way, for example to avoid write-after-write dependencies. llvm-svn: 159209	2012-06-26 17:09:29 +00:00
Stepan Dyatkovskiy	593d358cf3	IntegersSubsetTest: Due to compilation failure with -std=c11, replaced -1UL with NOT_A_NUMBER constant (0xffff). llvm-svn: 159207	2012-06-26 16:08:20 +00:00
Jack Carter	5e69cffed5	There are a number of generic inline asm operand modifiers that up to r158925 were handled as processor specific. Making them generic and putting tests for these modifiers in the CodeGen/Generic directory caused a number of targets to fail. This commit addresses that problem by having the targets call the generic routine for generic modifiers that they don't currently have explicit code for. For now only generic print operands 'c' and 'n' are supported.vi Affected files: test/CodeGen/Generic/asm-large-immediate.ll lib/Target/PowerPC/PPCAsmPrinter.cpp lib/Target/NVPTX/NVPTXAsmPrinter.cpp lib/Target/ARM/ARMAsmPrinter.cpp lib/Target/XCore/XCoreAsmPrinter.cpp lib/Target/X86/X86AsmPrinter.cpp lib/Target/Hexagon/HexagonAsmPrinter.cpp lib/Target/CellSPU/SPUAsmPrinter.cpp lib/Target/Sparc/SparcAsmPrinter.cpp lib/Target/MBlaze/MBlazeAsmPrinter.cpp lib/Target/Mips/MipsAsmPrinter.cpp MSP430 isn't represented because it did not even run with the long existing 'c' modifier and it was not apparent what needs to be done to get it inline asm ready. Contributer: Jack Carter llvm-svn: 159203	2012-06-26 13:49:27 +00:00
Duncan Sands	8bc764aeca	Replacing zero-sized alloca's with a null pointer is too aggressive, instead merge all zero-sized alloca's into one, fixing c43204g from the Ada ACATS conformance testsuite. What happened there was that a variable sized object was being allocated on the stack, "alloca i8, i32 %size". It was then being passed to another function, which tested that the address was not null (raising an exception if it was) then manipulated %size bytes in it (load and/or store). The optimizers cleverly managed to deduce that %size was zero (congratulations to them, as it isn't at all obvious), which made the alloca zero size, causing the optimizers to replace it with null, which then caused the check mentioned above to fail, and the exception to be raised, wrongly. Note that no loads and stores were actually being done to the alloca (the loop that does them is executed %size times, i.e. is not executed), only the not-null address check. llvm-svn: 159202	2012-06-26 13:39:21 +00:00
Stepan Dyatkovskiy	e481e0daf4	IntegersSubsetMapping: implemented "diff" operation. Operation allows at the same time perform up to three operations: - LHS exclude RHS - LHS intersect RHS (LHS successors will keeped) - RHS exclude LHS The complexity is N+M, where N is size of LHS M is size of RHS. llvm-svn: 159201	2012-06-26 11:57:43 +00:00
Stepan Dyatkovskiy	883850c4d2	IntegersSubsetMapping: removed exclude operation, it will replaced with more universal "diff" operation in next commit. Changes was separated onto two commits for better readability. llvm-svn: 159200	2012-06-26 11:41:47 +00:00
Bill Wendling	b4e01abdb4	Sphyinxify the Bugpoint document. llvm-svn: 159199	2012-06-26 11:37:00 +00:00
Elena Demikhovsky	863d2d3235	Removed unused variable llvm-svn: 159197	2012-06-26 10:50:07 +00:00
Bill Wendling	8ed44466c2	Rename to match other X86_64* names. llvm-svn: 159196	2012-06-26 10:05:06 +00:00
Elena Demikhovsky	26088d2e24	Shuffle optimization for AVX/AVX2. The current patch optimizes frequently used shuffle patterns and gives these instruction sequence reduction. Before: vshufps $-35, %xmm1, %xmm0, %xmm2 ## xmm2 = xmm0[1,3],xmm1[1,3] vpermilps $-40, %xmm2, %xmm2 ## xmm2 = xmm2[0,2,1,3] vextractf128 $1, %ymm1, %xmm1 vextractf128 $1, %ymm0, %xmm0 vshufps $-35, %xmm1, %xmm0, %xmm0 ## xmm0 = xmm0[1,3],xmm1[1,3] vpermilps $-40, %xmm0, %xmm0 ## xmm0 = xmm0[0,2,1,3] vinsertf128 $1, %xmm0, %ymm2, %ymm0 After: vshufps $13, %ymm0, %ymm1, %ymm1 ## ymm1 = ymm1[1,3],ymm0[0,0],ymm1[5,7],ymm0[4,4] vshufps $13, %ymm0, %ymm0, %ymm0 ## ymm0 = ymm0[1,3,0,0,5,7,4,4] vunpcklps %ymm1, %ymm0, %ymm0 ## ymm0 = ymm0[0],ymm1[0],ymm0[1],ymm1[1],ymm0[4],ymm1[4],ymm0[5],ymm1[5] llvm-svn: 159188	2012-06-26 08:04:10 +00:00
Chandler Carruth	9139f44d23	Update a bunch of stale comments that dated from when this folled the very first (and worst) placement algorithm. These should now more accurately reflect the reality of the pass. llvm-svn: 159185	2012-06-26 05:16:37 +00:00
Craig Topper	94bf0f3855	Remove some duplicate instructions that exist only to given different mnemonics for the assembler. Use InstAlias instead. llvm-svn: 159184	2012-06-26 04:12:49 +00:00
Andrew Trick	fb2ba3e1cb	Enable the new LoopInfo algorithm by default. The primary advantage is that loop optimizations will be applied in a stable order. This helps debugging and unit test creation. It is also a better overall implementation without pathologically bad performance on deep functions. On large functions (llvm-stress --size=200000 \| opt -loops) Before: 0.1263s After: 0.0225s On deep functions (after tweaking llvm-stress, thanks Nadav): Before: 0.2281s After: 0.0227s See r158790 for more comments. The loop tree is now consistently generated in forward order, but loop passes are applied in reverse order over the program. If we have a loop optimization that prefers forward order, that can easily be achieved by adding a different type of LoopPassManager. llvm-svn: 159183	2012-06-26 04:11:38 +00:00
Andrew Trick	fecf937938	Remove unnecessary FIXME llvm-svn: 159182	2012-06-26 04:11:34 +00:00
Evan Cheng	4c6f917d34	Make sure type is not extended or untyped before create a constant of the type. No test case. Found by inspection. llvm-svn: 159179	2012-06-26 01:19:33 +00:00
Eric Christopher	d6d1f18945	Typo. llvm-svn: 159178	2012-06-26 00:28:15 +00:00
Eli Friedman	bbcd09cc00	Make some ugly hacks for inline asm operands which name a specific register a bit more thorough. PR13196. llvm-svn: 159176	2012-06-25 23:42:33 +00:00
Nuno Lopes	31b54a5379	revert my previous commit (r159173), since as Eli pointed out, it's perfectly ok to mark realloc as noalias llvm-svn: 159175	2012-06-25 23:26:10 +00:00
Nuno Lopes	75eaa72de9	do not set realloc() as NotAlias, since it can return the same pointer. This whole thing should be upgraded to use the MemoryBuiltin interface anyway.. llvm-svn: 159173	2012-06-25 22:55:50 +00:00
Manman Ren	606953fbe7	ARM: update peephole optimization. More condition codes are included when deciding whether to remove cmp after a sub instruction. Specifically, we extend from GE\|LT\|GT\|LE to GE\|LT\|GT\|LE\|HS\|LS\|HI\|LO\|EQ\|NE. If we have "sub a, b; cmp b, a; movhs", we should be able to replace with "sub a, b; movls". rdar: 11725965 llvm-svn: 159166	2012-06-25 21:49:38 +00:00
Owen Anderson	c272bab5a5	Define DAGOperand, an empty base class for RegisterClass and Operand. This allows one to write multiclasses that are polymorphic over both registers and non-register operands. llvm-svn: 159162	2012-06-25 21:25:16 +00:00
Dan Gohman	5f725cd196	Fix the objc_autoreleasedReturnValue optimization code to locate the call correctly even in the case where it is an invoke. This fixes rdar://11714057. llvm-svn: 159157	2012-06-25 19:47:37 +00:00
Jakob Stoklund Olesen	a57fc12ec9	Enforce stricter liveness rules for PHIs. Verify that all paths from the entry block to a virtual register read pass through a def. Enable this check even when MRI->isSSA() is false. Verify that the live range of a virtual register is live out of all predecessor blocks, even for PHI-values. This requires that PHIElimination sometimes inserts IMPLICIT_DEF instruction in predecessor blocks. llvm-svn: 159150	2012-06-25 18:18:27 +00:00
Jakob Stoklund Olesen	eb49566447	Run ProcessImplicitDefs on SSA form where it can be much simpler. Implicitly defined virtual registers can simply have the <undef> bit set on all uses, and copies can be turned into implicit defs recursively. Physical registers are a bit trickier. We handle the common case where a physreg def is used by a nearby instruction in the same basic block. For more complicated cases, just leave the IMPLICIT_DEF instruction in. llvm-svn: 159149	2012-06-25 18:12:18 +00:00
Nuno Lopes	07594cba7c	improve optimization of invoke instructions: - simplifycfg: invoke undef/null -> unreachable - instcombine: invoke new -> invoke expect(0, 0) (an arbitrary NOOP intrinsic; only done if the allocated memory is unused, of course) - verifier: allow invoke of intrinsics (to make the previous step work) llvm-svn: 159146	2012-06-25 17:11:47 +00:00
Nuno Lopes	9ecc8761bc	check for the NoAlias attribute through CallSite llvm-svn: 159145	2012-06-25 16:17:54 +00:00

1 2 3 4 5 ...

83269 Commits