llvm-project

Commit Graph

Author	SHA1	Message	Date
David Majnemer	31d868b618	X86: Use 'mov' instead of 'lea' in Win64 SEH prologues when possible 'mov' and 'lea' are equivalent when the displacement applied with 'lea' is zero. However, 'mov' should encode smaller. llvm-svn: 230269	2015-02-23 21:50:27 +00:00
David Majnemer	b85e023b8b	X86: Explain why we cannot use a 'mov' in a Win64 epilogue llvm-svn: 230268	2015-02-23 21:50:25 +00:00
David Majnemer	086f6a7e6e	X86: Consistently use 'epilogue' instead of 'epilog' llvm-svn: 230267	2015-02-23 21:50:18 +00:00
Sanjay Patel	27aa1423d2	add newline for easier reading; NFC llvm-svn: 230265	2015-02-23 21:32:09 +00:00
Bruno Cardoso Lopes	24492b057e	[AsmPrinter] Access pointers to globals via pcrel GOT entries Front-ends could use global unnamed_addr to hold pointers to other symbols, like @gotequivalent below: @foo = global i32 42 @gotequivalent = private unnamed_addr constant i32* @foo @delta = global i32 trunc (i64 sub (i64 ptrtoint (i32** @gotequivalent to i64), i64 ptrtoint (i32* @delta to i64)) to i32) The global @delta holds a data "PC"-relative offset to @gotequivalent, an unnamed pointer to @foo. The darwin/x86-64 assembly output for this follows: .globl _foo _foo: .long 42 .globl _gotequivalent _gotequivalent: .quad _foo .globl _delta _delta: .long _gotequivalent-_delta Since unnamed_addr indicates that the address is not significant, only the content, we can optimize the case above by replacing pc-relative accesses to "GOT equivalent" globals, by a PC relative access to the GOT entry of the final symbol instead. Therefore, "delta" can contain a pc relative relocation to foo's GOT entry and we avoid the emission of "gotequivalent", yielding the assembly code below: .globl _foo _foo: .long 42 .globl _delta _delta: .long _foo@GOTPCREL+4 There are a couple of advantages of doing this: (1) Front-ends that need to emit a great deal of data to store pointers to external symbols could save space by not emitting such "got equivalent" globals and (2) IR constructs combined with this opt opens a way to represent GOT pcrel relocations by using the LLVM IR, which is something we previously had no way to express. Differential Revision: http://reviews.llvm.org/D6922 rdar://problem/18534217 llvm-svn: 230264	2015-02-23 21:26:18 +00:00
Andrew Kaylor	982ea13c79	Removing unused private field. llvm-svn: 230259	2015-02-23 21:03:30 +00:00
Andrew Kaylor	322236eed6	Second attempt to fix WinEHCatchDirector build failures. llvm-svn: 230257	2015-02-23 20:44:34 +00:00
Andrew Kaylor	2e30b459ec	Attempting to fix WinEHCatchDirector destructor related build failures. llvm-svn: 230252	2015-02-23 20:19:15 +00:00
Andrew Kaylor	f22fe4ae18	Remap frame variables for native Windows exception handling. Differential Revision: http://reviews.llvm.org/D7770 llvm-svn: 230249	2015-02-23 20:01:56 +00:00
Bruno Cardoso Lopes	32173cdf06	Revert "[X86][MMX] Add MMX instructions to foldable tables" This reverts commit r230226 since it breaks win buildbots. llvm-svn: 230248	2015-02-23 19:53:37 +00:00
Chad Rosier	1df9124289	Revert "Revert "Raising minimum required CMake version to 2.8.12.2."" This reverts commit r230240, which was an accidental commit. llvm-svn: 230246	2015-02-23 19:34:04 +00:00
Eric Christopher	ed47b22951	Rewrite the global merge pass to be subprogram agnostic for now. It was previously using the subtarget to get values for the global offset without actually checking each function as it was generating code. Go ahead and solidify the current behavior and make the existing FIXMEs more prominent. As a note the ARM backend previously had a thumb1 and non-thumb1 set of defaults. Only the former was tested so I've changed the behavior to only use that for now. llvm-svn: 230245	2015-02-23 19:28:45 +00:00
Chad Rosier	543900539f	Prevent hoisting fmul from THEN/ELSE to IF if there is fmsub/fmadd opportunity. This patch adds the isProfitableToHoist API. For AArch64, we want to prevent a fmul from being hoisted in cases where it is more profitable to form a fmsub/fmadd. Phabricator Review: http://reviews.llvm.org/D7299 Patch by Lawrence Hu <lawrence@codeaurora.org> llvm-svn: 230241	2015-02-23 19:15:16 +00:00
Chad Rosier	7c3310694c	Revert "Raising minimum required CMake version to 2.8.12.2." This reverts commit 247aed4710e8befde76da42b27313661dea7cf66. llvm-svn: 230240	2015-02-23 19:15:08 +00:00
Mehdi Amini	cd3ca6f7dd	InstSimplify: simplify 0 / X if nnan and nsz From: Fiona Glaser <fglaser@apple.com> llvm-svn: 230238	2015-02-23 18:30:25 +00:00
Daniel Sanders	afe27c7d27	[mips] Honour -mno-odd-spreg for vector insert/extract when MSA is enabled. Summary: -mno-odd-spreg prohibits the use of odd-numbered single-precision floating point registers. However, vector insert/extract was still using them when manipulating the subregisters of an MSA register. Fixed this by ensuring that insertion/extraction is only performed on even-numbered vector registers when -mno-odd-spreg is given. Reviewers: vmedic, sstankovic Reviewed By: sstankovic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7672 llvm-svn: 230235	2015-02-23 17:22:16 +00:00
Bob Wilson	89e94fc3ad	Fix incorrect immediate size for AddrModeT2_i8s4 in rewriteT2FrameIndex. The natural way to handle this addressing mode would be to say that it has 8 bits and gets scaled by 4, but since the MC layer is expecting the scaling to be already reflected in the immediate value, we have been setting the Scale to 1. That's fine, but then NumBits needs to be adjusted to reflect the effective increase in the range of the immediate. That adjustment was missing. The consequence is that the register scavenger can fail. The estimateRSStackSizeLimit() function in ARMFrameLowering.cpp correctly assumes that the AddrModeT2_i8s4 address mode can handle scaled offsets up to 1020. Under just the right circumstances, we fail to reserve space for the scavenger because it thinks that nothing will be needed. However, the overly pessimistic behavior in rewriteT2FrameIndex causes some frame indexes to be out of range and require scavenged registers, and so the scavenger asserts. Unfortunately I have not been able to come up with a testcase for this. I can only reproduce it on an internal branch where the frame layout and register allocation is slightly different than trunk. We really need a way to serialize MachineInstr-level IR to write reasonable tests for things like this. rdar://problem/19909005 llvm-svn: 230233	2015-02-23 16:57:19 +00:00
Benjamin Kramer	654a85e2ee	Sync the __builtin_expects for our 3 quadratically probed hash table implementations. This assumes that a) finding the bucket containing the value is LIKELY b) finding an empty bucket is LIKELY c) growing the table is UNLIKELY I also switched the a) and b) cases for SmallPtrSet as we seem to use the set mostly more for insertion than for checking existence. In a simple benchmark consisting of 2^21 insertions of 2^20 unique pointers into a DenseMap or SmallPtrSet a few percent speedup on average, but nothing statistically significant. llvm-svn: 230232	2015-02-23 16:41:36 +00:00
Bruno Cardoso Lopes	f488e2ae69	[X86][MMX] Add MMX instructions to foldable tables Teach the peephole optimizer to work with MMX instructions by adding entries into the foldable tables. This covers folding opportunities not handled during isel. llvm-svn: 230226	2015-02-23 15:23:22 +00:00
Bruno Cardoso Lopes	9e1c4c17d9	[X86][MMX] Support folding loads in psll, psrl and psra intrinsics llvm-svn: 230225	2015-02-23 15:23:14 +00:00
Elena Demikhovsky	52e81bc499	AVX-512: recommitted 229837 + bugfix + test llvm-svn: 230223	2015-02-23 15:12:31 +00:00
Elena Demikhovsky	145e5b4409	restructured X86 scalar unary operation templates I made the templates general, no need to define pattern separately for each instruction/intrinsic. Now only need to add r_Int pattern for AVX. llvm-svn: 230221	2015-02-23 14:14:02 +00:00
David Majnemer	eba692dd28	AsmParser: Check ConstantExpr insertvalue operands for type correctness llvm-svn: 230206	2015-02-23 07:13:52 +00:00
Zachary Turner	bc42da0326	[llvm-pdbdump] Very minor code cleanup. This just removes some dead enums as well as some debug flushes of stdout. llvm-svn: 230204	2015-02-23 05:59:14 +00:00
Zachary Turner	29c69105fb	[llvm-pdbdump] Add an option to dump full class definitions. This adds the --class-definitions flag. If specified, when dumping types, instead of "class Foo" you will see the full class definition, with member functions, constructors, access specifiers. NOTE: Using this option can be very slow, as generating a full class definition requires accessing many different parts of the PDB. llvm-svn: 230203	2015-02-23 05:58:34 +00:00
David Majnemer	8d22abdd59	AsmParser: Call instructions can't have an alignment llvm-svn: 230193	2015-02-23 00:01:32 +00:00
David Majnemer	00303b6861	AsmParser: Check ConstantExpr GEP operands for validity llvm-svn: 230188	2015-02-22 23:14:52 +00:00
Zachary Turner	9a818ad193	[llvm-pdbdump] Rewrite dumper using visitor pattern. This increases the flexibility of how to dump different symbol types -- necessary for context-sensitive formatting of symbol types -- and also improves the modularity by allowing the dumping to be implemented in the actual dumper, as opposed to in the PDB library. llvm-svn: 230184	2015-02-22 22:03:38 +00:00
Zachary Turner	fc4ecedb75	[llvm-pdbdump] Simplify options and output. This removes a wealth of options, and instead now only provides three options. -symbols, -types, and -compilands. This greatly simplifies use of the tool, and makes it easier to understand what you're going to see when you run the tool. llvm-svn: 230182	2015-02-22 21:45:38 +00:00
David Blaikie	5e5d7840fb	Roll condition into an assert then wrap it 'ifndef NDEBUG' to protect from the inevitable "unused variable" warning in a non-asserts build. llvm-svn: 230181	2015-02-22 20:58:38 +00:00
JF Bastien	30bf96bfe7	Use common parse routine to read alignment values from bitcode While fuzzing LLVM bitcode files, I discovered that (1) the bitcode reader doesn't check that alignments are no larger than 2**29; (2) downstream code doesn't check the range; and (3) for values out of range, corresponding large memory requests (based on alignment size) will fail. This code fixes the bitcode reader to check for valid alignments, fixing this problem. This CL fixes alignment value on global variables, functions, and instructions: alloca, load, load atomic, store, store atomic. Patch by Karl Schimpf (kschimpf@google.com). llvm-svn: 230180	2015-02-22 19:32:03 +00:00
Hal Finkel	3d4269ab05	[LICM] Refactor to expose functionality as utility functions This refactors the core functionality of LICM: HoistRegion, SinkRegion and PromoteAliasSet (renamed to promoteLoopAccessesToScalars) as utility functions in LoopUtils. This will enable other transformations to make use of them directly. Patch by Ashutosh Nema. llvm-svn: 230178	2015-02-22 18:35:32 +00:00
Simon Pilgrim	4e30d9b6d8	[DagCombiner] Generalized BuildVector Vector Concatenation The CONCAT_VECTORS combiner pass can transform the concat of two BUILD_VECTOR nodes into a single BUILD_VECTOR node. This patch generalises this to support any number of BUILD_VECTOR nodes, and also permits UNDEF nodes to be included as well. This was noticed as AVX vec128 -> vec256 canonicalization sometimes creates a CONCAT_VECTOR with a real vec128 lower and an vec128 UNDEF upper. Differential Revision: http://reviews.llvm.org/D7816 llvm-svn: 230177	2015-02-22 18:17:28 +00:00
Hal Finkel	e2dd84e42f	[DAGCombine] Don't assume integer-type legailty in reduceBuildVecConvertToConvertBuildVec DAGCombine will rewrite an BUILD_VECTOR where all non-undef inputs some from [US]INT_TO_FP, as a BUILD_VECTOR of integers with the conversion applied as a vector operation. We check operation legality of the conversion, but fail to check legality of the integer vector type itself. Because targets don't normally override operation legality defaults for illegal types, we need to check this also. This came up in the context of the QPX vector entensions for PowerPC (which can have legal floating-point vector types without corresponding legal integer vector types). No in-tree test case for this yes, but one can be added once the QPX support has been committed. llvm-svn: 230176	2015-02-22 16:10:22 +00:00
Hal Finkel	f5b957060b	[SDAG] Use correct alignments on expanded vector trunc-store/ext-loads When expanding a truncating store or extending load using vector extracts or inserts and scalar stores and loads, we were giving each of these scalar stores or loads the same alignment as the original vector operation. While this will often be right (most vector operations, especially those produced by autovectorization, have the alignment of the underlying scalar type), the vector operation could certainly have a larger alignment. No test case (yet); noticed by inspection. llvm-svn: 230175	2015-02-22 15:58:04 +00:00
NAKAMURA Takumi	3d61760bd6	Fix a warning on HexagonMCCodeEmitter::MCII. [-Wunused-private-field] llvm-svn: 230170	2015-02-22 09:58:29 +00:00
NAKAMURA Takumi	f7d08f6dcc	RewriteStatepointsForGC.cpp: Fix for -Asserts to mark isNullConstant() as LLVM_ATTRIBUTE_UNUSED. [-Wunused-function] llvm-svn: 230169	2015-02-22 09:58:19 +00:00
NAKAMURA Takumi	02aa295a00	RewriteStatepointsForGC.cpp: Fix for -Asserts. [-Wunused-variable] llvm-svn: 230168	2015-02-22 09:58:13 +00:00
NAKAMURA Takumi	6c24684c95	LowerBitSets.cpp: Prune incorrect \param(s). [-Wdocumentation] \param should be used as itemized. llvm-svn: 230167	2015-02-22 09:51:42 +00:00
Craig Topper	8659344d93	[X86] Add some missing redundant MMX and SSE encodings for disassembler. llvm-svn: 230165	2015-02-22 07:50:41 +00:00
David Majnemer	429fa1220d	COFF: Add 'IMAGE_SCN_CNT_INITIALIZED_DATA' to all DWARF sections The CodeView debug info section, .debug$S, also has this set. MinGW sets this bit for their DWARF sections as well. llvm-svn: 230156	2015-02-22 02:35:27 +00:00
David Majnemer	8c8b6d6b39	COFF: Consistently format the DWARF sections llvm-svn: 230155	2015-02-22 02:35:22 +00:00
Lang Hames	d1c2082c39	[Orc] Remove redundant using directive. llvm-svn: 230154	2015-02-22 01:48:23 +00:00
Lang Hames	53ccf88893	[Orc] Add header comment to IndirectionUtils.cpp. llvm-svn: 230153	2015-02-22 01:45:31 +00:00
Sanjoy Das	95c476db94	IRCE: generalize InductiveRangeCheck::computeSafeIterationSpace to work with a non-canonical induction variable. This is currently a non-functional change because we only ever call computeSafeIterationSpace on a canonical induction variable; but the generalization will be useful in a later commit. llvm-svn: 230151	2015-02-21 22:20:22 +00:00
Sanjoy Das	7fc60da2f5	IRCE: use SCEVs instead of llvm::Value's for intermediate calculations. Semantically non-functional change. This gets rid of some of the SCEV -> Value -> SCEV round tripping and the Construct(SMin\|SMax)Of and MaybeSimplify helper routines. llvm-svn: 230150	2015-02-21 22:07:32 +00:00
Matt Arsenault	f07833057c	R600/SI: Use v_madmk_f32 llvm-svn: 230149	2015-02-21 21:29:10 +00:00
Matt Arsenault	0325d3d27f	R600/SI: Try to use v_madak_f32 This is a code size optimization when the constant only has one use. llvm-svn: 230148	2015-02-21 21:29:07 +00:00
Matt Arsenault	657b1cb739	R600/SI: Don't crash when getting immediate operand size llvm-svn: 230147	2015-02-21 21:29:04 +00:00
Matt Arsenault	70120fa813	R600/SI: Fix mad*k definitions llvm-svn: 230146	2015-02-21 21:29:00 +00:00

1 2 3 4 5 ...

77256 Commits