llvm-project

Commit Graph

Author	SHA1	Message	Date
Kostya Serebryany	a1849af4a9	[fuzzer] add FAQ section to the README.txt llvm-svn: 227466	2015-01-29 17:11:30 +00:00
Aaron Ballman	ef11698cac	Reverting r227452, which adds back the fuzzer library. Now excluding the fuzzer library based on LLVM_USE_SANITIZE_COVERAGE being set or unset. llvm-svn: 227464	2015-01-29 16:58:29 +00:00
Colin LeMahieu	860210bc49	[Hexagon] Adding CR intrinsic tests. llvm-svn: 227463	2015-01-29 16:55:37 +00:00
Tom Stellard	b14ead55f4	R600/SI: Remove stray debug statements llvm-svn: 227462	2015-01-29 16:55:28 +00:00
Tom Stellard	83f0bcef7a	R600/SI: Define a schedule model and enable the generic machine scheduler The schedule model is not complete yet, and could be improved. llvm-svn: 227461	2015-01-29 16:55:25 +00:00
Colin LeMahieu	e75aa4983c	[Hexagon] Deleting unused classes. llvm-svn: 227460	2015-01-29 16:35:38 +00:00
Robert Lougher	c69cfeeafa	[X86] Use single add/sub for large stack offsets For large stack offsets the compiler generates multiple immediate mode sub/add instructions in the prologue/epilogue. This patch makes the compiler place the final amount to be added/subtracted into a register, which is then added/substracted with a single operation. Differential Revision: http://reviews.llvm.org/D7226 llvm-svn: 227458	2015-01-29 16:18:29 +00:00
Colin LeMahieu	a749b3ee6a	[Hexagon] Adding XTYPE/PRED intrinsic tests. Converting predicate types to i32 instead of i1. llvm-svn: 227457	2015-01-29 16:08:43 +00:00
Bill Schmidt	8cf15ced8c	[PowerPC] Complete setting the baseline for ppc64le Patch by Nemanja Ivanovic. As was uncovered by the failing test case (when run on non-PPC platforms), the feature set when compiling with -march=ppc64le was not being picked up. This change ensures that if the -mcpu option is not specified, the correct feature set is picked up regardless of whether we are on PPC or not. llvm-svn: 227455	2015-01-29 15:59:09 +00:00
Aaron Ballman	7b54ed221a	Temporarily reverting the fuzzer library as it causes too many build issues for MSVC users. This reverts: 227445, 227395, 227389, 227357, 227254, 227252 llvm-svn: 227452	2015-01-29 15:49:22 +00:00
Aaron Ballman	d39df1e24d	Adding missing #includes to try to get this to compile on Windows with Visual Studio. llvm-svn: 227445	2015-01-29 15:19:13 +00:00
Sean Silva	ba516a11a8	Remove unused tokens in the ll lexer. Patch by Robin Eklind! llvm-svn: 227442	2015-01-29 14:45:09 +00:00
Rafael Espindola	093dcc43f7	Use isMergeableConst now that it is sane. llvm-svn: 227441	2015-01-29 14:23:28 +00:00
Rafael Espindola	33804cac96	Remove MergeableConst. Only the specific ones (MergeableConst4, MergeableConst8, MergeableConst16) are handled specially. llvm-svn: 227440	2015-01-29 14:12:41 +00:00
James Molloy	5f255eb48f	[LoopReroll] Refactor most of reroll() into a helper class reroll() was slightly monolithic and a pain to modify. Refactor a bunch of its state from local variables to member variables of a helper class, and do some trivial simplification while we're there. llvm-svn: 227439	2015-01-29 13:48:05 +00:00
Benjamin Kramer	eb63e4d6f8	EHPrepare: Remove leftover initialization code for DomTrees. While there modernize some loops. NFC. llvm-svn: 227436	2015-01-29 13:26:50 +00:00
Rafael Espindola	e2d4b2df39	Use enum values. NFC. llvm-svn: 227435	2015-01-29 13:25:44 +00:00
Rafael Espindola	fad3901095	Don't create multiple mergeable sections with -fdata-sections. ELF has support for sections that can be split into fixed size or null terminated entities. Since these sections can be split by the linker, it is not necessary to split them in codegen. This reduces the combined .o size in a llvm+clang build from 202,394,570 to 173,819,098 bytes. The time for linking clang with gold (on a VM, on a laptop) goes from 2.250089985 to 1.383001792 seconds. The flip side is the size of rodata in clang goes from 10,926,785 to 10,929,345 bytes. The increase seems to be because of http://sourceware.org/bugzilla/show_bug.cgi?id=17902. llvm-svn: 227431	2015-01-29 12:43:28 +00:00
Vladimir Medic	df464ae224	[Mips][Disassembler] When disassembler meets cache/pref instructions for r6 it crashes as the access to operands array is out of range. This patch adds dedicated decoder method for R6 CACHE_HINT_DESC class that properly handles decoding of these instructions. llvm-svn: 227430	2015-01-29 11:33:41 +00:00
NAKAMURA Takumi	8182818a53	CommandLineParser: Avoid non-static member nitializer(s). llvm-svn: 227428	2015-01-29 11:06:59 +00:00
Owen Anderson	c4d245c391	Fix the preprocessor checks used to determine if backtraces have been enabled. llvm-svn: 227424	2015-01-29 07:53:13 +00:00
Owen Anderson	9253bb933a	Use the existing build configuration parameter ENABLE_BACKTRACE to compile out all pretty stack trace support when backtraces are disabled. This has the nice secondary effect of allowing LLVM to continue to build for targets without __thread or thread_local support to continue to work so long as they build without support for backtraces. llvm-svn: 227423	2015-01-29 07:35:31 +00:00
Simon Atanasyan	99cd1fb012	[ELFYAML] Provide default value 0 for YAML relocation addendum field Follow up to r227318. llvm-svn: 227422	2015-01-29 06:56:24 +00:00
Chandler Carruth	a1a622746e	Remove an unused private field added r227405 to fix a Clang warning. llvm-svn: 227415	2015-01-29 02:44:53 +00:00
Chandler Carruth	fb3139ad9e	[LPM] Clean up the use of TLS in pretty stack trace and disable it entirely when threads are not enabled. This should allow anyone who needs to bootstrap or cope with a host loader without TLS support to limp along without threading support. There is still some bug in the PPC TLS stuff that is not worked around. I'm getting access to a machine to reproduce and debug this further. There is some chance that I'll have to add a terrible workaround for PPC. There is also some problem with iOS, but I have no ability to really evaluate what the issue is there. I'm leaving it to folks maintaining that platform to suggest a path forward -- personally I don't see any useful path forward that supports threading in LLVM but does so without support for very basic TLS. Note that we don't need more than some pointers, and we don't need constructors, destructors, or any of the other fanciness which remains widely unimplemented. llvm-svn: 227411	2015-01-29 01:23:04 +00:00
Reid Kleckner	8fcb0ad8a6	Remove unused variable llvm-svn: 227408	2015-01-29 00:55:44 +00:00
Reid Kleckner	1185fced3d	Add a Windows EH preparation pass that zaps resumes If the personality is not a recognized MSVC personality function, this pass delegates to the dwarf EH preparation pass. This chaining supports people on -windows-itanium or -windows-gnu targets. Currently this recognizes some personalities used by MSVC and turns resume instructions into traps to avoid link errors. Even if cleanups are not used in the source program, LLVM requires the frontend to emit a code path that resumes unwinding after an exception. Clang does this, and we get unreachable resume instructions. PR20300 covers cleaning up these unreachable calls to resume. Reviewers: majnemer Differential Revision: http://reviews.llvm.org/D7216 llvm-svn: 227405	2015-01-29 00:41:44 +00:00
Eric Christopher	905f12d96d	Remove getSubtargetImpl from AArch64ISelLowering and cache the correct subtarget by passing it in during the constructor as TargetLowering is Subtarget specific. llvm-svn: 227402	2015-01-29 00:19:42 +00:00
Eric Christopher	1889fdc142	Remove getSubtargetImpl from ARMISelLowering and cache the correct subtarget by passing it in during the constructor as TargetLowering is Subtarget specific. llvm-svn: 227401	2015-01-29 00:19:39 +00:00
Eric Christopher	c125e12261	Small cleanup in ARMFastISel initialization. llvm-svn: 227400	2015-01-29 00:19:37 +00:00
Eric Christopher	1b21f00904	Migrate ARM except for TTI, AsmPrinter, and frame lowering away from getSubtargetImpl. llvm-svn: 227399	2015-01-29 00:19:33 +00:00
Manuel Jacob	6f508c578b	Add nullptr checks for TargetSelectionDAGInfo in SelectionDAG. TSI is not guaranteed be non-null in SelectionDAG. llvm-svn: 227397	2015-01-28 23:50:40 +00:00
Kostya Serebryany	265cf04f9c	[fuzzer] add option -save_minimized_corpus llvm-svn: 227395	2015-01-28 23:48:39 +00:00
Chandler Carruth	b2fe3e5c35	[LPM] Fix the PPC attribute to be spelled 'global-dynamic'. This should let the build bot make finish compiling stage2. llvm-svn: 227391	2015-01-28 23:10:57 +00:00
Philip Reames	9198b33b48	Teach SplitBlockPredecessors how to handle landingpad blocks. Patch by: Igor Laevsky <igor@azulsystems.com> "Currently SplitBlockPredecessors generates incorrect code in case if basic block we are going to split has a landingpad. Also seems like it is fairly common case among it's users to conditionally call either SplitBlockPredecessors or SplitLandingPadPredecessors. Because of this I think it is reasonable to add this condition directly into SplitBlockPredecessors." Differential Revision: http://reviews.llvm.org/D7157 llvm-svn: 227390	2015-01-28 23:06:47 +00:00
Kostya Serebryany	a8fbcf0c1f	Add lit-style tests for the Fuzzer library Summary: Add test targets and the lit-style runner. Test Plan: Run the tests on bot. Reviewers: samsonov Reviewed By: samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7217 llvm-svn: 227389	2015-01-28 22:49:25 +00:00
Sanjay Patel	08efcd9039	fix typos; NFC llvm-svn: 227386	2015-01-28 22:37:32 +00:00
Chris Bieneman	b6866425f3	Build fix for Visual Studio. NFC. llvm-svn: 227385	2015-01-28 22:25:00 +00:00
Colin LeMahieu	4379d10273	[Hexagon] Updating several V5 intrinsics and adding FP tests. llvm-svn: 227379	2015-01-28 22:08:16 +00:00
Simon Pilgrim	80bd3c9e5f	Spelling fixes. NFC. llvm-svn: 227376	2015-01-28 22:03:52 +00:00
Simon Pilgrim	b55bd1e2ac	Line endings fix. NFC. llvm-svn: 227374	2015-01-28 21:56:52 +00:00
Zoran Jovanovic	14c567be90	[mips][microMIPS] Implement SWM and LWM aliases Differential Revision: http://reviews.llvm.org/D5820 llvm-svn: 227373	2015-01-28 21:52:27 +00:00
Kostya Serebryany	e6972a029f	[fuzzer] instructions for building/running clang-format-fuzzer llvm-svn: 227357	2015-01-28 19:51:58 +00:00
Sanjay Patel	4058dd9f3f	invert check for less indentation; use local vars to reduce duplication; NFC llvm-svn: 227355	2015-01-28 19:44:21 +00:00
Colin LeMahieu	1de7e0d923	[Hexagon] Updating many V4 intrinsic patterns. Adding missing instruction and deleting unused classes. llvm-svn: 227353	2015-01-28 19:39:09 +00:00
Chandler Carruth	be09eb75aa	[LPM] Try to work around a bug with local-dynamic TLS on PowerPC 64. Sadly, this precludes optimizing it down to initial-exec or local-exec when statically linking, and in general makes the code slower on PPC 64, but there's nothing else for it until we can arrange to produce the correct bits for the linker. Lots of thanks to Ulirch for tracking this down and Bill for working on the long-term fix to LLVM so that we can relegate this to old host clang versions. I'll be watching the PPC build bots to make sure this effectively revives them. llvm-svn: 227352	2015-01-28 19:29:22 +00:00
Philip Reames	23cf2e2f97	Remove gc.root's performCustomLowering This is a refactoring to restructure the single user of performCustomLowering as a specific lowering pass and remove the custom lowering hook entirely. Before this change, the LowerIntrinsics pass (note to self: rename!) was essentially acting as a pass manager, but without being structured in terms of passes. Instead, it proxied calls to a set of GCStrategies internally. This adds a lot of conceptual complexity (i.e. GCStrategies are stateful!) for very little benefit. Since there's been interest in keeping the ShadowStackGC working, I extracting it's custom lowering pass into a dedicated pass and just added that to the pass order. It will only run for functions which opt-in to that gc. I wasn't able to find an easy way to preserve the runtime registration of custom lowering functionality. Given that no user of this exists that I'm aware of, I made the choice to just remove that. If someone really cares, we can look at restoring it via dynamic pass registration in the future. Note that despite the large diff, none of the lowering code actual changes. I added the framing needed to make it a pass and rename the class, but that's it. Differential Revision: http://reviews.llvm.org/D7218 llvm-svn: 227351	2015-01-28 19:28:03 +00:00
Colin LeMahieu	94c33218e3	[Hexagon] Adding XTYPE/MPY intrinsic tests and some missing multiply instructions. llvm-svn: 227347	2015-01-28 19:16:17 +00:00
Chris Bieneman	d1d9430a05	Refactoring llvm command line parsing and option registration. Summary: The primary goal of this patch is to remove the need for MarkOptionsChanged(). That goal is accomplished by having addOption and removeOption properly sort the options. This patch puts the new add and remove functionality on a CommandLineParser class that is a placeholder. Some of the functionality in this class will need to be merged into the OptionRegistry, and other bits can hopefully be in a better abstraction. This patch also removes the RegisteredOptionList global, and the need for cl::Option objects to be linked list nodes. The changes in CommandLineTest.cpp are required because these changes shift when we validate that options are not duplicated. Before this change duplicate options were only found during certain cl API calls (like cl::ParseCommandLine). With this change duplicate options are found during option construction. Reviewers: dexonsmith, chandlerc, pete Reviewed By: pete Subscribers: pete, majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D7132 llvm-svn: 227345	2015-01-28 19:00:25 +00:00
Colin LeMahieu	19ed07c75a	[Hexagon] Deleting a lot of old variants of intrinsics and updating references. llvm-svn: 227338	2015-01-28 18:29:11 +00:00
Colin LeMahieu	39b846ce0f	[Hexagon] Converting XTYPE/BIT intrinsic patterns and adding tests. llvm-svn: 227335	2015-01-28 18:06:23 +00:00
Sanjay Patel	9bb601856e	use SDValue methods directly instead of getNode()->* ; NFCI llvm-svn: 227334	2015-01-28 18:01:31 +00:00
Rafael Espindola	a05b3b73a4	Simplify code. NFC. llvm-svn: 227333	2015-01-28 17:54:19 +00:00
Colin LeMahieu	fe03c9a678	[Hexagon] Replacing XTYPE/SHIFT intrinsic patternss. Adding tests and missing instructions with tests. llvm-svn: 227330	2015-01-28 17:37:59 +00:00
Jozef Kolek	e10a02ecf0	[mips][microMIPS] Implement LWGP instruction Differential Revision: http://reviews.llvm.org/D6650 llvm-svn: 227325	2015-01-28 17:27:26 +00:00
Colin LeMahieu	fdbc5adbb6	[Hexagon] Replacing intrinsics for halfword adds and max/min word/dword. llvm-svn: 227322	2015-01-28 17:06:40 +00:00
Bjorn Steinbrink	a09ac0085d	Fix LLVMSetMetadata and LLVMAddNamedMetadataOperand for single value MDNodes Summary: MetadataAsValue uses a canonical format that strips the MDNode if it contains only a single constant value. This triggers an assertion when trying to cast the value to a MDNode. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7165 llvm-svn: 227319	2015-01-28 16:35:59 +00:00
Michael Kuperstein	90e08320c9	[x32] Change the condition from bitness to LP64 for TCRETURNdi64. TCRETURNmi64, which was mistakenly changed in r227307 will wait for another day. llvm-svn: 227317	2015-01-28 16:11:35 +00:00
Tom Stellard	40ce8af4a5	R600: Move DataLayout to AMDGPUTargetMachine This is a follow up to r227113. It is now required to use the amdgcn target for SI and newer GPUs. llvm-svn: 227316	2015-01-28 16:04:26 +00:00
Tom Stellard	eba5648ad2	R600: Use a Southern Islands GPU as the default for the amdgcn target llvm-svn: 227314	2015-01-28 15:38:42 +00:00
Hal Finkel	34c94d5caa	Correct the AggressiveAntiDepBreaker's handling of subregisters defining super registers As the AggressiveAntiDepBreaker iterated backward through a scheduling region, we must leave super registers live through subregister definitions so that all relevant subregister definitions are renamed together. The problem was that we were also discarding sub-register use locations as the sub-registers are redefined. The result is that we'd rename the super register along with some, but not all, subregister definitions. R0_D = {R0_L, R1_L} R0_L = {R0_S, R1_S} %R0_L<def> = TRLi9 16, pred:8, pred:%noreg %R1_L<def> = LSRLrr %R1_L<kill>, %R0_S, pred:8, pred:%noreg %R0_L<def> = LSRLrr %R2_L, %R0_S, pred:8, pred:%noreg, %R0_L<imp-use,kill> %R1_L<def> = ANDLri %R1_L<kill>, 2047, pred:8, pred:%noreg %R0_L<def> = ANDLri %R0_L<kill>, 2047, pred:8, pred:%noreg %R4_D<def> = ASRDrr %R0_D<kill>, %R6_S Anti: %R4_D<def> = ASRDrr %R0_D<kill>, %R6_S Def Groups: R4_D=g213->g215(via R4_S)->g214(via R4_L)->g216(via R5_S)->g216(via R4_L)->g217(via R5_L) Use Groups: R0_D=g0->g218(last-use) R1_L->g219(last-use) R6_S=g204->g220(last-use) Anti: %R0_L<def> = ANDLri %R0_L<kill>, 2047, pred:8, pred:%noreg Def Groups: R0_L=g208->g209(via R0_S)->g218(via R0_D)->g210(via R1_S)->g210(via R0_D) Antidep reg: R0_L (real dependency) Use Groups: R0_L=g210->g224(last-use) R0_S->g225(last-use) R1_S->g226(last-use) Anti: %R1_L<def> = ANDLri %R1_L<kill>, 2047, pred:8, pred:%noreg Def Groups: R1_L=g219->g210(via R0_D) Antidep reg: R1_L (real dependency) Use Groups: R1_L=g210->g229(last-use) Anti: %R0_L<def> = LSRLrr %R2_L, %R0_S, pred:8, pred:%noreg, %R0_L<imp-use,kill> Def Groups: R0_L=g224->g225(via R0_S)->g210(via R0_D)->g226(via R1_S)->g226(via R0_D) Antidep reg: R0_L Use Groups: R2_L=g192 R0_S=g226->g230(last-use) R0_L=g226->g231(last-use) R1_S->g232(last-use) Anti: %R1_L<def> = LSRLrr %R1_L<kill>, %R0_S, pred:8, pred:%noreg Def Groups: R1_L=g229->g226(via R0_D) Antidep reg: R1_L Use Groups: R1_L=g226->g233(last-use) R0_S=g230 Anti: %R0_L<def> = TRLi9 16, pred:8, pred:%noreg Def Groups: R0_L=g231->g230(via R0_S)->g226(via R0_D)->g232(via R1_S)->g232(via R0_D) Antidep reg: R0_L Rename Candidates for Group g232: R0_D: elcInt64Regs :: R0_D R1_D R2_D R3_D R4_D R5_D R8_D R9_D R10_D R11_D R12_D R13_D R14_D R15_D R16_D R17_D R18_D R19_D R20_D R21_D R22_D R23_D R24_D R25_D R0_L: elcIntRegs :: R0_L R1_L R2_L R3_L R4_L R5_L R8_L R9_L R10_L R11_L R12_L R13_L R14_L R15_L R16_L R17_L R18_L R19_L R20_L R21_L R22_L R23_L R24_L R25_L R0_S: elcShrtRegs elcShrtRegs :: R0_S R1_S R2_S R3_S R4_S R5_S R8_S R9_S R10_S R11_S R12_S R13_S R14_S R15_S R16_S R17_S R18_S R19_S R20_S R21_S R22_S R23_S R24_S R25_S Find Registers: [R12_D: R12_D R12_L R12_S] Breaking anti-dependence edge on R0_L: R0_D->R12_D(1 refs) R0_L->R12_L(2 refs) R0_S->R12_S(2 refs) Use Groups: ... %R12_L<def> = TRLi9 16, pred:8, pred:%noreg %R1_L<def> = LSRLrr %R1_L<kill>, %R12_S, pred:8, pred:%noreg %R0_L<def> = LSRLrr %R2_L<kill>, %R12_S, pred:8, pred:%noreg, %R12_L<imp-use> %R1_L<def> = ANDLri %R1_L<kill>, 2047, pred:8, pred:%noreg %R0_L<def> = ANDLri %R0_L<kill>, 2047, pred:8, pred:%noreg %R4_D<def> = ASRDrr %R12_D<kill>, %R6_S With this change, we now produce: Anti: %R4_D<def> = ASRDrr %R0_D<kill>, %R6_S Def Groups: R4_D=g213->g215(via R4_S)->g214(via R4_L)->g216(via R5_S)->g216(via R4_L)->g217(via R5_L) Use Groups: R0_D=g0->g218(last-use) R1_L->g219(last-use) R6_S=g204->g220(last-use) Anti: %R0_L<def> = ANDLri %R0_L<kill>, 2047, pred:8, pred:%noreg Def Groups: R0_L=g208->g209(via R0_S)->g218(via R0_D)->g210(via R1_S)->g210(via R0_D) Antidep reg: R0_L (real dependency) Use Groups: R0_L=g210 Anti: %R1_L<def> = ANDLri %R1_L<kill>, 2047, pred:8, pred:%noreg Def Groups: R1_L=g219->g210(via R0_D) Antidep reg: R1_L (real dependency) Use Groups: R1_L=g210 Anti: %R0_L<def> = LSRLrr %R2_L, %R0_S, pred:8, pred:%noreg, %R0_L<imp-use,kill> Def Groups: R0_L=g210->g210(via R0_D)->g210(via R0_D) Antidep reg: R0_L Use Groups: R2_L=g192 R0_S=g210 R0_L=g210 Anti: %R1_L<def> = LSRLrr %R1_L<kill>, %R0_S, pred:8, pred:%noreg Def Groups: R1_L=g210->g210(via R0_D) Antidep reg: R1_L Use Groups: R1_L=g210 R0_S=g210 Anti: %R0_L<def> = TRLi9 16, pred:8, pred:%noreg Def Groups: R0_L=g210->g210(via R0_D)->g210(via R0_D) Antidep reg: R0_L Rename Candidates for Group g210: R0_D: elcInt64Regs :: R0_D R1_D R2_D R3_D R4_D R5_D R8_D R9_D R10_D R11_D R12_D R13_D R14_D R15_D R16_D R17_D R18_D R19_D R20_D R21_D R22_D R23_D R24_D R25_D R0_L: elcIntRegs elcIntAIRegs elcIntRegs elcIntRegs elcIntRegs elcIntRegs :: R0_L R1_L R2_L R3_L R4_L R5_L R8_L R9_L R10_L R11_L R12_L R13_L R14_L R15_L R16_L R17_L R18_L R19_L R20_L R21_L R22_L R23_L R24_L R25_L R1_L: elcIntRegs elcIntRegs elcIntRegs elcIntRegs elcIntRegs :: R0_L R1_L R2_L R3_L R4_L R5_L R8_L R9_L R10_L R11_L R12_L R13_L R14_L R15_L R16_L R17_L R18_L R19_L R20_L R21_L R22_L R23_L R24_L R25_L R0_S: elcShrtRegs elcShrtRegs :: R0_S R1_S R2_S R3_S R4_S R5_S R8_S R9_S R10_S R11_S R12_S R13_S R14_S R15_S R16_S R17_S R18_S R19_S R20_S R21_S R22_S R23_S R24_S R25_S Find Registers: [R12_D: R12_D R12_L R13_L R12_S] Breaking anti-dependence edge on R0_L: R0_D->R12_D(1 refs) R0_L->R12_L(7 refs) R1_L->R13_L(5 refs) R0_S->R12_S(2 refs) Use Groups: ... %R12_L<def> = TRLi9 16, pred:8, pred:%noreg %R13_L<def> = LSRLrr %R13_L<kill>, %R12_S, pred:8, pred:%noreg %R12_L<def> = LSRLrr %R2_L<kill>, %R12_S<kill>, pred:8, pred:%noreg, %R12_L<imp-use,kill> %R13_L<def> = ANDLri %R13_L<kill>, 2047, pred:8, pred:%noreg %R12_L<def> = ANDLri %R12_L<kill>, 2047, pred:8, pred:%noreg %R4_D<def> = ASRDrr %R12_D, %R6_S, %R12_L<imp-def>, %R12_S<imp-def>, %R13_S<imp-def> As demonstrated by this example, this is also somewhat unfortunate, because there is actually no need to rename the super register in this case (it is fully covered by later subregister definitions), but we don't seem to track enough information here to exploit that either. Thanks to Daniil Troshkov for reporting the issue. The debug outputs in this commit message are from Daniil. llvm-svn: 227311	2015-01-28 14:44:14 +00:00
Michael Kuperstein	951995821a	[X86] Reduce some 32-bit imuls into lea + shl Reduce integer multiplication by a constant of the form k*2^c, where k is in {3,5,9} into a lea + shl. Previously it was only done for imulq on 64-bit platforms, but it makes sense for imull and 32-bit as well. Differential Revision: http://reviews.llvm.org/D7196 llvm-svn: 227308	2015-01-28 14:08:22 +00:00
Michael Kuperstein	f387611ac2	[x32] Enable sibcall optimization on x32. This includes two things: 1) Fix TCRETURNdi and TCRETURN64di patterns to check the right thing (LP64 as opposed to target bitness). 2) Allow LEA64_32 in MatchingStackOffset. llvm-svn: 227307	2015-01-28 13:38:48 +00:00
Elena Demikhovsky	7b0dd39db6	AVX-512: Added FMA intrinsics with rounding mode By Asaf Badouh and Elena Demikhovsky Added special nodes for rounding: FMADD_RND, FMSUB_RND.. It will prevent merge between nodes with rounding and other standard nodes. llvm-svn: 227303	2015-01-28 10:21:27 +00:00
Craig Topper	7d3c6d307a	[X86] Teach disassembler to handle illegal immediates on AVX512 integer compare instructions. llvm-svn: 227302	2015-01-28 10:09:56 +00:00
Craig Topper	6772eac490	[X86] Merge printSSECC and printAVXCC. They only differed by an assertion. llvm-svn: 227301	2015-01-28 10:09:52 +00:00
Chandler Carruth	16b670ec20	[LPM] Rip all of ManagedStatic and ThreadLocal out of the pretty stack tracing code. Managed static was just insane overhead for this. We took memory fences and external function calls in every path that pushed a pretty stack frame. This includes a multitude of layers setting up and tearing down passes, the parser in Clang, everywhere. For the regression test suite or low-overhead JITs, this was contributing to really significant overhead. Even the LLVM ThreadLocal is really overkill here because it uses pthread_{set,get}_specific logic, and has careful code to both allocate and delete the thread local data. We don't actually want any of that, and this code in particular has problems coping with deallocation. What we want is a single TLS pointer that is valid to use during global construction and during global destruction, any time we want. That is exactly what every host compiler and OS we use has implemented for a long time, and what was standardized in C++11. Even though not all of our host compilers support the thread_local keyword, we can directly use the platform-specific keywords to get the minimal functionality needed. Provided this limited trial survives the build bots, I will move this to Compiler.h so it is more widely available as a light weight if limited alternative to the ThreadLocal class. Many thanks to David Majnemer for helping me think through the implications across platforms and craft the MSVC-compatible syntax. The end result is substantially faster. When running llc in a tight loop over a small IR file targeting the aarch64 backend, this improves its performance by over 10% for me. It also seems likely to fix the remaining regressions seen by JIT users with threading enabled. This may actually have more impact on real-world compile times due to the use of the pretty stack tracing utility throughout the rest of Clang or LLVM, but I've not collected any detailed measurements. llvm-svn: 227300	2015-01-28 09:52:14 +00:00
Chandler Carruth	5b0d3e3f3a	[LPM] A targeted but somewhat horrible fix to the legacy pass manager's querying of the pass registry. The pass manager relies on the static registry of PassInfo objects to perform all manner of its functionality. I don't understand why it does much of this. My very vague understanding is that this registry is touched both during static initialization and while each pass is being constructed. As a consequence it is hard to make accessing it not require a acquiring some lock. This lock ends up in the hot path of setting up, tearing down, and invaliditing analyses in the legacy pass manager. On most systems you can observe this as a non-trivial % of the time spent in 'ninja check-llvm'. However, I haven't really seen it be more than 1% in extreme cases of compiling more real-world software, including LTO. Unfortunately, some of the GPU JITs are seeing this taking essentially all of their time because they have very small IR running through a small pass pipeline very many times (at least, this is the vague understanding I have of it). This patch tries to minimize the cost of looking up PassInfo objects by leveraging the fact that the objects themselves are immutable and they are allocated separately on the heap and so don't have their address change. It also requires a change I made the last time I tried to debug this problem which removed the ability to de-register a pass from the registry. This patch creates a single access path to these objects inside the PMTopLevelManager which memoizes the result of querying the registry. This is somewhat gross as I don't really know if PMTopLevelManager is the right place to put it, and I dislike using a mutable member to memoize things, but it seems to work. For long-lived pass managers this should completely eliminate the cost of acquiring locks to look into the pass registry once the memoized cache is warm. For 'ninja check' I measured about 1.5% reduction in CPU time and in total time on a machine with 32 hardware threads. For normal compilation, I don't know how much this will help, sadly. We will still pay the cost while we populate the memoized cache. I don't think it will hurt though, and for LTO or compiles with many small functions it should still be a win. However, for tight loops around a pass manager with many passes and small modules, this will help tremendously. On the AArch64 backend I saw nearly 50% reductions in time to complete 2000 cycles of spinning up and tearing down the pipeline. Measurements from Owen of an actual long-lived pass manager show more along the lines of 10% improvements. Differential Revision: http://reviews.llvm.org/D7213 llvm-svn: 227299	2015-01-28 09:47:21 +00:00
Elena Demikhovsky	45f0448081	Fold fcmp in cases where value is provably non-negative. By Arch Robison. This patch folds fcmp in some cases of interest in Julia. The patch adds a function CannotBeOrderedLessThanZero that returns true if a value is provably not less than zero. I.e. the function returns true if the value is provably -0, +0, positive, or a NaN. The patch extends InstructionSimplify.cpp to fold instances of fcmp where: - the predicate is olt or uge - the first operand is provably not less than zero - the second operand is zero The motivation for handling these cases optimizing away domain checks for sqrt in Julia for common idioms such as sqrt(xx+yy).. http://reviews.llvm.org/D6972 llvm-svn: 227298	2015-01-28 08:03:58 +00:00
Chandler Carruth	b81dfa6378	[LPM] Stop using the string based preservation API. It is an abomination. For starters, this API is incredibly slow. In order to lookup the name of a pass it must take a memory fence to acquire a pointer to the managed static pass registry, and then potentially acquire locks while it consults this registry for information about what passes exist by that name. This stops the world of LLVMs in your process no matter how little they cared about the result. To make this more joyful, you'll note that we are preserving many passes which do not exist any more, or are not even analyses which one might wish to have be preserved. This means we do all the work only to say "nope" with no error to the user. String-based APIs are a bad idea. String-based APIs that cannot produce any meaningful error are an even worse idea. =/ I have a patch that simply removes this API completely, but I'm hesitant to commit it as I don't really want to perniciously break out-of-tree users of the old pass manager. I'd rather they just have to migrate to the new one at some point. If others disagree and would like me to kill it with fire, just say the word. =] llvm-svn: 227294	2015-01-28 04:57:56 +00:00
Eric Christopher	6c901623c0	Migrate AArch64 except for TTI and AsmPrinter away from getSubtargetImpl. llvm-svn: 227293	2015-01-28 03:51:33 +00:00
David Blaikie	e245228903	Add description to assert llvm-svn: 227291	2015-01-28 02:43:15 +00:00
David Blaikie	fa1a3c7cf5	PR22356: DebugInfo: Handle the size of a member where the type of that member is a typedef (or other sugar) of a declaration. llvm-svn: 227290	2015-01-28 02:34:53 +00:00
Lang Hames	33c9433ed4	Revert r227247 and r227228: "Add weak symbol support to RuntimeDyld". This has wider implications than I expected when I reviewed the patch: It can cause JIT crashes where clients have used the default value for AbortOnFailure during symbol lookup. I'm currently investigating alternative approaches and I hope to have this back in tree soon. llvm-svn: 227287	2015-01-28 01:30:37 +00:00
Reid Kleckner	4af6415237	Move EH personality type classification to Analysis/LibCallSemantics.h Summary: Also add enum types for __C_specific_handler and _CxxFrameHandler3 for which we know a few things. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7214 llvm-svn: 227284	2015-01-28 01:17:38 +00:00
Quentin Colombet	308b171318	Revert r227242 - Merge vector stores into wider vector stores (PR21711). This commit creates infinite loop in DAG combine for in the LLVM test-suite for aarch64 with mcpu=cylcone (just having neon may be enough to expose this). llvm-svn: 227272	2015-01-27 23:58:01 +00:00
Petar Jovanovic	4a11849034	[mips] Use __clear_cache builtin instead of cacheflush() Use __clear_cache builtin instead of cacheflush() in Unix Memory::InvalidateInstructionCache(). Differential Revision: http://reviews.llvm.org/D7198 llvm-svn: 227269	2015-01-27 23:30:18 +00:00
Saleem Abdulrasool	c44d71b8df	SymbolRewriter: allow rewriting with comdats COMDATs must be identically named to the symbol. When support for COMDATs was introduced, the symbol rewriter was not updated, resulting in rewriting failing for symbols which were placed into COMDATs. This corrects the behaviour and adds test cases for this. llvm-svn: 227261	2015-01-27 22:57:39 +00:00
Saleem Abdulrasool	9769b18cba	SymbolRewriter: prevent unnecessary rewrite The rewrite for the pattern based rewrite is unnecessary if the existing name matches the pattern. llvm-svn: 227260	2015-01-27 22:57:35 +00:00
Sanjay Patel	b1ca4e48d4	remove function names from comments; NFC llvm-svn: 227256	2015-01-27 22:26:56 +00:00
Chris Bieneman	6816936287	Re-landing changes to use ArrayRef instead of SmallVectorImpl, and new API test. This contains the changes from r227148 & r227154, and also fixes to the test case to properly clean up the stack options. llvm-svn: 227255	2015-01-27 22:21:06 +00:00
Kostya Serebryany	bfa3f9d82f	[fuzzer] properly enable asan's coverage feedback llvm-svn: 227254	2015-01-27 22:19:55 +00:00
Sanjay Patel	6b280777b7	fix typos; NFC llvm-svn: 227253	2015-01-27 22:16:52 +00:00
Kostya Serebryany	d53b43fe11	Add a Fuzzer library Summary: A simple genetic in-process coverage-guided fuzz testing library. I've used this fuzzer to test clang-format (it found 12+ bugs, thanks djasper@ for the fixes!) and it may also help us test other parts of LLVM. So why not keep it in the LLVM repository? I plan to add the cmake build rules later (in a separate patch, if that's ok) and also add a clang-format-fuzzer target. See README.txt for details. Test Plan: Tests will follow separately. Reviewers: djasper, chandlerc, rnk Reviewed By: rnk Subscribers: majnemer, ygribov, dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D7184 llvm-svn: 227252	2015-01-27 22:08:41 +00:00
Ahmed Bougacha	1ac9356524	[SimplifyLibCalls] Don't confuse strcpy_chk for stpcpy_chk. This was introduced in a faulty refactoring (r225640, mea culpa): the tests weren't testing the return values, so, for both __strcpy_chk and __stpcpy_chk, we would return the end of the buffer (matching stpcpy) instead of the beginning (for strcpy). The root cause was the prefix "__" being ignored when comparing, which made us always pick LibFunc::stpcpy_chk. Pass the LibFunc::Func directly to avoid this kind of error. Also, make the testcases as explicit as possible to prevent this. The now-useful testcases expose another, entangled, stpcpy problem, with the further simplification. This was introduced in a refactoring (r225640) to match the original behavior. However, this leads to problems when successive simplifications generate several similar instructions, none of which are removed by the custom replaceAllUsesWith. For instance, InstCombine (the main user) doesn't erase the instruction in its custom RAUW. When trying to simplify say __stpcpy_chk: - first, an stpcpy is created (fortified simplifier), - second, a memcpy is created (normal simplifier), but the stpcpy call isn't removed. - third, InstCombine later revisits the instructions, and simplifies the first stpcpy to a memcpy. We now have two memcpys. llvm-svn: 227250	2015-01-27 21:52:16 +00:00
Sanjoy Das	dcf2651043	Teach IRCE to look at branch weights when recognizing range checks Splitting a loop to make range checks redundant is profitable only if the range check "never" fails. Make this fact a part of recognizing a range check -- a branch is a range check only if it is expected to pass (via branch_weights metadata). Differential Revision: http://reviews.llvm.org/D7192 llvm-svn: 227249	2015-01-27 21:38:12 +00:00
Alexey Samsonov	533948088e	Revert "[x86] Combine x86mmx/i64 to v2i64 conversion to use scalar_to_vector" This reverts commits r226953 and r226974. llvm-svn: 227248	2015-01-27 21:34:11 +00:00
Kevin Enderby	9a50944ca0	dd the option, -link-opt-hints to llvm-objdump used with -macho to print the Mach-O AArch64 linker optimization hints for ADRP code optimization. llvm-svn: 227246	2015-01-27 21:28:24 +00:00
Sanjay Patel	bcf62f2fa2	Merge vector stores into wider vector stores (PR21711) This patch resolves part of PR21711 ( http://llvm.org/bugs/show_bug.cgi?id=21711 ). The 'f3' test case in that report presents a situation where we have two 128-bit stores extracted from a 256-bit source vector. Instead of producing this: vmovaps %xmm0, (%rdi) vextractf128 $1, %ymm0, 16(%rdi) This patch merges the 128-bit stores into a single 256-bit store: vmovups %ymm0, (%rdi) Differential Revision: http://reviews.llvm.org/D7208 llvm-svn: 227242	2015-01-27 20:50:27 +00:00
Dmitry Vyukov	91ffdec3ec	tsan: properly instrument unaligned accesses If a memory access is unaligned, emit __tsan_unaligned_read/write callbacks instead of __tsan_read/write. Required to change semantics of __tsan_unaligned_read/write to not do the user memory. But since they were unused (other than through __sanitizer_unaligned_load/store) this is fine. Fixes long standing issue 17: https://code.google.com/p/thread-sanitizer/issues/detail?id=17 llvm-svn: 227231	2015-01-27 20:19:17 +00:00
Keno Fischer	88cc26811b	[ExecutionEngine] Add weak symbol support to RuntimeDyld Support weak symbols by first looking up if there is an externally visible symbol we can find, and only if that fails using the one in the object file we're loading. Reviewed By: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6950 llvm-svn: 227228	2015-01-27 20:02:31 +00:00
Keno Fischer	5f92a08fc0	[ExecutionEngine] FindFunctionNamed: Skip declarations Summary: Basically all other methods that look up functions by name skip them if they are mere declarations. Do the same in FindFunctionNamed. Reviewers: lhames Reviewed By: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7068 llvm-svn: 227227	2015-01-27 19:29:00 +00:00
Kai Nacke	e024539ea0	[mips] Add range checks and transformation to octeon instructions in AsmParser. This patch adds range checks to the immediate operands of octeon instructions in the AsmParser. Like gas, it applies the following transformations if the immediate is to large: bbit0 $8, 42, foo => bbit032 $8, 10, foo bbit1 $8, 46, foo => bbit132 $8, 14, foo cins $8, $31, 32, 31 => cins32 $8, $31, 0, 31 exts $7, $4, 54, 9 => exts32 $7, $4, 22, 9 Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D7080 llvm-svn: 227225	2015-01-27 19:11:28 +00:00
Marek Olsak	794ff8392e	R600/SI: Fix MIN3/MAX3 on VI, define MED3 llvm-svn: 227213	2015-01-27 17:25:15 +00:00
Marek Olsak	367447c255	R600/SI: Don't set patterns for chip-specific instructions while having pseudos Only pseudos have patterns on them. Also don't set the asm string for VINTRP_Pseudo. All pseudos should have empty asm. This matches what all other multiclasses do. llvm-svn: 227212	2015-01-27 17:25:11 +00:00
Marek Olsak	0c1f8812f5	R600/SI: Add VI versions of LDS atomics Each class is split into two: one adds let statements around non-pseudos, and the other one specifies the parameters. llvm-svn: 227211	2015-01-27 17:25:07 +00:00
Marek Olsak	19d9e1f459	R600/SI: Add VI versions of MUBUF atomics llvm-svn: 227210	2015-01-27 17:25:02 +00:00
Marek Olsak	ee98b1177c	R600/SI: Add VI versions of MUBUF loads and stores This enables a lot of existing patterns for VI. llvm-svn: 227209	2015-01-27 17:24:58 +00:00
Marek Olsak	7ef6db49ac	R600/SI: Add pseudos for MUBUF loads and stores This defines the SI versions only, so it shouldn't change anything. There are no changes other than using the new multiclasses, adding missing mayLoad/mayStore, and formatting fixes. llvm-svn: 227208	2015-01-27 17:24:54 +00:00
Andrea Di Biagio	086cbc37ad	[InstCombine] Teach how to fold a select into a cttz/ctlz with the 'is_zero_undef' flag. This patch teaches the Instruction Combiner how to fold a cttz/ctlz followed by a icmp plus select into a single cttz/ctlz with flag 'is_zero_undef' cleared. Added test InstCombine/select-cmp-cttz-ctlz.ll. llvm-svn: 227197	2015-01-27 15:58:14 +00:00
Evgeniy Stepanov	3fdfc7b1b3	[sancov] Fix unspecified constructor order between sancov and asan. Sanitizer coverage constructor must run after asan constructor (for each DSO). Bump constructor priority to guarantee that. llvm-svn: 227195	2015-01-27 15:01:22 +00:00
Manuel Jacob	8220addad7	Add a FIXME in SelectionDAGBuilder before an assert that is valid only on X86. When lowering memcpy, memset or memmove, this assert checks whether the pointer operands are in an address space < 256 which means "user defined address space" on X86. However, this notion of "user defined address space" does not exist for other targets. llvm-svn: 227191	2015-01-27 13:14:35 +00:00
Eric Christopher	337262068f	Replace some uses of getSubtargetImpl with the cached version off of the MachineFunction or with the version that takes a Function reference as an argument. llvm-svn: 227185	2015-01-27 08:48:42 +00:00
Eric Christopher	7592b0c5e6	Have the PBQP register allocator use the subtarget on the MachineFunction. (and remove an extraneous private). llvm-svn: 227181	2015-01-27 08:27:06 +00:00
Eric Christopher	e005826526	Remove some extraneous includes. llvm-svn: 227180	2015-01-27 08:27:03 +00:00
Eric Christopher	ad6bedb43e	Fix build failure with pointer vs reference. NB: Saving files after editing helps. llvm-svn: 227178	2015-01-27 08:00:42 +00:00
Eric Christopher	2c63549386	Update a few calls to getSubtarget<> to either be getSubtargetImpl when we didn't need the cast to the base class or the cached version off of the subtarget. llvm-svn: 227176	2015-01-27 07:54:39 +00:00
Eric Christopher	36d9273128	Clean up the AArch64 store pair suppression pass initialization and remove and unnecessary class variable. llvm-svn: 227175	2015-01-27 07:54:36 +00:00
Eric Christopher	3d4276f053	The subtarget is cached on the MachineFunction. Access it directly. llvm-svn: 227173	2015-01-27 07:31:29 +00:00
Eric Christopher	e38c8d4aa9	Migrate SeparateConstOffsetFromGEP to use a Function with getSubtarget. llvm-svn: 227172	2015-01-27 07:16:37 +00:00
David Majnemer	4c82daea60	LoopRotate: Don't walk the uses of a Constant LoopRotate wanted to avoid live range interference by looking at the uses of a Value in the loop latch and seeing if any lied outside of the loop. We would wrongly perform this operation on Constants. This fixes PR22337. llvm-svn: 227171	2015-01-27 06:21:43 +00:00
Eric Christopher	b9f60c17dc	Remove unused include. llvm-svn: 227170	2015-01-27 05:58:44 +00:00
Richard Trieu	15ac9363a7	Revert r227148 & r227154 which added a test which infinitely loops. r227148 added test CommandLineTest.HideUnrelatedOptionsMulti which repeatedly outputs two following lines: -tool: CommandLine Error: Option 'test-option-1' registered more than once! -tool: CommandLine Error: Option 'test-option-2' registered more than once! r227154 depends on changes from r227148 llvm-svn: 227167	2015-01-27 03:03:47 +00:00
Chandler Carruth	d649c0ad56	[PM] Refactor the core logic to run EarlyCSE over a function into an object that manages a single run of this pass. This was already essentially how it worked. Within the run function, it would point members at stack local allocations that were only live for a single run. Instead, it seems much cleaner to have a utility object whose lifetime is clearly bounded by the run of the pass over the function and can use member variables in a more direct way. This also makes it easy to plumb the analyses used into it from the pass and will make it re-usable with the new pass manager. No functionality changed here, its just a refactoring. llvm-svn: 227162	2015-01-27 01:34:14 +00:00
Eric Christopher	349d5886e5	MachineRegisterInfo can access TII off of the MachineFunction's subtarget and so doesn't need the TargetMachine or to access via getSubtargetImpl. Update all callers. llvm-svn: 227160	2015-01-27 01:15:16 +00:00
Eric Christopher	4e048eeb2a	Migrate AtomicExpandPass and DwarfEHPrepare to using a Function-ized getSubtargetImpl. llvm-svn: 227159	2015-01-27 01:04:42 +00:00
Eric Christopher	fccff37b53	Migrate CodeGenPrepare to use the Function based getSubtarget code. llvm-svn: 227157	2015-01-27 01:01:38 +00:00
Eric Christopher	2b214e7ad3	Grab the TargetLowering info from the DAG rather than querying for a subtarget. llvm-svn: 227156	2015-01-27 01:01:36 +00:00
Chris Bieneman	fd3dbd9403	One more fix to the new API to fix const-correctness. llvm-svn: 227154	2015-01-27 00:42:00 +00:00
Chad Rosier	f9327d6fe9	Commoning of target specific load/store intrinsics in Early CSE. Phabricator revision: http://reviews.llvm.org/D7121 Patch by Sanjin Sijaric <ssijaric@codeaurora.org>! llvm-svn: 227149	2015-01-26 22:51:15 +00:00
Chris Bieneman	c333e577fe	Pete Cooper suggested the new API should use ArrayRef instead of SmallVectorImpl. Also adding a test case. llvm-svn: 227148	2015-01-26 22:50:47 +00:00
Simon Pilgrim	0629ba1ad9	[X86][SSE] Float comparisons can sometimes be safely commuted For ordered, unordered, equal and not-equal tests, packed float and double comparison instructions can be safely commuted without affecting the results. This patch checks the comparison mode of the (v)cmpps + (v)cmppd instructions and commutes the result if it can. Differential Revision: http://reviews.llvm.org/D7178 llvm-svn: 227145	2015-01-26 22:29:24 +00:00
Zachary Turner	02991af057	Have the UTF conversion wrappers append a null terminator. This is especially useful for the UTF8 -> UTF16 direction, since there is no equivalent of llvm::SmallString<> for wide characters. This means that anyone who wants a null terminated string is forced to manually push and pop their own null terminator. Reviewed by: Reid Kleckner. llvm-svn: 227143	2015-01-26 22:05:50 +00:00
Simon Pilgrim	9b7c00352d	[X86][PCLMUL] Enable commutation for PCLMUL instructions Patch to allow (v)pclmulqdq to be commuted - swaps the src registers and inverts the immediate (low/high) src mask. Differential Revision: http://reviews.llvm.org/D7180 llvm-svn: 227141	2015-01-26 22:00:18 +00:00
Chris Bieneman	0104325776	Add new HideUnrelatedOptions API that takes a SmallVectorImpl. Need a new API for clang-modernize that allows specifying a list of option categories to remain visible. This will allow clang-modernize to move off getRegisteredOptions. llvm-svn: 227140	2015-01-26 21:57:29 +00:00
Alexei Starovoitov	3c8465acb2	bpf: fix build due to 'Move DataLayout back to the TargetMachine' commit r227113 moved DataLayout llvm-svn: 227133	2015-01-26 20:43:15 +00:00
Hans Wennborg	b64cb271dc	SimplifyCFG: Omit range checks for switch lookup tables when default is unreachable The range check would get optimized away later, but we might as well not emit them in the first place. http://reviews.llvm.org/D6471 llvm-svn: 227126	2015-01-26 19:52:34 +00:00
Hans Wennborg	6800008f04	SimplifyCFG: don't remove unreachable default switch destinations An unreachable default destination can be exploited by other optimizations and allows for more efficient lowering. Both the SDag switch lowering and LowerSwitch can exploit unreachable defaults. Also make TurnSwitchRangeICmp handle switches with unreachable default. This is kind of separate change, but it cannot be tested without the change above, and I don't want to land the change above without this since that would regress other tests. Differential Revision: http://reviews.llvm.org/D6471 llvm-svn: 227125	2015-01-26 19:52:32 +00:00
Hans Wennborg	90b827cae2	Make ConstantFoldTerminator() handle switches with unreachable default. Tested by Transforms/SimplifyCFG/switch-to-br.ll's @unreachable function. Differential Revision: http://reviews.llvm.org/D6471 llvm-svn: 227124	2015-01-26 19:52:24 +00:00
Justin Holewinski	d4d2e9bd0e	[NVPTX] Generate a more optimal sequence for select of i1 Instead of creating a pattern like "(p && a) \|\| ((!p) && b)", just expand the i8 operands to i32 and perform the selp on them. Fixes PR22246 llvm-svn: 227123	2015-01-26 19:52:20 +00:00
Reid Kleckner	d8cb6b00c5	Add a UTF8 to UTF16 conversion wrapper for use in the pdb dumper This can also be used instead of the WindowsSupport.h ConvertUTF8ToUTF16 helpers, but that will require massaging some character types. The Windows support routines want wchar_t output, but wchar_t is often 32 bits on non-Windows OSs. llvm-svn: 227122	2015-01-26 19:51:00 +00:00
Eric Christopher	b11a1b7b2c	Cache the lookup of TargetLowering in the atomic expand pass. llvm-svn: 227121	2015-01-26 19:45:40 +00:00
Ahmed Bougacha	9a9e1a59ce	[SelectionDAG] Fix assert message copypasta. NFC. llvm-svn: 227119	2015-01-26 19:31:42 +00:00
Justin Holewinski	23df659e6d	[NVPTX] Handle floating-point conversion patterns that are not explicitly ordered or unordered Fixes PR22322 llvm-svn: 227117	2015-01-26 19:11:20 +00:00
Alex Rosenberg	b9fefdd215	Use a different encoding for debugtrap on PS4. llvm-svn: 227116	2015-01-26 19:09:27 +00:00
Eric Christopher	8b7706517c	Move DataLayout back to the TargetMachine from TargetSubtargetInfo derived classes. Since global data alignment, layout, and mangling is often based on the DataLayout, move it to the TargetMachine. This ensures that global data is going to be layed out and mangled consistently if the subtarget changes on a per function basis. Prior to this all targets() have had subtarget dependent code moved out and onto the TargetMachine. One target hasn't been migrated as part of this change: R600. The R600 port has, as a subtarget feature, the size of pointers and this affects global data layout. I've currently hacked in a FIXME to enable progress, but the port needs to be updated to either pass the 64-bitness to the TargetMachine, or fix the DataLayout to avoid subtarget dependent features. llvm-svn: 227113	2015-01-26 19:03:15 +00:00
Philip Reames	a7ad6a589c	Refine memory dependence's notion of volatile semantics According to my reading of the LangRef, volatiles are only ordered with respect to other volatiles. It is entirely legal and profitable to forward unrelated loads over the volatile load. This patch implements this for GVN by refining the transition rules MemoryDependenceAnalysis uses when encountering a volatile. The added test cases show where the extra flexibility is profitable for local dependence optimizations. I have a related change (227110) which will extend this to non-local dependence (i.e. PRE), but that's essentially orthogonal to the semantic change in this patch. I have tested the two together and can confirm that PRE works over a volatile load with both changes. I will be submitting a PRE w/volatiles test case seperately in the near future. Differential Revision: http://reviews.llvm.org/D6901 llvm-svn: 227112	2015-01-26 18:54:27 +00:00
Sanjay Patel	805bc02c2b	Model sqrtsd as a binary operation with one source operand tied to the destination (PR14221) This patch fixes the following miscompile: define void @sqrtsd(<2 x double> %a) nounwind uwtable ssp { %0 = tail call <2 x double> @llvm.x86.sse2.sqrt.sd(<2 x double> %a) nounwind %a0 = extractelement <2 x double> %0, i32 0 %conv = fptrunc double %a0 to float %a1 = extractelement <2 x double> %0, i32 1 %conv3 = fptrunc double %a1 to float tail call void @callee2(float %conv, float %conv3) nounwind ret void } Current codegen: sqrtsd %xmm0, %xmm1 ## high element of %xmm1 is undef here xorps %xmm0, %xmm0 cvtsd2ss %xmm1, %xmm0 shufpd $1, %xmm1, %xmm1 cvtsd2ss %xmm1, %xmm1 ## operating on undef value jmp _callee This is a continuation of http://llvm.org/viewvc/llvm-project?view=revision&revision=224624 ( http://reviews.llvm.org/D6330 ) which was itself a continuation of r167064 ( http://llvm.org/viewvc/llvm-project?view=revision&revision=167064 ). All of these patches are partial fixes for PR14221 ( http://llvm.org/bugs/show_bug.cgi?id=14221 ); this should be the final patch needed to resolve that bug. Differential Revision: http://reviews.llvm.org/D6885 llvm-svn: 227111	2015-01-26 18:42:16 +00:00
Philip Reames	32351455f6	Pass QueryInst down through non-local dependency calculation This change is mostly motivated by exposing information about the original query instruction to the actual scanning work in getPointerDependencyFrom when used by GVN PRE. In a follow up change, I will use this to be more precise with regards to the semantics of volatile instructions encountered in the scan of a basic block. Worth noting, is that this change (despite appearing quite simple) is not semantically preserving. By providing more information to the helper routine, we allow some optimizations to kick in that weren't previously able to (when called from this code path.) In particular, we see that treatment of !invariant.load becomes more precise. In theory, we might see a difference with an ordered/atomic instruction as well, but I'm having a hard time actually finding a test case which shows that. Test wise, I've included new tests for !invariant.load which illustrate this difference. I've also included some updated TBAA tests which highlight that this change isn't needed for that optimization to kick in - it's handled inside alias analysis itself. Eventually, it would be nice to factor the !invariant.load handling inside alias analysis as well. Differential Revision: http://reviews.llvm.org/D6895 llvm-svn: 227110	2015-01-26 18:39:52 +00:00
Philip Reames	56a03938f7	Revert GCStrategy ownership changes This change reverts the interesting parts of 226311 (and 227046). This change introduced two problems, and I've been convinced that an alternate approach is preferrable anyways. The bugs were: - Registery appears to require all users be within the same linkage unit. After this change, asking for "statepoint-example" in Transform/ would sometimes get you nullptr, whereas asking the same question in CodeGen would return the right GCStrategy. The correct long term fix is to get rid of the utter hack which is Registry, but I don't have time for that right now. 227046 appears to have been an attempt to fix this, but I don't believe it does so completely. - GCMetadataPrinter::finishAssembly was being called more than once per GCStrategy. Each Strategy was being added to the GCModuleInfo multiple times. Once I get time again, I'm going to split GCModuleInfo into the gc.root specific part and a GCStrategy owning Analysis pass. I'm probably also going to kill off the Registry. Once that's done, I'll move the new GCStrategyAnalysis and all built in GCStrategies into Analysis. (As original suggested by Chandler.) This will accomplish my original goal of being able to access GCStrategy from Transform/ without adding all of the builtin GCs to IR/. llvm-svn: 227109	2015-01-26 18:26:35 +00:00
Zachary Turner	39571b37a3	Teach raw_ostream to support hex formatting without a prefix '0x'. Previously using format_hex() would always print a 0x prior to the hex characters. This allows this to be optional, so that one can choose to print (e.g.) 255 as either 0xFF or just FF. Differential Revision: http://reviews.llvm.org/D7151 llvm-svn: 227108	2015-01-26 18:21:33 +00:00
Alex Rosenberg	f298f16ccf	Remove trailing whitespace. NFC ® llvm-svn: 227105	2015-01-26 18:02:18 +00:00
Eric Christopher	a576281694	Move the Mips target to storing the ABI in the TargetMachine rather than on MipsSubtargetInfo. This required a bit of massaging in the MC level to handle this since MC is a) largely a collection of disparate classes with no hierarchy, and b) there's no overarching equivalent to the TargetMachine, instead only the subtarget via MCSubtargetInfo (which is the base class of TargetSubtargetInfo). We're now storing the ABI in both the TargetMachine level and in the MC level because the AsmParser and the TargetStreamer both need to know what ABI we have to parse assembly and emit objects. The target streamer has a pointer to the one in the asm parser and is updated when the asm parser is created. This is fragile as the FIXME comment notes, but shouldn't be a problem in practice since we always create an asm parser before attempting to emit object code via the assembler. The TargetMachine now contains the ABI so that the DataLayout can be constructed dependent upon ABI. All testcases have been updated to use the -target-abi command line flag so that we can set the ABI without using a subtarget feature. Should be no change visible externally here. llvm-svn: 227102	2015-01-26 17:33:46 +00:00
Eric Christopher	6e4ed49d79	Store the passed in CPU name string so that it can be accessed later. llvm-svn: 227101	2015-01-26 17:33:30 +00:00
Daniel Berlin	16f7a52628	Fix incorrect partial aliasing Update testcases llvm-svn: 227099	2015-01-26 17:31:17 +00:00
Daniel Berlin	8f10e387bb	Fix delegation llvm-svn: 227098	2015-01-26 17:30:39 +00:00
Michael J. Spencer	f5215652d5	[Support][Windows] Disable error dialog boxes when stack trace printing is enabled. llvm-svn: 227094	2015-01-26 17:05:02 +00:00
Chris Bieneman	831fc5e87d	Putting all the standard tool options into a "Generic" category. Summary: This puts all the options that CommandLine.cpp implements into a category so that the APIs to hide options can not hide based on the generic category instead of string matching a partial list of argument strings. This patch is pretty simple and straight forward but it does impact the -help output of all tools using cl::opt. Specifically the options implemented in CommandLine.cpp (help, help-list, help-hidden, help-list-hidden, print-options, print-all-options, version) are all grouped together into an Option category, and these options are never hidden by the cl::HideUnrelatedOptions API. Reviewers: dexonsmith, chandlerc, majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7150 llvm-svn: 227093	2015-01-26 16:56:00 +00:00
Vasileios Kalintiris	ef96a8ecd6	[mips] Enable arithmetic and binary operations for the i128 data type. Summary: This patch adds support for some operations that were missing from 128-bit integer types (add/sub/mul/sdiv/udiv... etc.). With these changes we can support the __int128_t and __uint128_t data types from C/C++. Depends on D7125 Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7143 llvm-svn: 227089	2015-01-26 12:33:22 +00:00
Joerg Sonnenberger	429edc1780	The canonical CPU variant for ARM according to config.guess uses a suffix it seems: # ./config.guess earmv7hfeb-unknown-netbsd7.99.4 Extend the triple parsing to support this. Avoid running the ARM parser multiple times because StringSwitch is not lazy. Reviewers: Renato Golin, Tim Northover Differential Revision: http://reviews.llvm.org/D7166 llvm-svn: 227085	2015-01-26 11:41:48 +00:00

1 2 3 4 5 ...

76272 Commits