llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Zolotukhin	9f06ef76d3	[Unroll] Handle SwitchInst properly. Previously successor selection was simply wrong. llvm-svn: 243545	2015-07-29 18:10:33 +00:00
Michael Zolotukhin	3a7d55b623	[Unroll] Don't crash when simplified branch condition is undef. llvm-svn: 243544	2015-07-29 18:10:29 +00:00
Michael Zolotukhin	a2069d36ce	Rename test full-unroll-bad-geps.ll to full-unroll-crashers.ll. No reason to limit it only to GEP-related crashes. More tests are to come here. llvm-svn: 243543	2015-07-29 18:10:23 +00:00
Lang Hames	585d5b9f2d	Fix typos in comments. NFC. llvm-svn: 243542	2015-07-29 18:07:48 +00:00
Bruno Cardoso Lopes	38c0250679	Revert "[PeepholeOptimizer] Look through PHIs to find additional register sources" Reported to Broke some internal tests: PR24303 This reverts commit r243486. llvm-svn: 243540	2015-07-29 17:46:47 +00:00
Douglas Katzman	f2b960886e	Add an ArgList::AddAllArgs that accepts a vector of OptSpecifier. This lifts the somewhat arbitrary restriction on 3 OptSpecifiers. Differential Revision: http://reviews.llvm.org/D11597 llvm-svn: 243539	2015-07-29 17:34:41 +00:00
Tim Northover	cf739b8c3d	AArch64: use AddressingModes.h accessors for compare shifts No functional change because "lsl #12" is actually encoded as 12, but one less bug if someone ever decides to change that for the giggles. llvm-svn: 243536	2015-07-29 16:39:56 +00:00
Hans Wennborg	0742a3e574	test-release.sh: Add option for building the OpenMP run-time This isn't part of the official release process, but provides a convenient way to build binaries for those who want to experiment with it. Hopefully the run- time can be part of the regular build and release process for 3.8. Differential Revision: http://reviews.llvm.org/D11494 llvm-svn: 243531	2015-07-29 16:29:06 +00:00
Aaron Ballman	9f154f601d	Reverting r243386 because it has serious post-commit concerns that have not been addressed. Also reverts r243389, which relied on this commit. llvm-svn: 243527	2015-07-29 15:57:49 +00:00
Colin LeMahieu	77804bed85	[llvm-objdump] Added -j flag to filter sections that are operated on. llvm-svn: 243526	2015-07-29 15:45:39 +00:00
Jingyue Wu	7ec38530a5	Temporarily revert r242871 PR24299 llvm-svn: 243522	2015-07-29 15:26:11 +00:00
Bill Schmidt	42ddd71120	[PPC] Fix PR24216: Don't generate splat for misaligned shuffle mask Given certain shuffle-vector masks, LLVM emits splat instructions which splat the wrong bytes from the source register. The issue is that the function PPC::isSplatShuffleMask() in PPCISelLowering.cpp does not ensure that the splat pattern found is requesting bytes that are aligned on an EltSize boundary. This patch detects this situation as not a valid splat mask, resulting in a permute being generated instead of a splat. Patch and test case by Tyler Kenney, cleaned up a bit by me. This is a simple bug fix that would be good to incorporate into 3.7. llvm-svn: 243519	2015-07-29 14:31:57 +00:00
Akira Hatanaka	f53b0403f8	[AArch64] Define subtarget feature strict-align. This commit defines subtarget feature strict-align and uses it instead of cl::opt -aarch64-strict-align to decide whether strict alignment should be forced. rdar://problem/21529937 llvm-svn: 243516	2015-07-29 14:17:26 +00:00
Bjarke Hammersholt Roune	3747a6dbb8	Make function comments consistently imperative. (tiny edit, mostly a test that my new commit access works) llvm-svn: 243505	2015-07-29 00:29:08 +00:00
Sanjoy Das	cfe41f050c	[Statepoints] Let patchable statepoints have a symbolic call target. Summary: As added initially, statepoints required their call targets to be a constant pointer null if ``numPatchBytes`` was non-zero. This turns out to be a problem ergonomically, since there is no way to mark patchable statepoints as calling a (readable) symbolic value. This change remove the restriction of requiring ``null`` call targets for patchable statepoints, and changes PlaceSafepoints to maintain the symbolic call target through its transformation. Reviewers: reames, swaroop.sridhar Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11550 llvm-svn: 243502	2015-07-28 23:50:30 +00:00
Alex Lorenz	d8a1e542ab	Fix broken ArrayRef conversion from r243497. llvm-svn: 243501	2015-07-28 23:34:27 +00:00
Sanjay Patel	133e68b45c	ignore duplicate divisor uses when transforming into reciprocal multiplies (PR24141) PR24141: https://llvm.org/bugs/show_bug.cgi?id=24141 contains a test case where we have duplicate entries in a node's uses() list. After r241826, we use CombineTo() to delete dead nodes when combining the uses into reciprocal multiplies, but this fails if we encounter the just-deleted node again in the list. The solution in this patch is to not add duplicate entries to the list of users that we will subsequently iterate over. For the test case, this avoids triggering the combine divisors logic entirely because there really is only one user of the divisor. Differential Revision: http://reviews.llvm.org/D11345 llvm-svn: 243500	2015-07-28 23:28:22 +00:00
Sanjay Patel	1dd15598cf	fix TLI's combineRepeatedFPDivisors interface to return the minimum user threshold This fix was suggested as part of D11345 and is part of fixing PR24141. With this change, we can avoid walking the uses of a divisor node if the target doesn't want the combineRepeatedFPDivisors transform in the first place. There is no NFC-intended other than that. Differential Revision: http://reviews.llvm.org/D11531 llvm-svn: 243498	2015-07-28 23:05:48 +00:00
Alex Lorenz	ef5c196fb0	MIR Serialization: Serialize the target index machine operands. Reviewers: Duncan P. N. Exon Smith llvm-svn: 243497	2015-07-28 23:02:45 +00:00
Akira Hatanaka	2670f4a550	[ARM] Define subtarget feature strict-align. This commit defines subtarget feature strict-align and uses it instead of cl::opt -arm-strict-align to decide whether strict alignment should be forced. Also, remove the logic that was checking the OS and architecture as clang is now responsible for setting strict-align based on the command line options specified and the target architecute and OS. rdar://problem/21529937 http://reviews.llvm.org/D11470 llvm-svn: 243493	2015-07-28 22:44:28 +00:00
Tim Northover	17ae83a25f	AArch64: be careful of large immediates when optimising cmps. llvm-svn: 243492	2015-07-28 22:42:32 +00:00
Davide Italiano	f75bf454e4	[tests] Use llvm-readobj instead of macho-dump. llvm-svn: 243487	2015-07-28 21:58:08 +00:00
Bruno Cardoso Lopes	3c235763e5	[PeepholeOptimizer] Look through PHIs to find additional register sources Reapply 243271 with more fixes; although we are not handling multiple sources with coalescable copies, we were not properly skipping this case. - Teaches the ValueTracker in the PeepholeOptimizer to look through PHI instructions. - Add findNextSourceAndRewritePHI method to lookup into multiple sources returnted by the ValueTracker and rewrite PHIs with new sources. With these changes we can find more register sources and rewrite more copies to allow coaslescing of bitcast instructions. Hence, we eliminate unnecessary VR64 <-> GR64 copies in x86, but it could be extended to other archs by marking "isBitcast" on target specific instructions. The x86 example follows: A: psllq %mm1, %mm0 movd %mm0, %r9 jmp C B: por %mm1, %mm0 movd %mm0, %r9 jmp C C: movd %r9, %mm0 pshufw $238, %mm0, %mm0 Becomes: A: psllq %mm1, %mm0 jmp C B: por %mm1, %mm0 jmp C C: pshufw $238, %mm0, %mm0 Differential Revision: http://reviews.llvm.org/D11197 rdar://problem/20404526 llvm-svn: 243486	2015-07-28 21:45:50 +00:00
Vasileios Kalintiris	9876946aee	[mips][FastISel] Fix call lowering by bailing out on "fastcc" calls. Summary: Currently, we support only the MIPS O32 ABI calling convention for call lowering. With this change we avoid using the O32 calling convetion for lowering calls marked as using the fast calling convention. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11515 llvm-svn: 243485	2015-07-28 21:43:31 +00:00
Lang Hames	5c969333b8	[RuntimeDyld] Remove a memory-leak that was introduced in r243456. Thanks to Ben Kramer for catching this. llvm-svn: 243476	2015-07-28 20:51:53 +00:00
Chih-Hung Hsieh	41169c5487	Fix typo. llvm-svn: 243475	2015-07-28 20:38:29 +00:00
Chih-Hung Hsieh	c5e53ca1b7	Limit this test only on linux. Differential Revision: http://reviews.llvm.org/D10522 llvm-svn: 243474	2015-07-28 20:31:10 +00:00
Michael Zolotukhin	80d13bac02	[Unroll] Add debug dumps to loop-unroll analyzer. llvm-svn: 243471	2015-07-28 20:07:29 +00:00
Vasileios Kalintiris	9ec6114860	[mips][FastISel] Fix generated code for IR's select instruction. Summary: Generate correct code for the select instruction by zero-extending it's boolean/condition operand to GPR-width. This is necessary because the conditional-move instructions operate on the whole register. Reviewers: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11506 llvm-svn: 243469	2015-07-28 19:57:25 +00:00
Michael Zolotukhin	a425c9d0e3	[Unroll] Don't analyze blocks outside the loop. llvm-svn: 243466	2015-07-28 19:21:21 +00:00
Matt Arsenault	7227cc1a48	AMDGPU: Don't try to use LDS/vector for private if pointer value stored If the pointer is the store's value operand, this would produce a broken module. Make sure the use is actually for the pointer operand. llvm-svn: 243462	2015-07-28 18:47:00 +00:00
Matt Arsenault	fdcd39a8ad	AMDGPU: Fix crash if called function is a bitcast getCalledFunction() is null, so this would crash. Replace crash with an error on unsupported call. llvm-svn: 243461	2015-07-28 18:29:14 +00:00
Jingyue Wu	42f1d67a45	[SCEV] Apply NSW and NUW flags via poison value analysis Summary: Make Scalar Evolution able to propagate NSW and NUW flags from instructions to SCEVs in some cases. This is based on reasoning about when poison from instructions with these flags would trigger undefined behavior. This gives a 13% speed-up on some Eigen3-based Google-internal microbenchmarks for NVPTX. There does not seem to be clear agreement about when poison should be considered to propagate through instructions. In this analysis, poison propagates only in cases where that should be uncontroversial. This change makes LSR able to create induction variables for expressions like &ptr[i + offset] for loops like this: for (int i = 0; i < limit; ++i) { sum += ptr[i + offset]; } Here ptr is a 64 bit pointer and offset is a 32 bit integer. For NVPTX, LSR currently creates an induction variable for i + offset instead, which is not as fast. Improving this situation is what brings the 13% speed-up on some Eigen3-based Google-internal microbenchmarks for NVPTX. There are more details in this discussion on llvmdev. June: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-June/thread.html#87234 July: http://lists.cs.uiuc.edu/pipermail/llvmdev/2015-July/thread.html#87392 Patch by Bjarke Roune Reviewers: eliben, atrick, sanjoy Subscribers: majnemer, hfinkel, jingyue, meheff, llvm-commits Differential Revision: http://reviews.llvm.org/D11212 llvm-svn: 243460	2015-07-28 18:22:40 +00:00
Matt Arsenault	916cea5682	AMDGPU: Fix return type of getImplicitParameterOffset. Patch by Zoltan Gilian <zoltan.gilian@gmail.com> llvm-svn: 243459	2015-07-28 18:09:55 +00:00
Alex Lorenz	305e8f6312	Add a test case for r242191 ([MMX] Use the appropriate instructions for GR64 <-> VR64 copies). This commit adds a MIR test case for the commit r242191, which was committed without one. This test case verifies that the ExpandPostRA pass expands the GR64 <-> VR64 copies into the appropriate MMX_MOV instructions. llvm-svn: 243457	2015-07-28 17:52:59 +00:00
Lang Hames	2e88f4fc5f	[RuntimeDyld] Make LoadedObjectInfo::getLoadedSectionAddress take a SectionRef rather than a string section name. llvm-svn: 243456	2015-07-28 17:52:11 +00:00
Chih-Hung Hsieh	9843f406ec	Move unit tests to target specific directories. Differential Revision: http://reviews.llvm.org/D10522 llvm-svn: 243454	2015-07-28 17:32:49 +00:00
Alex Lorenz	deb534907e	MIR Serialization: Serialize the block address machine operands. llvm-svn: 243453	2015-07-28 17:28:03 +00:00
JF Bastien	ae7eebd429	WebAssembly: MCAsmInfo only has one syntax variant for now. Summary: MCAsmInfo is set up with the default AssemblerDialect, which is zero. Subscribers: llvm-commits, sunfish, jfb Differential Revision: http://reviews.llvm.org/D11567 llvm-svn: 243452	2015-07-28 17:23:07 +00:00
Sanjay Patel	94a7433cde	add tests to show broken current behavior of minsize attribute llvm-svn: 243451	2015-07-28 17:18:25 +00:00
Alex Lorenz	41df7d3d10	MIR Parser: Extract the method 'parseGlobalValue'. NFC. This commit extracts the code that parses a global value from the method 'parseGlobalAddressOperand' into a new method 'parseGlobalValue', so that this code can be reused by the method which will parse the block address machine operands. llvm-svn: 243450	2015-07-28 17:09:52 +00:00
Alex Lorenz	82a1cfdca2	MIR Parser: Move the function 'lexName'. NFC. This commit moves the function 'lexName' to the start of the file so it can be reused by the function which will lex the named LLVM IR block references. llvm-svn: 243449	2015-07-28 17:03:40 +00:00
Alex Lorenz	e8ce3e616b	MIR Printer: Remove an outdated TODO comment and assertion. NFC. This commit removes an outdated TODO comment and a corresponding assertion which asserts that the mir printer can't the print machine basic blocks that aren't sequentially numbered. This comment and assertion were correct when I was working on the patch which serialized the machine basic blocks, but then I decided to add an 'ID' attribute to the machine basic block's YAML mapping based on the patch review. This comment and assertion then became invalid as with the 'ID' attribute we can serialize the non sequential machine basic blocks and their references without any problems. llvm-svn: 243447	2015-07-28 16:56:45 +00:00
Alex Lorenz	db07c40943	MIR Parser: Remove redundant parameters. NFC. This commit removes the redundant parameters from the two methods 'initializeRegisterInfo' and 'initializeFrameInfo'. The removed parameters are redundant as we are already passing in the 'MachineFunction' to those methods, and those parameters can be derived from the machine function parameter. llvm-svn: 243445	2015-07-28 16:48:37 +00:00
Chih-Hung Hsieh	1e859582d6	Implement target independent TLS compatible with glibc's emutls.c. The 'common' section TLS is not implemented. Current C/C++ TLS variables are not placed in common section. DWARF debug info to get the address of TLS variables is not generated yet. clang and driver changes in http://reviews.llvm.org/D10524 Added -femulated-tls flag to select the emulated TLS model, which will be used for old targets like Android that do not support ELF TLS models. Added TargetLowering::LowerToTLSEmulatedModel as a target-independent function to convert a SDNode of TLS variable address to a function call to __emutls_get_address. Added into lib/Target//ISelLowering.cpp to call LowerToTLSEmulatedModel for TLSModel::Emulated. Although all targets supporting ELF TLS models are enhanced, emulated TLS model has been tested only for Android ELF targets. Modified AsmPrinter.cpp to print the emutls_v.* and emutls_t.* variables for emulated TLS variables. Modified DwarfCompileUnit.cpp to skip some DIE for emulated TLS variabls. TODO: Add proper DIE for emulated TLS variables. Added new unit tests with emulated TLS. Differential Revision: http://reviews.llvm.org/D10522 llvm-svn: 243438	2015-07-28 16:24:05 +00:00
Martell Malone	1eff5c9c09	Summary: Object: add IMAGE_FILE_MACHINE_ARM64 The official specifications state that the value of IMAGE_FILE_MACHINE_ARM64 is 0xAA64 (as per the Microsoft Portable Executable and Common Object Format Specification v8.3). Reviewers: rnk Subscribers: llvm-commits, compnerd, ruiu Differential Revision: http://reviews.llvm.org/D11511 llvm-svn: 243434	2015-07-28 16:18:17 +00:00
Bruno Cardoso Lopes	51fd242cfc	[LVI] Cleanup whitespaces. NFC llvm-svn: 243430	2015-07-28 15:53:21 +00:00
Sanjay Patel	d411114e77	fix formatting; NFC llvm-svn: 243424	2015-07-28 15:38:43 +00:00
Geoff Berry	c573bf7a5f	[AArch64] Match float round and convert to int instructions. Summary: Add patterns for doing floating point round with various rounding modes followed by conversion to int as a single FCVT* instruction. Reviewers: t.p.northover, jmolloy Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D11424 llvm-svn: 243422	2015-07-28 15:24:10 +00:00
Douglas Katzman	280ee917d7	Use a specified list of languages in cmake project() command. This allows asm files and Cxx files to be compiled with different flags rather than treating them identically. LLVM itself has no asm files other than tests, but this setting is inherited by the compiler-rt project (unless compiled standalone), which does have asm files. Differential Revision: http://reviews.llvm.org/D10707 llvm-svn: 243419	2015-07-28 14:43:53 +00:00
Silviu Baranga	4825060059	[LAA] Add clarifying comments for the checking pointer grouping algorithm. NFC llvm-svn: 243416	2015-07-28 13:44:08 +00:00
Adhemerval Zanella	7bc3319d84	Implement __builtin_thread_pointer This path add the aarch64 lowering of __builtin_thread_pointer. It uses the already implemented AArch64ISD::THREAD_POINTER used in TLS generation. llvm-svn: 243412	2015-07-28 13:03:31 +00:00
Martell Malone	182b4bbc2a	docs: update arcanist links Summary: I need a test commit for using arc. This seems like an appropriate commit to use as a test We may want to port this commit back to 3.7 also Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11527 llvm-svn: 243408	2015-07-28 11:43:37 +00:00
Chandler Carruth	99ad7bb49c	[GMR] Teach GlobalsModRef to distinguish an important and safe case of no-alias with non-addr-taken globals: they cannot alias a captured pointer. If the non-global underlying object would have been a capture were it to alias the global, we can firmly conclude no-alias. It isn't reasonable for a transformation to introduce a capture in a way observable by an alias analysis. Consider, even if it were to temporarily capture one globals address into another global and then restore the other global afterward, there would be no way for the load in the alias query to observe that capture event correctly. If it observes it then the temporary capturing would have changed the meaning of the program, making it an invalid transformation. Even instrumentation passes or a pass which is synthesizing stores to global variables to expose race conditions in programs could not trigger this unless it queried the alias analysis infrastructure mid-transform, in which case it seems reasonable to return results from before the transform started. See the comments in the change for a more detailed outlining of the theory here. This should address the primary performance regression found when the non-conservatively-correct path of the alias query was disabled. Differential Revision: http://reviews.llvm.org/D11410 llvm-svn: 243405	2015-07-28 11:11:11 +00:00
Renato Golin	e51c1ce1db	Improving lli documentation Too many people hope lli would act as an emulator when it's actually just a tool to help prototype IR code and test the JIT compiler. This commit makes that fact explicit in the documentation It also migrates the old style bold/italic doc tags to the preferred meta tags (.. option::, :program:, etc). No errors when generating the documents, visual inspection in the HTML result doesn't show any major difference, apart from the slight style change. llvm-svn: 243401	2015-07-28 10:24:11 +00:00
Michael Kuperstein	cba308cf96	[X86] Remove mergeSPUpdatesUp() X86FrameLowering has both a mergeSPUpdates() that accepts a direction, and an mergeSPUpdatesUp(), which seem to do the same thing, except for a slightly different interface. Removed the less general function. NFC. Differential Revision: http://reviews.llvm.org/D11510 llvm-svn: 243396	2015-07-28 08:56:13 +00:00
Simon Pilgrim	df984f58ad	[X86][SSE] Use bitmasks instead of shuffles where possible. VPAND is a lot faster than VPSHUFB and VPBLENDVB - this patch ensures we attempt to lower to a basic bitmask before lowering to the slower byte shuffle/blend instructions. Split off from D11518. Differential Revision: http://reviews.llvm.org/D11541 llvm-svn: 243395	2015-07-28 08:54:41 +00:00
Igor Breger	47a7b95b1d	AVX512: Add encoding tests to vptestnm instructions Differential Revision: http://reviews.llvm.org/D11521 llvm-svn: 243391	2015-07-28 07:00:00 +00:00
Igor Breger	8352a0ddf2	AVX512: Implemented encoding and intrinsics for VGETEXPSS/D instructions Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D11528 llvm-svn: 243390	2015-07-28 06:53:28 +00:00
Puyan Lotfi	567001c281	Changes for MachineBasicBlock to use SortedVector for LiveIns. llvm-svn: 243389	2015-07-28 06:38:41 +00:00
Mehdi Amini	b58f8137c1	Move the Target way of overriding DAG Scheduler to a target hook Summary: The previous way of overriding it was relying on calling "setDefault" on the global registry, which implies global mutable state. Reviewers: echristo, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11538 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 243388	2015-07-28 06:18:04 +00:00
Puyan Lotfi	36b7f1d1c2	Adding ADT SortedVector; client patch will follow. llvm-svn: 243386	2015-07-28 06:04:00 +00:00
Chandler Carruth	786e6db187	[GMR] Fix a long-standing bug in GlobalsModRef where it failed to clear out the per-function modref data structures when functions were deleted or when globals were deleted. I don't actually know how the global deletion side of this bug hasn't been hit before, but for the other it just-so-happens that functions aren't likely to be deleted in the particular part of the LTO pipeline where we currently enable GMR, so we got lucky. With this patch, I can self-host with GMR enabled in the normal pass pipeline! I was a bit concerned about the compile-time impact of this chang, which is part of what motivated my prior string of patches to make the per-function datastructure very dense and fast to walk. With those changes in place, I can't measure a significant compile time difference (the difference is around 0.1% which is way below the noise) before and after this patch when building a linked bitcode for all of Clang. Differential Revision: http://reviews.llvm.org/D11453 llvm-svn: 243385	2015-07-28 06:01:57 +00:00
Adam Nemet	0a674401bf	[LDist][LVer] Explicitly pass the set of memchecks to LoopVersioning, NFC Before the patch, the checks were generated internally in addRuntimeCheck. Now, we use the new overloaded version of addRuntimeCheck that takes the ready-made set of checks as a parameter. The checks are now generated by the client (LoopDistribution) with the new RuntimePointerChecking::generateChecks API. Also the new printChecks API is used to print out the checks for debugging. This is to continue the transition over to the new model whereby clients will get the full set of checks from LAA, filter it and then pass it to LoopVersioning and in turn to addRuntimeCheck. llvm-svn: 243382	2015-07-28 05:01:53 +00:00
Craig Topper	7554da2ca3	Remove unnecessary const_casts. NFC llvm-svn: 243380	2015-07-28 04:28:46 +00:00
Bob Wilson	043ee65ef3	Reserve some constant values for the Swift calling convention. Swift has a custom calling convention that also requires some new flags on arguments and one new attribute on alloca instructions. This patch does not include the implementation of that calling convention - that will be provided as part of the open-source release of Swift; this only reserves the bitcode constant values so that they are not used for other purposes. llvm-svn: 243379	2015-07-28 04:05:45 +00:00
Sanjoy Das	6c7a186599	FileCheck'ify some wc/grep based tests; NFCI. llvm-svn: 243378	2015-07-28 03:50:09 +00:00
Kostya Serebryany	ae7df1ca4d	[libFuzzer] ensure that the dfsan tracing hooks actually run (using -verbosity=3 in tests) llvm-svn: 243365	2015-07-28 01:25:00 +00:00
Kostya Serebryany	35959592a3	[libFuzzer] when using cmp traces, first check that the CMP is evaluated to one value much more frequently than to the other value (heuristic) llvm-svn: 243363	2015-07-28 00:59:53 +00:00
Sanjay Patel	8c13e3680d	fix invalid load folding with SSE/AVX FP logical instructions (PR22371) This is a follow-up to the FIXME that was added with D7474 ( http://reviews.llvm.org/rL229531 ). I thought this load folding bug had been made hard-to-hit, but it turns out to be very easy when targeting 32-bit x86 and causes a miscompile/crash in Wine: https://bugs.winehq.org/show_bug.cgi?id=38826 https://llvm.org/bugs/show_bug.cgi?id=22371#c25 The quick fix is to simply remove the scalar FP logical instructions from the load folding table in X86InstrInfo, but that causes us to miss load folds that should be possible when lowering fabs, fneg, fcopysign. So the majority of this patch is altering those lowerings to use vector FP logical instructions (because that's all x86 gives us anyway). That lets us do the load folding legally. Differential Revision: http://reviews.llvm.org/D11477 llvm-svn: 243361	2015-07-28 00:48:32 +00:00
Sanjoy Das	3895a57b32	[LSR] Move X86 specific test case to X86/ rL243348 added the test case in the wrong directory. llvm-svn: 243357	2015-07-28 00:13:42 +00:00
David Blaikie	71c9c9ce31	[opaque pointer type] Avoid using pointee types to retrieve InlineAsm's function type As a stop-gap, retrieving the InlineAsm's function type was done via the pointee type of its (pointer) Value type. Instead, pass down and store the FunctionType in the InlineAsm object. The only wrinkle with this is the ConstantUniqueMap, which then needs to ferry the FunctionType down through the InlineAsmKeyType. This could be done a bit differently if the ConstantInfo trait were broadened a bit to provide an extension point for access to the TypeClass object from the ValType objects, so that the ConstantUniqueMap<InlineAsm> would then be keyed on FunctionTypes instead of PointerTypes that point to FunctionTypes. This drops the number of IR tests that don't roundtrip through bitcode* without calling PointerType::getElementType from 416 to 8 (out of 10733). 3 of those crash when roundtripping at ToT anyway. * modulo various unavoidable uses of pointer types when validating IR (for now) and in the way globals are parsed, unfortunately. These cases will either go away (because such validation will no longer be necessary or possible when pointee types are opaque), or have to be made simultaneously with the removal of pointee types. llvm-svn: 243356	2015-07-28 00:06:38 +00:00
Adam Nemet	54f0b83ee2	[LAA] Split out a helper to print a collection of memchecks This is effectively an NFC but we can no longer print the index of the pointer group so instead I print its address. This still lets us cross-check the section that list the checks against the section that list the groups (see how I modified the test). E.g. before we printed this: Run-time memory checks: Check 0: Comparing group 0: %arrayidxC = getelementptr inbounds i16, i16* %c, i64 %store_ind %arrayidxC1 = getelementptr inbounds i16, i16* %c, i64 %store_ind_inc Against group 1: %arrayidxA = getelementptr i16, i16* %a, i64 %ind %arrayidxA1 = getelementptr i16, i16* %a, i64 %add ... Grouped accesses: Group 0: (Low: %c High: (78 + %c)) Member: {%c,+,4}<%for.body> Member: {(2 + %c),+,4}<%for.body> Now we print this (changes are underlined): Run-time memory checks: Check 0: Comparing group (0x7f9c6040c320): ~~~~~~~~~~~~~~ %arrayidxC1 = getelementptr inbounds i16, i16* %c, i64 %store_ind_inc %arrayidxC = getelementptr inbounds i16, i16* %c, i64 %store_ind Against group (0x7f9c6040c358): ~~~~~~~~~~~~~~ %arrayidxA1 = getelementptr i16, i16* %a, i64 %add %arrayidxA = getelementptr i16, i16* %a, i64 %ind ... Grouped accesses: Group 0x7f9c6040c320: ~~~~~~~~~~~~~~ (Low: %c High: (78 + %c)) Member: {(2 + %c),+,4}<%for.body> Member: {%c,+,4}<%for.body> llvm-svn: 243354	2015-07-27 23:54:41 +00:00
Sanjay Patel	1d74cadcbc	fix typo; NFC llvm-svn: 243351	2015-07-27 23:43:09 +00:00
David Blaikie	41ba2b47da	[opaque pointers] Avoid the use of pointee types when parsing inline asm in IR When parsing calls to inline asm the pointee type (of the pointer type representing the value type of the InlineAsm value) was used. To avoid using it, use the ValID structure to ferry the FunctionType directly through to the InlineAsm construction. This is a bit of a workaround - alternatively the inline asm could explicitly describe the type but that'd be verbose/redundant in the IR and so long as the inline asm calls directly in the context of a call or invoke, this should suffice. llvm-svn: 243349	2015-07-27 23:32:19 +00:00
Sanjoy Das	93b3504aa8	[LSR] Generate and use zero extends Summary: If a scale or a base register can be rewritten as "Zext({A,+,1})" then LSR will now consider a formula of that form in its normal cost computation. Depends on D9180 Reviewers: qcolombet, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9181 llvm-svn: 243348	2015-07-27 23:27:51 +00:00
Sanjoy Das	c3182d8c43	[TargetTransformInfo][NFCI] Add TargetTransformInfo::isZExtFree. Summary: This function is not used in this change but will be used in a subsequent change. Reviewers: mcrosier, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9180 llvm-svn: 243347	2015-07-27 23:27:43 +00:00
JF Bastien	088c47ee5b	WebAssembly: add a generic CPU Summary: WebAssemblySubtarget.cpp expects a default 'generic' CPU to exist, and this seems to be prevalent with other targets. It makes sense to have something between MVP and bleeding-edge, even though for now it's the same as MVP. This removes a warning that's currently generated. Subscribers: jfb, llvm-commits, sunfish Differential Revision: http://reviews.llvm.org/D11546 llvm-svn: 243345	2015-07-27 23:25:54 +00:00
NAKAMURA Takumi	69ed7170dc	Tweak llvm/test/CodeGen/X86/virtual-registers-cleared-in-machine-functions-liveins.ll not to fail for targeting win32. llvm-svn: 243341	2015-07-27 23:01:41 +00:00
Alex Lorenz	8a1915b04e	MIR Serialization: Serialize the unnamed basic block references. This commit serializes the references from the machine basic blocks to the unnamed basic blocks. This commit adds a new attribute to the machine basic block's YAML mapping called 'ir-block'. This attribute contains the actual reference to the basic block. Reviewers: Duncan P. N. Exon Smith llvm-svn: 243340	2015-07-27 22:42:41 +00:00
JF Bastien	6c6efa1786	WebAssembly: more MCAsmInfo nits. Summary: As suggested by sunfish. Subscribers: jfb, llvm-commits, sunfish Differential Revision: http://reviews.llvm.org/D11544 llvm-svn: 243339	2015-07-27 22:40:31 +00:00
Colin LeMahieu	fe36f83b11	[llvm-mc] Add --no-warn flag with -W alias to disable outputting warnings while assembling. llvm-svn: 243338	2015-07-27 22:39:14 +00:00
Reid Kleckner	7bdf4f2eb2	Fix -Wmicrosoft-enum warning llvm-svn: 243337	2015-07-27 22:35:50 +00:00
Alex Lorenz	991a6241d3	IR: Expose the method 'getLocalSlot' in the module slot tracker. This commit publicly exposes the method 'getLocalSlot' in the 'ModuleSlotTracker' class. This change is useful for MIR serialization, to serialize the unnamed basic block and unnamed alloca references. Reviewers: Duncan P. N. Exon Smith llvm-svn: 243336	2015-07-27 22:31:04 +00:00
Alexandros Lamprineas	4ea707555a	- Added support for parsing HWDiv features using Target Parser. - Architecture extensions are represented as a bitmap. Phabricator: http://reviews.llvm.org/D11457 llvm-svn: 243335	2015-07-27 22:26:59 +00:00
Colin LeMahieu	fe2c8b8015	[llvm-mc] Pushing plumbing through for --fatal-warnings flag. llvm-svn: 243334	2015-07-27 21:56:53 +00:00
Sanjoy Das	5dab205ced	[IndVars] Make loop varying predicates loop invariant. Summary: Was D9784: "Remove loop variant range check when induction variable is strictly increasing" This change re-implements D9784 with the two differences: 1. It does not use SCEVExpander and does not generate new instructions. Instead, it does a quick local search for existing `llvm::Value`s that it needs when modifying the `icmp` instruction. 2. It is more general -- it deals with both increasing and decreasing induction variables. I've added all of the tests included with D9784, and two more. As an example on what this change does (copied from D9784): Given C code: ``` for (int i = M; i < N; i++) // i is known not to overflow if (i < 0) break; a[i] = 0; } ``` This transformation produces: ``` for (int i = M; i < N; i++) if (M < 0) break; a[i] = 0; } ``` Which can be unswitched into: ``` if (!(M < 0)) for (int i = M; i < N; i++) a[i] = 0; } ``` I went back and forth on whether the top level logic should live in `SimplifyIndvar::eliminateIVComparison` or be put into its own routine. Right now I've put it under `eliminateIVComparison` because even though the `icmp` is not eliminated, it no longer is an IV comparison. I'm open to putting it in its own helper routine if you think that is better. Reviewers: reames, nicholas, atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11278 llvm-svn: 243331	2015-07-27 21:42:49 +00:00
Sanjay Patel	1cf245fd96	remove unnecessary forward declaration; NFC llvm-svn: 243328	2015-07-27 21:11:55 +00:00
Sanjay Patel	aa99a2304d	don't repeat function names in comments; NFC llvm-svn: 243327	2015-07-27 21:03:03 +00:00
JF Bastien	1a12bf1aa2	WebAssembly: minor MCAsmInfo fixes Summary: Fix pointer / callee-save stack sto size. Update comment character to be LISP-ish. Subscribers: llvm-commits, sunfish, jfb Differential Revision: http://reviews.llvm.org/D11537 llvm-svn: 243326	2015-07-27 20:46:51 +00:00
Simon Pilgrim	c363e7d8e0	[X86][SSE] Added shuffle tests to demonstrate missed bitmask. llvm-svn: 243324	2015-07-27 20:41:57 +00:00
Alex Lorenz	5b0d5f6f26	MIR Serialization: Serialize the '.cfi_def_cfa_register' CFI instruction. llvm-svn: 243322	2015-07-27 20:39:03 +00:00
Alex Lorenz	1ea608986d	MIR Parser: Rename the standalone parsing methods. NFC. This commit renames the methods 'parseMBB' and 'parseNamedRegister' to 'parseStandaloneMBB' and 'parseStandaloneNamedRegister' in order for their names to be consistent with the method 'parseStandaloneVirtualRegister'. llvm-svn: 243319	2015-07-27 20:29:27 +00:00
Bruno Cardoso Lopes	b20841df44	Revert "[PeepholeOptimizer] Look through PHIs to find additional register sources" Still breaks some ARM buildbots. This reverts r243271. llvm-svn: 243318	2015-07-27 20:26:04 +00:00
Adam Nemet	7c52e0527d	[LAA] Upper-case variable names, NFC llvm-svn: 243313	2015-07-27 19:38:50 +00:00
Adam Nemet	bbe1f1de16	[LAA] Split out a helper from addRuntimeCheck to generate the check, NFC llvm-svn: 243312	2015-07-27 19:38:48 +00:00
Akira Hatanaka	2541e0241c	[AArch64] Remove check for Darwin that was needed to decide if x18 should be reserved. The decision to reserve x18 is going to be made solely by the front-end, so it isn't necessary to check if the OS is Darwin in the backend. llvm-svn: 243308	2015-07-27 19:18:47 +00:00
Simon Pilgrim	074c0d97dc	Fixed signed/unsigned comparison warning. llvm-svn: 243306	2015-07-27 19:07:15 +00:00
Juergen Ributzka	93d67463a3	[AArch64][FastISel] Add more truncation tests. This is a follow-up to r243198 and adds more truncation tests. llvm-svn: 243304	2015-07-27 19:00:23 +00:00
Simon Pilgrim	15c0a59463	[InstCombine][X86][SSE] Replace sign/zero extension intrinsics with native IR Now that we are generating sane codegen for vector sext/zext nodes on SSE targets, this patch uses instcombine to replace the SSE41/AVX2 pmovsx and pmovzx intrinsics with the equivalent native IR code. Differential Revision: http://reviews.llvm.org/D11503 llvm-svn: 243303	2015-07-27 18:52:15 +00:00
Pete Cooper	11bd958cb6	Revert "Remove unnecessary null check. NFC." This reverts commit r243167. Duncan pointed out that dyn_cast can return null in these cases, so this was an unsafe commit to make. Sorry for the noise. Worryingly there were no tests which fail... llvm-svn: 243302	2015-07-27 18:37:58 +00:00
Matt Arsenault	95365ca482	Fix assert when inlining a constantexpr addrspacecast The pointer size of the addrspacecasted pointer might not have matched, so this would have hit an assert in accumulateConstantOffset. I think this was here to allow constant folding of a load of an addrspacecasted constant. Accumulating the offset through the addrspacecast doesn't make much sense, so something else is necessary to allow folding the load through this cast. llvm-svn: 243300	2015-07-27 18:31:03 +00:00
Diego Novillo	cd973c4f77	Fix ODR violation. NFC. There is an ODR conflict between lib/ExecutionEngine/ExecutionEngineBindings.cpp and lib/Target/TargetMachineC.cpp. The inline definitions should simply be marked static (thanks dblaikie for the hint). llvm-svn: 243298	2015-07-27 18:27:23 +00:00
JF Bastien	ba70e9e1e6	Fix `llvm-config` to emit the linker flag for the combined shared object built by autoconfig/make instead of the individual components. Summary: When LLVM is configured to build shared libraries, CMake builds each component as it's own shared object, while autoconfig/make builds them statically and then links them all together to create a single shared object. This change adds compile time config flags to `llvm-config` so it can know whether LLVM's components are separated or not and act accordingly. This fixes `llvm-config` instead of fixing the makefiles to behave like CMake because, AIUI, LLVM's autoconfig/make build system is on the way out anyway. This change only affects `llvm-config` from builds that use autoconfig/make. Reviewers: jfb Subscribers: echristo, dschuff, llvm-commits Differential Revision: http://reviews.llvm.org/D11392 llvm-svn: 243297	2015-07-27 18:26:30 +00:00
Marek Olsak	93df060871	AMDGPU: don't match vgpr loads for constant loads Author: Dave Airlie <airlied@redhat.com> In order to implement indirect sampler loads, we don't want to match on a VGPR load but an SGPR one for constants, as we cannot feed VGPRs to the sampler only SGPRs. this should be applicable for llvm 3.7 as well. llvm-svn: 243294	2015-07-27 18:16:08 +00:00
Sanjay Patel	c1c2b87001	move combineRepeatedFPDivisors logic into a helper function; NFCI llvm-svn: 243293	2015-07-27 17:58:49 +00:00
Alex Lorenz	10b23525cc	Reset the virtual registers in liveins when clearing the virtual registers. This commit zeroes out the virtual register references in the machine function's liveins in the class 'MachineRegisterInfo' when the virtual register definitions are cleared. Reviewers: Matthias Braun llvm-svn: 243290	2015-07-27 17:51:59 +00:00
Alex Lorenz	12045a4b59	MIR Serialization: Serialize the machine function's liveins. Reviewers: Duncan P. N. Exon Smith llvm-svn: 243288	2015-07-27 17:42:45 +00:00
Sanjay Patel	beb4cffb43	fix typo and spacing; NFC llvm-svn: 243287	2015-07-27 17:39:20 +00:00
Davide Italiano	fa04402e24	[TableGen] Emit the correct error message. llvm-svn: 243284	2015-07-27 17:22:19 +00:00
Pete Cooper	0ae7393027	Revert "Add const to a bunch of Type* in DataLayout. NFC." This reverts commit r243135. Feedback from Craig Topper and David Blaikie was that we don't put const on Type as it has no mutable state. llvm-svn: 243283	2015-07-27 17:15:28 +00:00
Pete Cooper	2e20147403	Revert "Add const to some Type* parameters which didn't need to be mutable. NFC." This reverts commit r243146. Feedback from Craig Topper and David Blaikie was that we don't put const on Type as it has no mutable state. llvm-svn: 243282	2015-07-27 17:15:24 +00:00
Silviu Baranga	de38070587	The tests added in r243270 require asserts to be enabled llvm-svn: 243274	2015-07-27 15:22:49 +00:00
Silviu Baranga	65bdb6788b	Fix the tests added in r243270. Use 2>&1 instead of \|& llvm-svn: 243273	2015-07-27 15:08:55 +00:00
Bruno Cardoso Lopes	669c921bfd	[PeepholeOptimizer] Look through PHIs to find additional register sources Reapply r242295 with fixes in the implementation. - Teaches the ValueTracker in the PeepholeOptimizer to look through PHI instructions. - Add findNextSourceAndRewritePHI method to lookup into multiple sources returnted by the ValueTracker and rewrite PHIs with new sources. With these changes we can find more register sources and rewrite more copies to allow coaslescing of bitcast instructions. Hence, we eliminate unnecessary VR64 <-> GR64 copies in x86, but it could be extended to other archs by marking "isBitcast" on target specific instructions. The x86 example follows: A: psllq %mm1, %mm0 movd %mm0, %r9 jmp C B: por %mm1, %mm0 movd %mm0, %r9 jmp C C: movd %r9, %mm0 pshufw $238, %mm0, %mm0 Becomes: A: psllq %mm1, %mm0 jmp C B: por %mm1, %mm0 jmp C C: pshufw $238, %mm0, %mm0 Differential Revision: http://reviews.llvm.org/D11197 rdar://problem/20404526 llvm-svn: 243271	2015-07-27 14:39:46 +00:00
Silviu Baranga	7581d22512	[ARM/AArch64] Fix cost model for interleaved accesses Summary: Fix the cost of interleaved accesses for ARM/AArch64. We were calling getTypeAllocSize and using it to check the number of bits, when we should have called getTypeAllocSizeInBits instead. This would pottentially cause the vectorizer to generate loads/stores and shuffles which cannot be matched with an interleaved access instruction. No performance changes are expected for now since matching/generating interleaved accesses is still disabled by default. Reviewers: rengolin Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D11524 llvm-svn: 243270	2015-07-27 14:39:34 +00:00
Simon Pilgrim	81accb7b27	[X86] Reordered lowerVectorShuffleAsBitMask before lowerVectorShuffleAsBlend. NFCI. Allows us to show diffs for D11518 more clearly llvm-svn: 243264	2015-07-27 12:37:19 +00:00
Marek Olsak	1354b87695	AMDGPU/SI: Fix the V_FRACT_F64 SI bug workaround This is a candidate for 3.7. llvm-svn: 243263	2015-07-27 11:37:42 +00:00
NAKAMURA Takumi	94abbbd6ab	LoopAccessAnalysis.cpp: Tweak r243239 to avoid side effects. It caused different emissions between gcc and clang. llvm-svn: 243258	2015-07-27 01:35:30 +00:00
Sean Silva	e1c6b549ef	Avoid using uncommon acronym "MSROM". llvm-svn: 243256	2015-07-27 00:46:59 +00:00
Jingyue Wu	bfefff555e	Roll forward r243250 r243250 appeared to break clang/test/Analysis/dead-store.c on one of the build slaves, but I couldn't reproduce this failure locally. Probably a false positive as I saw this test was broken by r243246 or r243247 too but passed later without people fixing anything. llvm-svn: 243253	2015-07-26 19:10:03 +00:00
Jingyue Wu	84879b71a9	Revert r243250 breaks tests llvm-svn: 243251	2015-07-26 18:30:13 +00:00
Jingyue Wu	bf485f059c	[TTI/CostModel] improve TTI::getGEPCost and use it in CostModel::getInstructionCost Summary: This patch updates TargetTransformInfoImplCRTPBase::getGEPCost to consider addressing modes. It now returns TCC_Free when the GEP can be completely folded to an addresing mode. I started this patch as I refactored SLSR. Function isGEPFoldable looks common and is indeed used by some WIP of mine. So I extracted that logic to getGEPCost. Furthermore, I noticed getGEPCost wasn't directly tested anywhere. The best testing bed seems CostModel, but its getInstructionCost method invokes getAddressComputationCost for GEPs which provides very coarse estimation. So this patch also makes getInstructionCost call the updated getGEPCost for GEPs. This change inevitably breaks some tests because the cost model changes, but nothing looks seriously wrong -- if we believe the new cost model is the right way to go, these tests should be updated. This patch is not perfect yet -- the comments in some tests need to be updated. I want to know whether this is a right approach before fixing those details. Reviewers: chandlerc, hfinkel Subscribers: aschwaighofer, llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D9819 llvm-svn: 243250	2015-07-26 17:28:13 +00:00
Simon Pilgrim	65d35a14b7	[X86][SSE] Refreshed vector bit count tests. llvm-svn: 243249	2015-07-26 17:02:25 +00:00
Simon Pilgrim	8a9c1d7d88	[X86][AVX2] Refreshed avx2 conversion tests llvm-svn: 243248	2015-07-26 17:01:16 +00:00
Tobias Grosser	56eab3603a	bugpoint: make the number of trim iterations a compile-time constant Around 10 year ago Chris limited this code to a single iteration by just dropping a break into the loop body. We now make the number of trim iterations a compile time constant to be able to play with it and see if this can improve the bugpoint results. We currently use with '3' still a small and conservative value, but this can be adjusted in the future, if needed. I tried to look for a trivial test case, but did not succeed yet. llvm-svn: 243247	2015-07-26 15:18:45 +00:00
Igor Breger	f2460112ad	Implemented encoding and intrinsics of the following instructions vunpckhps/pd, vunpcklps/pd, vpunpcklbw, vpunpckhbw, vpunpcklwd, vpunpckhwd, vpunpckldq, vpunpckhdq, vpunpcklqdq, vpunpckhqdq Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D11509 llvm-svn: 243246	2015-07-26 14:41:44 +00:00
Tobias Grosser	e692669759	Fix typo in comment llvm-svn: 243244	2015-07-26 11:37:05 +00:00
Davide Italiano	4376ddb88e	[llvm-dwarfump] Don't rely on global state, part 3. Some tools used to rely on a global static variable to keep track of the return value for main(). I changed llvm-cxxdump to use exit(1) and Rafael shortly after did the same with llvm-readobj. This is (yet) another step towards the goal. llvm-svn: 243240	2015-07-26 05:35:59 +00:00
Adam Nemet	1da7df3700	[LAA] Begin moving the logic of generating checks out of addRuntimeCheck Summary: The goal is to start moving us closer to the model where RuntimePointerChecking will compute and store the checks. Then a client can filter the check according to its requirements and then use the filtered list of checks with addRuntimeCheck. Before the patch, this is all done in addRuntimeCheck. So the patch starts to split up addRuntimeCheck while providing the old API under what's more or less a wrapper now. The new underlying addRuntimeCheck takes a collection of checks now, expands the code for the bounds then generates the code for the checks. I am not completely happy with making expandBounds static because now it needs so many explicit arguments but I don't want to make the type PointerBounds part of LAI. This should get fixed when addRuntimeCheck is moved to LoopVersioning where it really belongs, IMO. Audited the assembly diff of the testsuite (including externals). There is a tiny bit of assembly churn that is due to the different order the code for the bounds is expanded now (MultiSource/Benchmarks/Prolangs-C/bison/conflicts.s and with LoopDist on 456.hmmer/fast_algorithms.s). Reviewers: hfinkel Subscribers: klimek, llvm-commits Differential Revision: http://reviews.llvm.org/D11205 llvm-svn: 243239	2015-07-26 05:32:14 +00:00
Simon Pilgrim	54fcd62c6f	[InstCombine][SSE4A] Standardized references to Length/Width and Index/Start to match AMD docs. NFCI. llvm-svn: 243226	2015-07-25 20:41:00 +00:00
Simon Pilgrim	357b85c926	[InstCombine] Split off SSE4a tests. These aren't vector demanded bits tests. More tests to follow. llvm-svn: 243223	2015-07-25 17:14:01 +00:00
Simon Pilgrim	944a5777bb	[X86][SSE] Added additional vector sign/zero load extension tests. llvm-svn: 243216	2015-07-25 14:07:20 +00:00
Simon Pilgrim	20dc35aff6	[X86][SSE] Added additional vector sign/zero extension tests. llvm-svn: 243212	2015-07-25 11:17:35 +00:00
Chen Li	145c2f57ae	[LoopUnswitch] Improve loop unswitch pass to find trivial unswitch conditions more effectively Summary: This patch improves trivial loop unswitch. The current trivial loop unswitch only checks if loop header's terminator contains a trivial unswitch condition. But if the loop header only has one reachable successor (due to intentionally or unintentionally missed code simplification), we should consider the successor as part of the loop header. Therefore, instead of stopping at loop header's terminator, we should keep traversing its successors within loop until reach a real conditional branch or switch (whose condition can not be constant folded). This change will enable a single -loop-unswitch pass to unswitch multiple trivial conditions (unswitch one trivial condition could open opportunity to unswitch another one in the same loop), while the old implementation can unswitch only one per pass. Reviewers: reames, broune Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D11481 llvm-svn: 243203	2015-07-25 03:21:06 +00:00
Juergen Ributzka	6364985b58	[AArch64][FastISel] Always use an AND instruction when truncating to non-legal types. When truncating to non-legal types (such as i16, i8 and i1) always use an AND instruction to mask out the upper bits. This was only done when the source type was an i64, but not when the source type was an i32. This commit fixes this and adds the missing i32 truncate tests. This fixes rdar://problem/21990703. llvm-svn: 243198	2015-07-25 02:16:53 +00:00
Eric Christopher	f0024d14f1	Fix PPCMaterializeInt to check the size of the integer based on the extension property we're requesting - zero or sign extended. This fixes cases where we want to return a zero extended 32-bit -1 and not be sign extended for the entire register. Also updated the already out of date comment with the current behavior. llvm-svn: 243192	2015-07-25 00:48:08 +00:00
Eric Christopher	03df7ac8a9	PPCMaterializeInt should only take a ConstantInt so represent this in the prototype and fix up all uses. llvm-svn: 243191	2015-07-25 00:48:06 +00:00
Akira Hatanaka	0d4c9ea6e0	[AArch64] Define subtarget feature "reserve-x18", which is used to decide whether register x18 should be reserved. This change is needed because we cannot use a backend option to set cl::opt "aarch64-reserve-x18" when doing LTO. Out-of-tree projects currently using cl::opt option "-aarch64-reserve-x18" to reserve x18 should make changes to add subtarget feature "reserve-x18" to the IR. rdar://problem/21529937 Differential Revision: http://reviews.llvm.org/D11463 llvm-svn: 243186	2015-07-25 00:18:31 +00:00
Duncan P. N. Exon Smith	56b893b364	DI/Verifier: Fix argument bitrot in DILocalVariable Add a verifier check that `DILocalVariable`s of tag `DW_TAG_arg_variable` always have a non-zero 'arg:' field, and those of tag `DW_TAG_auto_variable` always have a zero 'arg:' field. These are the only configurations that are properly understood by the backend. (Also, fix the bad examples in LangRef and test/Assembler, and fix the bug in Kaleidoscope Ch8.) A large number of testcases seem to have bitrotted their way forward from some ancient version of the debug info hierarchy that didn't have `arg:` parameters. If you have out-of-tree testcases that start failing in the verifier and you don't care enough to get the `arg:` right, you may have some luck just calling: sed -e 's/, arg: 0/, arg: 1/' or some such, but I hand-updated the ones in tree. llvm-svn: 243183	2015-07-24 23:59:25 +00:00
Alex Lorenz	1bb48de1f9	MIR Serialization: Serialize MachineFrameInfo's callee saved information. This commit serializes the callee saved information from the class 'MachineFrameInfo'. This commit extends the YAML mappings for the fixed and the ordinary stack objects and adds an optional 'callee-saved-register' attribute. This attribute is used to serialize the callee save information. llvm-svn: 243173	2015-07-24 22:22:50 +00:00
Lawrence Hu	dc8a83b53b	Handle loop with negtive induction variable increment This patch extend LoopReroll pass to hand the loops which is similar to the following: while (len > 1) { sum4 += buf[len]; sum4 += buf[len-1]; len -= 2; } llvm-svn: 243171	2015-07-24 22:01:49 +00:00
Pete Cooper	3191697138	Remove unnecessary null check. NFC. Since both places which set this variable do so with dyn_cast, and not dyn_cast_or_null, its impossible to get a nullptr here, so we can remove the check. llvm-svn: 243167	2015-07-24 21:38:01 +00:00
Pete Cooper	7679afda82	Use make_range(rbegin(), rend()) to allow foreach loops. NFC. Instead of the pattern for (auto I = x.rbegin(), E = x.end(); I != E; ++I) we can use make_range to construct the reverse range and iterate using that instead. llvm-svn: 243163	2015-07-24 21:13:43 +00:00
Duncan P. N. Exon Smith	16bc6e1727	DI: Fix unit tests after r243160 These always empty fields are gone, so don't test that they're empty. llvm-svn: 243162	2015-07-24 21:11:06 +00:00
Duncan P. N. Exon Smith	b9e045af44	DI: Remove unnecessary DICompositeTypeBase Remove unnecessary and confusing common base class for `DICompositeType` and `DISubroutineType`. While at a high-level `DISubroutineType` is a sort of composite of other types, it has no shared code paths, and its fields are completely disjoint. This relationship was left over from the old debug info hierarchy. llvm-svn: 243160	2015-07-24 20:56:36 +00:00
Duncan P. N. Exon Smith	260fa8a75b	DI: Simplify DebugInfoFinder::processType(), NFC Handle `DISubroutineType` up-front rather than as part of a branch for `DICompositeTypeBase`. The only shared code path was looking through the base type, but `DISubroutineType` can never have a base type. This also removes the last use of `DICompositeTypeBase`, since we can strengthen the cast to `DICompositeType`. llvm-svn: 243159	2015-07-24 20:56:10 +00:00
Duncan P. N. Exon Smith	3c5a56b13c	DI: Remove dead code: getDICompositeType() llvm-svn: 243158	2015-07-24 20:46:46 +00:00
Duncan P. N. Exon Smith	acd8cf8582	AsmPrinter: Use DICompositeType in updateAcceleratorTables(), NFC `DISubroutineType` is impossible at this `dyn_cast` site, since we're only dealing with named types and `DISubroutineType` cannot be named. Strengthen the `dyn_cast` to `DICompositeType`. llvm-svn: 243157	2015-07-24 20:45:26 +00:00
Alex Lorenz	ab4cbcfda7	MIR Serialization: Serialize the simple virtual register allocation hints. This commit serializes the virtual register allocations hints of type 0. These hints specify the preferred physical registers for allocations. llvm-svn: 243156	2015-07-24 20:35:40 +00:00
Duncan P. N. Exon Smith	338aef0a07	DI: Remove DIDerivedTypeBase Remove an unnecessary (and confusing) common subclass for `DIDerivedType` and `DICompositeType`. These classes aren't really related, and even in the old debug info hierarchy, there was a long-standing FIXME to separate them. llvm-svn: 243152	2015-07-24 20:16:36 +00:00
Duncan P. N. Exon Smith	dbfc010691	Verifier: Sink filename check into visitMDCompositeType(), NFC We really only want to check this for unions and classes (all the other tags have been ruled out), so simplify the check and move it to the right place. llvm-svn: 243150	2015-07-24 19:57:19 +00:00
Duncan P. N. Exon Smith	df9c9ff43b	Verifier: Remove unnecessary references to DW_TAG_subroutine_type, NFC Remove unnecessary references to `DW_TAG_subroutine_type` in `visitDICompositeType()` and `visitDIDerivedTypeBase()`, since `visitDISubroutineType()` doesn't call either of those (and shouldn't, since subroutine types are really quite special). llvm-svn: 243149	2015-07-24 19:52:18 +00:00
Duncan P. N. Exon Smith	89c5e6ff49	DI: Clarify isUnsignedDIType(), NFC Refactor `isUnsignedDIType()` to deal with `DICompositeType` explicitly. Since `DW_TAG_subroutine_type` isn't handled here (the assertions about tags rule it out), this allows strengthening the `dyn_cast` to `DIDerivedType`. Besides making the code clearer, this it removes a use of `DIDerivedTypeBase`. llvm-svn: 243148	2015-07-24 19:42:12 +00:00
Pete Cooper	098f7c1fcb	Add const to some Type* parameters which didn't need to be mutable. NFC. We were only getting the size of the type which doesn't need to modify the type. llvm-svn: 243146	2015-07-24 19:19:26 +00:00
Diego Novillo	b9bf447d90	Remove unused variable. NFC. llvm-svn: 243145	2015-07-24 19:18:32 +00:00
Duncan P. N. Exon Smith	2ecfb1e269	DI: Strengthen some dyn_casts to DIDerivedType, NFC The surrounding code proves in both cases that these must be `DIDerivedType` if they're `DIDerivedTypeBase`, so strengthen the `dyn_cast`s to the more specific type. llvm-svn: 243143	2015-07-24 19:17:20 +00:00
Jingyue Wu	abb05aa3c6	Remove the user-count threshold when analyzing read attributes Summary: This threshold limited FunctionAttrs ability to prove arguments to be read-only. In NVPTX, a specialized instruction ld.global.nc can be used to load memory with non-coherent texture cache. We notice that in SHOC [1] benchmark, some function arguments are not marked with readonly because FunctionAttrs reaches a hardcoded threshold when analysis uses. Removing this threshold won't cause significant regression in compilation time, because the worst-case time complexity of the algorithm is still O(# of instructions) for each parameter. Patched by Xuetian Weng. [1] https://github.com/vetter/shoc Reviewers: nlewycky, jingyue, nicholas Subscribers: nicholas, test, llvm-commits Differential Revision: http://reviews.llvm.org/D11311 llvm-svn: 243141	2015-07-24 19:05:53 +00:00
Philip Reames	fa2c630f79	[RewriteStatepointsForGC] Adjust naming scheme to be more stable The names for instructions inserted were previous dependent on iteration order. By deriving the names from the original instructions, we can avoid instability in tests without resorting to ordered traversals. It also makes the IR mildly easier to read at large scale. llvm-svn: 243140	2015-07-24 19:01:39 +00:00
Duncan P. N. Exon Smith	099ea1c9ae	DI: Strengthen block-byref cast to DIDerivedType, NFC This code is visiting the members of a block-byref, and we know those are all `DIDerivedType`. Strengthen the cast. llvm-svn: 243138	2015-07-24 18:58:32 +00:00
Pete Cooper	0debbdc872	Use foreach loops for StructType::elements(). NFC. We had a few places where we did for (unsigned i = 0, e = STy->getNumElements(); i != e; ++i) { but those could instead do for (auto *EltTy : STy->elements()) { llvm-svn: 243136	2015-07-24 18:55:49 +00:00
Pete Cooper	6aaad8d88d	Add const to a bunch of Type* in DataLayout. NFC. Almost all methods in DataLayout took mutable pointers but didn't need to. These were only accessing constant methods of the types, or using the Type* to key a map. Neither of these needs a mutable pointer. llvm-svn: 243135	2015-07-24 18:29:09 +00:00
Duncan P. N. Exon Smith	6ac940db19	DI: Only DICompositeType has getElements(), NFC There is an assertion inside `DICompositeTypeBase::getElements()` that `this` is not a `DISubroutineType`, leaving only `DICompositeType`. Make that clear at the call sites. llvm-svn: 243134	2015-07-24 18:17:17 +00:00
Alex Lorenz	c7bf20403b	MIR Parser: Run the machine verifier after initializing machine functions. llvm-svn: 243128	2015-07-24 17:44:49 +00:00
Lang Hames	a8183e5c40	[RuntimeDyld] MachO: Add support for ARM scattered vanilla relocations. llvm-svn: 243126	2015-07-24 17:40:04 +00:00
Alex Lorenz	3905d9db97	MIR Tests: Add liveins and successors to make tests pass with machine verifier. This commit adds the liveins and successors properties to machine basic blocks in some of the MIR tests to ensure that the tests will pass when the MIR parser will run the machine verifier after initializing a machine function. llvm-svn: 243124	2015-07-24 17:36:55 +00:00
Alex Lorenz	55f95127bf	MIR Tests: Make the basic block successor test an X86 specific test. This commit moves and transforms the generic test 'CodeGen/MIR/successor-basic-blocks.mir' into an X86 specific test 'CodeGen/MIR/X86/successor-basic-blocks.mir'. This change is required in order to enable the machine verifier for the MIR parser, as the machine verifier verifies that the machine basic blocks contain instructions that actually determine the machine basic block successors. llvm-svn: 243123	2015-07-24 17:31:55 +00:00
Igor Breger	074a64e72c	AVX-512: Implemented encoding , DAG lowering and intrinsics for Integer Truncate with/without saturation Added tests for DAG lowering ,encoding and intrinsic Differential Revision: http://reviews.llvm.org/D11218 llvm-svn: 243122	2015-07-24 17:24:15 +00:00
Chandler Carruth	df47bb9a06	Update for r243115 which changed the DataLayout API on TargetMachine but didn't update the gold-plugin. llvm-svn: 243121	2015-07-24 17:23:09 +00:00
Hans Wennborg	fa8e3a551f	test-release.sh: Defer test errors until the end This makes the script run to the end and produce tarballs even on test failures, and then highlights any errors afterwards. (I first tried just storing the errors in a global variable, but that didn't work as the "test_llvmCore" function invocation is actually running as a sub-shell.) Differential Revision: http://reviews.llvm.org/D11478 llvm-svn: 243116	2015-07-24 16:16:09 +00:00
Mehdi Amini	26d481311a	Remove access to the DataLayout in the TargetMachine Summary: Replace getDataLayout() with a createDataLayout() method to make explicit that it is intended to create a DataLayout only and not accessing it for other purpose. This change is the last of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11103 (cherry picked from commit 5609fc56bca971e5a7efeaa6ca4676638eaec5ea) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 243114	2015-07-24 16:04:22 +00:00
Sanjay Patel	0495dbf1e1	fix wrong comment; NFC llvm-svn: 243113	2015-07-24 16:02:14 +00:00
NAKAMURA Takumi	c9bc0b1e14	llvm/test/tools/dsymutil/ARM/lit.local.cfg: Fix possibly typo, s/X86/ARM/. llvm-svn: 243106	2015-07-24 11:55:11 +00:00
Luke Cheeseman	4d45ff2b87	[ARM] - Fix lowering of shufflevectors in AArch32 Some shufflevectors are currently being incorrectly lowered in the AArch32 backend as the existing checks for detecting the NEON operations from the shufflevector instruction expects the shuffle mask and the vector operands to be of the same length. This is not always the case as the mask may be twice as long as the operand; here only the lower half of the shufflemask gets checked, so provided the lower half of the shufflemask looks like a vector transpose (or even is just all -1 for undef) then the intrinsics may get incorrectly lowered into a vector transpose (VTRN) instruction. This patch fixes this by accommodating for both cases and adds regression tests. Differential Revision: http://reviews.llvm.org/D11407 llvm-svn: 243103	2015-07-24 09:57:05 +00:00
Luke Cheeseman	b5c627aba8	When lowering vector shifts a check is performed to see if the value to shift by is an immediate, in this check the value is negated and stored in and int64_t. The value can be -2^63 yet the result cannot be stored in an int64_t and this gives some undefined behaviour causing failures. The negation is only necessary when the values is within a certain range and so it should not need to negate -2^63, this patch introduces this and also a regression test. Differential Revision: http://reviews.llvm.org/D11408 llvm-svn: 243100	2015-07-24 09:31:48 +00:00
Frederic Riss	eb85c8fb09	[dsymutil] Implement support for universal mach-o object files. This patch allows llvm-dsymutil to read universal (aka fat) macho object files and archives. The patch touches nearly everything in the BinaryHolder, but it is fairly mechinical: the methods that returned MemoryBufferRefs or ObjectFiles now return a vector of those, and the high-level access function takes a triple argument to select the architecture. There is no support yet for handling fat executables and thus no support for writing fat object files. llvm-svn: 243096	2015-07-24 06:41:11 +00:00
Frederic Riss	65f0abf275	[dsymutil] Make the triple detection more strict. MachOObjectFile offers a method for detecting the correct triple, use it instead of the previous approximation. This doesn't matter right now, but it will become important for mach-o universal (fat) binaries. llvm-svn: 243095	2015-07-24 06:41:04 +00:00
Frederic Riss	9388406c21	[dsymutil] Refactor BinaryHolder internals. NFC Call a helper that resets all the internal state of the BinaryHolder when we change the underlying memory buffer. Makes a followup patch a tiny bit smaller. llvm-svn: 243094	2015-07-24 06:40:59 +00:00
Mehdi Amini	5d8e569926	Revert "Remove access to the DataLayout in the TargetMachine" This reverts commit 0f720d984f419c747709462f7476dff962c0bc41. It breaks clang too badly, I need to prepare a proper patch for clang first. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 243089	2015-07-24 03:36:55 +00:00
Alexei Starovoitov	01886a05b8	[bpf] initial support for debug_info llvm-svn: 243087	2015-07-24 03:17:08 +00:00
Davide Italiano	cd1b6dbcad	[llvm-reaobj] Display COFF-specific sections/tables only if the object is COFF. Just skip them otherwise. llvm-svn: 243086	2015-07-24 02:14:20 +00:00
Michael Zolotukhin	57776b8159	Handle resolvable branches in complete loop unroll heuristic. Summary: Resolving a branch allows us to ignore blocks that won't be executed, and thus make our estimate more accurate. This patch is intended to be applied after D10205 (though it could be applied independently). Reviewers: chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10206 llvm-svn: 243084	2015-07-24 01:53:04 +00:00
Mehdi Amini	b4bc424c9a	Remove access to the DataLayout in the TargetMachine Summary: Replace getDataLayout() with a createDataLayout() method to make explicit that it is intended to create a DataLayout only and not accessing it for other purpose. This change is the last of a series of commits dedicated to have a single DataLayout during compilation by using always the one owned by the module. Reviewers: echristo Subscribers: jholewinski, llvm-commits, rafael, yaron.keren Differential Revision: http://reviews.llvm.org/D11103 (cherry picked from commit 5609fc56bca971e5a7efeaa6ca4676638eaec5ea) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 243083	2015-07-24 01:44:39 +00:00
Saleem Abdulrasool	f95da49f25	build: fix small typo in cmake doxygen build A search word spelled as "searhc" in the LLVM_DOXYGEN_SEARCHENGINE_URL cmake variable docstring. Patch by Daniel Otero! llvm-svn: 243082	2015-07-24 01:14:25 +00:00
NAKAMURA Takumi	a6ccd6cd15	MIRParser/LLVMBuild.txt: Add MC for MCRegisterInfo::getDwarfRegNum(). llvm-svn: 243081	2015-07-24 01:12:36 +00:00
NAKAMURA Takumi	d12ebaf9a4	Reorder alphabetically. llvm-svn: 243080	2015-07-24 01:12:28 +00:00
Eric Christopher	1fb23395c3	Clean up function attributes on PPC fast-isel tests. llvm-svn: 243079	2015-07-24 01:07:50 +00:00
Kostya Serebryany	404c69f2c8	[libFuzzer] allow users to supply their own implementation of rand llvm-svn: 243078	2015-07-24 01:06:40 +00:00
Philip Reames	29e9ae7891	[RewriteStatepointsForGC] Fix release build warning llvm-svn: 243076	2015-07-24 00:42:55 +00:00
Jonathan Roelofs	b032e04efd	Add missing underlines for a docs section. NFC llvm-svn: 243075	2015-07-24 00:29:50 +00:00
Philip Reames	88958b2df3	[RewriteStatepointsForGC] Use a worklist algorithm for first part of base pointer algorithm [NFC] The new code should hopefully be equivalent to the old code; it just uses a worklist to track instructions which need to visited rather than iterating over all instructions visited each time. This should be faster, but the primary benefit is that the purpose should be more clear and the diff of adding another instruction type (forthcoming) much more obvious. Differential Revision: http://reviews.llvm.org/D11480 llvm-svn: 243071	2015-07-24 00:02:11 +00:00
Lawrence Hu	687097a0a9	test commit, only added one space llvm-svn: 243070	2015-07-23 23:55:28 +00:00
Jingyue Wu	2e424da39b	[NaryReassociate] remove redundant code This check is already done by findClosestMatchingDominator. llvm-svn: 243065	2015-07-23 23:13:37 +00:00
Alex Lorenz	8cfc68677c	MIR Serialization: Serialize the '.cfi_offset' CFI instruction. Reviewers: Duncan P. N. Exon Smith llvm-svn: 243062	2015-07-23 23:09:07 +00:00
JF Bastien	8969666450	WebAssembly: test that valid -mcpu flags are accepted. Summary: AArch64 has a similar test. Subscribers: sunfish, aemerson, llvm-commits, jfb Differential Revision: http://reviews.llvm.org/D11479 llvm-svn: 243058	2015-07-23 23:00:04 +00:00
Sanjay Patel	f2fa58e744	fix crash in machine trace metrics due to processing dbg_value instructions (PR24199) The test in PR24199 ( https://llvm.org/bugs/show_bug.cgi?id=24199 ) crashes because machine trace metrics was not ignoring dbg_value instructions when calculating data dependencies. The machine-combiner pass asks machine trace metrics to calculate an instruction trace, does some reassociations, and calls MachineInstr::eraseFromParentAndMarkDBGValuesForRemoval() along with MachineTraceMetrics::invalidate(). The dbg_value instructions have their operands invalidated, but the instructions are not expected to be deleted. On a subsequent loop iteration of the machine-combiner pass, machine trace metrics would be called again and die while accessing the invalid debug instructions. Differential Revision: http://reviews.llvm.org/D11423 llvm-svn: 243057	2015-07-23 22:56:53 +00:00
Philip Reames	9b141ed48e	[RewriteStatepointsForGC] Rename PhiState to reflect that it's associated w/more than just PHIs Today, Select instructions also have associated PhiStates. In the near future, so will ExtractElement and SuffleVector. llvm-svn: 243056	2015-07-23 22:49:14 +00:00
Philip Reames	2a892a630b	[RewriteStatepointsForGC] Use idomatic mechanisms for debug tracing [NFC] Deleting much of the code using trace-rewrite-statepoints and use idiomatic DEBUG statements instead. This includes adding operator<< to a helper class. llvm-svn: 243054	2015-07-23 22:25:26 +00:00
David Gross	d9c1bc9955	[ARM] Register (existing) ARMLoadStoreOpt pass with LLVM pass manager. Summary: Among other things, this allows -print-after-all/-print-before-all to dump IR around this pass. Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D11373 llvm-svn: 243052	2015-07-23 22:12:46 +00:00
Colin LeMahieu	333a19f6c3	Moving tests in to X86 directory. llvm-svn: 243049	2015-07-23 21:55:26 +00:00

... 2 3 4 5 6 ...

120028 Commits