llvm-project

Commit Graph

Author	SHA1	Message	Date
Kostya Serebryany	5b266a8a23	[fuzzer] make multi-process execution more verbose; fix mutation to actually respect mutation depth and to never produce empty units llvm-svn: 228170	2015-02-04 19:10:20 +00:00
Colin LeMahieu	86abe35ceb	[Hexagon] Replacing some load patterns with cleaner versions. llvm-svn: 228169	2015-02-04 19:05:32 +00:00
Michael Kuperstein	cd63c5fa73	Fixes a bug in vector load legalization that confused bits and bytes. Differential Revision: http://reviews.llvm.org/D7400 llvm-svn: 228168	2015-02-04 18:54:01 +00:00
Ismail Donmez	9559232f1d	Revert test commit llvm-svn: 228167	2015-02-04 18:46:00 +00:00
Ismail Donmez	3b25ef3831	Test commit llvm-svn: 228166	2015-02-04 18:45:43 +00:00
Juergen Ributzka	719615f6dd	Add missing include. llvm-svn: 228161	2015-02-04 18:16:53 +00:00
Colin LeMahieu	f856dcb75e	[Hexagon] Adding missing isCodeGenOnly = 0 llvm-svn: 228160	2015-02-04 18:11:32 +00:00
Colin LeMahieu	c0434466e4	[Hexagon] Adding encoding information for absolute-reg mode stores. Xfailing a test until constant extenders are correctly put in the same packet. llvm-svn: 228158	2015-02-04 17:52:06 +00:00
Alexey Samsonov	b9b8027cee	SpecialCaseList: Add support for parsing multiple input files. Summary: This change allows users to create SpecialCaseList objects from multiple local files. This is needed to implement a proper support for -fsanitize-blacklist flag (allow users to specify multiple blacklists, in addition to default blacklist, see PR22431). DFSan can also benefit from this change, as DFSan instrumentation pass now accepts ABI-lists both from -fsanitize-blacklist= and -mllvm -dfsan-abilist flags. Go bindings are fixed accordingly. Test Plan: regression test suite Reviewers: pcc Subscribers: llvm-commits, axw, kcc Differential Revision: http://reviews.llvm.org/D7367 llvm-svn: 228155	2015-02-04 17:39:48 +00:00
Colin LeMahieu	7d971056ed	[Hexagon] Adding encoding information for absolute-set stores. llvm-svn: 228154	2015-02-04 17:24:04 +00:00
Colin LeMahieu	0eb9727d42	[Hexagon] Adding encoding bits for indirect long load instructions. llvm-svn: 228152	2015-02-04 16:56:46 +00:00
Bradley Smith	9f4cd59e80	[ARM] Fix subtarget feature set truncation when using .cpu directive This is a bug that was caused due to storing the feature bitset in a 32-bit variable when it is a 64-bit mask, discarding the top half of the feature set. llvm-svn: 228151	2015-02-04 16:23:24 +00:00
Zoran Jovanovic	5a1a780c2a	[mips][microMIPS] Implement CodeGen support for SW16 and LW16 instructions Differential Revision: http://reviews.llvm.org/D6581 llvm-svn: 228149	2015-02-04 15:43:17 +00:00
Daniel Sanders	e67d27f5cc	[mips] Make MipsSubtarget::hasMips*() functions consistent. NFC. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7377 llvm-svn: 228147	2015-02-04 15:18:11 +00:00
Daniel Sanders	a9aab74304	[mips] Remove unused check prefix from tests. NFC. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7376 llvm-svn: 228145	2015-02-04 14:48:39 +00:00
Aaron Ballman	34c325e749	Fixing a -Wsign-compare warning; NFC llvm-svn: 228142	2015-02-04 14:01:08 +00:00
Renato Golin	6088504499	Adding support to LLVM for targeting Cortex-A72 Currently, Cortex-A72 is modelled as an Cortex-A57 except the fp load balancing pass isn't enabled for Cortex-A72 as it's not profitable to have it enabled for this core. Patch by Ranjeet Singh. llvm-svn: 228140	2015-02-04 13:31:29 +00:00
Rafael Espindola	a5eb775c4d	Fix warning: "function declaration isn’t a prototype" llvm-svn: 228139	2015-02-04 13:30:28 +00:00
Justin Bogner	1f92ce67c5	InstrProf: std::to_string needs to #include <string> llvm-svn: 228136	2015-02-04 11:19:16 +00:00
Chandler Carruth	4d31f58c88	[x86] Give movss and movsd execution domains in the x86 backend. This associates movss and movsd with the packed single and packed double execution domains (resp.). While this is largely cosmetic, as we now don't have weird ping-pong-ing between single and double precision, it is also useful because it avoids the domain fixing algorithm from seeing domain breaks that don't actually exist. It will also be much more important if we have an execution domain default other than packed single, as that would cause us to mix movss and movsd with integer vector code on a regular basis, a very bad mixture. llvm-svn: 228135	2015-02-04 10:58:53 +00:00
Chandler Carruth	78c8dcd9d3	[x86] Remove a low-value test that was just checking how we cleared a register. We have lots of tests covering this. llvm-svn: 228133	2015-02-04 10:47:34 +00:00
Chandler Carruth	bb525e336b	[x86] Mechanically update a bunch of tests' check lines using the latest version of the script. Changes include: - Using the VEX prefix - Skipping more detail when we have useful shuffle comments to match - Matching more shuffle comments that have been added to the printer (yay!) - Matching the destination registers of some AVX instructions - Stripping trailing whitespace that crept in - Fixing indentation issues Nothing interesting going on here. I'm just trying really hard to ensure these changes don't show up in the diffs with actual changes to the backend. llvm-svn: 228132	2015-02-04 10:46:53 +00:00
Chandler Carruth	e375095392	[x86] Teach the test update script to strip trailing whitespace. This is done in a bit of a strange way to use a multiline RE instead of looping over the lines. Suggestions welcome here for a more pythonic way of doing this as long as its reasonably fast. llvm-svn: 228131	2015-02-04 10:46:48 +00:00
Renato Golin	2a5c0a51ce	Reverting VLD1/VST1 base-updating/post-incrementing combining This reverts patches 223862, 224198, 224203, and 224754, which were all related to the vector load/store combining and were reverted/reaplied a few times due to the same alignment problems we're seeing now. Further tests, mainly self-hosting Clang, will be needed to reapply this patch in the future. llvm-svn: 228129	2015-02-04 10:11:59 +00:00
Chandler Carruth	22b1525ae8	[x86] Include the destination register in the check-lines for AVX instructions. No actual change here. llvm-svn: 228127	2015-02-04 09:18:27 +00:00
Chandler Carruth	18ba596609	[x86] Add some tests I missed in the prior commit to cover blends with zero for v8i16 as well. These exhibit the same domain badness, but also exhibit other weaknesses in our blend lowering. More fixes to come. llvm-svn: 228126	2015-02-04 09:15:46 +00:00
Chandler Carruth	024cf8efd7	[x86] Start to introduce bit-masking based blend lowering. This is the simplest form of bit-math based blending which only fires when we are blending with zero and is relatively profitable. I've only enabled this path on very specific lowering strategies. I'm planning to widen its applicability in subsequent patches, but so far you'll notice that even though we get fewer shufps instructions, we still do the bit math in the FP execution port. I'm looking into why this is still happening. llvm-svn: 228124	2015-02-04 09:06:05 +00:00
Chandler Carruth	f4a1c33c7c	[x86] Add missing patterns for andps, orps, xorps, and andnps. Specifically, the existing patterns were scalar-only. These cover the packed vector bitwise operations when specifically requested with pseudo instructions. This is particularly important in SSE1 where we can't actually emit a logical operation on a v2i64 as that isn't a legal type. This will be tested in subsequent patches which form the floating point and patterns in more places. llvm-svn: 228123	2015-02-04 09:06:01 +00:00
Chandler Carruth	872d80e7a4	[x86] Add tests for blends-with-zero on 4-element vectors. llvm-svn: 228122	2015-02-04 09:05:58 +00:00
Bill Schmidt	81638eabe6	Replace tabs with spaces from r228116. Oops. llvm-svn: 228117	2015-02-04 06:14:38 +00:00
Bill Schmidt	1354f7c5fa	[PowerPC] Handle 32-bit targets properly in PPCTLSDynamicCall.cpp llvm-svn: 228116	2015-02-04 05:51:56 +00:00
Philip Reames	72634d6af0	Fix a warning in non-asserts builds llvm-svn: 228114	2015-02-04 05:11:20 +00:00
Frederic Riss	b61f01f1c2	Fix some unnoticed/unwanted behavior change from r222319. The ARM assembler allows register alias redefinitions as long as it targets the same register. r222319 broke that. In the AArch64 case it would just produce a new warning, but in the ARM case it would error out on previously accepted assembler. llvm-svn: 228109	2015-02-04 03:10:03 +00:00
Kostya Serebryany	fe43aa8d19	[fuzzer]: fix exit code, add more diagnostics llvm-svn: 228103	2015-02-04 01:22:57 +00:00
Kostya Serebryany	77cc729ad7	[sanitizer] add another workaround for PR 17409: when over a threshold emit coverage instrumentation as calls. llvm-svn: 228102	2015-02-04 01:21:45 +00:00
Kevin Enderby	95df54c819	Add code to llvm-objdump so the -section option with -macho will disassemble sections that have attributes indicating they contain instructions. llvm-svn: 228101	2015-02-04 01:01:38 +00:00
Chandler Carruth	abd09a1f35	[x86] Refresh the checks of a number of tests using update_llc_test_checks.py. The exact format of the checks has changed over time. This includes different indenting rules, new shuffle comments that have been added, and more operand hiding behind regular expressions. No functional change to the tests are expected here, but this will make subsequent patches have a clean diff as they change shuffle lowering. llvm-svn: 228097	2015-02-04 00:58:42 +00:00
Chandler Carruth	abde67eb1c	[x86] Switch to using the long '--check-prefix' form which the update_llc_test_checks.py script uses, and refresh the checks in this test. No functionality changed here, just bringing this test up to work with automated updates using the python script. llvm-svn: 228096	2015-02-04 00:58:40 +00:00
Chandler Carruth	52332dc620	[x86] Port this test to use utils/update_llc_test_checks.py. This will make it easy to update as I change some parts of the X86 backend, makes it more clear what instruction differences are introduced, and I find it makes it a bit easier to read as well. llvm-svn: 228095	2015-02-04 00:58:37 +00:00
Peter Collingbourne	69ba0167b3	Misc documentation/comment fixes. llvm-svn: 228093	2015-02-04 00:42:45 +00:00
Philip Reames	5a9685dba6	Clang format of a file introduced in 228090 (NFC) llvm-svn: 228091	2015-02-04 00:39:57 +00:00
Philip Reames	47cc673e1f	Add a pass for inserting safepoints into (nearly) arbitrary IR This pass is responsible for figuring out where to place call safepoints and safepoint polls. It doesn't actually make the relocations explicit; that's the job of the RewriteStatepointsForGC pass (http://reviews.llvm.org/D6975). Note that this code is not yet finalized. Its moving in tree for incremental development, but further cleanup is needed and will happen over the next few days. It is not yet part of the standard pass order. Planned changes in the near future: - I plan on restructuring the statepoint rewrite to use the functions add to the IRBuilder a while back. - In the current pass, the function "gc.safepoint_poll" is treated specially but is not an intrinsic. I plan to make identifying the poll function a property of the GCStrategy at some point in the near future. - As follow on patches, I will be separating a collection of test cases we have out of tree and submitting them upstream. - It's not explicit in the code, but these two patches are introducing a new state for a statepoint which looks a lot like a patchpoint. There's no a transient form which doesn't yet have the relocations explicitly represented, but does prevent reordering of memory operations. Once this is in, I need to update actually make this explicit by reserving the 'unused' argument of the statepoint as a flag, updating the docs, and making the code explicitly check for such a thing. This wasn't really planned, but once I split the two passes - which was done for other reasons - the intermediate state fell out. Just reminds us once again that we need to merge statepoints and patchpoints at some point in the not that distant future. Future directions planned: - Identifying more cases where a backedge safepoint isn't required to ensure timely execution of a safepoint poll. - Tweaking the insertion process to generate easier to optimize IR. (For example, investigating making SplitBackedge) the default. - Adding opt-in flags for a GCStrategy to use this pass. Once done, add this pass to the actual pass ordering. Differential Revision: http://reviews.llvm.org/D6981 llvm-svn: 228090	2015-02-04 00:37:33 +00:00
Sanjay Patel	b82b8d6b84	improved CHECK llvm-svn: 228086	2015-02-04 00:24:06 +00:00
Galina Kistanova	d9b46a187f	Added missing header for the explicit dependency on MDNode. llvm-svn: 228085	2015-02-04 00:20:52 +00:00
Justin Bogner	0cca70a6e5	InstrProf: Add some unit tests for CoverageMapping The llvm-level tests for coverage mapping need a binary input file, which means they're hard to understand, hard to update, and it's difficult to add new ones. By adding some unit tests that build up the coverage data structures in C++, we can write more meaningful and targeted tests. llvm-svn: 228084	2015-02-04 00:15:12 +00:00
Justin Bogner	70e0c09e6c	InstrProf: Use a stable sort when reading coverage regions Keeping regions that start at the same location in insertion order makes this logic easier to test / more deterministic. llvm-svn: 228083	2015-02-04 00:12:18 +00:00
Colin LeMahieu	585316cb41	[Hexagon] Revert change to isCodeGenOnly = 1 in r228080 llvm-svn: 228082	2015-02-04 00:09:23 +00:00
Colin LeMahieu	510ba0c661	[Hexagon] Changing some isCodeGenOnly to isAsmParserOnly since we want them to asm parse but not cause decode conflicts. llvm-svn: 228080	2015-02-04 00:07:26 +00:00
Owen Anderson	21b1788ad0	Remove a gross usage of environment variables in MachineVerifier, replacing it with support for setting the -verify-machineinstrs flag via an environment variable in LIT. This preserves the handy functionality of force-enabling the MachineVerifier, without the need to embed usage of environment variables in LLVM client applications. llvm-svn: 228079	2015-02-04 00:02:59 +00:00
Justin Bogner	26b3142d34	InstrProf: Make CounterMappingRegions less confusing to construct Creating empty and expansion regions is awkward with the current API. Expose static methods to make this simpler. llvm-svn: 228075	2015-02-03 23:59:33 +00:00
Arnaud A. de Grandmaison	10797c5707	[PBQP] Provide more information in the debug prints Based on a patch by Jonas Paulsson llvm-svn: 228068	2015-02-03 23:40:24 +00:00
Philip Reames	0285c74261	Use ImmutableCallSite for statepoint verification. Patch by: Igor Laevsky "This change generalizes statepoint verification to use ImmutableCallSite instead of CallInst. This will allow to easily implement invoke statepoint verification (in a following change)." Differential Revision: http://reviews.llvm.org/D7308 llvm-svn: 228064	2015-02-03 23:18:47 +00:00
Adam Nemet	5add5d9d85	[LV] Split off memcheck block really at the first check I've noticed this while trying to move addRuntimeCheck to LoopAccessAnalysis. I think that the intention was to early exit from the overflow checking before the code for the memchecks. This is the entire reason why we compute FirstCheckInst but then we don't use that as the splitting instruction but the final check. Looks like an oversight. llvm-svn: 228056	2015-02-03 22:45:39 +00:00
Chandler Carruth	68fcb38328	[x86] Fix signed vs. unsigned comparison. llvm-svn: 228055	2015-02-03 22:43:30 +00:00
Simon Pilgrim	9eecb14bd9	Fixed unused variable warning. llvm-svn: 228054	2015-02-03 22:39:28 +00:00
Colin LeMahieu	e4101e2c9e	[Hexagon] Marking a bunch of non-encoded instructions with isCodeGenOnly = 1. llvm-svn: 228050	2015-02-03 22:09:51 +00:00
Hans Wennborg	6d12f69363	[CMake] add_llvm_library: don't use .imp suffix for import libraries on Windows (PR22334) This was added in r188351 to fix a naming conflict between the profile_rt-static and profile_rt-shared who both ended up in lib/profile_rt.lib. The change also affected other libraries (like libclang), and users are reporting that they find it surprising that there's no longer a libclang.lib. Since the profile_rt naming conflict doesn't seem to exist any more, I think we can remove this. Differential Revision: http://reviews.llvm.org/D7391 llvm-svn: 228049	2015-02-03 22:08:20 +00:00
Arnaud A. de Grandmaison	1f4448ad51	[PBQP] Constify Graph::getEdgeNode1Id and Graph::getEdgeNode2Id llvm-svn: 228048	2015-02-03 22:02:45 +00:00
Simon Pilgrim	46cd4f7400	[X86][SSE] psrl(w/d/q) and psll(w/d/q) bit shifts for SSE2 Patch to match cases where shuffle masks can be reduced to bit shifts. Similar to byte shift shuffle matching from D5699. Differential Revision: http://reviews.llvm.org/D6649 llvm-svn: 228047	2015-02-03 21:58:29 +00:00
Bill Schmidt	fe88b18990	[PowerPC] Implement the vpopcnt instructions for POWER8 Patch by Kit Barton. Add the vector population count instructions for byte, halfword, word, and doubleword sizes. There are two major changes here: PPCISelLowering.cpp: Make CTPOP legal for vector types. PPCRegisterInfo.td: Added v2i64 to the VRRC register definition. This is needed for the doubleword variations of the integer ops that were added in P8. Test Plan Test the instruction vpcnt* encoding/decoding in ppc64-encoding-vmx.s Test the generation of the vpopcnt instructions for various vector data types. When adding the v2i64 type to the Vector Register set, I also needed to add the appropriate bit conversion patterns between v2i64 and the existing vector types. Testing for these conversions were also added in the test case by passing a different vector type as a parameter into the test functions. There is also a run step that will ensure the vpopcnt instructions are generated when the vsx feature is disabled. llvm-svn: 228046	2015-02-03 21:58:23 +00:00
Kostya Serebryany	cf9fdd5876	[fuzzer] Add proper dependensices to the fuzzer tests Summary: Make sure that FileCheck is built when running check-fuzzer Test Plan: run on bot: lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fuzzer Reviewers: samsonov Reviewed By: samsonov Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7387 llvm-svn: 228045	2015-02-03 21:57:32 +00:00
Chandler Carruth	1fff318a41	[x86] Add two truly horrific test cases for the new vector shuffle lowering. I'm prepping patches to improve these, and this will let the delta of those patches show the improvement. =] llvm-svn: 228044	2015-02-03 21:56:28 +00:00
Chandler Carruth	4ce669d91c	[x86] Update the indent and layout of some tests in this file. NFC This is just to remove voise from using the update_llc_test_checks script. llvm-svn: 228043	2015-02-03 21:56:24 +00:00
Duncan P. N. Exon Smith	974860774e	AsmParser: Recognize DW_TAG_* constants Recognize `DW_TAG_` constants in assembly, and output it by default for `GenericDebugNode`. llvm-svn: 228042	2015-02-03 21:56:01 +00:00
Duncan P. N. Exon Smith	4e4aa70535	IR: Assembly and bitcode for GenericDebugNode llvm-svn: 228041	2015-02-03 21:54:14 +00:00
Marek Olsak	37cd4d0f42	R600/SI: Remove the -CHECK suffix from all FileCheck prefixes in LIT tests llvm-svn: 228040	2015-02-03 21:53:27 +00:00
Marek Olsak	24ae2cda7c	R600/SI: Remove useless patterns in VALU which are already covered by SALU Also remove hasPostISelHook=1 from V_LSHL_B32. It's defined by InstSI already. Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 228039	2015-02-03 21:53:08 +00:00
Marek Olsak	3ecf508734	R600/SI: Rewrite VOP1InstSI to contain a pseudo and _si opcode What this does is that if you accidentally select these instructions on VI, the code generation will fail, because the pseudo -> _vi mapping will be undefined. The idea is to be able to catch possible future bugs easily. Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 228038	2015-02-03 21:53:05 +00:00
Marek Olsak	707a6d0c20	R600/SI: Fix B64 VALU shifts on VI SI only has standard versions. VI only has REV versions. Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 228037	2015-02-03 21:53:01 +00:00
Justin Bogner	de15817ea2	InstrProf: Remove CoverageMapping::HasCodeBefore, it isn't used It's not entirely clear to me what this field was meant for, but it's always false. Remove it. llvm-svn: 228034	2015-02-03 21:35:36 +00:00
Chandler Carruth	a4a77ed59e	[x86] Tweak my update script to use test case function names starting with 'stress' to indicate that the specific output isn't interesting and relax them to only check the last instruction (a ret). I've updated the one test case that really uses this to name the one 'stress_test' which was actually producing output we can directly check. With this, the script doesn't introduce noise when run over the v16 test file. llvm-svn: 228033	2015-02-03 21:26:45 +00:00
Duncan P. N. Exon Smith	6f5546cdee	Support: Add string => unsigned mapping for DW_TAG Add `dwarf::getTag()` to translate from `StringRef` to `unsigned`. llvm-svn: 228031	2015-02-03 21:16:49 +00:00
Duncan P. N. Exon Smith	981811efc8	Support: Re-implement dwarf::TagString() using a .def file, NFC Also re-implements the `dwarf::Tag` enumerator. I've moved the mock tags into the enumerator since there's no other way to do this. Really they shouldn't be used at all (they're just a hack to identify `MDNode`s, but we have a class hierarchy for that now). llvm-svn: 228030	2015-02-03 21:13:16 +00:00
Duncan P. N. Exon Smith	b036f1c98c	Support: Stop stringifying DW_TAG_{lo,hi}_user `dwarf::TagString()` shouldn't stringify `DW_TAG_lo_user` or `DW_TAG_hi_user`. These aren't actual tags; they're markers for the edge of vendor-specific tag regions. llvm-svn: 228029	2015-02-03 21:08:33 +00:00
Simon Pilgrim	c4e5f1e192	Fixed signed/unsigned comparison warning. llvm-svn: 228027	2015-02-03 20:54:01 +00:00
Colin LeMahieu	cd9cb023d7	[Hexagon] Converting XTYPE/SHIFT intrinsics. Cleaning out old intrinsic patterns and updating tests. llvm-svn: 228026	2015-02-03 20:40:52 +00:00
Simon Pilgrim	03c379a0fa	Fixed unused variable warning. llvm-svn: 228025	2015-02-03 20:38:52 +00:00
Daniel Berlin	487aed0d77	Allow PRE to insert no-cost phi nodes llvm-svn: 228024	2015-02-03 20:37:08 +00:00
Simon Pilgrim	d9885856e6	[X86][SSE] Added general integer shuffle matching for MOVQ instruction This patch adds general shuffle pattern matching for the MOVQ zero-extend instruction (copy lower 64bits, zero upper) for all 128-bit integer vectors, it is added as a fallback test in lowerVectorShuffleAsZeroOrAnyExtend. llvm-svn: 228022	2015-02-03 20:09:18 +00:00
Colin LeMahieu	cf7248bcaf	[Hexagon] Updating XTYPE/PRED intrinsics. llvm-svn: 228019	2015-02-03 19:43:59 +00:00
Kostya Serebryany	4b96ce96c6	[fuzzer] update the include line to use the new header name llvm-svn: 228018	2015-02-03 19:42:05 +00:00
Jingyue Wu	d7966ff3b9	Add straight-line strength reduction to LLVM Summary: Straight-line strength reduction (SLSR) is implemented in GCC but not yet in LLVM. It has proven to effectively simplify statements derived from an unrolled loop, and can potentially benefit many other cases too. For example, LLVM unrolls #pragma unroll foo (int i = 0; i < 3; ++i) { sum += foo((b + i) * s); } into sum += foo(b * s); sum += foo((b + 1) * s); sum += foo((b + 2) * s); However, no optimizations yet reduce the internal redundancy of the three expressions: b * s (b + 1) * s (b + 2) * s With SLSR, LLVM can optimize these three expressions into: t1 = b * s t2 = t1 + s t3 = t2 + s This commit is only an initial step towards implementing a series of such optimizations. I will implement more (see TODO in the file commentary) in the near future. This optimization is enabled for the NVPTX backend for now. However, I am more than happy to push it to the standard optimization pipeline after more thorough performance tests. Test Plan: test/StraightLineStrengthReduce/slsr.ll Reviewers: eliben, HaoLiu, meheff, hfinkel, jholewinski, atrick Reviewed By: jholewinski, atrick Subscribers: karthikthecool, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7310 llvm-svn: 228016	2015-02-03 19:37:06 +00:00
Colin LeMahieu	e5daf3abfe	[Hexagon] Updating XTYPE/PERM intrinsics. llvm-svn: 228015	2015-02-03 19:36:59 +00:00
Simon Pilgrim	6544f815b3	[X86][AVX2] Enabled shuffle matching for the AVX2 zero extension (128bit -> 256bit) vpmovzx* instructions. Differential Revision: http://reviews.llvm.org/D7251 llvm-svn: 228014	2015-02-03 19:34:09 +00:00
Rafael Espindola	a5ef4905a5	Fix duplicated symbol error. llvm-svn: 228012	2015-02-03 19:25:53 +00:00
Rafael Espindola	8d911b4419	Fix typo in test/CodeGen/X86/sibcall.ll (pr22331). llvm-svn: 228011	2015-02-03 19:20:26 +00:00
Colin LeMahieu	99cc7c1070	[Hexagon] Adding missing vector multiply instruction encodings. Converting multiply intrinsics and updating tests. llvm-svn: 228010	2015-02-03 19:15:11 +00:00
Sanjay Patel	b7d5628784	Merge consecutive 16-byte loads into one 32-byte load (PR22329) This patch detects consecutive vector loads using the existing EltsFromConsecutiveLoads() logic. This fixes: http://llvm.org/bugs/show_bug.cgi?id=22329 This patch effectively reverts the tablegen additions of D6492 / http://reviews.llvm.org/rL224344 ...which in hindsight were a horrible hack. The test cases that were added with that patch are simply modified to load from varying offsets of a base pointer. These loads did not match the existing tablegen patterns. A happy side effect of doing this optimization earlier is that we can now fold the load into a math op where possible; this is shown in some of the updated checks in the test file. Differential Revision: http://reviews.llvm.org/D7303 llvm-svn: 228006	2015-02-03 18:54:00 +00:00
Sanjay Patel	e63abfe70e	remove variable names from comments; NFC I didn't bother to fix the self-referential definitions and grammar because my eyes started to bleed. llvm-svn: 228004	2015-02-03 18:47:32 +00:00
Manman Ren	8121e1db91	[LTO API] split lto_codegen_compile to lto_codegen_optimize and lto_codegen_compile_optimized. Also add lto_api_version. Before this commit, we can only dump the optimized bitcode after running lto_codegen_compile, but it includes some impacts of running codegen passes, one example is StackProtector pass. We will get assertion failure when running llc on the optimized bitcode, because StackProtector is effectively run twice. After splitting lto_codegen_compile, the linker can choose to dump the bitcode before running lto_codegen_compile_optimized. lto_api_version is added so ld64 can check for runtime-availability of the new API. rdar://19565500 llvm-svn: 228000	2015-02-03 18:39:15 +00:00
Hans Wennborg	9148e1721c	Fix ProgramFiles path for 64-bit Windows installer If we are building an 64bit installer on Windows we have to adjust the Program Files path otherwise it uses the wrong Program Files (x86) directory. Related CMake bug report http://public.kitware.com/Bug/view.php?id=14211 Patch by Ismail Dönmez! llvm-svn: 227999	2015-02-03 18:31:29 +00:00
Colin LeMahieu	a6632452be	[Hexagon] Converting complex number intrinsics and adding tests. llvm-svn: 227995	2015-02-03 18:16:28 +00:00
Colin LeMahieu	cdba4e1bcc	[Hexagon] Adding vector intrinsics for alu32/alu and xtype/alu. llvm-svn: 227993	2015-02-03 18:01:45 +00:00
Adam Nemet	b60295a525	[LoopVectorize] Fix rebase glitch in r227751 LoopVectorizationLegality::{getNumLoads,getNumStores} should forward to LoopAccessAnalysis now. Thanks to Takumi for noticing this! llvm-svn: 227992	2015-02-03 17:59:53 +00:00
Jingyue Wu	5bbcdaa8d9	Remove usernames from TODOs, NFC making the style consistent with the rest llvm-svn: 227991	2015-02-03 17:57:38 +00:00
Marek Olsak	191507e0b7	R600/SI: Don't generate non-existent LSHL, LSHR, ASHR B32 variants on VI This can happen when a REV instruction is commuted. The trick is not to define the _vi versions of instructions, which has these consequences: - code generation will always fail if a pseudo cannot be lowered (very useful to catch bugs where an unsupported instruction somehow makes it to the printer) - ability to query if a pseudo can be lowered, which is done in commuteOpcode to prevent REV from commuting to non-REV on VI Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 227990	2015-02-03 17:38:12 +00:00
Marek Olsak	7585a29bd4	R600/SI: Remove VOP2_REV definitions from target-specific instructions The getCommute* functions are only used with pseudos, so this commit doesn't change anything. The issue with missing non-rev versions of shift instructions on VI will fixed separately. Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 227989	2015-02-03 17:38:05 +00:00
Marek Olsak	11057ee022	R600/SI: Trivial instruction definition corrections for VI (v2) - V_MAC_LEGACY_F32 exists on VI, but it's VOP3-only. - Define CVT_PK opcodes which are different between SI and VI. These are unused. The idea is to define all chip differences. v2: keep V_MUL_LO_U32 Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 227988	2015-02-03 17:38:01 +00:00
Marek Olsak	3db6ba8cfa	R600/SI: Determine target-specific encoding of READLANE and WRITELANE early v2 These are VOP2 on SI and VOP3 on VI, and their pseudos are neither, which can be a problem. In order to make isVOP2 and isVOP3 queries behave as expected, the encoding must be determined first. This doesn't fix any known issue, but better safe than sorry. v2: add and use getMCOpcodeFromPseudo Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 227987	2015-02-03 17:37:57 +00:00
Marek Olsak	1bd2463548	R600/SI: Fix dependency between instruction writing M0 and S_SENDMSG on VI (v2) This fixes a hang when using an empty geometry shader. v2: - don't add s_nop when followed by s_waitcnt - comestic changes Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 227986	2015-02-03 17:37:52 +00:00
Sanjay Patel	ffd039bde1	Fix program crashes due to alignment exceptions generated for SSE memop instructions (PR22371). r224330 introduced a bug by misinterpreting the "FeatureVectorUAMem" bit. The commit log says that change did not affect anything, but that's not correct. That change allowed SSE instructions to have unaligned mem operands folded into math ops, and that's not allowed in the default specification for any SSE variant. The bug is exposed when compiling for an AVX-capable CPU that had this feature flag but without enabling AVX codegen. Another mistake in r224330 was not adding the feature flag to all AVX CPUs; the AMD chips were excluded. This is part of the fix for PR22371 ( http://llvm.org/bugs/show_bug.cgi?id=22371 ). This feature bit is SSE-specific, so I've renamed it to "FeatureSSEUnalignedMem". Changed the existing test case for the feature bit to reflect the new name and renamed the test file itself to better reflect the feature. Added runs to fold-vex.ll to check for the failing codegen. Note that the feature bit is not set by default on any CPU because it may require a configuration register setting to enable the enhanced unaligned behavior. llvm-svn: 227983	2015-02-03 17:13:04 +00:00
Bill Schmidt	e2062dbb29	Disable 32-bit tests in tls-pic.ll until they can be repaired llvm-svn: 227981	2015-02-03 16:57:38 +00:00
Bill Schmidt	a5908c74e6	Further revise too-restrictive test CodeGen/PowerPC/tls-pic.ll llvm-svn: 227980	2015-02-03 16:33:55 +00:00
Bill Schmidt	c2208acfbe	Further revise too-restrictive test CodeGen/PowerPC/tls-pic.ll llvm-svn: 227978	2015-02-03 16:29:52 +00:00
Bill Schmidt	50f486b447	Revise too-restrictive test CodeGen/PowerPC/tls-pic.ll llvm-svn: 227977	2015-02-03 16:24:05 +00:00
Bill Schmidt	685aa8b0c5	[PowerPC] Yet another approach to __tls_get_addr This patch is a third attempt to properly handle the local-dynamic and global-dynamic TLS models. In my original implementation, calls to __tls_get_addr were hidden from view until the asm-printer phase, at which point the underlying branch-and-link instruction was created with proper relocations. This mostly worked well, but I used some repellent techniques to ensure that the TLS_GET_ADDR nodes at the SD and MI levels correctly received input from GPR3 and produced output into GPR3. This proved to work badly in the presence of multiple TLS variable accesses, with the copies to and from GPR3 being scheduled incorrectly and generally creating havoc. In r221703, I addressed that problem by representing the calls to __tls_get_addr as true calls during instruction lowering. This had the advantage of removing all of the bad hacks and relying on the existing call machinery to properly glue the copies in place. It looked like this was going to be the right way to go. However, as a side effect of the recent discovery of problems with linker optimizations for TLS, we discovered cases of suboptimal code generation with this strategy. The problem comes when tls_get_addr is called for the same address, and there is a resulting CSE opportunity. It turns out that in such cases MachineCSE will common the addis/addi instructions that set up the input value to tls_get_addr, but will not common the calls themselves. MachineCSE does not have any machinery to common idempotent calls. This is perfectly sensible, since presumably this would be done at the IR level, and introducing calls in the back end isn't commonplace. In any case, we end up with two calls to __tls_get_addr when one would suffice, and that isn't good. I presumed that the original design would have allowed commoning of the machine-specific nodes that hid the __tls_get_addr calls, so as suggested by Ulrich Weigand, I went back to that design and cleaned it up so that the copies were properly held together by glue nodes. However, it turned out that this didn't work either...the presence of copies to physical registers kept the machine-specific nodes from being commoned also. All of which leads to the design presented here. This is a return to the original design, except that no attempt is made to introduce copies to and from GPR3 during instruction lowering. Virtual registers are used until prior to register allocation. At that point, a special pass is run that identifies the machine-specific nodes that hide the tls_get_addr calls and introduces the copies to and from GPR3 around them. The register allocator then coalesces these copies away. With this design, MachineCSE succeeds in commoning tls_get_addr calls where possible, and we get nice optimal code generation (better than GCC at the moment, which does not common these calls). One additional problem must be dealt with: After introducing the mentions of the physical register GPR3, the aggressive anti-dependence breaker sees opportunities to improve scheduling by selecting a different register instead. Flags must be used on the instruction descriptions to tell the anti-dependence breaker to keep its hands in its pockets. One thing missing from the original design was recording a definition of the link register on the GET_TLS_ADDR nodes. Doing this was found to be insufficient to force a stack frame to be created, which led to looping behavior because two different LR values were stored at the same address. This appears to have been an oversight in PPCFrameLowering::determineFrameLayout(), which is repaired here. Because MustSaveLR() returns true for calls to builtin_return_address, this changed the expected behavior of test/CodeGen/PowerPC/retaddr2.ll, which now stacks a frame but formerly did not. I've fixed the test case to reflect this. There are existing TLS tests to catch regressions; the checks in test/CodeGen/PowerPC/tls-store2.ll proved to be too restrictive in the face of instruction scheduling with these changes, so I fixed that up. I've added a new test case based on the PrettyStackTrace module that demonstrated the original problem. This checks that we get correct code generation and that CSE of the calls to __get_tls_addr has taken place. llvm-svn: 227976	2015-02-03 16:16:01 +00:00
Sanjay Patel	a4276fb294	Improve test to actually check for a folded load. This test was checking for lack of a "movaps" (an aligned load) rather than a "movups" (an unaligned load). It also included a store which complicated the checking. Add specific CPU runs to prevent subtarget feature flag overrides from inhibiting this optimization. llvm-svn: 227972	2015-02-03 15:37:18 +00:00
Bruno Cardoso Lopes	077774b820	[X86][MMX] Improve transfer from mmx to i32 Improve EXTRACT_VECTOR_ELT DAG combine to catch conversion patterns between x86mmx and i32 with more layers of indirection. Before: movq2dq %mm0, %xmm0 movd %xmm0, %eax After: movd %mm0, %eax llvm-svn: 227969	2015-02-03 14:46:49 +00:00
Renato Golin	af213728cc	Adding AArch64 support to ASan instrumentation For the time being, it is still hardcoded to support only the 39 VA bits variant, I plan to work on supporting 42 and 48 VA bits variants, but I don't have access to such hardware at the moment. Patch by Chrystophe Lyon. llvm-svn: 227965	2015-02-03 11:20:45 +00:00
Craig Topper	6b4499a393	[X86] Make fxsave64/fxrstor64/xsave64/xsrstor64/xsaveopt64 parseable in AT&T syntax. Also make them the default output. llvm-svn: 227963	2015-02-03 11:03:57 +00:00
Craig Topper	ce25047b83	[X86] Add Requires[In64BitMode] around MOVSX64rr32/MOVSX64rm32. This makes it more strictly mutexed with the ARPL instruction 32-bit mode. Helps with some disassembler changes I'm experimenting with. Should be NFC. llvm-svn: 227962	2015-02-03 11:03:43 +00:00
Eric Christopher	36fe028a2a	Only access TLOF via the TargetMachine, not TargetLowering. llvm-svn: 227949	2015-02-03 07:22:52 +00:00
Eric Christopher	8f276db622	Define a runOnMachineFunction for the Hexagon AsmPrinter and use it to initialize the subtarget. llvm-svn: 227948	2015-02-03 06:40:22 +00:00
Eric Christopher	bb1ae666fe	Migrate away from using a Subtarget except for the one place we want to use it. Use the triple to determine OS format bits at the module level. llvm-svn: 227947	2015-02-03 06:40:19 +00:00
Lang Hames	d48bf3f912	[PBQP Regalloc] Pre-spill vregs that have no legal physregs. The PBQP::RegAlloc::MatrixMetadata class assumes that matrices have at least two rows/columns (for the spill option plus at least one physreg). This patch ensures that that invariant is met by pre-spilling vregs that have no physreg options so that no node (and no corresponding edges) need be added to the PBQP graph. This fixes a bug in an out-of-tree target that was identified by Jonas Paulsson. Thanks for tracking this down Jonas! llvm-svn: 227942	2015-02-03 06:14:06 +00:00
NAKAMURA Takumi	c7f8bfc5e5	Resurrect initializers for NumLoads and NumStores in LoopVectorizationLegality to suppress undefined behavior. FIXME: Shall they be managed in LAA? llvm-svn: 227940	2015-02-03 03:55:06 +00:00
Andrew Kaylor	d4b80b8e68	Really, really, really don't build llvm-pdbdump on MSVC < 2013. There was a typo in the last attempt. llvm-svn: 227937	2015-02-03 03:08:25 +00:00
Rafael Espindola	dcfd6ed183	Propagate a better error message to the C api. llvm-svn: 227934	2015-02-03 01:53:03 +00:00
Rafael Espindola	3ee23a9ec8	Use a non-fatal diag handler in the C API. FIxes PR22368. llvm-svn: 227903	2015-02-03 00:49:57 +00:00
Justin Bogner	195a4f08ea	InstrProf: Simplify RawCoverageMappingReader's API slightly This is still kind of a weird API, but dropping the (partial) update of the passed in CoverageMappingRecord makes it a little easier to understand and use. llvm-svn: 227900	2015-02-03 00:20:11 +00:00
Justin Bogner	346359daac	InstrProf: Simplify some logic by using ArrayRef::slice (NFC) llvm-svn: 227898	2015-02-03 00:00:00 +00:00
Alex Rosenberg	bacd479a5d	Revert part of r227437 as it was unnecessary. Thanks to echristo for pointing this out. llvm-svn: 227897	2015-02-02 23:58:54 +00:00
Eric Christopher	a1c535b5e8	Migrate to using the subtarget on the machine function and update all uses. llvm-svn: 227891	2015-02-02 23:03:45 +00:00
Eric Christopher	6b6db77824	Use the function template getSubtarget off of the machine function, and use it in all locations. llvm-svn: 227890	2015-02-02 23:03:43 +00:00
Eric Christopher	d5c235dab8	Use the cached subtarget on the MachineFunction. llvm-svn: 227885	2015-02-02 22:40:56 +00:00
Eric Christopher	6905059e80	Remove dead header. llvm-svn: 227884	2015-02-02 22:40:54 +00:00
Eric Christopher	57931fca07	Remove dead code in the HexagonMCInst classes. This also fixes a layering violation in the port and removes calls to getSubtargetImpl. llvm-svn: 227883	2015-02-02 22:40:53 +00:00
Eric Christopher	d21486dfe0	80-col fixup. llvm-svn: 227882	2015-02-02 22:40:51 +00:00
Justin Bogner	94695c4b87	InstrProf: Remove an unused header (NFC) llvm-svn: 227881	2015-02-02 22:38:39 +00:00
Eric Christopher	2b7707c07e	Remove dead code in the HexagonMCInst classes. This also fixes a layering violation in the port and removes calls to getSubtargetImpl. llvm-svn: 227880	2015-02-02 22:28:48 +00:00
Eric Christopher	97a2a39695	80-col fixup. llvm-svn: 227879	2015-02-02 22:28:46 +00:00
Eric Christopher	6098f150a1	Remove unused class variables and update all callers/uses from the HexagonSplitTFRCondSet pass. Use the subtarget off the machine function at the same time. llvm-svn: 227878	2015-02-02 22:28:44 +00:00
Eric Christopher	01f875e859	Migrate the HexagonSplitConst32AndConst64 pass from TargetMachine based getSubtarget to the one cached on the MachineFunction. Remove unused class variables and update all callers/uses. llvm-svn: 227874	2015-02-02 22:11:43 +00:00
Eric Christopher	0fef34e3fc	Remove #if'd code and update comment. llvm-svn: 227873	2015-02-02 22:11:42 +00:00
Eric Christopher	f8b8e4a3fb	Move HexagonMachineScheduler to use the subtarget off of the MachineFunction and update all uses accordingly including VLIWResourceModel. llvm-svn: 227872	2015-02-02 22:11:40 +00:00
Eric Christopher	d737b76b63	Cache and use the subtarget that owns the target lowering. llvm-svn: 227871	2015-02-02 22:11:36 +00:00
Bruno Cardoso Lopes	f7410e5292	[X86][MMX] Add tests for MMX extract element LLVM ToT produces poor MMX code compared to 3.5. However, part of the previous functionality can be achieved by using -x86-experimental-vector-widening-legalization. Add tests to be sure we don't regress again. llvm-svn: 227869	2015-02-02 22:00:48 +00:00
Bruno Cardoso Lopes	e4716e65b3	[X86][MMX] Cleanup shuffle, bitcast and insert element tests - Merge MMX arg passing test files - Merge MMX bitcast, insert elt and shuffle tests llvm-svn: 227867	2015-02-02 21:56:11 +00:00
Alexei Starovoitov	3e7f0e84d8	bpf: Use the getSubtarget call off of the MachineFunction rather than the TargetMachine Summary: Hi Eric, this patch cleans up the layering violation that you're fixing across backends. Anything else I need to fix on bpf backend side? Thanks Reviewers: echristo Reviewed By: echristo Differential Revision: http://reviews.llvm.org/D7355 llvm-svn: 227865	2015-02-02 21:24:27 +00:00
Jingyue Wu	49a766e468	Resurrect the assertion removed by r227717 Summary: MSVC can compile "LoopID->getOperand(0) == LoopID" when LoopID is MDNode*. Test Plan: no regression Reviewers: mkuper Subscribers: jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D7327 llvm-svn: 227853	2015-02-02 20:41:11 +00:00
Duncan P. N. Exon Smith	c7e0813dbd	Fix the -Werror build, NFC llvm-svn: 227849	2015-02-02 20:20:56 +00:00
Duncan P. N. Exon Smith	9146fc8fd6	IR: Allow GenericDebugNode construction from MDString Allow `GenericDebugNode` construction directly from `MDString`, rather than requiring `StringRef`s. I've refactored the `StringRef` constructors to use these. There's no real functionality change here, except for exposing the lower-level API. The purpose of this is to simplify construction of string operands when reading bitcode. It's unnecessarily indirect to parse an `MDString` ID, lookup the `MDString` in the bitcode reader list, get the `StringRef` out of that, and then have `GenericDebugNode::getImpl()` use `MDString::get()` to acquire the original `MDString`. Instead, this allows the bitcode reader to directly pass in the `MDString`. llvm-svn: 227848	2015-02-02 20:01:03 +00:00
Duncan P. N. Exon Smith	61e62a5b04	IR: Extract DEFINE_MDNODE_GET(), NFC llvm-svn: 227847	2015-02-02 19:55:21 +00:00
Duncan P. N. Exon Smith	442ec0223b	IR: Separate helpers for string operands, NFC llvm-svn: 227846	2015-02-02 19:54:05 +00:00
Lang Hames	35a514d200	[Orc] Make OrcMCJITReplacement::addObject calls transfer buffer ownership to the ObjectLinkingLayer. There are a two of overloads for addObject, one of which transfers ownership of the underlying buffer to OrcMCJITReplacement. This commit makes the ownership transfering version pass ownership down to the ObjectLinkingLayer in order to prevent the issue described in r227778. I think this commit will fix the sanitizer bot failures that necessitated the removal of the load-object-a.ll regression test in r227785, so I'm reinstating that test. llvm-svn: 227845	2015-02-02 19:51:18 +00:00
Rafael Espindola	6ffb1d7e3c	Move simple case earlier and use a continue. llvm-svn: 227841	2015-02-02 19:22:51 +00:00
Eric Christopher	202f22bbda	Migrate HexagonISelDAGToDAG to setting a subtarget pointer during runOnMachineFunction. Update all uses of the Subtarget accordingly. llvm-svn: 227840	2015-02-02 19:22:03 +00:00
Eric Christopher	90295c9c63	Use the getSubtarget call off of the MachineFunction rather than the TargetMachine. llvm-svn: 227839	2015-02-02 19:22:01 +00:00
Eric Christopher	2c44f43ebe	Remove unused class variables and update calls to get the subtarget off of the machine function. llvm-svn: 227837	2015-02-02 19:05:28 +00:00
Eric Christopher	d55c7c6670	Sink queries into asserts since the variable is unused otherwise. llvm-svn: 227836	2015-02-02 18:58:24 +00:00

1 2 3 4 5 ...

112854 Commits