llvm-project

Commit Graph

Author	SHA1	Message	Date
Nicola Zaghen	9438b15946	[ReleaseNotes] Add release note for the new LLVM_DEBUG macro. This is to provide a way to migrate from the old DEBUG macro to the new one. Differential Revision: https://reviews.llvm.org/D47528 llvm-svn: 333898	2018-06-04 13:55:09 +00:00
Simon Dardis	fb4dde1142	[mips] Restore the availablity of trap for microMIPS Reviewers: smaksimovic, atanasyan, abeserminji Differential Revision: https://reviews.llvm.org/D47584 llvm-svn: 333895	2018-06-04 12:50:32 +00:00
Greg Bedwell	96f51f09d4	[llvm-mca][UpdateTestChecks] Prevent an IndexError being raised when given empty input llvm-svn: 333894	2018-06-04 12:30:10 +00:00
Greg Bedwell	bbe64af0a0	[llvm-mca] Regenerate a test to remove a double newline Command used: py update_mca_test_checks.py ..\test\tools\llvm-mca\\.s ..\test\tools\llvm-mca\\\*.s llvm-svn: 333893	2018-06-04 12:30:03 +00:00
Andrea Di Biagio	2008c7c8fd	[llvm-mca] Track cycles contributed by resources that are in a 'Super' relationship. This is required if we want to correctly match the behavior of method SubtargetEmitter::ExpandProcResource() in Tablegen. When computing the set of "consumed" processor resources and resource cycles, the logic in ExpandProcResource() doesn't update the number of resource cycles contributed by a "Super" resource to a group. We need to take this into account when a model declares a processor resource which is part of a 'processor resource group', and it is also used as the "Super" of other resources. llvm-svn: 333892	2018-06-04 12:23:07 +00:00
Roman Lebedev	7b53d1454f	[llvm-mca] Make sure not to end the test files with an empty line. Summary: It's super irritating. [properly configured] git client then complains about that double-newline, and you have to use `--force` to ignore the warning, since even if you fix it manually, it will be reintroduced the very next runtime :/ Reviewers: RKSimon, andreadb, courbet, craig.topper, javed.absar, gbedwell Reviewed By: gbedwell Subscribers: javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47697 llvm-svn: 333887	2018-06-04 11:48:46 +00:00
Clement Courbet	2cb97b95a2	[llvm-exegesis][NFC] Use an enum instead of a string for benchmark mode. Summary: YAML encoding is backwards-compatible. Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D47705 llvm-svn: 333886	2018-06-04 11:43:40 +00:00
Clement Courbet	7228721b30	[llvm-exegesis] Analysis: Show inconsistencies between checked-in and measured data. Summary: We now highlight any sched classes whose measurements do not match the LLVM SchedModel. "bad" clusters are marked in red. Screenshot in phabricator diff. Reviewers: gchatelet Subscribers: tschuett, mgrang, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D47639 llvm-svn: 333884	2018-06-04 11:11:55 +00:00
Luke Geeson	43e4367961	[AArch64] Audit on rL333634 to fix FP16 Disasm BitPatterns llvm-svn: 333879	2018-06-04 09:41:32 +00:00
Sander de Smalen	d0a6f6a502	[AArch64][SVE] Fix range for DUP immediates (16bit elts) For immediates used in DUP instructions that have the range -128 to 127, or a multiple of 256 in the range -32768 to 32512, one could argue that when the result element size is 16bits (.h), the value can be considered both signed and unsigned. Reviewers: rengolin, fhahn, SjoerdMeijer, samparker, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47619 llvm-svn: 333873	2018-06-04 07:24:23 +00:00
Sander de Smalen	fd54a781f6	[AArch64][SVE] Asm: Print indexed element 0 as FPR. Print the first indexed element as a FP register, for example: mov z0.d, z1.d[0] Is now printed as: mov z0.d, d1 Next to printing, this patch also adds aliases to parse 'mov z0.d, d1'. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47571 llvm-svn: 333872	2018-06-04 07:07:35 +00:00
Sander de Smalen	c33d668ab7	[AArch64][SVE] Asm: Support for indexed DUP instructions. Unpredicated copy of indexed SVE element to SVE vector, along with MOV-aliases. For example: dup z0.h, z1.h[0] duplicates the first 16-bit element from z1 to all elements in the result vector z0. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D47570 llvm-svn: 333871	2018-06-04 06:40:55 +00:00
Sander de Smalen	367a53b059	[AArch64][SVE] Asm: Support for FCPY immediate instructions. Predicated copy of floating-point immediate value to SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: javed.absar Differential Revision: https://reviews.llvm.org/D47518 llvm-svn: 333869	2018-06-04 05:58:06 +00:00
Sander de Smalen	512d57f1a5	[AArch64][SVE] Asm: Support for CPY immediate instructions Predicated copy of possibly shifted immediate value into SVE vector, along with MOV-aliases. Reviewers: rengolin, fhahn, samparker, SjoerdMeijer, javed.absar Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D47517 llvm-svn: 333868	2018-06-04 05:40:46 +00:00
Vedant Kumar	7dda22115e	[Debugify] Add debug intrinsics before terminating musttail calls After r333856, opt -debugify would just stop emitting debug value intrinsics after encountering a musttail call. This wasn't sufficient to avoid verifier failures. Debug value intrinicss for all instructions preceding a musttail call must also be emitted before the musttail call. llvm-svn: 333866	2018-06-04 03:33:01 +00:00
Serguei Katkov	d894fb4288	[InstCombine] Fix div handling When we optimize select basing on fact that div by 0 is undef we should not traverse the instruction which are not guaranteed to transfer execution to next instruction. Guard intrinsic is an example. Reviewers: spatel, craig.topper Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47576 llvm-svn: 333864	2018-06-04 02:52:36 +00:00
Vedant Kumar	adbd27a599	[Debugify] Don't apply DI before the bitcode writer pass Applying synthetic debug info before the bitcode writer pass has no testing-related purpose. This commit prevents that from happening. It also adds tests which check that IR produced with/without -debugify-each enabled is identical after stripping. This makes it possible to check that individual passes (or full pipelines) are invariant to debug info. llvm-svn: 333861	2018-06-04 00:11:49 +00:00
Vedant Kumar	f0e4321c30	[opt] Add a -strip-module-flags option The -strip-module-flags option strips llvm.module.flags metadata from a module at the beginning of the opt pipeline. This will be used to test whether the output of a pass is debug info (DI) invariant. E.g, after applying synthetic debug info to a test case, we'd like to strip out all DI-related metadata and check that the final IR is identical to a baseline file without any DI applied, to check that optimizations aren't inhibited by debug info. llvm-svn: 333860	2018-06-04 00:11:48 +00:00
Vedant Kumar	36b89d4456	Reformat overflowing lines, NFC llvm-svn: 333859	2018-06-04 00:11:47 +00:00
Craig Topper	9923eac358	[X86] Remove and autoupgrade masked avx512vnni intrinsics using the unmasked intrinsics and select instructions. llvm-svn: 333857	2018-06-03 23:24:17 +00:00
Vedant Kumar	77f4d4d8aa	[Debugify] Skip dbg.value placement for EH pads, musttail Placing meta-instructions into EH pads breaks certain IR invariants, as does placing instructions after a musttail call. llvm-svn: 333856	2018-06-03 22:50:22 +00:00
Chris Bieneman	00d8c1abf0	Re-land: [MachO] Fixing ub in MachO BinaryFormat This re-lands r333797 with a fix for big endian systems. Original commit message: This isn't encountered anywhere inside LLVM, so I wrote a test case to expose the issue and verify that it is fixed. The basic problem is that the macho_load_command union contains all load comamnd structs. Load command structs in 32-bit macho files can be 32-bit aligned instead of 64-bit aligned. There are some strange circumstances in which this can be exposed in a 64-bit macho if the load commands are invalid or if a 32-bit aligned load command is used. In the past we've worked around this type of problem with changes like r264232. llvm-svn: 333854	2018-06-03 20:33:42 +00:00
Lang Hames	d6155ff002	[ORC] Add a constructor to create an IRMaterializationUnit from a module and pre-existing SymbolFlags and SymbolToDefinition maps. This constructor is useful when delegating work from an existing IRMaterialiaztionUnit to a new one, as it avoids the cost of re-computing these maps. llvm-svn: 333852	2018-06-03 19:22:48 +00:00
Simon Pilgrim	7c4446ce0c	[X86][TBM] Use realistic BEXTR control bits Avoid constant values that are guaranteed to give zero Found while investigating BEXTR optimizations for PR34042. llvm-svn: 333849	2018-06-03 18:15:06 +00:00
Sanjay Patel	3bd957b7ae	[InstCombine] improve sub with bool folds There's a patchwork of existing transforms trying to handle these cases, but as seen in the changed test, we weren't catching them all. llvm-svn: 333845	2018-06-03 16:35:26 +00:00
Simon Pilgrim	1f60e2b41b	[X86][AVX512] Cleanup intrinsics tests Ensure we test on 32-bit and 64-bit targets, and strip -mcpu usage. Part of ongoing work to ensure we test all intrinsic style tests on 32 and 64 bit targets where possible. llvm-svn: 333843	2018-06-03 14:56:04 +00:00
Simon Pilgrim	7d717fed0b	[X86][AVX512BW] Regenerate arithmetic tests using update_llc_test_checks.py script Require manual stripping of existing CHECKs as update_llc_test_checks doesn't remove them if they're outside the function llvm-svn: 333842	2018-06-03 14:31:30 +00:00
Simon Pilgrim	e370ade180	[X86][BMI1] Test i32 intrinsics on 32/64 bits + branch off i64 tests Further refactoring will wait until D47452 has landed. Part of ongoing work to ensure we test all intrinsic style tests on 32 and 64 bit targets where possible. llvm-svn: 333841	2018-06-03 14:11:34 +00:00
Simon Pilgrim	8dc43621ec	[X86][BMI] Remove CTTZ tests - this is fully covered in clz.ll llvm-svn: 333840	2018-06-03 13:55:17 +00:00
Simon Pilgrim	d4ef869e28	[X86][TBM] Branch off i32 intrinsics and test on 32/64 bits Part of ongoing work to ensure we test all intrinsic style tests on 32 and 64 bit targets where possible. llvm-svn: 333839	2018-06-03 13:38:52 +00:00
Amaury Sechet	99909e9308	Remove SETCCE use from Lanai's backend Summary: This creates a small perf regression, but after talking with Jacques Pienaar, he was good with it to get things moving toward removng SETCCE. Reviewers: jpienaar, bryant Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47626 llvm-svn: 333838	2018-06-03 12:56:24 +00:00
Lang Hames	353499fc95	[ORC] Use JITEvaluatedSymbol for IndirectStubsManager findStub and findPointer. Existing implementations of these methods do not require lazy materialization, and switching to JITEvaluatedSymbol allows us to remove error checking on the client side. llvm-svn: 333835	2018-06-03 02:18:03 +00:00
Simon Pilgrim	2b55e751ce	[X86][SSE] Cleanup AVX1 intrinsics tests Ensure we cover 32/64-bit targets for SSE/AVX/AVX512 cases as necessary, strip -mcpu usage. llvm-svn: 333834	2018-06-02 21:35:48 +00:00
Simon Pilgrim	58ff2ecc4b	[X86][SSE] Cleanup SSE1 intrinsics tests Ensure we cover 32/64-bit targets for SSE/AVX/AVX512 cases as necessary llvm-svn: 333833	2018-06-02 20:25:56 +00:00
Simon Pilgrim	8790844848	[X86][SSE] Cleanup SSE2 intrinsics tests Ensure we cover 32/64-bit targets for SSE/AVX/AVX512 cases as necessary llvm-svn: 333832	2018-06-02 19:43:14 +00:00
Simon Pilgrim	8c5b33a085	[X86][SSE] Cleanup SSE3/SSSE3 intrinsics tests Ensure we cover 32/64-bit targets for SSE/AVX/AVX512 cases as necessary llvm-svn: 333831	2018-06-02 18:41:46 +00:00
Simon Pilgrim	1c0fa05397	[X86][SSE4] Tweak rL333828 sse41/sse42 cleanup to recover SKX/EVEX2VEX testing Just testing for avx512f was missing the tests for EVEX TO VEX Compression encoding etc. llvm-svn: 333830	2018-06-02 18:01:09 +00:00
Simon Pilgrim	dda8daec73	[X86][SSE] Cleanup SSE4A/SSE41/SSE42 intrinsics tests Ensure we cover 32/64-bit targets for SSE/AVX/AVX512 cases as necessary Added some missing encoding checks to SSE4A tests llvm-svn: 333828	2018-06-02 17:33:26 +00:00
Simon Pilgrim	d93157c1b3	[X86][BMI2] Test i32 intrinsics on 32/64 bits + branch off i64 tests I had to tweak the i32 tests so we check both reg-reg and reg-mem cases. I also added i64 load tests. Part of ongoing work to ensure we test all intrinsic style tests on 32 and 64 bit targets where possible. llvm-svn: 333827	2018-06-02 17:22:13 +00:00
Simon Pilgrim	6028dc451a	[X86][BMI1] Remove test for non-existent andn i16 instruction llvm-svn: 333826	2018-06-02 17:02:27 +00:00
Ivan A. Kosarev	60a991ed1a	[NEON] Support VLD1xN intrinsics in AArch32 mode (LLVM part) We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47120 llvm-svn: 333825	2018-06-02 16:40:03 +00:00
Ivan A. Kosarev	73c5337a64	Revert r333819 "[NEON] Support VLD1xN intrinsics in AArch32 mode (Clang part)" The LLVM part was committed instead of the Clang part. Differential Revision: https://reviews.llvm.org/D47121 llvm-svn: 333824	2018-06-02 16:38:38 +00:00
Michael J. Spencer	ae6eeaea92	[MC] Add assembler support for .cg_profile. Object FIle Representation At codegen time this is emitted into the ELF file a pair of symbol indices and a weight. In assembly it looks like: .cg_profile a, b, 32 .cg_profile freq, a, 11 .cg_profile freq, b, 20 When writing an ELF file these are put into a SHT_LLVM_CALL_GRAPH_PROFILE (0x6fff4c02) section as (uint32_t, uint32_t, uint64_t) tuples as (from symbol index, to symbol index, weight). Differential Revision: https://reviews.llvm.org/D44965 llvm-svn: 333823	2018-06-02 16:33:01 +00:00
Craig Topper	93d8fbd8f2	[X86] Add tied source operand to AVX5124FMAPS and AVX5124VNNIW instructions. This doesn't affect the assembly or disassembly, but is more accurate. llvm-svn: 333822	2018-06-02 16:30:39 +00:00
Craig Topper	27234f1d8f	[X86] Fix warning message for AVX5124FMAPS and AVX5124VNNIW instructions in the assembly parser. The caret was positioned on the wrong operand. It's too hard to get right so just put the caret at the beginning of the instruction. llvm-svn: 333821	2018-06-02 16:30:36 +00:00
Sanjay Patel	bbc6d60677	[InstCombine] call simplify before trying vector folds As noted in the review thread for rL333782, we could have made a bug harder to hit if we were simplifying instructions before trying other folds. The shuffle transform in question isn't ever a simplification; it's just a canonicalization. So I've renamed that to make that clearer. This is NFCI at this point, but I've regenerated the test file to show the cosmetic value naming difference of using instcombine's RAUW vs. the builder. Possible follow-ups: 1. Move reassociation folds after simplifies too. 2. Refactor common code; we shouldn't have so much repetition. llvm-svn: 333820	2018-06-02 16:27:44 +00:00
Ivan A. Kosarev	51f19b9ee1	[NEON] Support VLD1xN intrinsics in AArch32 mode (Clang part) We currently support them only in AArch64. The NEON Reference, however, says they are 'ARMv7, ARMv8' intrinsics. Differential Revision: https://reviews.llvm.org/D47121 llvm-svn: 333819	2018-06-02 16:26:42 +00:00
Sanjay Patel	b6333486f4	[InstCombine] add more tests for shuffle-binop; NFC As noted in the review thread for rL333782, we're lacking coverage for this transform, so add tests for each binop opcode with constant operand. llvm-svn: 333818	2018-06-02 16:16:42 +00:00
Fangrui Song	8ca769d204	[Support] Remove unused raw_ostream::handle whose anchor role was superseded by anchor() llvm-svn: 333817	2018-06-02 06:00:35 +00:00
Craig Topper	1534929623	[X86] Add encoding information for the AVX5124FMAPS and AVX5124VNNIW instructions so they can be assembled and disassembled. These instructions are unusual in that they operate on 4 consecutive registers so supporting them in codegen will be more difficult than normal. Includes an assembler check to warn if the source register is not the first register of a 4 register group. llvm-svn: 333812	2018-06-02 02:15:10 +00:00

1 2 3 4 5 ...

164904 Commits