llvm-project

Commit Graph

Author	SHA1	Message	Date
Mitch Phillips	2f908c1436	Revert "[X86] Remove uses of the -x86-experimental-vector-widening-legalization flag from test/CodeGen/X86/" This reverts commit `3f572c7b84`. The MSan sanitizer buildbot was broken by rL367901. This commit (rL368079) depends on the broken commit that need to be reverted, and thus itself is being reverted. See https://reviews.llvm.org/rL367901 for more information. llvm-svn: 368106	2019-08-06 23:00:30 +00:00
Peter Collingbourne	0930643ff6	hwasan: Instrument globals. Globals are instrumented by adding a pointer tag to their symbol values and emitting metadata into a special section that allows the runtime to tag their memory when the library is loaded. Due to order of initialization issues explained in more detail in the comments, shadow initialization cannot happen during regular global initialization. Instead, the location of the global section is marked using an ELF note, and we require libc support for calling a function provided by the HWASAN runtime when libraries are loaded and unloaded. Based on ideas discussed with @evgeny777 in D56672. Differential Revision: https://reviews.llvm.org/D65770 llvm-svn: 368102	2019-08-06 22:07:29 +00:00
Peter Collingbourne	411d96f99a	IR: Disable verifier check for GlobalValues with private linkage named after a comdat for non-COFF. This check is only meaningful for COFF and it is perfectly valid to create such a GlobalValue in ELF. Differential Revision: https://reviews.llvm.org/D65686 llvm-svn: 368094	2019-08-06 21:47:18 +00:00
Craig Topper	ecc1e5d476	[X86] Don't allow combineSIntToFP to create v2i32 vectors after type legalization. If we're after type legalization we should only be trying to turn v2i64 into v2i32. So bitcast to v4i32, shuffle the even elements together. Then use X86ISD::CVTSI2P. The alternative is to leave the v2i64 type alone and let it scalarized. Hopefully keeping it packed is better. Fixes PR42905. llvm-svn: 368091	2019-08-06 21:43:15 +00:00
Reid Kleckner	e4bd38478b	Revert [InstCombine] Shift amount reassociation: shl-trunc-shl pattern This reverts r368059 (git commit `0f95710976`) This caused Clang to assert while self-hosting and compiling SystemZInstrInfo.cpp. Reduction is running. llvm-svn: 368084	2019-08-06 20:32:07 +00:00
Craig Topper	fc33e33776	[X86] Add more extract subvector cost model tests for smaller element sizes and smaller than 128-bit vectors. With the switch to widening legalization, we need to a better job of costing extractions of less than 128-bits. llvm-svn: 368081	2019-08-06 20:12:41 +00:00
Craig Topper	b1e4da2b90	[X86] Remove tests for -x86-experimental-vector-widening-legalization from test/Analysis/CostModel/X86/ This flag is now the default behavior so we don't need separate tests. llvm-svn: 368080	2019-08-06 20:12:34 +00:00
Craig Topper	3f572c7b84	[X86] Remove uses of the -x86-experimental-vector-widening-legalization flag from test/CodeGen/X86/ This flag is now the default behavior so we no longer need to set it in tests. Some redundant tests have been removed after verifying we have an equivalent test that didn't use the flag. llvm-svn: 368079	2019-08-06 20:12:20 +00:00
Dmitri Gribenko	e2f17e2649	Revert "Added Delta IR Reduction Tool" This reverts commit r368071, it broke buildbots. llvm-svn: 368073	2019-08-06 19:40:37 +00:00
Diego Trevino Ferrer	800618f241	Added Delta IR Reduction Tool Summary: Tool parses input IR file, and runs the delta debugging algorithm to reduce the functions inside the input file. Reviewers: alexshap, chandlerc Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63672 llvm-svn: 368071	2019-08-06 18:59:11 +00:00
Aditya Nandakumar	6bbfde5c48	[GISel]: Fix trivial build breakage llvm-svn: 368067	2019-08-06 17:53:04 +00:00
Aditya Nandakumar	c8ac029d0a	[GISel]: Add GISelKnownBits analysis https://reviews.llvm.org/D65698 This adds a KnownBits analysis pass for GISel. This was done as a pass (compared to static functions) so that we can add other features such as caching queries(within a pass and across passes) in the future. This patch only adds the basic pass boiler plate, and implements a lazy non caching knownbits implementation (ported from SelectionDAG). I've also hooked up the AArch64PreLegalizerCombiner pass to use this - there should be no compile time regression as the analysis is lazy. llvm-svn: 368065	2019-08-06 17:18:29 +00:00
Roman Lebedev	0f95710976	[InstCombine] Shift amount reassociation: shl-trunc-shl pattern Summary: Currently `reassociateShiftAmtsOfTwoSameDirectionShifts()` only handles two shifts one after another. If the shifts are `shl`, we still can easily perform the fold, with no extra legality checks: https://rise4fun.com/Alive/OQbM If we have right-shift however, we won't be able to make it any simpler than it already is. After this the only thing missing here is constant-folding: (`NewShAmt >= bitwidth(X)`) * If it's a logical shift, then constant-fold to `0` (not `undef`) * If it's a `ashr`, then a splat of original signbit https://rise4fun.com/Alive/E1K https://rise4fun.com/Alive/i0V Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65380 llvm-svn: 368059	2019-08-06 17:03:40 +00:00
Cameron McInally	9c52f66f48	[NFC][EarlyCSE] Pre-commit unary FNeg tests. llvm-svn: 368056	2019-08-06 16:41:30 +00:00
George Rimar	c92b951567	[test/Object] - Cleanup the Object\obj2yaml.test a bit. This makes 2 changes: 1) Removes unwind-section.elf-x86-64 object and the corresponding test case, because SHT_X86_64_UNWIND is already tested here: https://github.com/llvm-mirror/llvm/blob/master/test/tools/obj2yaml/section-type.yaml 2) Removes/partially moves "No such file or directory" test, because we already have a similar test here: https://github.com/llvm-mirror/llvm/blob/master/test/tools/obj2yaml/invalid_input_file.test Differential revision: https://reviews.llvm.org/D65570 llvm-svn: 368044	2019-08-06 14:34:39 +00:00
Simon Pilgrim	dae5ddad9d	[TargetLowering] SimplifyMultipleUseDemandedBits - return UNDEF for undemanded ops If we demand no bits/elts from an Op, just return UNDEF llvm-svn: 368043	2019-08-06 14:30:42 +00:00
Tim Renouf	5a0794327a	[StructurizeCFG] Enable -structurizecfg-relaxed-uniform-regions by default D62198 introduced an option to relax the checks for hasOnlyUniformBranches. This commit turns the option on by default, for better code generation in some cases in AMDGPU. Differential Revision: https://reviews.llvm.org/D63198 Change-Id: I9cbff002a1e74d3b7eb96b4192dc8129936d537d llvm-svn: 368042	2019-08-06 14:30:19 +00:00
Dmitri Gribenko	81dc15e883	Revert "Fixed failing test cases" This reverts commit r368030, which depends on r368021 that I reverted. llvm-svn: 368036	2019-08-06 13:50:28 +00:00
Dmitri Gribenko	fc21bb661f	Revert "[yaml2obj] Move core yaml2obj code into lib and include for use in unit tests" This reverts commit r368021, it broke tests. llvm-svn: 368035	2019-08-06 13:39:50 +00:00
Tim Northover	b5abc425d2	AArch64: bail instead of asserting on unexpected type in G_CONSTANT 0. llvm-svn: 368031	2019-08-06 13:34:08 +00:00
Alex Brachet	9eee425479	Fixed failing test cases llvm-svn: 368030	2019-08-06 13:29:55 +00:00
Sanjay Patel	efc24d9d6f	[InstCombine] add tests for binop with FMF with select operands; NFC Baseline coverage for D65658. llvm-svn: 368028	2019-08-06 13:19:13 +00:00
Simon Pilgrim	cf62047d29	[X86][SSE] Call SimplifyMultipleUseDemandedBits on PACKSS/PACKUS arguments. This mainly helps to replace unused arguments with UNDEF in the case where they have multiple users. llvm-svn: 368026	2019-08-06 13:10:42 +00:00
Simon Atanasyan	2fbf58c6e6	[llvm/test/Object] Remove redundant test case. NFC Remove redundant `yaml2obj-elf-file-headers-with-e_flags.yaml` test case. The same functionality is checked by the `Mips/elf-flags.yaml`. llvm-svn: 368023	2019-08-06 12:41:43 +00:00
Alex Brachet	3cfeaa4d2c	[yaml2obj] Move core yaml2obj code into lib and include for use in unit tests Reviewers: jhenderson, rupprecht, MaskRay, grimar, labath Reviewed By: rupprecht Subscribers: seiya, mgorny, sbc100, hiraditya, aheejin, jakehehrlich, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65255 llvm-svn: 368021	2019-08-06 12:15:18 +00:00
Simon Pilgrim	c6735aecfa	[X86][SSE] Enable min/max partial reduction As mentioned on D65047 / rL366933 the plan is to enable partial reduction handling wherever possible. llvm-svn: 368016	2019-08-06 11:00:34 +00:00
Simon Pilgrim	23cd0da9e9	[X86][SSE] Add tests for min/max partial reduction As mentioned on D65047 / rL366933 the plan is to enable partial reduction handling wherever possible. llvm-svn: 368015	2019-08-06 10:52:44 +00:00
Ulrich Weigand	7b24dd741c	[Strict FP] Allow custom operation actions This patch changes the DAG legalizer to respect the operation actions set by the target for strict floating-point operations. (Currently, the legalizer will usually fall back to mutate to the non-strict action (which is assumed to be legal), and only skip mutation if the strict operation is marked legal.) With this patch, if whenever a strict operation is marked as Legal or Custom, it is passed to the target as usual. Only if it is marked as Expand will the legalizer attempt to mutate to the non-strict operation. Note that this will now fail if the non-strict operation is itself marked as Custom -- the target will have to provide a Custom definition for the strict operation then as well. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D65226 llvm-svn: 368012	2019-08-06 10:43:13 +00:00
Tim Northover	de98e92bc2	AArch64: use xzr/wzr for constant 0 in GlobalISel. COPYs from xzr and wzr can often be folded away entirely during register allocation, unlike a movz. llvm-svn: 368003	2019-08-06 09:18:41 +00:00
George Rimar	c056dd1502	[llvm/test/Object] - Cleanup and move out the yaml2obj tests. There are multiple yaml2obj-* tests in llvm/test/Object folder. This is not correct place to have them and my intention was to move them out to test\tools\yaml2obj folder. I reviewed them, made some changes, and my comments are below. For all tests I: Added comments when needed. Moved them from llvm/test/Object to yaml2obj tests. Another changes performed: 1) yaml2obj-invalid.yaml. It was a test for an invalid YAML input. I just moved it. 2) yaml2obj-coff-multi-doc.test/yaml2obj-elf-multi-doc.test: these were a tests for testing --docnum=x functionality, one was for COFF and one for ELF. I merged them into one. 3) yaml2obj-elf-bits-endian.test: I removed its 4 YAML inputs (merged into the main test). 4) yaml2obj-readobj.test: This file has a long history. It was added to check the "parsing of header charactestics" initially. Then was used to test how yaml2obj writes the relocations. Then was upgraded to check how yaml2obj handle "-o" option. I think it should be heavily splitted and refactored in a separate patch. For now I leaved it as is, but restyled to reduce the changes in a follow-ups. 5) yaml2obj-elf-alignment.yaml: its intention was to check we can set sh-addralign field. I moved, renamed (to elf-sh-addralign.yaml) and updated this test. 6) yaml2obj-elf-file-headers.yaml: I removed it. It's intention was to check that yaml2obj handles OS/ABI and ELF type (e.g Relocatable). We are testing this already, for example in D64800. We might want to add a better (more complete) test, but keeping the existent test does not have much sense I think. 7) yaml2obj-elf-file-headers-with-e_flags.yaml: I would describe its intention as "testing MIPS e_flags". It is far from being complete and tests only a few flags. I leaved it alone for now. 8) yaml2obj-elf-rel.yaml: its intention is to check the MIPS32 relocations. We have a version for MIPS64 here: test\Object\Mips\elf-mips64-rel.yaml Seems them both are incomplete. I leaved them alone for now. 9) yaml2obj-elf-rel-noref.yaml: was introduced to check the support of arm32 R_ARM_V4BX relocatiion. I leaved it alone for now. 10) yaml2obj-elf-section-basic.yaml: it just checked that we are able to recognise trivial fields like section 'Name', 'Type', 'Flags' and others. All of our yaml2obj tests are heavily using it. I just removed this test. 11) yaml2obj-elf-section-invalid-size.yaml: its intention was to check the "Section size must be greater than or equal to the content size" error. I moved this test to `tools\yaml2obj\section-size-content.yaml' 12) yaml2obj-elf-symbol-basic.yaml: its intention seems was to support declarations of the symbols in yaml2obj. I removed it. We use this in almost each test we already have. 13) yaml2obj-elf-symbol-LocalGlobalWeak.yaml: its intention was to check that we can declare different symbol bindings. I moved it to tools\yaml2obj\elf-symbol-binding.yaml. 14) yaml2obj-coff-invalid-alignment.test: check that error is reported for a too large coff section alignment. Moved it to tools\yaml2obj\coff-invalid-alignment.test 15) yaml2obj-elf-symbol-visibility.yaml: tests ELF symbols visibility. I improved it and moved to tools\yaml2obj\elf-symbol-visibility.yaml and tools\obj2yaml\elf-symbol-visibility.yaml Differential revision: https://reviews.llvm.org/D65652 llvm-svn: 367988	2019-08-06 08:02:25 +00:00
Hideki Saito	ec818d7fb3	[LV][NFC] Share the LV illegality reporting with LoopVectorize. Reviewers: hsaito, fhahn, rengolin Reviewed By: rengolin Patch by psamolysov, thanks! Differential Revision: https://reviews.llvm.org/D62997 llvm-svn: 367980	2019-08-06 06:08:48 +00:00
Austin Kerbow	a05c384132	Re-commit: [AMDGPU] Use S_DENORM_MODE for gfx10 Summary: During fdiv32 lowering use S_DENORM_MODE to select denorm mode in gfx10. Reviewers: arsenm, rampitec Reviewed By: arsenm, rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65620 llvm-svn: 367969	2019-08-06 02:16:11 +00:00
Shiva Chen	b12056bd33	[RISCV] Custom legalize i32 operations for RV64 to reduce signed extensions Differential Revision: https://reviews.llvm.org/D65434 llvm-svn: 367960	2019-08-06 00:24:00 +00:00
Johannes Doerfert	e83f303938	[Attributor] Deduce the "no-return" attribute for functions A function is "no-return" if we never reach a return instruction, either because there are none or the ones that exist are dead. Test have been adjusted: - either noreturn was added, or - noreturn was avoided by modifying the code. The new noreturn_{sync,async} test make sure we do handle invoke instructions with a noreturn (and potentially nowunwind) callee correctly, even in the presence of potential asynchronous exceptions. llvm-svn: 367948	2019-08-05 23:22:05 +00:00
Wolfgang Pieb	c71c629926	[llvm-readelf] Support dumping of stack sizes sections with readelf --stack-sizes Reviewers: jhenderson, grimar, rupprecht Differential Revision: https://reviews.llvm.org/D65313 llvm-svn: 367942	2019-08-05 22:47:07 +00:00
Peter Collingbourne	38f985eb1c	Add "REQUIRES: x86-registered-target" to test. llvm-svn: 367937	2019-08-05 21:44:45 +00:00
Keno Fischer	5c3cdef84b	[WebAssembly] Fix conflict between ret legalization and sjlj Summary: When the WebAssembly backend encounters a return type that doesn't fit within i32, SelectionDAG performs sret demotion, adding an additional argument to the start of the function that contains a pointer to an sret buffer to use instead. However, this conflicts with the emscripten sjlj lowering pass. There we translate calls like: ``` call {i32, i32} @foo() ``` into (in pseudo-llvm) ``` %addr = @foo call {i32, i32} @__invoke_{i32,i32}(%addr) ``` i.e. we perform an indirect call through an extra function. However, the sret transform now transforms this into the equivalent of ``` %addr = @foo %sret = alloca {i32, i32} call {i32, i32} @__invoke_{i32,i32}(%sret, %addr) ``` (while simultaneously translation the implementation of @foo as well). Unfortunately, this doesn't work out. The __invoke_ ABI expected the function address to be the first argument, causing crashes. There is several possible ways to fix this: 1. Implementing the sret rewrite at the IR level as well and performing it as part of lowering to __invoke 2. Fixing the wasm backend to recognize that __invoke has a special ABI 3. A change to the binaryen/emscripten ABI to recognize this situation This revision implements the middle option, teaching the backend to treat __invoke_ functions specially in sret lowering. This is achieved by 1) Introducing a new CallingConv ID for invoke functions 2) When this CallingConv ID is seen in the backend and the first argument is marked as sret (a function pointer would never be marked as sret), swapping the first two arguments. Reviewed By: tlively, aheejin Differential Revision: https://reviews.llvm.org/D65463 llvm-svn: 367935	2019-08-05 21:36:09 +00:00
Johannes Doerfert	3d7bbc6f9c	[Attributor][Fix] Do not remove instructions during manifestation When we remove instructions cached references could still be live. This patch avoids removing invoke instructions that are replaced by calls and instead keeps them around but in a dead block. llvm-svn: 367933	2019-08-05 21:35:02 +00:00
Peter Collingbourne	a56d81f4fb	llvm-symbolizer: Untag addresses in object files by default. Any addresses that we pass to llvm-symbolizer are going to be untagged, while any HWASAN instrumented globals are going to be tagged in the symbol table. Therefore we need to untag the addresses before using them. Differential Revision: https://reviews.llvm.org/D65769 llvm-svn: 367926	2019-08-05 20:59:25 +00:00
Amara Emerson	85e5e28ab4	[AArch64][GlobalISel] Inline tiny memcpy et al at -O0. FastISel already does this since the initial arm64 port was upstreamed, so it seems there are no issues with doing this at -O0 for very small memcpys. Gives a 0.2% geomean code size improvement on CTMark. Differential Revision: https://reviews.llvm.org/D65758 llvm-svn: 367919	2019-08-05 20:02:52 +00:00
Dmitri Gribenko	8820b122b3	Revert "Try to fix failing AMDGPU disasm test, both Lin/Win agree this is 0 not 0x0" This reverts commit r367907, it broke the test. llvm-svn: 367909	2019-08-05 19:07:09 +00:00
Anusha Basana	ff2c59b3f5	[llvm-lipo] Implement -segalign Sets section alignments of the specified architecture slices to the alignment values. Alignment values are hexadecimal values that are powers of 2. Differential Revision: https://reviews.llvm.org/D65420 llvm-svn: 367908	2019-08-05 19:06:55 +00:00
Reid Kleckner	d67c90a8c4	Try to fix failing AMDGPU disasm test, both Lin/Win agree this is 0 not 0x0 llvm-svn: 367907	2019-08-05 18:46:26 +00:00
Dmitri Gribenko	37aa8ad663	Revert "[AMDGPU] Use S_DENORM_MODE for gfx10" This reverts commit r367882. It broke the test MC/Disassembler/AMDGPU/gfx10_dasm_all.txt. llvm-svn: 367904	2019-08-05 18:36:43 +00:00
Craig Topper	3de33245d2	[X86] Enable -x86-experimental-vector-widening-legalization by default. This patch changes our defualt legalization behavior for 16, 32, and 64 bit vectors with i8/i16/i32/i64 scalar types from promotion to widening. For example, v8i8 will now be widened to v16i8 instead of promoted to v8i16. This keeps the elements widths the same and pads with undef elements. We believe this is a better legalization strategy. But it carries some issues due to the fragmented vector ISA. For example, i8 shifts and multiplies get widened and then later have to be promoted/split into vXi16 vectors. This has the potential to cause regressions so we wanted to get it in early in the 10.0 cycle so we have plenty of time to address them. Next steps will be to merge tests that explicitly test the command line option. And then we can remove the option and its associated code. llvm-svn: 367901	2019-08-05 18:25:36 +00:00
Evandro Menezes	a005c1ac4f	[AArch64] Expand bcmp() for small block lengths Patch D56593 by @courbet results in calls to `bcmp()` in some cases, should the target support the it. Unless `TTI::MemCmpExpansionOptions()` is overridden by the target. In a proprietary benchmark we see a performance drop of about 12% on PNG compression before this patch, though it passes all tests. This patch mirrors X86 for AArch64 and initializes `TTI::MemCmpExpansionOptions()` to then expand calls to `bcmp()` when appropriate. No tuning of the parameters was performed, but, at this point, it's enough to recover the performance drop above. This problem also exists on ARM. Once a consensus is reached for AArch64, we can work to fix ARM as well. Authors: - Evandro Menezes (@evandro) <e.menezes@samsung.com> - Brian Rzycki (@brzycki) <b.rzycki@samsung.com> Differential revision: https://reviews.llvm.org/D64805 llvm-svn: 367898	2019-08-05 18:09:14 +00:00
Roman Lebedev	76b772f9ce	[InstCombine][NFC] Tests for non-canonical clamp-like pattern As discussed in https://reviews.llvm.org/D65148#1607019 The canonical fold is: https://rise4fun.com/Alive/FKe llvm-svn: 367897	2019-08-05 18:01:22 +00:00
Pablo Barrio	a8426b43f8	[AArch64] Set preferred function alignment to 16 bytes on Neoverse N1 Summary: The Arm Neoverse N1 Software Optimization Guide [1], Section "4.8 Branch instruction alignment" states: "Consider aligning subroutine entry points and branch targets to 32B boundaries, within the bounds of the code-density requirements of the program." This patch sets the preferred function alignment on Neoverse N1 to 2^4=16B. This was already the case in some of the latest Cortex-A CPUs. Benchmarking in previous Cortex-A CPUs suggested that 16B alignment is already better than the default. See commit d04ee305. The reason we don't set it to 32B right now (as the optimisation guide suggests) is that this will impact code size and perhaps the instruction cache performance. Therefore we need benchmark numbers first. I have also added testing for A75 and A76 that we were missing. [1] https://developer.arm.com/docs/swog309707/latest Reviewers: fhahn, greened, samparker, dmgreen Reviewed By: dmgreen Subscribers: dmgreen, javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D65654 llvm-svn: 367894	2019-08-05 17:38:58 +00:00
Sanjay Patel	5dbb90bfe1	[InstCombine] combine mul+shl separated by zext This appears to slightly help patterns similar to what's shown in PR42874: https://bugs.llvm.org/show_bug.cgi?id=42874 ...but not in the way requested. That fix will require some later IR and/or backend pass to decompose multiply/shifts into something more optimal per target. Those transforms already exist in some basic forms, but probably need enhancing to catch more cases. https://rise4fun.com/Alive/Qzv2 llvm-svn: 367891	2019-08-05 16:59:58 +00:00
Jordan Rupprecht	9008d8c5ff	[llvm-readobj][test] Add llvm-readobj style test cases for r367878 llvm-svn: 367884	2019-08-05 16:26:48 +00:00

1 2 3 4 5 ...

63872 Commits