llvm-project

Commit Graph

Author	SHA1	Message	Date
NAKAMURA Takumi	1ecfc12b0c	[Bazel] Make `builtin_headers_gen` as subset of CMake's `clang-resource-headers` At the moment, two files are not installed by CMake. - `lib/Headers/openmp_wrappers/time.h` - `lib/Headers/ppc_wrappers/nmmintrin.h` `builtin_headers_gen` is available as the source of rules_pkg. The difference of the layout of installed headers makes cache hit harder.	2022-07-03 15:46:38 +09:00
Craig Topper	7e4ab9d5b8	[RISCV] Add more SHXADD isel patterns. This handles the code we get for int foo(int* x, unsigned y) { return x[y >> 1]; } The shift right and the shl will get DAG combined into (shl (and X, 0xfffffffe), 1). We have custom isel to match the shl+and, but with Zba the (add (shl X, 1), Y) part will get matched and leave the and to be iseled by itself. This commit adds a larger pattern that includes the and.	2022-07-02 23:11:22 -07:00
Vitaly Buka	7788b0c097	[lsan] malloc_usable_size returns 0 for nullptr	2022-07-02 20:16:30 -07:00
lewuathe	5148c685e3	[mlir][complex] Inverse canonicalization between exp and log We can canonicalize consecutive complex.exp and complex.log which are inverse functions each other. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D128966	2022-07-03 09:26:55 +09:00
Craig Topper	5d787689b1	[RISCV] Match RISCVISD::ADD_LO in SelectAddrRegImm. This allows us to fold global and constant pool addresses into load/store during isel instead of in the post-isel peephole. I did not copy the alignment check for ConsantPoolSDNode because it wasn't tested. This is a step towards being able to remove the post-isel peephole. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D128738	2022-07-02 09:51:06 -07:00
Florian Hahn	b0da3c6fa4	[VPlan] Move setDebugLocFromInst to VPTransformState (NFC). The moved helpers are only used for codegen. It will allow moving the remaining ::execute implementations out of LoopVectorize.cpp. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D128657	2022-07-02 15:18:17 +01:00
Luo, Yuanke	23ee84f432	[globalisel] Add test case for regbank selection.	2022-07-02 17:06:00 +08:00
lorenzo chelini	7fc4518f4a	[MLIR] Rename FusePadOpWithLinalgConsumer -> FusePadOpWithLinalgProducer (NFC) Follow up after D128978, where I mistakenly rename the file. The linalg op is fused with its producer, not the consumer.	2022-07-02 11:01:50 +02:00
Craig Topper	b2e9684fe4	[RISCV] isel (shl (and X, C2), C) -> (slli (srliw X, C3), C3+C). where C2 has 32 leading zeros and C3 trailing zeros. When the shl is used by an add C is 1,2 or 3, we end up matching (add (shl X, C), Y) first. This leaves an and with a constant that is harder to materialize.	2022-07-02 01:04:44 -07:00
Craig Topper	9ac548e118	[RISCV] isel (add (and X, 0xFFFFFFFE), Y) as (SH1ADD (SRLIW X, 1), Y). Similar for SH2ADD and SH3ADD. This is what we get from int foo(int* x, unsigned y) { return x[y >> 1]; } This allows us to avoid materializing 0xFFFFFFFE into a register.	2022-07-01 23:52:29 -07:00
Petr Hosek	68050c2354	Revert "[CMake][Fuchsia] Use libunwind as the default unwinder" This reverts commit `6213dba19f` since this broke Fuchsia builders.	2022-07-02 04:51:16 +00:00
owenca	40d2ef841b	[clang-format][NFC] Replace an EXPECT_EQ with a verifyFormat	2022-07-01 21:20:54 -07:00
Joseph Huber	ccf7dd5e81	[llvm-objdump] Ensure offloading sections have proper alignment Summary: A previous patch added support for dumping offloading sections. The tests for this feature added dummy input to the required section using `llvm-objcopy`. This binary format has a required alignment of `8` which was not being respected by the file copied with llvm-objcopy and would cause failures on architectures sensitive to alignment problems or with sanitizers. This patch adds the proper alignemnt and adds an error check at least for the binary format so it's not completely opaque. This should be improvbed so users actually get a helpful message.	2022-07-01 23:26:44 -04:00
Yeting Kuo	5744b9cb79	[RISCV] Restore "Enable shrink wrap by default" This reverts commit `7af3d4ab3d`. RISC-V reverted the shrink wrap patch for bug 53662. Since the bug is fixed by D123679, the commit re-enable it. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D128965	2022-07-02 11:13:13 +08:00
Johannes Doerfert	07766f4070	[Attributor] Move heap2stack allocas to the entry block if possible If we are certainly not in a loop we can directly emit the heap2stack allocas in the function entry block. This will help to get rid of them (SROA) and avoid stacksave/restore intrinsics when the function is inlined.	2022-07-01 21:34:12 -05:00
Johannes Doerfert	b52d33e6de	[OpenMP][NFC] Reuse check lines for Clang/OpenMP tests I used a script to reuse existing check lines rather than creating new ones. There are more opportunities to reduce the line count but the "check generated functions" logic makes that somewhat tricky. FWIW, we really should redo the update script with all these use cases in mind... Differential Revision: https://reviews.llvm.org/D128686	2022-07-01 21:34:11 -05:00
owenca	cc55d97ceb	[clang-format] Run dump_format_style.py for LK_Verilog	2022-07-01 19:01:09 -07:00
wren romano	537db49596	[mlir][sparse] Silencing some -Wunused-function in unittests This is a followup to D128058. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D129027	2022-07-01 18:47:44 -07:00
Yeting Kuo	8590a35ef9	[RISCV][NFC] Simplify condition of IsTU. Just simplify code. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D128972	2022-07-02 09:22:38 +08:00
LLVM GN Syncbot	0dbf0ba033	[gn build] Port `d2d8b0aa4f`	2022-07-02 01:13:41 +00:00
LLVM GN Syncbot	9c4d301ddd	[gn build] Port `228c8f9cc0`	2022-07-02 01:13:40 +00:00
Joseph Huber	d2d8b0aa4f	[llvm-objdump] Add support for dumping embedded offloading data In Clang/LLVM we are moving towards a new binary format to store many embedded object files to create a fatbinary. This patch adds support for dumping these embedded images in the `llvm-objdump` tool. This will allow users to query information about what is stored inside the binary. This has very similar functionality to the `cuobjdump` tool for thoe familiar with the Nvidia utilities. The proposed use is as follows: ``` $ clang input.c -fopenmp --offload-arch=sm_70 --offload-arch=sm_52 -c $ llvm-objdump -O input.o input.o: file format elf64-x86-64 OFFLOADIND IMAGE [0]: kind cubin arch sm_52 triple nvptx64-nvidia-cuda producer openmp OFFLOADIND IMAGE [1]: kind cubin arch sm_70 triple nvptx64-nvidia-cuda producer openmp ``` This will be expanded further once we start embedding more information into these offloading images. Right now we are planning on adding flags and entries for debug level, optimization, LTO usage, target features, among others. This patch only supports printing these sections, later we will want to support dumping files the user may be interested in via another flag. I am unsure if this should go here in `llvm-objdump` or `llvm-objcopy`. Reviewed By: MaskRay, tra, jhenderson, JonChesterfield Differential Revision: https://reviews.llvm.org/D126904	2022-07-01 21:13:28 -04:00
Joseph Huber	228c8f9cc0	[ObjectYAML] Add offloading binary implementations for obj2yaml and yaml2obj This patchs adds the necessary code for inspecting or creating offloading binaries using the standing `obj2yaml` and `yaml2obj` features in LLVM. Depends on D127774 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D127776	2022-07-01 21:13:18 -04:00
Jennifer Yu	927156a674	Generate the capture for the field when the field is used in openmp region with implicit default inside the member function. This is to fix assert when field is referenced in OpenMP region with default (first\|private) clause inside member function. The problem of assert is that the capture is not generated for the field. This patch is to generate capture when the field is used with implicit default, use it in the code, and save the capture off to make sure it is considered from that point and add first/private clauses. 1> Add new field ImplicitDefaultFirstprivateFDs in SharingMapTy, used to store generated capture fields info. 2> In function isOpenMPCaptureDecl: the caputer is generated and saved in ImplicitDefaultFirstprivateFDs. 3> Add new help functions: getImplicitFDCapExprDecl isImplicitDefaultFirstprivateFD addImplicitDefaultFirstprivateFD 4> Add addition argument in hasDSA to check default attribute for default(first\|private). 5> The isImplicitDefaultFirstprivateFD is used in VisitDeclRefExpr to build the implicit clause. 6> Add new parameter "Context" for buildCaptureDecl, due to when capture field, the parent context is needed to be used. 7> Change in isOpenMPPrivateDecl where stop propagate the capture from the enclosing region for private variable. 8> In ActOnOpenMPFirstprivate/ActOnOpenMPPrivate, using captured info to generate first\|private clause. 9> Add new function isOpenMPRebuildMemberExpr: use to determine if field needs to be rebuild during template instantiation. Differential Revision: https://reviews.llvm.org/D127803	2022-07-01 17:09:01 -07:00
LLVM GN Syncbot	17c8119564	[gn build] Port `94c7b89fe5`	2022-07-01 23:35:58 +00:00
Konstantin Varlamov	94c7b89fe5	[libc++][ranges] Implement `ranges::stable_sort`. Differential Revision: https://reviews.llvm.org/D127834	2022-07-01 16:34:26 -07:00
Vitaly Buka	f2fa4f9759	[sanitizer] Update dn_expand interceptor for glibc 2.34 Symbol changed with 640bbdf71c6f10ac26252ac67a22902e26657bd8	2022-07-01 16:26:58 -07:00
Nuno Lopes	7c4f45f87a	Revert [LowerMatrixMultiplication] Switch dummy values from undef to poison [NFC] This reverts commits `47e6f98f84` and `3e701bcd2a`	2022-07-01 23:53:41 +01:00
Nuno Lopes	3e701bcd2a	attempt to fix aarch64 build bot	2022-07-01 23:43:48 +01:00
Nuno Lopes	47e6f98f84	[LowerMatrixMultiplication] Switch dummy values from undef to poison [NFC]	2022-07-01 23:31:31 +01:00
Maksim Panchenko	3a47037fcc	[BOLT] Fix instrumentation problem with floating point If BOLT instrumentation runtime uses XMM registers, it can interfere with the user program causing crashes and unexpected behavior. This happens as the instrumentation code preserves general purpose registers only. Build BOLT instrumentation runtime with "-mno-sse". Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D128960	2022-07-01 15:29:36 -07:00
Fangrui Song	fd25a0aa41	[llvm-lto2] Remove unneeded cl::init(false). NFC	2022-07-01 14:35:36 -07:00
Argyrios Kyrtzidis	0d3a2b4c66	[Lex] Introduce `PPCallbacks::LexedFileChanged()` preprocessor callback This is a preprocessor callback focused on the lexed file changing, without conflating effects of line number directives and other pragmas. A client that only cares about what files the lexer processes, like dependency generation, can use this more straightforward callback instead of `PPCallbacks::FileChanged()`. Clients that want the pragma directive effects as well can keep using `FileChanged()`. A use case where `PPCallbacks::LexedFileChanged()` is particularly simpler to use than `FileChanged()` is in a situation where a client wants to keep track of lexed file changes that include changes from/to the predefines buffer, where it becomes unnecessary complicated trying to use `FileChanged()` while filtering out the pragma directives effects callbacks. Also take the opportunity to provide information about the prior `FileID` the `Lexer` moved from, even when entering a new file. Differential Revision: https://reviews.llvm.org/D128947	2022-07-01 14:22:31 -07:00
Arthur Eubanks	bcd153485e	[bazel] Fix invalid characters	2022-07-01 13:47:56 -07:00
Arthur Eubanks	5a65c5180e	[bazel] Port `43dc3190`, adding rules to generate dxil intrinsics	2022-07-01 13:38:43 -07:00
Sanjay Patel	9c8a39c67b	[InstCombine] restrict select of bit-tests to constant shift amounts This transform is responsible for a long-standing miscompile as discussed in issue #47012 (was bugzilla #47668). There was a proposal to correct it in D88432, but that was abandoned and there hasn't been any recent activity to fix it AFAICT. The original patch D45108 started with a constant-shift-only restriction and only expanded during review, so I don't think there's much risk of perf regression on the motivating code.	2022-07-01 16:24:34 -04:00
Sanjay Patel	feb4b628ac	[InstCombine] avoid 'tmp' usage in test files; NFC The update script ( utils/update_test_checks.py ) warns against this.	2022-07-01 16:18:41 -04:00
wren romano	875ee0ed1c	[mlir][sparse] Reducing computational complexity This is a followup to D128847. The `AffineMap::getPermutedPosition` method performs a linear scan of the map, thus the previous implementation had asymptotic complexity of `O(\|topSort\| * \|m\|)`. This change reduces that to `O(\|topSort\| + \|m\|)`. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D129011	2022-07-01 12:55:09 -07:00
Valentin Clement	b19cbda45a	[flang][NFC] Add embox test with character This test is added to check for multidimensional descriptor of array substring/derived type component array. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: jeanPerier Differential Revision: https://reviews.llvm.org/D128990 Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-07-01 21:49:20 +02:00
Alexey Bataev	34073b5538	[SLP][NFC]Rework the test for logical and freeze, need some extra nodes, NFC.	2022-07-01 12:43:10 -07:00
Haojian Wu	bbcd8e5271	[pseudo] NFC, polish the fix of `c998273499`	2022-07-01 21:25:46 +02:00
Eric Kunze	b7f4335d6a	[mlir][tosa] Update TOSA transpose_conv2d to match spec The TOSA Specification doesn't have a dilation attribute for transpose_conv2d, and the padding array is of size 4. (top,bottom,left,right). This change updates the dialect to match the specification, and updates the lit tests to match the dialect changes. Differential Revision: https://reviews.llvm.org/D127332	2022-07-01 19:10:28 +00:00
Alexey Bataev	48aa787ab3	[SLP][NFC]Add a test for logical and operands, requiring extra freezextra freeze, NFC.e.	2022-07-01 11:53:50 -07:00
Fangrui Song	ab3630dd41	[UpdateTestChecks][test] Remove stray ; before/after non-RUN-non-CHECK comments	2022-07-01 11:42:47 -07:00
Arthur Eubanks	3d7aeb3c73	[gn build] Manually port `43dc3190`	2022-07-01 11:39:04 -07:00
rdzhabarov	f59c279b72	[mlir] Fix usages of `run-reproducer`. There is no need to specify `run-reproducer` explicitly anymore. Differential Revision: https://reviews.llvm.org/D129010	2022-07-01 18:36:07 +00:00
Erich Keane	258c3aee54	Revert "Re-apply "Deferred Concept Instantiation Implementation""" This reverts commit `befa8cf087`. Apparently this breaks some libc++ builds with an apparent assertion, so I'm looking into that .	2022-07-01 11:20:16 -07:00
Craig Topper	188582b7e0	[RISCV] Considering existing offset in the alignment when folding ADDIs into load/store. getPointerAlignment and ConstantPoolSDNode::getAlign only consider the alignment of the object. If we already have a non-zero offset into the offset that may have reduced the alignment. Since the base pointer will become an LUI with the old offset, we need to be sure the new offset fits in the alignment of the address that will be used to create the LUI immediate. I'm not sure it is possible to have a non-zero offset in the GlobalAddressSDNode or ConstantPoolSDNode at this point today so this may only be a theoretical bug. Differential Revision: https://reviews.llvm.org/D129006	2022-07-01 11:18:40 -07:00
Haojian Wu	c998273499	[pseudo] Fix an out-of-bound issue in getReduceRules.	2022-07-01 20:16:06 +02:00
Fangrui Song	6e8ec13d3f	[MC][RISCV] Suppress R_RISCV_{ADD,SUB}32 in .apple_names .apple_types after D127549 This fixes test/DebugInfo/Generic/accel-table-hash-collisions.ll and cross-cu-inlining.ll when the default triple is riscv. llvm-dwarfdump --apple-names does not resolve R_RISCV_{ADD,SUB}32 in .apple_names .apple_types and having ADD/SUB will cause decoding failure `Atom[0]: Error extracting the value`.	2022-07-01 11:15:04 -07:00

1 2 3 4 5 ...

428711 Commits All Branches Search

428711 Commits

All Branches