llvm-project

Commit Graph

Author	SHA1	Message	Date
Michał Górny	f8537895b9	[lldb] [Host] Sync TerminalState::Data to struct type	2021-10-01 20:34:11 +02:00
Sanjay Patel	3fabd98e5b	[InstCombine] fold (trunc (X>>C1)) << C to shift+mask directly This is no-externally-visible-functional-difference-intended. That is, the test diffs show identical instructions other than name changes (those are included specifically to verify the logic). The existing transforms created extra instructions and relied on subsequent folds to get to the final result, but that could conflict with other transforms like the proposed D110170 (and caused that patch to be reverted twice so far because of infinite combine loops).	2021-10-01 14:22:44 -04:00
LLVM GN Syncbot	06cea95a5a	[gn build] Port `33dd98e9e4`	2021-10-01 18:18:21 +00:00
Lang Hames	33dd98e9e4	[ORC] Remove ORC RPC. With the removal of OrcRPCExecutorProcessControl and OrcRPCTPCServer in `6aeed7b19c` the ORC RPC library no longer has any in-tree users. Clients needing serialization for ORC should move to Simple Packed Serialization (usually by adopting SimpleRemoteEPC for remote JITing).	2021-10-01 11:17:33 -07:00
Lei Zhang	a3f425946d	[mlir][linalg] Include InitTensorOp in tiling canonicalization Tiling can create dim ops and those dim ops can take `InitTensorOp` as input. Including it in the tiling canonicalization patterns allows us to fold those dim ops away. Also sorted the existing ops along the way. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D110876	2021-10-01 14:13:19 -04:00
Arthur Eubanks	a7b4ce9cfd	[NFC][AttributeList] Replace index_begin/end with an iterator We expose the fact that we rely on unsigned wrapping to iterate through all indexes. This can be confusing. Rather, keeping it as an implementation detail through an iterator is less confusing and is less code. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D110885	2021-10-01 10:17:41 -07:00
Jay Foad	31c92d515d	[MachineLoopInfo] Enable machine verification after this pass Enabling this does not show any problems in check-llvm in an LLVM_ENABLE_EXPENSIVE_CHECKS build. Differential Revision: https://reviews.llvm.org/D110703	2021-10-01 18:15:57 +01:00
Jay Foad	04787239c9	[LiveVariables] Skip verification of kills inside bundles LiveVariables does not examine the contents of bundles, so MachineVerifier should not expect it to know about kill flags on operands of instructions inside a bundle. With this fix we can enable machine verification after running the LiveVariables analysis. Doing this does not show any problems in check-llvm in an LLVM_ENABLE_EXPENSIVE_CHECKS build. Differential Revision: https://reviews.llvm.org/D110700	2021-10-01 18:15:57 +01:00
Jay Foad	08d41f75d9	[UnreachableMachineBlockElim] Enable machine verification after this pass Enabling this does not show any problems in check-llvm in an LLVM_ENABLE_EXPENSIVE_CHECKS build. Differential Revision: https://reviews.llvm.org/D110697	2021-10-01 18:15:57 +01:00
Jay Foad	2bfe777a45	[ProcessImplicitDefs] Enable machine verification after this pass Enabling this does not show any problems in check-llvm in an LLVM_ENABLE_EXPENSIVE_CHECKS build. Differential Revision: https://reviews.llvm.org/D110695	2021-10-01 18:15:56 +01:00
Jay Foad	fd8e99700d	[DetectDeadLanes] Enable machine verification after this pass Machine verification after DetectDeadLanes has been disabled since the pass was first added in D18427, but I guess this was just due to copy- and-paste. Enabling it does not show any problems in check-llvm in an LLVM_ENABLE_EXPENSIVE_CHECKS build. Differential Revision: https://reviews.llvm.org/D110689	2021-10-01 18:15:56 +01:00
Arthur O'Dwyer	2a6b99d5f8	[libc++] Revert the part of my `b82683b` that affected <version>. This reverts part of commit `b82683b2eb`. I hadn't intended to remove the `// -- C++ --` comment line from `libcxx/include/version`, only from the generated tests. Thanks to Raul Tambre for the catch.	2021-10-01 13:01:51 -04:00
Lang Hames	6aeed7b19c	[ORC] Remove OrcRPCExecutorProcessControl ad OrcRPCTPCServer. All in-tree tools have moved to SimpleRemoteEPC.	2021-10-01 10:00:20 -07:00
Kazu Hirata	4f0225f6d2	[Transforms] Migrate from getNumArgOperands to arg_size (NFC) Note that getNumArgOperands is considered a legacy name. See llvm/include/llvm/IR/InstrTypes.h for details.	2021-10-01 09:57:40 -07:00
zhijian	5b44c716ee	[AIX]implement the --syms and using "symbol index and qualname" for --sym --symbol--description for llvm-objdump for xcoff Summary: for xcoff : implement the getSymbolFlag and getSymbolType() for option --syms. llvm-objdump --sym , if the symbol is label, print the containing section for the symbol too. when using llvm-objdump --sym --symbol--description, print the symbol index and qualname for symbol. for example: --symbol-description 00000000000000c0 l .text (csect: (idx: 2) .foov[PR]) (idx: 3) .foov and without --symbol-description 00000000000000c0 l .text (csect: .foov) .foov Reviewers: James Henderson,Esme Yi Differential Revision: https://reviews.llvm.org/D109452	2021-10-01 12:37:51 -04:00
Roman Lebedev	907d5da3da	[NFC][Codegen][X86] Drop unused check prefixes in newly added tests	2021-10-01 19:35:14 +03:00
Michał Górny	bd21257bf5	[lldb] [Host] Fix flipped logic in TerminalState::Save()	2021-10-01 18:23:54 +02:00
Arthur O'Dwyer	c333505fa5	[libc++] [test] Remove filenames from copyright headers. NFCI. Discussed in D110794.	2021-10-01 12:14:00 -04:00
Anna Thomas	f98a918d34	[TrivialDeadness] Update function comment isInstructionTriviallyDead also works for certain side-effecting instructions. Update incorrect comment (as suggested in D109917).	2021-10-01 12:09:26 -04:00
Peyton, Jonathan L	343b9e8590	[OpenMP][host runtime] Introduce kmp_cpuinfo_flags_t to replace integer flags Store CPUID support flags as bits instead of using entire integers. Differential Revision: https://reviews.llvm.org/D110091	2021-10-01 11:08:39 -05:00
Peyton, Jonathan L	957b4c5750	[OpenMP][testing] increase threshold for omp_get_wtime test	2021-10-01 11:07:41 -05:00
Arthur O'Dwyer	d4b59a05fc	[libc++] Remove "// -- C++ --" comments from all .cpp files. NFCI. Even if these comments have a benefit in .h files (for editors that care about language but can't be configured to treat .h as C++ code), they certainly have no benefit for files with the .cpp extension. Discussed in D110794.	2021-10-01 12:06:59 -04:00
Arthur O'Dwyer	b82683b2eb	[libc++] [test] Remove "// -- C++ --" comments from generated .cpp files. Even if these comments have a benefit in .h files (for editors that care about language but can't be configured to treat .h as C++ code), they certainly have no benefit for files with the .cpp extension. Discussed in D110794.	2021-10-01 12:04:19 -04:00
Lang Hames	d908118b8a	[llvm-jitlink] Fix a FIXME. ORC errors preserve the SymbolStringPool since `6fe2e9a9cc`, so we can stop bailing out early.	2021-10-01 08:49:51 -07:00
Roman Lebedev	657e1e316b	[NFC][X86][Codegen] Add test coverage for interleaved i8 load/store stride=3	2021-10-01 18:47:25 +03:00
Roman Lebedev	727a359979	[NFC][X86][LV] Improve costmodel test coverage for interleaved i8 load/store stride=3	2021-10-01 18:47:25 +03:00
Manas	a3d0b5805e	[analyzer] Fix deprecated plistlib functions It replaces the usage of readPlist,writePlist functions with load,dump in plistlib package. This fixes deprecation issues when analyzer reports are being generated outside of docker. Patch by Manas! Reviewed By: steakhal Differential Revision: https://reviews.llvm.org/D107312	2021-10-01 17:07:24 +02:00
LLVM GN Syncbot	9c49587cc1	[gn build] Port `47d6274d4c`	2021-10-01 14:57:31 +00:00
Sanjay Patel	baac82b4cf	[InstCombine] add tests for icmp of gep; NFC	2021-10-01 10:53:23 -04:00
Daniel Kiss	532783f9e1	[libunwind] Fix cfi_register for float registers. Fixes D110144. registers.getFloatRegister is not const in ARM therefor can't be called here. Reviewed By: mstorsjo, #libunwind Differential Revision: https://reviews.llvm.org/D110731	2021-10-01 16:51:51 +02:00
Daniil Fukalov	47d6274d4c	[NFC][AMDGPU] Reduce includes dependencies, part 2 1. Splitted out some parts of R600 target to separate modules/headers. 2. Reduced some include lists in headers. 3. Minor forward declarations, redundant includes and flags in GCNSubtarget cleanup. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D109351	2021-10-01 17:50:20 +03:00
Roman Lebedev	3e260efdfc	[X86][Costmodel] Load/store i64/f64 Stride=2 VF=16 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/1WMTojvfW - for intels `Block RThroughput: =16.0`; for ryzens, `Block RThroughput: <=8.0` So pick cost of `16`. For store we have: https://godbolt.org/z/1WMTojvfW - for intels `Block RThroughput: =16.0`; for ryzens, `Block RThroughput: <=16.0` So pick cost of `16`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110840	2021-10-01 17:48:14 +03:00
Roman Lebedev	abd37de63e	[X86][Costmodel] Load/store i64/f64 Stride=2 VF=8 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/PGYbYKPq8 - for intels `Block RThroughput: =8.0`; for ryzens, `Block RThroughput: <=4.0` So pick cost of `8`. For store we have: https://godbolt.org/z/PGYbYKPq8 - for intels `Block RThroughput: =8.0`; for ryzens, `Block RThroughput: <=8.0` So pick cost of `8`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110838	2021-10-01 17:48:14 +03:00
Roman Lebedev	71bc31b907	[X86][Costmodel] Load/store i64/f64 Stride=2 VF=4 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/j5co1qWEW - for intels `Block RThroughput: =4.0`; for ryzens, `Block RThroughput: <=2.0` So pick cost of `4`. For store we have: https://godbolt.org/z/j5co1qWEW - for intels `Block RThroughput: =4.0`; for ryzens, `Block RThroughput: <=4.0` So pick cost of `4`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110837	2021-10-01 17:48:14 +03:00
Roman Lebedev	612e5b05a2	[X86][Costmodel] Load/store i64/f64 Stride=2 VF=2 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/8a1cfGeMn - for intels `Block RThroughput: =2.0`; for ryzens, `Block RThroughput: =1.0` So pick cost of `2`. For store we have: https://godbolt.org/z/jMdcM47bx - for intels `Block RThroughput: =2.0`; for ryzens, `Block RThroughput: <=2.0` So pick cost of `2`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110835	2021-10-01 17:48:14 +03:00
Roman Lebedev	ea76cb87ee	[X86][Costmodel] Load/store i32/f32 Stride=2 VF=32 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 Here for `store` pattern we are starting to have spilling, so accurate modelling may be problematic, although if i drop the spilling, the measurements don't change. For load we have: https://godbolt.org/z/1oTTnncbx - for intels `Block RThroughput: =16.0`; for ryzens, `Block RThroughput: <=8.0` So pick cost of `16`. For store we have: https://godbolt.org/z/1oTTnncbx - for intels `Block RThroughput: =16.0`; for ryzens, `Block RThroughput: =8.0` So pick cost of `16`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110761	2021-10-01 17:48:14 +03:00
Roman Lebedev	80cd8da78d	[X86][Costmodel] Load/store i32/f32 Stride=2 VF=16 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/M9eev3xe8 - for intels `Block RThroughput: =8.0`; for ryzens, `Block RThroughput: <=4.0` So pick cost of `8`. For store we have: https://godbolt.org/z/M9eev3xe8 - for intels `Block RThroughput: =8.0`; for ryzens, `Block RThroughput: =4.0` So pick cost of `8`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110756	2021-10-01 17:48:14 +03:00
Roman Lebedev	3a0643e9c2	[X86][Costmodel] Load/store i32/f32 Stride=2 VF=8 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/n8aMKeo4E - for intels `Block RThroughput: =4.0`; for ryzens, `Block RThroughput: <=2.0` So pick cost of `4`. For store we have: https://godbolt.org/z/n8aMKeo4E - for intels `Block RThroughput: =4.0`; for ryzens, `Block RThroughput: =2.0` So pick cost of `4`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110755	2021-10-01 17:48:13 +03:00
Roman Lebedev	b12aeaec9a	[X86][Costmodel] Load/store i32/f32 Stride=2 VF=4 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/EM5Ean7bd - for intels `Block RThroughput: =2.0`; for ryzens, `Block RThroughput: =1.0` So pick cost of `2`. For store we have: https://godbolt.org/z/EM5Ean7bd - for intels `Block RThroughput: =2.0`; for ryzens, `Block RThroughput: <=2.0` So pick cost of `2`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110754	2021-10-01 17:48:13 +03:00
Roman Lebedev	f44d9009c2	[X86][Costmodel] Load/store i32/f32 Stride=2 VF=2 interleaving costs The only sched models that for cpu's that support avx2 but not avx512 are: haswell, broadwell, skylake, zen1-3 For load we have: https://godbolt.org/z/4rY96hnGT - for intels `Block RThroughput: =2.0`; for ryzens, `Block RThroughput: =1.0` So pick cost of `2`. For store we have: https://godbolt.org/z/vbo37Y3r9 - for intels `Block RThroughput: =1.0`; for ryzens, `Block RThroughput: =0.5` So pick cost of `1`. I'm directly using the shuffling asm the llc produced, without any manual fixups that may be needed to ensure sequential execution. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D110753	2021-10-01 17:48:13 +03:00
LLVM GN Syncbot	4f0a39b9b4	[gn build] Port `fb99424a6f`	2021-10-01 14:29:37 +00:00
Anirudh Prasad	fb99424a6f	[SystemZ][z/OS] Introduce initial support for GOFF asm parser - Introduce a skeleton outline for the GOFFAsmParser - Before instantiating AsmParser/HLASMAsmParser, target specific asm parsers are attempted to be initialized first before proceeding. If it doesn't exist for a particular file type, we report a fatal error. - This patch allows to properly instantiate the HLASMAsmParser on z/OS, and ensures we can write lit tests and unit tests which will involve the instantiation of asm parsers, without an assert / fatal error. Reviewed By: uweigand, Kai Differential Revision: https://reviews.llvm.org/D110730	2021-10-01 10:29:14 -04:00
Eric Schweitz	2372249d86	[fir] Remove obsolete fir.negf and fir.modf ops fir.negf op is replaced by mlir.negf and fir.modf is just deleted. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D110932 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: Valentin Clement <clementval@gmail.com>	2021-10-01 16:11:36 +02:00
Jon Chesterfield	05ba9ff6a6	[libomptarget][amdgpu] Refactor memory pool collection	2021-10-01 14:58:01 +01:00
Tobias Gysi	bf28849745	[mlir][linalg] Retire PoolingMaxOp/PoolingMinOp/PoolingSumOp. The pooling ops are among the last remaining hard coded Linalg operations that have no region attached. They got obsolete due to the OpDSL pooling operations. Removing them allows us to delete specialized code and tests that are not needed for the OpDSL counterparts that rely on the standard code paths. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110909	2021-10-01 13:51:56 +00:00
Pavel Labath	633ac51709	[lldb] Simplify TestCompletion.py	2021-10-01 15:49:23 +02:00
Fraser Cormack	52c60459f5	[RISCV][NFC] Reformat a line of frame lowering code	2021-10-01 14:25:47 +01:00
Nico Weber	ec4a822866	[clang] Try to unbreak crash-report.cpp on PS4 bot after `8dfbe9b0a` Looks like exceptions are off-by-default with the PS4 triple. Since adding -fexceptions defeats the purpose of the test change in `8dfbe9b0a`, pass an explicit triple instead.	2021-10-01 09:33:13 -04:00
Nemanja Ivanovic	369d785574	[PowerPC] Optimal sequence for doubleword vec_all_{eq\|ne} on Power7 These builtins produce inefficient code for CPU's prior to Power8 due to vcmpequd being unavailable. The predicate forms can actually leverage the available vcmpequw along with xxlxor to produce a better sequence.	2021-10-01 08:27:15 -05:00
Michał Górny	12ee4c9ad8	[lldb] [test] Delay pty/tty imports to fix Windows builds Delay pty/tty module imports until they are actually used, in order to prevent their import failures on Windows.	2021-10-01 15:25:35 +02:00

1 2 3 4 5 ...

400553 Commits All Branches Search

400553 Commits

All Branches