Commit Graph

26449 Commits

Author SHA1 Message Date
Trevor Clinkenbeard 4ec3eac0e8
Merge pull request #10572 from sfc-gh-tclinkenbeard/main-decouple-token-bucket-knobs
Decouple token bucket knobs for different types of throttlers
2023-06-28 12:29:31 -07:00
Hui Liu 66a7acd960
Fix blob restore stuck issue (#10574) 2023-06-28 10:23:11 -07:00
sfc-gh-tclinkenbeard 36542c139d Update test parameters for /GrvProxyTagThrottler/LimitedIdleBudget test
This is to match the updated default value of
TAG_THROTTLE_MAX_EMPTY_QUEUE_BUDGET.
2023-06-28 09:24:32 -07:00
sfc-gh-tclinkenbeard 4f711b15cd Fix comments for proxy knobs 2023-06-28 01:27:35 -07:00
sfc-gh-tclinkenbeard 86fae8b7b5 Increase default value of TAG_THROTTLE_MAX_EMPTY_QUEUE_BUDGET 2023-06-28 01:27:28 -07:00
sfc-gh-tclinkenbeard 6e86f94cb9 Decouple token bucket knobs for different types of throttlers 2023-06-28 01:27:19 -07:00
Jingyu Zhou cf3b5d8c57
Merge pull request #10568 from jzhou77/main 2023-06-27 17:25:51 -07:00
Jingyu Zhou bbf9632964 Fix joshua_logtool.py to allow . in ensemble ID
e.g., 20230627-102446-nightly-release-7.3-clang-45609535-729-20a8c
2023-06-27 15:09:47 -07:00
Jingyu Zhou 178cddbffd
Merge pull request #10563 from sfc-gh-etschannen/fix-durable-change-feeds
cancel durable change feed actors in DatabaseContext destructor
2023-06-27 13:39:20 -07:00
Hui Liu 788f3420c8
Fix tenant map update race when applying mutations (#10557) 2023-06-27 10:31:58 -07:00
Jingyu Zhou 321b77d809 Add 7.3 protocol version to transaction_profiling_analyzer.py
Manually tested the tool to be working.
2023-06-27 10:19:10 -07:00
Evan Tschannen b247f565b7 cancel durable change feed actors in DatabaseContext destructor 2023-06-27 09:22:47 -07:00
Yi Wu 92fc089eb0
FDBCORE-5211: Fix worker server handleIOErrors heap-use-after-free (#380) (#10555) 2023-06-26 16:35:33 -07:00
Jingyu Zhou e5728fcb84
Fix the version for deprecated keys (#10553) 2023-06-26 13:49:41 -07:00
A.J. Beamon da382372f0
Merge pull request #10550 from sfc-gh-ajbeamon/speedup-backup-correctness-clean
Speed up BackupCorrectnessClean
2023-06-26 09:48:18 -07:00
A.J. Beamon ecf8a5b3f6
Merge pull request #10542 from sfc-gh-ajbeamon/reduce-mutation-log-reader-correctness-duration
Decrease the number of records in MutationLogReaderCorrectness
2023-06-26 08:55:34 -07:00
He Liu 6337125712
Several minor improvements for ShardedRocksDB (#10520)
* Terminate DD if SHARD_ENCODE_LOCATION_METADATA is not enabled and storage_engine_type is ShardedRocksDB.

* Fixed Error in non-main thread.

* Minor improvements.
2023-06-24 16:07:14 -07:00
Zhe Wang 37689af3f2
Detect inconsistency of KeyServers and ServerKeys in real time (#10484)
* add framework

* add audit logic

* refactor audit loc metadata

* address comments

* add realtime audit timeout, add post validation logic

* fix input empty range to compareKeyServersAndServerKeys

* add context for auditKeyServersAndServerKeysInRealTime

* focus on moveShard

* remove space

* address comments

* cleanup

* add audit cleanup

* make validateRangeAssignment simple

* change trace name

* add shardAssigned

* stop DD when inconsistency detected

* fix ci

* small fix

* revert ss and auditUtl and simplify rt audit

* cleanup ss

* tiny change

* address comments and refactor code

* make auditLocationMetadataPreCheck retriable

* handle actor cancel in auditLocationMetadataPreCheck

* rm timeout and add new protection for failure of audit

* fix bugs

* import dataMoveId to validation

* improve trace event

* carefully propagate error and stop DD

* tiny fix

* small change

* remove a state var

* nit

* clean comments

* fmt
2023-06-23 17:40:21 -07:00
A.J. Beamon 155c03f6fe Decrease transaction rate of backup correctness clean to speed it up 2023-06-23 16:14:03 -07:00
Lukas Joswiak 8b8da372d5 Fix issue with inconsistent coordinator disk queue
An inconsistent coordinator disk queue could cause repeated reboots of
the coordinator process. In simulation, this could cause stuck runs if
another role was running on the coordinator process, as the other role
would also repeatedly reboot (specifically stateful roles like tlogs).

See the comment included in this commit for much more detail.
2023-06-23 15:37:31 -07:00
A.J. Beamon 09a0a60c07
Merge pull request #10546 from sfc-gh-ajbeamon/speedup-dr-upgrade
Speed up DR upgrade tests
2023-06-23 12:00:42 -07:00
A.J. Beamon 1af1346cde Change a buggify and decrease the number of transactions per second so that the dr upgrade test runs faster 2023-06-23 09:03:34 -07:00
A.J. Beamon daf20abeee Decrease the number of records in MutationLogReaderCorrectness to decrease its duration 2023-06-22 13:37:35 -07:00
Xiaoxi Wang 7d3cc86860
check serverList before update storage metadata (#10540) 2023-06-22 12:02:09 -07:00
Josh Slocum 356d0030a4
fixed consistency scan corruption check when worker is hard-killed in the middle, and improved debugging for stuck consistency scan (#10516) 2023-06-22 11:54:08 -05:00
A.J. Beamon b5497bdc3e
Merge pull request #10529 from sfc-gh-ajbeamon/test-metacluster-use-after-restore
Test use of the metacluster after a restore
2023-06-21 18:50:18 -07:00
Ata E Husain Bohra 370e39fbdb
Update SimKmsVault unit test assert checking max encryption keys (#10535)
Description

SimKmsVault unit test when run as part of simulation Random test,
based on the test order, SimKmsVaultKeyCtx can be initialized as
part of some other test (FlowSingleton).
Update the test to handle the scenario.

Testing

devRunCorrectness - 100K
2023-06-21 17:17:21 -07:00
Zhe Wu 8c796e6ff9
Merge pull request #10534 from halfprice/zhewu/gray-failure-recovery-cc
Cluster controller remove recovered peer in gray failure
2023-06-21 14:54:26 -07:00
Zhe Wu b89a50a37d Cluster controller remove recovered peer in gray failure 2023-06-21 13:02:59 -07:00
A.J. Beamon a2302e45ec
Merge pull request #10522 from sfc-gh-ajbeamon/fix-mapped-range-assert
Fix check in getExactRange that determines whether we can return early
2023-06-21 10:24:52 -07:00
Zhe Wu d1bafa7514
Merge pull request #10528 from halfprice/zhewu/nit-fixing
Nit fixing
2023-06-21 10:21:47 -07:00
A.J. Beamon 728b847bd6
Merge pull request #10526 from sfc-gh-yajin/mark-cp-rare-1
SNOW-XXXXXX Mark a rare code probe as rare
2023-06-21 09:41:29 -07:00
Zhe Wu 36befca2ba Nit fixing 2023-06-21 09:25:51 -07:00
A.J. Beamon cc68320333 Add testing that a metacluster can be used after a restore. Fix some bugs found by this related to the restore ID and tenant tombstones. 2023-06-21 08:59:22 -07:00
Zhe Wu aac6d95182
Merge pull request #10525 from halfprice/zhewu/turn-on-batch-empty-commit-by-default
Turn on batching empty peek by default
2023-06-21 07:57:17 -07:00
Zhe Wu f3e95f6698
Merge pull request #10521 from halfprice/zhewu/gray-failure-recovery
Worker health monitor tracks recovered peers
2023-06-21 07:56:55 -07:00
Yanqin Jin 75ab773819 SNOW-XXXXXX Mark a rare code probe as rare (#415) 2023-06-20 21:56:54 -07:00
Zhe Wu 514561e167 Turn on batching empty peek by default 2023-06-20 20:24:20 -07:00
Zhe Wu 8eb526684a
Merge pull request #10437 from halfprice/zhewu/one-shard-per-physical-shard
Add an option to limit the number of shard per team
2023-06-20 19:43:31 -07:00
Xiaoge Su c5f09b0921 fixup! Reformat source 2023-06-20 18:07:05 -07:00
Xiaoge Su de7db7ba14 fixup! reformat source 2023-06-20 18:07:05 -07:00
Xiaoge Su 63dde188dd Report missing attribute in StorageServer status 2023-06-20 18:07:05 -07:00
A.J. Beamon 75ec56bffb When redoing a key location request, wait until after we've checked whether we've satisfied our min rows 2023-06-20 16:02:12 -07:00
Zhe Wu 31de83ccc1 cleanup gray failure recovery documentation 2023-06-20 13:30:03 -07:00
Zhe Wu 4741d7032f Revert logic to test testing logic 2023-06-20 12:40:22 -07:00
Zhe Wu ef638cd6a6 Worker health monitor keeps track of recovered peer 2023-06-20 12:32:29 -07:00
Evan Tschannen 902d87a72c
Merge pull request #10515 from sfc-gh-etschannen/fix-ikeyvaluestore
fix IKeyValueStore include
2023-06-20 06:28:23 -07:00
Hui Liu af20493ad0
Move lastFlushTs to BlobGranuleBackupConfig (#10505) 2023-06-16 16:12:10 -07:00
Jefferson Zhong 13853c9f89 Move stepSize knob from ClientKnobs to ServerKnobs 2023-06-16 14:48:11 -07:00
Evan Tschannen 60cbc7ec8d fix formatting 2023-06-16 13:43:25 -07:00