Commit Graph

25355 Commits

Author SHA1 Message Date
A.J. Beamon acf1dfb245 Fix formatting 2023-02-24 13:00:57 -08:00
A.J. Beamon 4a38bb4c3f Allow performing assert comparisons (e.g. ASSERT_EQ) with any traceable type 2023-02-24 12:53:01 -08:00
Dan Adkins e3a61b9b22
Add metrics to understand tail commit latency (#9435)
* Add server-side latency metrics for Resolver requests.

* Add separate resolver latency metrics for queue wait and compute time.

* Add histogram for queue depth observed on resolver (during metrics interval).

* Fix tlog latency measurement to use timer() instead of now().
2023-02-24 14:13:12 -05:00
Xiaoxi Wang 998a5b7c0e enable read-aware DD by default and write release notes/doc 2023-02-24 11:11:25 -08:00
Markus Pilman 01fab190b3 add arrayGenerator for w|rw option 2023-02-24 12:01:43 -07:00
Evan Tschannen f3673d808b Replaced the fetchKeysParallelismFullLock with a lock specifically for change feeds to avoid blocking fetches on idle clusters 2023-02-24 10:59:35 -08:00
Jingyu Zhou 9a257a60a4 Address review comments 2023-02-24 10:47:32 -08:00
Markus Pilman e0df60bd92
Apply suggestions from code review
Co-authored-by: A.J. Beamon <aj.beamon@snowflake.com>
2023-02-24 11:33:29 -07:00
Markus Pilman 6c15506c36 Fixed tests 2023-02-24 11:32:37 -07:00
A.J. Beamon 2d936acbdd Fix memory error from using invalidated iterator 2023-02-24 10:25:38 -08:00
A.J. Beamon 03fbc59bb1
Merge pull request #9461 from sfc-gh-ajbeamon/metacluster-concurrent-restore-testing
Metacluster concurrent restore testing
2023-02-24 09:13:51 -08:00
Markus Pilman ee9d511d16
Merge pull request #9463 from sfc-gh-mpilman/buildcop/2023-02-23/bugfixes/arm-awssdk
Fix build issue with awssdk_target
2023-02-24 09:20:08 -07:00
Josh Slocum 910965a5a6
Adding additional blob granule authz tests (#9443)
* added granule location authz tests

* added authz test for blob worker endpoint

* addressing comments

* fixing ide build
2023-02-24 09:32:05 -06:00
A.J. Beamon 344f6977c9 Restores can now throw a cluster_already_exists error in the metacluster management workload if we timeout a restore and have to retry 2023-02-23 20:32:18 -08:00
Nim Wijetunga 29819b0645
Change Feed Bug Fix + Encryption Asserts (#9457)
* add encryption asserts

* modify function name

* address pr comments

* address pr comments

* Trigger Build
2023-02-23 19:33:25 -08:00
A.J. Beamon 2b25cfef8b Merge branch 'main' into metacluster-concurrent-restore-testing 2023-02-23 16:06:47 -08:00
Jingyu Zhou 82bd5796f5 Fix merge conflicts 2023-02-23 16:04:37 -08:00
Jingyu Zhou 67f84cc802
Merge pull request #9448 from xis19/main
Provide a tool that allows downloading logs when simulation RocksDB f…
2023-02-23 15:55:22 -08:00
Jingyu Zhou 0b2e02c402 Fix rare test failures
Unclog after DB is recovered, otherwise another recovery may become stuck again.
2023-02-23 15:42:33 -08:00
Jon Fu 33f8e90f9f
Split tenant group metadata (#9446)
* initial commit to split tenant group metadata

* attempt to fix merge errors

* fix compile errors and adjust existing tests

* fix infinite loop and extra ACTOR tag

* direct assignment instead of store

* direct assign instead of store (missed a few)
2023-02-23 18:11:49 -05:00
A.J. Beamon 3ac7e17b79 Fix create tenant usage in tenant management workload 2023-02-23 15:08:52 -08:00
Jingyu Zhou 65443b6541 Fix compiling errors 2023-02-23 15:02:44 -08:00
Jingyu Zhou 40c7cfec0c Move ClogTlog.toml to rare 2023-02-23 14:31:47 -08:00
Jingyu Zhou ecae81882c Change to only clog once for a particular tlog
If we repeat clogging, different tlogs may be excluded, which can cause the
recovery to stuck.
2023-02-23 14:31:39 -08:00
Jingyu Zhou 6055f752c2 Exclude failed tlog if recovery stuck more than 30s
Because the tlog is clogged, recovery can stuck in initializing_transaction_servers.
This exclude allows the recovery to complete.
2023-02-23 14:31:32 -08:00
Jingyu Zhou c4773b7cc8 Update clogTlog workload to be single region 2023-02-23 14:31:24 -08:00
Jingyu Zhou 955826f2fe Add ClogTlog workload 2023-02-23 14:31:12 -08:00
Jingyu Zhou 1f1dc5e768 Allow a comma separated list of excluded addresses 2023-02-23 14:29:08 -08:00
Jingyu Zhou 6ac8720364 Add exclude to fdbcli's configure command
Right now this only allows one server address being excluded. This is useful
when the database is unavailable but we want the recruitment to skip some
particular processes.

Manually tested the concept works with a loopback cluster.
2023-02-23 14:28:20 -08:00
Markus Pilman c1f80fe471 Merge remote-tracking branch 'origin/main' into features/tenant-lock-fdbcli 2023-02-23 15:16:14 -07:00
Jingyu Zhou 792950dbdc
Merge pull request #9434 from sfc-gh-huliu/splitmetrics
Implement SplitMetric pagination in blob migrator
2023-02-23 14:10:27 -08:00
Markus Pilman 1862e65415 Fix build issue with awssdk_target 2023-02-23 15:05:17 -07:00
Markus Pilman 8759fd8f12 Fix refactoring mistake 2023-02-23 14:41:27 -07:00
A.J. Beamon b828f3f257 Add missing change to explicit TenantMapEntry conversion 2023-02-23 13:38:04 -08:00
A.J. Beamon 54955d54f2 Don't allow repopulating from a management cluster if there is another ID registered for the same cluster. Instead, the cluster must be unregistered first before repopulating from it. Also improves a trace event. 2023-02-23 13:28:10 -08:00
A.J. Beamon c2d28377af Set the restore ID in the data cluster after marking the cluster restoring in the management cluster 2023-02-23 13:28:10 -08:00
A.J. Beamon 6adccdafa9 Add a conflict range on the active restore ID when setting it 2023-02-23 13:28:10 -08:00
A.J. Beamon 537834ef00 Properly initialize API version of simulated MVC clusters when calling openDatabase 2023-02-23 13:28:10 -08:00
A.J. Beamon 06fe00544a Remove TenantMapEntry <-> MetaclusterTenantMapEntry conversion constructors and use named functions instead 2023-02-23 13:28:10 -08:00
A.J. Beamon dcae48cbbd Add concurrent restore testing to the metacluster restore workload 2023-02-23 13:27:20 -08:00
Markus Pilman 193e517cc4 Address review comments and move lock ID into TenantMapEntry 2023-02-23 14:25:36 -07:00
A.J. Beamon e151a2d363
Merge pull request #9451 from sfc-gh-ajbeamon/metacluster-management-workload-restore-support
Improve restore support in the metacluster management workload
2023-02-23 13:10:31 -08:00
Xiaoge Su 4c9c357d2c Change the storage directory to joshua/ensembles/results/applications/simulation_logs 2023-02-23 13:05:20 -08:00
Markus Pilman efc5bf9ee8
Merge pull request #9456 from sfc-gh-ajbeamon/smaller-tenant-in-txn-state-store
Store a smaller tenant object in the txn state store
2023-02-23 14:00:12 -07:00
A.J. Beamon 9e9a31c0f1 Use error variable consistently 2023-02-23 11:27:53 -08:00
Xiaoge Su 6408208c38
Update contrib/joshua_logtool.py
Co-authored-by: Jingyu Zhou <jingyuzhou@gmail.com>
2023-02-23 10:49:11 -08:00
Evan Tschannen cf3a4e6161 Merge branch 'main' into feature-change-feed-cache 2023-02-23 10:16:13 -08:00
Jingyu Zhou 3d8b8a2a05
Merge pull request #9450 from sfc-gh-ahusain/ahusain-misc-fixes
EaR: RESTClient and EKP changes to handle unreachable external KMS
2023-02-23 10:04:12 -08:00
Evan Tschannen a581a55452 ensure a worker cannot run multiple blob worker roles 2023-02-23 09:51:26 -08:00
A.J. Beamon dd650215d4 Store a smaller tenant object in the txn state store 2023-02-23 09:29:33 -08:00