Stephen Atherton
|
7c1dc305cb
|
Merge commit 'a72c8f5cb2e79a673abc0ed3d27ef1c51028fb13' into feature-redwood
|
2018-10-05 10:15:10 -07:00 |
Stephen Atherton
|
5d9cd9acdc
|
Correctness test now has additional random reader which doesn't do verification but isn't stopped when the btree is closed. Fixed bug exposed by this where pager snapshots will still try to read pages after the pager has been shut down or even destroyed. Added new error type, shutdown_in_progress.
|
2018-10-04 23:46:37 -07:00 |
A.J. Beamon
|
a72c8f5cb2
|
Merge pull request #811 from apple/release-6.0
Merge release-6.0 into master
|
2018-10-04 13:30:49 -07:00 |
A.J. Beamon
|
d02b6bf211
|
Merge pull request #810 from apple/release-5.2
Merge release-5.2 into release-6.0
|
2018-10-04 13:30:26 -07:00 |
A.J. Beamon
|
b3912b3918
|
Merge pull request #809 from apple/release-5.1
Merge release-5.1 into release-5.2
|
2018-10-04 13:29:58 -07:00 |
A.J. Beamon
|
e7718724fb
|
Merge pull request #808 from ajbeamon/release-5.1
Fix typo
|
2018-10-04 13:29:27 -07:00 |
A.J. Beamon
|
abaefbe294
|
Fix typo
|
2018-10-04 13:28:35 -07:00 |
A.J. Beamon
|
fc0dbe7c66
|
Merge pull request #807 from apple/release-6.0
Merge release-6.0 into master
|
2018-10-04 13:26:32 -07:00 |
A.J. Beamon
|
d7cf6a7759
|
Merge pull request #806 from apple/release-5.2
Merge release-5.2 into release-6.0
|
2018-10-04 13:25:13 -07:00 |
A.J. Beamon
|
1b7aac6f5a
|
Merge pull request #805 from apple/release-5.1
Merge release-5.1 into release-5.2
|
2018-10-04 13:24:32 -07:00 |
A.J. Beamon
|
f215ae037a
|
Merge pull request #804 from apple/release-5.1
Merge release-5.1 into release-5.2
|
2018-10-04 13:23:16 -07:00 |
Evan Tschannen
|
fc098586a1
|
Merge pull request #802 from etschannen/master
Merge 6.0 into master
|
2018-10-03 16:58:14 -07:00 |
Evan Tschannen
|
3922e477a5
|
Merge branch 'release-6.0'
# Conflicts:
# documentation/sphinx/source/release-notes.rst
# fdbclient/ManagementAPI.actor.cpp
# fdbserver/ClusterController.actor.cpp
# fdbserver/DataDistribution.actor.cpp
# fdbserver/LogSystemDiskQueueAdapter.actor.cpp
# fdbserver/SimulatedCluster.actor.cpp
# fdbserver/TLogServer.actor.cpp
|
2018-10-03 16:57:18 -07:00 |
Evan Tschannen
|
9de55f362b
|
Merge pull request #793 from ajbeamon/add-new-storage-status-metrics
Add new metrics for bytes queried, keys queried, mutation bytes, muta…
|
2018-10-03 16:34:26 -07:00 |
Evan Tschannen
|
5b7ca7e4b1
|
Merge pull request #796 from brownleej/monitor-conf-system-key-range
Adds a key range for storing changes to monitor conf files.
|
2018-10-03 16:33:05 -07:00 |
Evan Tschannen
|
a91f59209b
|
Merge pull request #799 from ajbeamon/release-5.1
Add release notes for 5.0.8 to old release notes
|
2018-10-03 16:32:28 -07:00 |
Evan Tschannen
|
598788f60b
|
Merge pull request #801 from etschannen/feature-fix-forced-recovery
Fixed a number of problems with forced recoveries
|
2018-10-03 16:32:03 -07:00 |
Evan Tschannen
|
4bec4927aa
|
updated release notes for 6.0.12
|
2018-10-03 14:32:28 -07:00 |
Evan Tschannen
|
636420abee
|
fix: if the disk queue adapter peek hangs for a while, switch to a peek from a different locality
|
2018-10-03 13:58:55 -07:00 |
Evan Tschannen
|
28545e0f8d
|
multi cursors start a get more for the first 10 cursors to hide latency
|
2018-10-03 13:57:45 -07:00 |
Evan Tschannen
|
aa51d69b2d
|
fix: set peekLocality for upgraded tags
|
2018-10-03 13:54:59 -07:00 |
A.J. Beamon
|
87ff41006b
|
Merge pull request #800 from ajbeamon/master
Fix API version in old release notes link
|
2018-10-03 12:35:43 -07:00 |
A.J. Beamon
|
efb081ad86
|
Fix API version in old release notes link
|
2018-10-03 12:33:35 -07:00 |
Stephen Atherton
|
c5420218d0
|
Missed changes from previous checkin which stops freeing physical pages after a latest version logical page rewrite.
|
2018-10-03 12:10:07 -07:00 |
A.J. Beamon
|
54c240cbed
|
Add release notes for 5.0.8 to old release notes
|
2018-10-03 08:51:47 -07:00 |
A.J. Beamon
|
02f409b235
|
Merge pull request #798 from ajbeamon/master
Preliminary setup for 6.1.0 release notes
|
2018-10-03 08:34:13 -07:00 |
A.J. Beamon
|
14a0427462
|
Preliminary setup for 6.1.0 release notes
|
2018-10-03 08:32:19 -07:00 |
Evan Tschannen
|
c9f4109539
|
fix: add some additional time in the kill region workload to detect if we recovered successfully
|
2018-10-02 17:47:15 -07:00 |
Evan Tschannen
|
cdaf5e1192
|
fix: forced recovery does not recover tags from any DC besides the surviving one
|
2018-10-02 17:46:22 -07:00 |
Evan Tschannen
|
69711a107b
|
fix: because of forced recovery, 0 log router tags does not mean we are a special tlog set
|
2018-10-02 17:45:11 -07:00 |
Evan Tschannen
|
e7e1c634e0
|
fix: we need to restart the peek cursor when the known committed version becomes available
|
2018-10-02 17:44:14 -07:00 |
Evan Tschannen
|
a92fc911ac
|
do not spin on a failed storage server recruitment
|
2018-10-02 17:31:07 -07:00 |
Evan Tschannen
|
dd9c6856e1
|
fix: forced recovery is not safe to send multiple times
|
2018-10-02 17:29:39 -07:00 |
Evan Tschannen
|
15ce215c1b
|
fix: parallel peek requests leaked memory
|
2018-10-02 17:28:39 -07:00 |
Stephen Atherton
|
4ad44e72dd
|
Workaround for issue where calling sync on a cached file that doesn't exist yet causes an io_error after a simulated machine kill. Temporary disabled freeing physical pages on logical page replacement because it is a bug as currently implemented, it will reuse physical pages before the remap of its old logical page to a new physical page is durable.
|
2018-10-02 17:00:35 -07:00 |
Stephen Atherton
|
15c1f87681
|
Rewrite superpage writes to fix bugs involving logical ID reuse and to make the logic cleaner. Correctness test is stronger, with pipelined writes/commits during verification.
|
2018-10-01 16:51:57 -07:00 |
A.J. Beamon
|
84c2e3567f
|
Fix keys queried to use the RowsQueried metric instead of BytesQueried.
|
2018-10-01 11:19:28 -07:00 |
John Brownlee
|
2beeadf8be
|
Adds a key range for storing changes to monitor conf files.
|
2018-10-01 10:49:02 -07:00 |
A.J. Beamon
|
a98fcf5972
|
Rename durable_lag to durability_lag
|
2018-10-01 09:58:49 -07:00 |
Evan Tschannen
|
bd6b743a81
|
fix: the storage server must always keep MAX_READ_TRANSACTION_LIFE_VERSIONS of history in memory, because forced recovery could roll back an epoch end.
fix: rollbacks were triggered unnecessarily
|
2018-09-28 16:04:59 -07:00 |
Evan Tschannen
|
3fdf72c626
|
fix: we need to force recovery if the master is still attempting to read the txs tag
|
2018-09-28 13:33:33 -07:00 |
Evan Tschannen
|
59335aa757
|
fix: the latest generation of remote transaction logs might has less data the a previous generation, because they take over at known committed version. Detect this case and end at the version that has the most data
|
2018-09-28 12:25:27 -07:00 |
Evan Tschannen
|
c577840020
|
fix: forced recovery should remove all references to the old primary tlogs in all generations of logs to help the peek logic avoid attempting to read from them
|
2018-09-28 12:23:09 -07:00 |
Evan Tschannen
|
05e7f08b26
|
added a peek method which will attempt to read the txsTag from the local region as much as possible
|
2018-09-28 12:21:08 -07:00 |
Evan Tschannen
|
a24eadd73a
|
fix: for remote logs, their known committed version cannot be set to 1, because they can be used when their durable version is 0, leading to a known committed version being greater than a queue committed version
|
2018-09-28 12:17:21 -07:00 |
Evan Tschannen
|
e64c55dce0
|
fix: data distribution would use the wrong priority sometimes when fixing an incomplete movement, this lead to the cluster thinking the data was replicated in all regions before it actually was
|
2018-09-28 12:15:23 -07:00 |
Evan Tschannen
|
b1fe069165
|
fix: during forced recovery logs can be removed from the logSystemConfig. We need to avoid killing the removed logs as unneeded until we actually complete the recovery
|
2018-09-28 12:13:46 -07:00 |
Evan Tschannen
|
22e6afbb18
|
fix: the cluster controller did not pass in its own locality when creating its database object, therefore it was not using locality aware load balancing
|
2018-09-28 12:12:06 -07:00 |
Evan Tschannen
|
b560b94ebc
|
fix: do not force a recovery if the master was already in the other region (and therefore already recovered)
fix: reboot the remaining DC, because any storage server rejoins that were rolled back will cause that server to be unusable
|
2018-09-28 12:10:04 -07:00 |
Stephen Atherton
|
a9f467c502
|
Btree correctness verification of contents is now asynchronous.
|
2018-09-28 00:35:03 -07:00 |