Evan Tschannen
|
eb9114fd52
|
fixed a few problems with the consistency check
|
2018-07-09 15:46:22 -07:00 |
Evan Tschannen
|
0e1c32e1d2
|
fix: consistency check was broken when the key server key space is sharded
|
2018-07-09 15:46:17 -07:00 |
Evan Tschannen
|
ef6ccc7b9e
|
fix: consistency check was not checking for data inconsistencies
|
2018-07-09 15:46:12 -07:00 |
Evan Tschannen
|
6b40f2764d
|
fix: off by one error on popping missing tags
|
2018-07-09 15:43:22 -07:00 |
Alex Miller
|
c6133ba711
|
Merge pull request #579 from satherton/release-5.2-leaks
Destroy peers that are inactive
|
2018-07-09 15:21:24 -07:00 |
Evan Tschannen
|
e503dc975c
|
fix: destroy peers that are inactive
do not open new connections to send replies
|
2018-07-09 13:37:06 -07:00 |
Bhaskar Muppana
|
4a9feece79
|
Merge pull request #566 from alecgrieser/bump-api-version
Bump API version from 520 to 600
|
2018-07-09 13:35:31 -07:00 |
Alec Grieser
|
46b8612566
|
bump API version from 520 to 600
|
2018-07-09 11:00:57 -07:00 |
Evan Tschannen
|
f49569075c
|
Merge pull request #577 from etschannen/release-6.0
Added a trace event related to max_notifications being reached
|
2018-07-08 20:19:17 -07:00 |
Evan Tschannen
|
ef62c98ef5
|
Added a trace event related to max_notifications being reached
|
2018-07-08 20:18:24 -07:00 |
Evan Tschannen
|
1a1d0c36da
|
Merge pull request #576 from etschannen/release-6.0
Merge 5.2 into 6.0
|
2018-07-08 20:15:18 -07:00 |
Evan Tschannen
|
5a2cb3037b
|
merge 5.2 into 6.0
|
2018-07-08 20:14:06 -07:00 |
Evan Tschannen
|
d894f4cd52
|
Merge branch 'release-5.2' of github.com:apple/foundationdb into release-5.2
|
2018-07-08 20:12:36 -07:00 |
Evan Tschannen
|
3b62fa5b89
|
Merge pull request #574 from satherton/release-5.2-leaks
Old memory leak fixes
|
2018-07-08 20:11:43 -07:00 |
Evan Tschannen
|
2ac65fc708
|
Merge pull request #575 from etschannen/feature-remote-logs
Allow peers to be deleted when no longer used
|
2018-07-08 20:10:14 -07:00 |
Evan Tschannen
|
c5b5d02858
|
Merge branch 'release-5.2' of github.com:apple/foundationdb into release-5.2
|
2018-07-08 20:04:31 -07:00 |
Evan Tschannen
|
d42c9914d2
|
fix: future quiet databases need to be able to continue the reconfigure if the first one completes the repopulate but is cancelled before changing usable_regions
|
2018-07-08 19:56:55 -07:00 |
Evan Tschannen
|
ce6b0d4952
|
fix: consistency check must also configuration usable regions to 1, because the remote log set might not be able to copy data
|
2018-07-08 18:25:01 -07:00 |
Evan Tschannen
|
380b2895f7
|
fix: we need to wait for the yield in the team tracker not just after the initial failure reaction delay, but also after zeroHealthyTeams changes
|
2018-07-08 17:44:19 -07:00 |
Evan Tschannen
|
0e97ce79b4
|
fix: destroy peers that are inactive
do not open new connections to send replies
|
2018-07-08 10:26:41 -07:00 |
Evan Tschannen
|
1c90e948f8
|
Merge pull request #573 from etschannen/feature-remote-logs
Data distribution bug fixes
|
2018-07-07 17:43:11 -07:00 |
Evan Tschannen
|
4dd18afb84
|
fix: we cannot make MAX_RECOVERY_VERSIONS lower than MAX_VERSIONS_IN_FLIGHT because we can mark a recovery as stalled before finishing the recovery, leading to an infinite loop of recoveries
|
2018-07-07 17:41:20 -07:00 |
Evan Tschannen
|
d6c6e7d306
|
fix: do not attempt data movement to an unhealthy destination team
allow building more teams than desired if all teams are unhealthy
bestTeamStuck is an error in simulation again
|
2018-07-07 16:51:16 -07:00 |
Stephen Atherton
|
a2f16e217e
|
Memory waste fix, when a Peer disconnects an extra packet buffer block is allocated to copy unsent reliable bytes to even if there aren't any.
|
2018-07-06 19:44:30 -07:00 |
Stephen Atherton
|
fee4234e6b
|
Bug fixes in memory activity logging.
|
2018-07-06 19:02:42 -07:00 |
Stephen Atherton
|
acee7eef1b
|
Merge branch 'release-5.2' of github.com:etschannen/foundationdb into release-5.2-leaks
|
2018-07-06 18:32:21 -07:00 |
Balachandar Namasivayam
|
3892aa7263
|
Merge pull request #572 from etschannen/feature-remote-logs
the cluster controller needs to update its priority immediately
|
2018-07-06 18:31:27 -07:00 |
Evan Tschannen
|
13fb59cf11
|
fix: the cluster controller needs to update its priority immediately
|
2018-07-06 18:29:54 -07:00 |
Evan Tschannen
|
cdafd542ee
|
fix: fixed a memory leak where leaderInfo notifications are not cleared out
|
2018-07-06 17:40:29 -07:00 |
Evan Tschannen
|
72ffdf7204
|
Merge pull request #571 from etschannen/feature-remote-logs
added a trace event for forced recovery
|
2018-07-06 17:11:05 -07:00 |
Evan Tschannen
|
f0494f18b1
|
added a trace event for forced recovery
|
2018-07-06 17:09:29 -07:00 |
Balachandar Namasivayam
|
d58ccab22a
|
Merge pull request #570 from etschannen/feature-remote-logs
properly handle zero logRouterTags
|
2018-07-06 16:58:51 -07:00 |
Evan Tschannen
|
43b5cb28ba
|
fix: properly handle zero logRouterTags, this is important for forced recovery
|
2018-07-06 16:52:25 -07:00 |
Evan Tschannen
|
6dbddab530
|
Merge pull request #568 from etschannen/feature-remote-logs
Many bug fixes
|
2018-07-06 16:12:56 -07:00 |
Evan Tschannen
|
2718176927
|
fix: remote logs did not pop all of the data for removed logs on recovery because data for the missing tag was not recorded yet at the time of recovery
|
2018-07-06 16:10:41 -07:00 |
Evan Tschannen
|
28c0d96c90
|
fix: treat the local region as best when version difference is too large
re-check requests when the version difference becomes small
|
2018-07-06 14:44:11 -07:00 |
Evan Tschannen
|
9015b8038f
|
io_error should cause the process to die and restart, to prevent repeated recruitment of a bad disk
|
2018-07-06 14:42:36 -07:00 |
Evan Tschannen
|
7d54ca4dc2
|
fix: errors from disk should trump errors from workers
|
2018-07-06 14:41:36 -07:00 |
Evan Tschannen
|
6d7172ef7e
|
fix: canKillProcesses did not take into account the remoteTLogPolicy when checking notEnoughLeft
|
2018-07-05 21:36:09 -07:00 |
Evan Tschannen
|
6f4ca2eba2
|
fix: get all processes did not include rebooting processes
|
2018-07-05 21:13:56 -07:00 |
Alex Miller
|
bb2eb2fe53
|
Merge pull request #565 from etschannen/feature-remote-logs
Simulation did not permanently kill machines in most tests
|
2018-07-05 15:07:28 -07:00 |
Alex Miller
|
5b12414b74
|
Merge pull request #564 from alexmiller-apple/tlsplugin
Fix dependencies for TLS library stuff
|
2018-07-05 14:32:45 -07:00 |
Evan Tschannen
|
cd4fb9285a
|
waitForExlusion requires both regions to be healthy, which is only possible if we do not kill all logs in a region
|
2018-07-05 14:04:42 -07:00 |
Alex Miller
|
6c98aa8aac
|
Fix not depending on FDBLibTLS.a and default TLS_LIBDIR to a sensible place.
|
2018-07-05 13:23:20 -07:00 |
Evan Tschannen
|
6cf5354425
|
checkSatelliteTagLocations is not an error if the same zoneId is used multiple times
|
2018-07-05 13:00:13 -07:00 |
Evan Tschannen
|
21347df254
|
fix: getting metrics did not handle broken_promise errors
|
2018-07-05 12:30:11 -07:00 |
Evan Tschannen
|
da5a232d7e
|
fix: If we have not recruited the remote logs yet and detect a configuration change, we must fail the master to update the remote recruitment request
|
2018-07-05 12:17:41 -07:00 |
Evan Tschannen
|
7315e5da55
|
fix: isExcluded and isCleared were exactly wrong
fix: isCleared should mean the process is dead
|
2018-07-05 02:22:22 -07:00 |
Evan Tschannen
|
507b3bacb0
|
fix: kill all tlogs in one region prevents the remote logs from recovering in that region, do not allow that to prevent us from configuring usable_regions=1.
added more recovery states.
|
2018-07-05 00:08:51 -07:00 |
Evan Tschannen
|
99e2b06c2d
|
Merge pull request #562 from etschannen/feature-remote-logs
Added repopulate region anti-quorum to the configuration
|
2018-07-04 16:44:35 -04:00 |