Commit Graph

2499 Commits

Author SHA1 Message Date
Evan Tschannen eb9114fd52 fixed a few problems with the consistency check 2018-07-09 15:46:22 -07:00
Evan Tschannen 0e1c32e1d2 fix: consistency check was broken when the key server key space is sharded 2018-07-09 15:46:17 -07:00
Evan Tschannen ef6ccc7b9e fix: consistency check was not checking for data inconsistencies 2018-07-09 15:46:12 -07:00
Evan Tschannen 6b40f2764d fix: off by one error on popping missing tags 2018-07-09 15:43:22 -07:00
Alex Miller c6133ba711
Merge pull request #579 from satherton/release-5.2-leaks
Destroy peers that are inactive
2018-07-09 15:21:24 -07:00
Evan Tschannen e503dc975c fix: destroy peers that are inactive
do not open new connections to send replies
2018-07-09 13:37:06 -07:00
Bhaskar Muppana 4a9feece79
Merge pull request #566 from alecgrieser/bump-api-version
Bump API version from 520 to 600
2018-07-09 13:35:31 -07:00
Alec Grieser 46b8612566
bump API version from 520 to 600 2018-07-09 11:00:57 -07:00
Evan Tschannen f49569075c
Merge pull request #577 from etschannen/release-6.0
Added a trace event related to max_notifications being reached
2018-07-08 20:19:17 -07:00
Evan Tschannen ef62c98ef5 Added a trace event related to max_notifications being reached 2018-07-08 20:18:24 -07:00
Evan Tschannen 1a1d0c36da
Merge pull request #576 from etschannen/release-6.0
Merge 5.2 into 6.0
2018-07-08 20:15:18 -07:00
Evan Tschannen 5a2cb3037b merge 5.2 into 6.0 2018-07-08 20:14:06 -07:00
Evan Tschannen d894f4cd52 Merge branch 'release-5.2' of github.com:apple/foundationdb into release-5.2 2018-07-08 20:12:36 -07:00
Evan Tschannen 3b62fa5b89
Merge pull request #574 from satherton/release-5.2-leaks
Old memory leak fixes
2018-07-08 20:11:43 -07:00
Evan Tschannen 2ac65fc708
Merge pull request #575 from etschannen/feature-remote-logs
Allow peers to be deleted when no longer used
2018-07-08 20:10:14 -07:00
Evan Tschannen c5b5d02858 Merge branch 'release-5.2' of github.com:apple/foundationdb into release-5.2 2018-07-08 20:04:31 -07:00
Evan Tschannen d42c9914d2 fix: future quiet databases need to be able to continue the reconfigure if the first one completes the repopulate but is cancelled before changing usable_regions 2018-07-08 19:56:55 -07:00
Evan Tschannen ce6b0d4952 fix: consistency check must also configuration usable regions to 1, because the remote log set might not be able to copy data 2018-07-08 18:25:01 -07:00
Evan Tschannen 380b2895f7 fix: we need to wait for the yield in the team tracker not just after the initial failure reaction delay, but also after zeroHealthyTeams changes 2018-07-08 17:44:19 -07:00
Evan Tschannen 0e97ce79b4 fix: destroy peers that are inactive
do not open new connections to send replies
2018-07-08 10:26:41 -07:00
Evan Tschannen 1c90e948f8
Merge pull request #573 from etschannen/feature-remote-logs
Data distribution bug fixes
2018-07-07 17:43:11 -07:00
Evan Tschannen 4dd18afb84 fix: we cannot make MAX_RECOVERY_VERSIONS lower than MAX_VERSIONS_IN_FLIGHT because we can mark a recovery as stalled before finishing the recovery, leading to an infinite loop of recoveries 2018-07-07 17:41:20 -07:00
Evan Tschannen d6c6e7d306 fix: do not attempt data movement to an unhealthy destination team
allow building more teams than desired if all teams are unhealthy
bestTeamStuck is an error in simulation again
2018-07-07 16:51:16 -07:00
Stephen Atherton a2f16e217e Memory waste fix, when a Peer disconnects an extra packet buffer block is allocated to copy unsent reliable bytes to even if there aren't any. 2018-07-06 19:44:30 -07:00
Stephen Atherton fee4234e6b Bug fixes in memory activity logging. 2018-07-06 19:02:42 -07:00
Stephen Atherton acee7eef1b Merge branch 'release-5.2' of github.com:etschannen/foundationdb into release-5.2-leaks 2018-07-06 18:32:21 -07:00
Balachandar Namasivayam 3892aa7263
Merge pull request #572 from etschannen/feature-remote-logs
the cluster controller needs to update its priority immediately
2018-07-06 18:31:27 -07:00
Evan Tschannen 13fb59cf11 fix: the cluster controller needs to update its priority immediately 2018-07-06 18:29:54 -07:00
Evan Tschannen cdafd542ee fix: fixed a memory leak where leaderInfo notifications are not cleared out 2018-07-06 17:40:29 -07:00
Evan Tschannen 72ffdf7204
Merge pull request #571 from etschannen/feature-remote-logs
added a trace event for forced recovery
2018-07-06 17:11:05 -07:00
Evan Tschannen f0494f18b1 added a trace event for forced recovery 2018-07-06 17:09:29 -07:00
Balachandar Namasivayam d58ccab22a
Merge pull request #570 from etschannen/feature-remote-logs
properly handle zero logRouterTags
2018-07-06 16:58:51 -07:00
Evan Tschannen 43b5cb28ba fix: properly handle zero logRouterTags, this is important for forced recovery 2018-07-06 16:52:25 -07:00
Evan Tschannen 6dbddab530
Merge pull request #568 from etschannen/feature-remote-logs
Many bug fixes
2018-07-06 16:12:56 -07:00
Evan Tschannen 2718176927 fix: remote logs did not pop all of the data for removed logs on recovery because data for the missing tag was not recorded yet at the time of recovery 2018-07-06 16:10:41 -07:00
Evan Tschannen 28c0d96c90 fix: treat the local region as best when version difference is too large
re-check requests when the version difference becomes small
2018-07-06 14:44:11 -07:00
Evan Tschannen 9015b8038f io_error should cause the process to die and restart, to prevent repeated recruitment of a bad disk 2018-07-06 14:42:36 -07:00
Evan Tschannen 7d54ca4dc2 fix: errors from disk should trump errors from workers 2018-07-06 14:41:36 -07:00
Evan Tschannen 6d7172ef7e fix: canKillProcesses did not take into account the remoteTLogPolicy when checking notEnoughLeft 2018-07-05 21:36:09 -07:00
Evan Tschannen 6f4ca2eba2 fix: get all processes did not include rebooting processes 2018-07-05 21:13:56 -07:00
Alex Miller bb2eb2fe53
Merge pull request #565 from etschannen/feature-remote-logs
Simulation did not permanently kill machines in most tests
2018-07-05 15:07:28 -07:00
Alex Miller 5b12414b74
Merge pull request #564 from alexmiller-apple/tlsplugin
Fix dependencies for TLS library stuff
2018-07-05 14:32:45 -07:00
Evan Tschannen cd4fb9285a waitForExlusion requires both regions to be healthy, which is only possible if we do not kill all logs in a region 2018-07-05 14:04:42 -07:00
Alex Miller 6c98aa8aac Fix not depending on FDBLibTLS.a and default TLS_LIBDIR to a sensible place. 2018-07-05 13:23:20 -07:00
Evan Tschannen 6cf5354425 checkSatelliteTagLocations is not an error if the same zoneId is used multiple times 2018-07-05 13:00:13 -07:00
Evan Tschannen 21347df254 fix: getting metrics did not handle broken_promise errors 2018-07-05 12:30:11 -07:00
Evan Tschannen da5a232d7e fix: If we have not recruited the remote logs yet and detect a configuration change, we must fail the master to update the remote recruitment request 2018-07-05 12:17:41 -07:00
Evan Tschannen 7315e5da55 fix: isExcluded and isCleared were exactly wrong
fix: isCleared should mean the process is dead
2018-07-05 02:22:22 -07:00
Evan Tschannen 507b3bacb0 fix: kill all tlogs in one region prevents the remote logs from recovering in that region, do not allow that to prevent us from configuring usable_regions=1.
added more recovery states.
2018-07-05 00:08:51 -07:00
Evan Tschannen 99e2b06c2d
Merge pull request #562 from etschannen/feature-remote-logs
Added repopulate region anti-quorum to the configuration
2018-07-04 16:44:35 -04:00