Commit Graph

1998 Commits

Author SHA1 Message Date
Evan Tschannen e12b7a79aa
Merge pull request #464 from etschannen/feature-remote-logs
fixed another trace event
2018-06-11 12:54:02 -07:00
Evan Tschannen 8dfda1e57b fixed another trace event 2018-06-11 12:53:07 -07:00
Evan Tschannen 3c50fb8d47
Merge pull request #462 from etschannen/feature-remote-logs
fixed trace event name
2018-06-11 12:44:08 -07:00
Evan Tschannen e28769b98e fixed trace event name 2018-06-11 12:43:08 -07:00
Evan Tschannen 33dd2b157a
Merge pull request #458 from etschannen/feature-remote-logs
Do not index tags on TLogs for remote localities
2018-06-11 12:23:21 -07:00
Evan Tschannen 372ed67497 Merge branch 'master' into feature-remote-logs
# Conflicts:
#	fdbserver/DataDistribution.actor.cpp
#	fdbserver/MasterProxyServer.actor.cpp
#	fdbserver/TLogServer.actor.cpp
#	fdbserver/TagPartitionedLogSystem.actor.cpp
2018-06-11 11:34:10 -07:00
Evan Tschannen 588eaf4b36 fix: previous delay 0 could still cause us to recruit a tlog before processing disk errors 2018-06-11 11:26:30 -07:00
Alec Grieser 7817c4ac92
Merge pull request #457 from brownleej/site-map-fix
Add administration and TLS sections to the site map.
2018-06-11 11:20:25 -07:00
John Brownlee cd4ce7843d Add administration and TLS sections to the site map.
#264
2018-06-11 11:13:44 -07:00
Evan Tschannen 64e0260085 fix: assert did not properly handle default constructed policies 2018-06-10 21:51:59 -07:00
Evan Tschannen b60264024a fix: we need to copy the txsTag on satellite logs 2018-06-10 20:30:44 -07:00
Evan Tschannen a5c2a8ee8a fix: allow disk errors to cancel the actor before recruiting logs 2018-06-10 20:27:19 -07:00
Evan Tschannen 134b5d6f65 fix: only consider data distribution started when remote has recovered so quite database works correctly 2018-06-10 20:25:15 -07:00
Evan Tschannen 2407e3774b fix: we cannot run with less storage replication than log replication because it breaks recruitment logic 2018-06-10 20:22:58 -07:00
Evan Tschannen 4903df5ce9 fix: give time to detect failed servers before building teams 2018-06-10 20:21:39 -07:00
Evan Tschannen 0bc7274d0e fix: hasSatelliteReplication was set incorrectly 2018-06-10 20:20:41 -07:00
Evan Tschannen 6e48d93d39 backed out the healthy team check because it was unnecessary 2018-06-10 12:43:32 -07:00
Evan Tschannen 8a24bf6124 describe did not list all the log sets 2018-06-10 12:38:50 -07:00
Evan Tschannen 82be52205b
Merge pull request #447 from bnamasivayam/save-fitness-info
Save fitness info of a process to become a cluster controller. This i…
2018-06-08 16:18:00 -07:00
Evan Tschannen b9826dc1cb fix: do not automatically reduce redundancy we move keys if the database does not have remote replicas. This is to prevent problems when dropping remote replicas from a configuration. 2018-06-08 16:17:27 -07:00
Balachandar Namasivayam 8360f71cbb Merge branch 'master' of github.com:apple/foundationdb into save-fitness-info
# Conflicts:
#	fdbserver/worker.actor.cpp
2018-06-08 16:09:59 -07:00
Evan Tschannen 35ab8b9b8e
Merge pull request #455 from ajbeamon/master
Some more trace event normalization
2018-06-08 16:03:34 -07:00
A.J. Beamon 06ccd9a500 Allow trace event type names to end with an underscore. 2018-06-08 15:49:31 -07:00
A.J. Beamon 1fdfe20908 Relax the rules on trace event Types a bit by allowing multiple underscores, as well as starting with an underscore and consecutive underscores. 2018-06-08 15:40:29 -07:00
Evan Tschannen 48fbc407fd fix: we cannot kill all of the remote tlogs, because we still need their data to copy to the next generation in the same data center 2018-06-08 15:28:44 -07:00
Balachandar Namasivayam 32285ee958 Don't crash if fitness file is corrupted in real production use case. 2018-06-08 14:03:36 -07:00
A.J. Beamon 99c9958db7 Some more trace event normalization 2018-06-08 13:57:00 -07:00
Balachandar Namasivayam 34995d4d64 Address review comments. 2018-06-08 11:51:51 -07:00
Evan Tschannen 943dfa278b
Merge pull request #454 from ajbeamon/normalize-trace-events
Attempt to normalize trace events
2018-06-08 11:28:44 -07:00
A.J. Beamon c12b235080 Fix case in a few commented out trace events 2018-06-08 11:20:06 -07:00
A.J. Beamon e5488419cc Attempt to normalize trace events:
* Detail names now all start with an uppercase character and contain no underscores. Ideally these should be head-first camel case, though that was harder to check.
* Type names have the same rules, except they allow one underscore (to support a usage pattern Context_Type). The first character after the underscore is also uppercase.
* Use seconds instead of milliseconds in details.

Added a check when events are logged in simulation that logs a message to stderr if the first two rules above aren't followed.

This probably doesn't address every instance of the above problems, but all of the events I was able to hit in simulation pass the check.
2018-06-08 11:11:08 -07:00
A.J. Beamon f1d389448c
Merge pull request #453 from apple/release-5.2
Merge release-5.2 into master
2018-06-08 10:41:44 -07:00
A.J. Beamon 6461478695
Merge pull request #452 from apple/release-5.1
Merge release-5.1 into release-5.2
2018-06-08 10:41:13 -07:00
Evan Tschannen d7d38c3544
Merge pull request #430 from ajbeamon/rename-logGroup-attribute
Rename trace file logGroup attribute to LogGroup
2018-06-08 10:30:45 -07:00
Evan Tschannen 953c27e570
Merge pull request #431 from ajbeamon/tlog-rename-variables
Rename several variables in TLogServer.actor.cpp to follow our normal camel case conventions.
2018-06-08 10:30:22 -07:00
Evan Tschannen 12c45ccf79
Merge pull request #451 from ajbeamon/release-5.1
Fix case of newSeverity detail in StderrSeverity trace event
2018-06-08 10:28:30 -07:00
A.J. Beamon c9543791fd Fix case of newSeverity detail in StderrSeverity trace event 2018-06-08 10:24:12 -07:00
Evan Tschannen 7d392689fe fix: only update metrics for healthy destinations, because unhealthy destinations are already in the source 2018-06-07 18:12:04 -07:00
Evan Tschannen e4d5817679 fix: we must server getTeam requests before readyToStart is set because we cannot complete relocateShard requests without getTeam responses from both team collections 2018-06-07 16:14:40 -07:00
Evan Tschannen 9f0c16f062 do not build teams which contain failed servers 2018-06-07 14:05:53 -07:00
Balachandar Namasivayam 11b79c6c94 Save fitness info of a process to become a cluster controller. This info is currently lost after a reboot. Save this info and reload it to avoid unnecessary re-recruitments. 2018-06-07 13:07:19 -07:00
Evan Tschannen b423d73b42 fix: do not finish a shard relocation until all of the storage servers have made the current recovery version durable. This is to prevent dropping a needed storage server as a source for a shard after dropping a remote configuration 2018-06-07 12:29:25 -07:00
A.J. Beamon f9cec3c6bb
Merge pull request #443 from ajbeamon/master
Merge release-5.2
2018-06-06 15:28:33 -07:00
A.J. Beamon 3fde6cbaa7 Change MSI GUID in 6.0 2018-06-06 15:27:36 -07:00
A.J. Beamon 216404de45 Merge branch 'release-5.2' of github.com:apple/foundationdb
# Conflicts:
#	documentation/sphinx/source/release-notes.rst
#	versions.target
2018-06-06 15:25:37 -07:00
A.J. Beamon b52681c763
Merge pull request #442 from ajbeamon/release-5.2
Update versions.target and MSI package for 5.2.4
2018-06-06 15:21:39 -07:00
A.J. Beamon cf7ab15c3e Update versions.target and MSI package for 5.2.4 2018-06-06 15:20:50 -07:00
A.J. Beamon b8efd4c2b5
Merge pull request #441 from ajbeamon/release-5.2
Updates for release 5.2.3. This excludes required changes to administ…
2018-06-06 14:14:23 -07:00
A.J. Beamon bd90cdbc59 Updates for release 5.2.3. This excludes required changes to administration.rst pending some clarification about how certain upgrades work. 2018-06-06 14:13:20 -07:00
A.J. Beamon 59caa968dd
Merge pull request #440 from etschannen/release-5.2
backup created large transactions when erasing log ranges
2018-06-06 13:45:43 -07:00