Evan Tschannen
e12b7a79aa
Merge pull request #464 from etschannen/feature-remote-logs
...
fixed another trace event
2018-06-11 12:54:02 -07:00
Evan Tschannen
8dfda1e57b
fixed another trace event
2018-06-11 12:53:07 -07:00
Evan Tschannen
3c50fb8d47
Merge pull request #462 from etschannen/feature-remote-logs
...
fixed trace event name
2018-06-11 12:44:08 -07:00
Evan Tschannen
e28769b98e
fixed trace event name
2018-06-11 12:43:08 -07:00
Evan Tschannen
33dd2b157a
Merge pull request #458 from etschannen/feature-remote-logs
...
Do not index tags on TLogs for remote localities
2018-06-11 12:23:21 -07:00
Evan Tschannen
372ed67497
Merge branch 'master' into feature-remote-logs
...
# Conflicts:
# fdbserver/DataDistribution.actor.cpp
# fdbserver/MasterProxyServer.actor.cpp
# fdbserver/TLogServer.actor.cpp
# fdbserver/TagPartitionedLogSystem.actor.cpp
2018-06-11 11:34:10 -07:00
Evan Tschannen
588eaf4b36
fix: previous delay 0 could still cause us to recruit a tlog before processing disk errors
2018-06-11 11:26:30 -07:00
Alec Grieser
7817c4ac92
Merge pull request #457 from brownleej/site-map-fix
...
Add administration and TLS sections to the site map.
2018-06-11 11:20:25 -07:00
John Brownlee
cd4ce7843d
Add administration and TLS sections to the site map.
...
#264
2018-06-11 11:13:44 -07:00
Evan Tschannen
64e0260085
fix: assert did not properly handle default constructed policies
2018-06-10 21:51:59 -07:00
Evan Tschannen
b60264024a
fix: we need to copy the txsTag on satellite logs
2018-06-10 20:30:44 -07:00
Evan Tschannen
a5c2a8ee8a
fix: allow disk errors to cancel the actor before recruiting logs
2018-06-10 20:27:19 -07:00
Evan Tschannen
134b5d6f65
fix: only consider data distribution started when remote has recovered so quite database works correctly
2018-06-10 20:25:15 -07:00
Evan Tschannen
2407e3774b
fix: we cannot run with less storage replication than log replication because it breaks recruitment logic
2018-06-10 20:22:58 -07:00
Evan Tschannen
4903df5ce9
fix: give time to detect failed servers before building teams
2018-06-10 20:21:39 -07:00
Evan Tschannen
0bc7274d0e
fix: hasSatelliteReplication was set incorrectly
2018-06-10 20:20:41 -07:00
Evan Tschannen
6e48d93d39
backed out the healthy team check because it was unnecessary
2018-06-10 12:43:32 -07:00
Evan Tschannen
8a24bf6124
describe did not list all the log sets
2018-06-10 12:38:50 -07:00
Evan Tschannen
82be52205b
Merge pull request #447 from bnamasivayam/save-fitness-info
...
Save fitness info of a process to become a cluster controller. This i…
2018-06-08 16:18:00 -07:00
Evan Tschannen
b9826dc1cb
fix: do not automatically reduce redundancy we move keys if the database does not have remote replicas. This is to prevent problems when dropping remote replicas from a configuration.
2018-06-08 16:17:27 -07:00
Balachandar Namasivayam
8360f71cbb
Merge branch 'master' of github.com:apple/foundationdb into save-fitness-info
...
# Conflicts:
# fdbserver/worker.actor.cpp
2018-06-08 16:09:59 -07:00
Evan Tschannen
35ab8b9b8e
Merge pull request #455 from ajbeamon/master
...
Some more trace event normalization
2018-06-08 16:03:34 -07:00
A.J. Beamon
06ccd9a500
Allow trace event type names to end with an underscore.
2018-06-08 15:49:31 -07:00
A.J. Beamon
1fdfe20908
Relax the rules on trace event Types a bit by allowing multiple underscores, as well as starting with an underscore and consecutive underscores.
2018-06-08 15:40:29 -07:00
Evan Tschannen
48fbc407fd
fix: we cannot kill all of the remote tlogs, because we still need their data to copy to the next generation in the same data center
2018-06-08 15:28:44 -07:00
Balachandar Namasivayam
32285ee958
Don't crash if fitness file is corrupted in real production use case.
2018-06-08 14:03:36 -07:00
A.J. Beamon
99c9958db7
Some more trace event normalization
2018-06-08 13:57:00 -07:00
Balachandar Namasivayam
34995d4d64
Address review comments.
2018-06-08 11:51:51 -07:00
Evan Tschannen
943dfa278b
Merge pull request #454 from ajbeamon/normalize-trace-events
...
Attempt to normalize trace events
2018-06-08 11:28:44 -07:00
A.J. Beamon
c12b235080
Fix case in a few commented out trace events
2018-06-08 11:20:06 -07:00
A.J. Beamon
e5488419cc
Attempt to normalize trace events:
...
* Detail names now all start with an uppercase character and contain no underscores. Ideally these should be head-first camel case, though that was harder to check.
* Type names have the same rules, except they allow one underscore (to support a usage pattern Context_Type). The first character after the underscore is also uppercase.
* Use seconds instead of milliseconds in details.
Added a check when events are logged in simulation that logs a message to stderr if the first two rules above aren't followed.
This probably doesn't address every instance of the above problems, but all of the events I was able to hit in simulation pass the check.
2018-06-08 11:11:08 -07:00
A.J. Beamon
f1d389448c
Merge pull request #453 from apple/release-5.2
...
Merge release-5.2 into master
2018-06-08 10:41:44 -07:00
A.J. Beamon
6461478695
Merge pull request #452 from apple/release-5.1
...
Merge release-5.1 into release-5.2
2018-06-08 10:41:13 -07:00
Evan Tschannen
d7d38c3544
Merge pull request #430 from ajbeamon/rename-logGroup-attribute
...
Rename trace file logGroup attribute to LogGroup
2018-06-08 10:30:45 -07:00
Evan Tschannen
953c27e570
Merge pull request #431 from ajbeamon/tlog-rename-variables
...
Rename several variables in TLogServer.actor.cpp to follow our normal camel case conventions.
2018-06-08 10:30:22 -07:00
Evan Tschannen
12c45ccf79
Merge pull request #451 from ajbeamon/release-5.1
...
Fix case of newSeverity detail in StderrSeverity trace event
2018-06-08 10:28:30 -07:00
A.J. Beamon
c9543791fd
Fix case of newSeverity detail in StderrSeverity trace event
2018-06-08 10:24:12 -07:00
Evan Tschannen
7d392689fe
fix: only update metrics for healthy destinations, because unhealthy destinations are already in the source
2018-06-07 18:12:04 -07:00
Evan Tschannen
e4d5817679
fix: we must server getTeam requests before readyToStart is set because we cannot complete relocateShard requests without getTeam responses from both team collections
2018-06-07 16:14:40 -07:00
Evan Tschannen
9f0c16f062
do not build teams which contain failed servers
2018-06-07 14:05:53 -07:00
Balachandar Namasivayam
11b79c6c94
Save fitness info of a process to become a cluster controller. This info is currently lost after a reboot. Save this info and reload it to avoid unnecessary re-recruitments.
2018-06-07 13:07:19 -07:00
Evan Tschannen
b423d73b42
fix: do not finish a shard relocation until all of the storage servers have made the current recovery version durable. This is to prevent dropping a needed storage server as a source for a shard after dropping a remote configuration
2018-06-07 12:29:25 -07:00
A.J. Beamon
f9cec3c6bb
Merge pull request #443 from ajbeamon/master
...
Merge release-5.2
2018-06-06 15:28:33 -07:00
A.J. Beamon
3fde6cbaa7
Change MSI GUID in 6.0
2018-06-06 15:27:36 -07:00
A.J. Beamon
216404de45
Merge branch 'release-5.2' of github.com:apple/foundationdb
...
# Conflicts:
# documentation/sphinx/source/release-notes.rst
# versions.target
2018-06-06 15:25:37 -07:00
A.J. Beamon
b52681c763
Merge pull request #442 from ajbeamon/release-5.2
...
Update versions.target and MSI package for 5.2.4
2018-06-06 15:21:39 -07:00
A.J. Beamon
cf7ab15c3e
Update versions.target and MSI package for 5.2.4
2018-06-06 15:20:50 -07:00
A.J. Beamon
b8efd4c2b5
Merge pull request #441 from ajbeamon/release-5.2
...
Updates for release 5.2.3. This excludes required changes to administ…
2018-06-06 14:14:23 -07:00
A.J. Beamon
bd90cdbc59
Updates for release 5.2.3. This excludes required changes to administration.rst pending some clarification about how certain upgrades work.
2018-06-06 14:13:20 -07:00
A.J. Beamon
59caa968dd
Merge pull request #440 from etschannen/release-5.2
...
backup created large transactions when erasing log ranges
2018-06-06 13:45:43 -07:00