Commit Graph

75 Commits

Author SHA1 Message Date
David Youngworth d0391db862 Merge branch 'release-6.2' into release-6.3 2020-11-16 10:15:23 -08:00
Vishesh Yadav 1976f2c371 FlowTransport: Sample connect latencies 2020-11-10 12:12:01 -08:00
Vishesh Yadav 7bdcb01bdf FlowTransport: Count number of incoming/outgoing/failed connections with logging 2020-11-02 18:51:48 -08:00
Jingyu Zhou d7420bb8c7 Fix merging 6.3 caused test failure 2020-10-16 21:11:01 -07:00
Jingyu Zhou 8f17a1a5d6 Merge branch 'release-6.2' into release-6.3 2020-10-16 15:25:39 -07:00
Evan Tschannen 52828f9e03 Added bytesSent to the PingLatency logging; increasing the logging interval from 1 second to 3 seconds 2020-10-11 23:05:21 -07:00
Evan Tschannen f03a355c4e Merge branch 'release-6.2' into release-6.3
# Conflicts:
#	CMakeLists.txt
#	documentation/sphinx/source/downloads.rst
#	fdbrpc/FlowTransport.actor.cpp
#	fdbrpc/FlowTransport.h
#	flow/Arena.h
#	flow/Knobs.cpp
#	flow/Knobs.h
#	packaging/msi/FDBInstaller.wxs
2020-10-07 12:28:22 -07:00
Evan Tschannen 29d4d87b56 added a knob for the number of samples 2020-10-06 17:38:09 -07:00
Evan Tschannen 12d2f3a8f3 fixed includes 2020-10-06 17:01:13 -07:00
Evan Tschannen 822312b31d only track latencies to public network addresses; use a continousSample to get median and p90 latencies 2020-10-06 16:55:35 -07:00
Evan Tschannen 2166f9a3dd added logging about bytes received 2020-10-06 16:07:35 -07:00
Evan Tschannen 9efda1fec5 added logging for the ping latencies for all network connections 2020-10-06 13:58:05 -07:00
A.J. Beamon b09dddc07e Merge branch 'release-6.2' into merge-release-6.2-into-release-6.3
# Conflicts:
#	cmake/ConfigureCompiler.cmake
#	documentation/sphinx/source/downloads.rst
#	fdbrpc/FlowTransport.actor.cpp
#	fdbrpc/fdbrpc.vcxproj
#	fdbserver/DataDistributionQueue.actor.cpp
#	fdbserver/Knobs.cpp
#	fdbserver/Knobs.h
#	fdbserver/LogSystemPeekCursor.actor.cpp
#	fdbserver/MasterProxyServer.actor.cpp
#	fdbserver/Status.actor.cpp
#	fdbserver/storageserver.actor.cpp
#	flow/flow.vcxproj
2020-07-10 15:06:34 -07:00
Evan Tschannen 717242a0ee reset WAN network connections every 5 minutes is responses take more than 500ms 2020-07-09 22:50:47 -07:00
sfc-gh-tclinkenbeard dd5cd762a8 Removed dead code 2020-07-04 12:50:40 -07:00
Evan Tschannen 6a006c4c1f use the first RequestStream as the base endpoint 2020-05-20 13:52:22 -07:00
Evan Tschannen 459186dad0 revert previous change to bit shift 2020-05-20 12:32:50 -07:00
Evan Tschannen b999b93fa2 added a hash function for endpoint 2020-05-20 12:21:57 -07:00
Evan Tschannen dfc152ebb3 shift the bits farther 2020-05-20 11:34:24 -07:00
Evan Tschannen ed9d5dd857 cannot change sort order of endpoints 2020-05-20 00:00:06 -07:00
Evan Tschannen c09a28b0db speed up comparison of endpoints 2020-05-19 20:41:56 -07:00
Evan Tschannen bf9f24beb5 fixed mismatch of Endpoint UIDs 2020-05-19 17:16:41 -07:00
Evan Tschannen 9313e490aa change the first part of the UID to make comparisons cheaper 2020-05-19 16:48:20 -07:00
Evan Tschannen fd0ee72293 Merge branch 'master' into feature-small-endpoint 2020-04-29 18:43:10 -07:00
Evan Tschannen 76fb345dd1 Merge branch 'master' into feature-tree-broadcast
# Conflicts:
#	fdbrpc/FailureMonitor.actor.cpp
2020-04-29 09:51:22 -07:00
Vishesh Yadav 3a5315d10c FlowTransport: Don't immediately mark connections failed
In connectionKeeper(), when a connection is failed for
FAILURE_DETECTION_DELAY, then only mark connection as failed.

This is much closer to the original centralized behaviour, and also
adds more confidence on whether the connection is actually failed.
2020-04-22 19:46:04 -07:00
Vishesh Yadav da7d0093ee Cleanup unused code 2020-04-15 19:48:25 -07:00
Evan Tschannen ff5543b579 working implementation 2020-04-12 22:18:51 -07:00
Evan Tschannen 0c2e8b9462 only serialize a single endpoint for an interface 2020-04-12 16:04:48 -07:00
Evan Tschannen ce4493f679 many bug fixes 2020-04-10 13:45:16 -07:00
Evan Tschannen a51c92854a Merge branch 'master' into feature-tree-broadcast
# Conflicts:
#	fdbserver/WorkerInterface.actor.h
#	fdbserver/worker.actor.cpp
2020-04-06 21:09:44 -07:00
Evan Tschannen 2a1bd97120 fix compilation errors 2020-04-06 20:58:43 -07:00
Evan Tschannen 477d66b46d implemented a tree broadcast for txn state message for proxies, and serverDBInfo for workers 2020-04-05 23:09:36 -07:00
Vishesh Yadav d90e168e24 Add HealthMonitoring skeleton code 2020-04-03 15:53:00 -07:00
Meng Xu 7f559bc712 Cleanup code and apply clang-format
Self code review
2020-03-16 15:08:32 -07:00
Meng Xu 1759d5c8c4 Apply clang-format 2020-03-12 10:18:53 -07:00
Meng Xu a9136f3f72 Add waitForUnreliableExtraStoreReboot to wait for extra store to reboot 2020-03-12 10:18:31 -07:00
Meng Xu bd345f85db ConsistencyCheck:Fix failue due to address inconsistency between process and worker
With TLS, a worker (or process) can have a TLS address and non-TLS address.
When a process is created in simulation, the primary address is TLS by default.
The non-TLS one is the TLS address port plus one.

In a connection between two workers, if their primary addresses do not enable
or disable TLS together, one worker will swap its primary address and secondary address
so that the TLS config of the two endpoints can match.

The swap can make the primary address no longer the TLS one that was created
when the process is created. And the swap only happens for worker instead of
process struct in simulation.

This swap can cause worker->address != process->address.
In checkForExtraDataStores actor, we use worker->address to check if a process
is killable and use the process->address to kill the process. The inconsistency
can cause simulation to kill a protected process that is not killable and leads
to simulation failure.
2020-03-10 21:07:16 -07:00
mpilman aac94a766b Removed dead code 2020-02-15 21:56:48 -08:00
A.J. Beamon 562ce17eca Initialize outgoingConnectionIdle in the constructor. Add back line to connectionKeeper that is needed in some looping cases 2019-10-10 12:48:35 -07:00
Evan Tschannen 1c0484cffc fix: do not close connections which have outstanding tryGetReplies with the peer 2019-08-29 16:49:57 -07:00
Vishesh Yadav 3f4f71ff9f fdbrpc: Increment peerReferences correctly
The constructor of FlowReceiver which handled reference counting
peerReferences relied on calling a virtual method from constructor
whose behaviour isn't correct. This patch, bubbles down result of that
virtual method from derived constructor to base contructor.
2019-07-09 14:24:16 -07:00
Alex Miller 8e1ab6e7db Merge remote-tracking branch 'upstream/master' into flowlock-api 2019-06-28 17:32:54 -07:00
Evan Tschannen 5ccffe3cb3
Merge pull request #1684 from jzhou77/large-packet
Better handling for large packets
2019-06-28 16:19:01 -07:00
Alex Miller bf883d7055 Merge remote-tracking branch 'upstream/master' into flowlock-api 2019-06-25 14:26:50 -07:00
Evan Tschannen 24937d8125
Merge pull request #1744 from vishesh/task/monitor-leader-on-demand
Fix setting enClientFailureMonitor global for client
2019-06-25 13:38:59 -07:00
Jingyu Zhou c4e44e6697 Refactor: add missing include 2019-06-25 10:18:56 -07:00
Alex Miller 7a500cd37f A giant translation of TaskFooPriority -> TaskPriority::Foo
This is so that APIs that take priorities don't take ints, which are
common and easy to accidentally pass the wrong thing.
2019-06-25 02:47:35 -07:00
Vishesh Yadav cbc2398254 fdbrpc: Remove default parameter from FlowTransport::createInstance 2019-06-25 01:17:38 -07:00
mpilman 68ce9a5e75 ProtocolVersion type - second try 2019-06-18 17:55:27 -07:00