Commit Graph

3499 Commits

Author SHA1 Message Date
A.J. Beamon fe811143a5
Merge pull request #4879 from sfc-gh-ajbeamon/reduce-client-read-failure-frequency
Reduce the frequency that buggified reads are failed
2021-05-28 15:02:45 -07:00
Evan Tschannen 4a753a0bce
Merge pull request #4623 from dlambrig/issue4252
CDs reject requests using different connection descriptor
2021-05-28 10:45:51 -07:00
Dan Lambright cc3175fc50 remove forwarding 2021-05-28 11:41:42 -04:00
A.J. Beamon 750901dd1d Reduce the frequency that buggified reads are failed so that transactions with a lot of reads aren't doomed to almost always fail. 2021-05-27 21:54:59 -07:00
Dan Lambright 10289ef8f1 Respond to AJs comments 2021-05-27 09:14:32 -04:00
Dan Lambright 64c10d3625 fix joshua failures, formatting 2021-05-27 08:08:07 -04:00
Dan Lambright 53d0ecc2fa respond to comments made on 5/4 2021-05-27 08:08:07 -04:00
Dan Lambright fc65154b5d forward back new coordinator 2021-05-27 08:08:07 -04:00
Dan Lambright fcfb78162c misc cleanup for publishing 2021-05-27 08:08:07 -04:00
Dan Lambright 742c22cef2 Don't allow changing desriptor if knob is set 2021-05-27 08:08:07 -04:00
Meng Xu ac15642273
Merge pull request #4850 from jzhou77/apiversion
Only enable backup worker when using partitioned logs
2021-05-24 15:07:43 -07:00
Markus Pilman e1254d38a0
Merge pull request #4838 from sfc-gh-xwang/ppwiggle
perpetual storage wiggling command line support
2021-05-24 14:48:00 -06:00
Jingyu Zhou bb076115c9 Only enable backup worker when using partitioned logs
This addresses issue #4849.
2021-05-21 16:40:29 -07:00
A.J. Beamon 4133e79c1f
Merge pull request #4770 from sbodagala/master
Expose "bounce impact" and Storage Server "version catch-up rate" metrics
2021-05-21 08:34:18 -07:00
Xiaoxi Wang a57061a5ed change UID to PID meaning Process ID 2021-05-20 00:06:53 +00:00
Xiaoxi Wang 93c809764f fix Schema check error 2021-05-19 23:52:16 +00:00
Sreenath Bodagala 2fa80e7912 Address review comments 2021-05-19 22:04:43 +00:00
Sreenath Bodagala 622f43474a Expose "bounce impact" and Storage Server "version catch-up rate" metrics
Changes:

Schemas.cpp: Extend the JSON schema to report the new metrics that have
been added.

mr-status-json-schemas.rst.inc: Update the schema to reflect the changes
made to the JSON schema.

release-notes-700.rst: Add a note about the new metrics in "Status"
section.
2021-05-19 19:54:49 +00:00
Xiaoxi Wang 6065d247f8 fix perpetualStorageWiggleKey 2021-05-17 20:59:38 +00:00
Xiaoxi Wang 8b1f9f7337 Add command line support 2021-05-17 20:57:54 +00:00
Sreenath Bodagala d8cad8efca Report bounce impact info as part of cluster JSON object. 2021-05-13 16:36:57 +00:00
Sreenath Bodagala 160293bd54 Report bounce impact in fdbcli status
Changes:

Schemas.cpp: Extend the JSON schema to report whether the cluster is
bounceable and if not, report the reason for why it is not bounceable.

Status.actor.cpp: Extend recoveryStateStatusFetcher() to populate the
bounce related field(s).

mr-status-json-schemas.rst.inc: Update the schema to reflect the change
made in Schemas.cpp.

release-notes-700.rst: Add a note about the new status fields in "Status"
section.
2021-05-13 14:28:06 +00:00
Lukas Joswiak 6499fa178e
Merge pull request #4802 from sfc-gh-ljoswiak/revert/actor-lineage
Revert "Merge pull request #4136 from sfc-gh-mpilman/features/actor-l…
2021-05-11 18:50:42 -07:00
Lukas Joswiak 4ea760b2a9 Revert "Merge pull request #4136 from sfc-gh-mpilman/features/actor-lineage"
This reverts commit da41534618, reversing
changes made to e6300905d6.
2021-05-10 20:26:12 -07:00
Lukas Joswiak cdf98f987d
Revert "Fix global config not triggering changes on server processes" 2021-05-10 12:18:28 -07:00
Lukas Joswiak 3f6ef14384 Fix OOM 2021-05-07 16:28:39 -07:00
Lukas Joswiak 71ba6f4501 Switch std::function to AsyncTrigger 2021-05-07 16:05:03 -07:00
Lukas Joswiak 0fc1552683 Fix global config updates not triggering on fdbserver processes 2021-05-06 15:46:34 -07:00
Chaoguang Lin 7da5004690 Merge branch 'master' of github.com:apple/foundationdb into refactor-fdbcli 2021-05-06 15:44:26 -07:00
Sreenath Bodagala 336a9bff66 Provide "time since last full recovery" in fdbcli status
Changes:

Schemas.cpp: Extend the JSON schema to include a new field that reports
the number of seconds since last full recovery.

Status.actor.cpp: Extend recoveryStateStatusFetcher() to populate the
new field that has been added to Schemas.cpp.

mr-status-json-schemas.rst.inc: Update the schema to reflect the change
made in Schemas.cpp.
2021-05-05 19:43:44 +00:00
Andrew Noyes ed464d3514 Deduplicate 2021-05-05 16:30:59 +00:00
Andrew Noyes 946e3f989f Use a deterministic iteration order for committing in the special key space 2021-05-05 16:30:59 +00:00
A.J. Beamon 1537019237
Merge pull request #4729 from sfc-gh-etschannen/feature-bypass-unreadable
Added a new option to bypass unreadable protection in read your writes for calls to get
2021-05-04 11:35:44 -07:00
Evan Tschannen f5f9de48e7
Update fdbclient/vexillographer/fdb.options
Co-authored-by: A.J. Beamon <aj.beamon@snowflake.com>
2021-05-04 09:31:24 -07:00
Evan Tschannen 3175e82393 updated documentation 2021-05-04 09:26:09 -07:00
Lukas Joswiak e7d7b39f12
Merge pull request #4744 from sfc-gh-tclinkenbeard/add-rangeresult-type-alias
Create RangeResult type alias
2021-05-03 16:29:33 -07:00
sfc-gh-tclinkenbeard 56e719533e Move all Standalone<*Ref> type alias definitions to the same location 2021-05-03 15:47:55 -07:00
Jingyu Zhou d49e0091ce
Merge pull request #4727 from sfc-gh-etschannen/fix-rewrite-bme
Simulation could still stall writes for 10 seconds even when speedUpSimulation was on
2021-05-03 13:37:04 -07:00
sfc-gh-tclinkenbeard 5c2d7b6080 Create RangeResult type alias 2021-05-03 13:14:16 -07:00
sfc-gh-ngoyal 63d82b9b63
Merge pull request #4616 from sfc-gh-satherton/redwood-improvements
Redwood page splitting/building refactor
2021-05-03 13:07:41 -07:00
Lukas Joswiak f275fd3c32 Fix gcc compilation 2021-05-02 17:27:18 -07:00
Lukas Joswiak 8dcd779fc4
Merge branch 'master' into features/actor-lineage 2021-05-02 14:11:42 -07:00
Lukas Joswiak c016e154a7 Remove global config fdbserver fix
This is causing problems with the 5.2.0 restarting test. Removing this
line disables fdbserver processes from receiving global config updates,
instead requiring a restart to see them.
2021-05-02 11:03:07 -07:00
Lukas Joswiak 637699be32 Fix issue with fdbserver not receiving global config change
notifications
2021-05-01 21:41:10 -07:00
Lukas Joswiak cf4218dfd1 Fixes simulation failures
Fixes the following issues:

1. Use the right index when initializing the WriteOnlySet's vector of
   atomics. Also switch to std::atomic_init to initialize each atomic in
   the vector (cannot default construct the atomics in the vector
   because std::atomic does not have a copy constructor).
2. Add failure check for when items cannot be inserted into the
   WriteOnlySet due to capacity constraints. This situation occurs when
   `copy` is not called on the WriteOnlySet, such as when sampling is
   disabled. The `copy` function is what clears the WriteOnlySet.
3. Remove a global config feature I added to update the ClientDBInfo
   object used by the global config listener function. This needs more
   investigation, but the effect of this change could be that global
   config changes are not correctly recognized on fdbserver processes.
4. Add various ASSERTs to verify data in WriteOnlySet.
2021-05-01 15:26:28 -07:00
A.J. Beamon e6300905d6
Merge pull request #4733 from sfc-gh-ajbeamon/fix-mvc-thread-future-cancellation
Fix: explicitly cancel MVC thread futures for version monitors
2021-04-30 16:05:17 -07:00
sbodagala f7e28c50d4
Merge pull request #4735 from sbodagala/master
Expose CommitBatchingWindowSize metric to fdbcli status
2021-04-30 15:52:29 -04:00
A.J. Beamon ab3f96f16d Fix: simulation doesn't have a protocol version monitor and can't cancel it. 2021-04-30 11:56:35 -07:00
A.J. Beamon eaf1e0f64e Be more defensive with cancellation by not capturing the this pointer in lambdas, instead capturing a full Reference. 2021-04-30 11:11:02 -07:00
A.J. Beamon cb3d2bfec7 Add cancellation of the protocol monitor when the database is destroyed. Avoid using any state when cancelled. Fix race between setting up the protocol version monitor and destroying the database. 2021-04-30 10:35:44 -07:00