foundationdb

Commit Graph

Author	SHA1	Message	Date
Alex Miller	fd769ad878	Fix parallel peek stalling for 10min when a TLog generation is destroyed. `peekTracker` was held on the Shared TLog (TLogData), whereas peeks are received and replied to as part of a TLog instance (LogData). When a peek was received on a TLog, it was registered into peekTracker along with the ReplyPromise. If the TLog was then removed as part of a no-longer-needed generation of TLogs, there is nothing left to reply to the request, but by holding onto the ReplyPromise in peekTracker, we leave the remote end with an expectation that we will reply. Then, 10min later, peekTrackerCleanup runs and finally times out the peek cursor, thus preventing FDB from being completely stuck. Now, each TLog generation has its own `peekTracker`, and when a TLog is destroyed, it times out all of the pending peek curors that are still expecting a response. This will then trigger the client to re-issue them to the next generation of TLogs, thus removing the 10min gap to do so.	2019-07-09 17:27:36 -07:00
Alex Miller	44f11702a8	Log Routers will prefer to peek from satellite logs. Formerly, they would prefer to peek from the primary's logs. Testing of a failed region rejoining the cluster revealed that this becomes quite a strain on the primary logs when extremely large volumes of peek requests are coming from the Log Routers. It happens that we have satellites that contain the same mutations with Log Router tags, that have no other peeking load, so we can prefer to use the satellite to peek rather than the primary to distribute load across TLogs better. Unfortunately, this revealed a latent bug in how tagged mutations in the KnownCommittedVersion->RecoveryVersion gap were copied across generations when the number of log router tags were decreased. Satellite TLogs would be assigned log router tags using the team-building based logic in getPushLocations(), whereas TLogs would internally re-index tags according to tag.id%logRouterTags. This mismatch would mean that we could have: Log0 -2:0 ----- -2:0 Log 0 Log1 -2:1 \ >--- -2:1,-2:0 (-2:2 mod 2 becomes -2:0) Log 1 Log2 -2:2 / And now we have data that's tagged as -2:0 on a TLog that's not the preferred location for -2:0, and therefore a BestLocationOnly cursor would miss the mutations. This was never noticed before, as we never used a satellite as a preferred location to peek from. Merge cursors always peek from all locations, and thus a peek for -2:0 that needed data from the satellites would have gone to both TLogs and merged the results. We now take this mod-based re-indexing into account when assigning which TLogs need to recover which tags from the previous generation, to make sure that tag.id%logRouterTags always results in the assigned TLog being the preferred location. Unfortunately, previously existing will potentially have existing satellites with log router tags indexed incorrectly, so this transition needs to be gated on a `log_version` transition. Old LogSets will have an old LogVersion, and we won't prefer the sattelite for peeking. Log Sets post-6.2 (opt-in) or post-6.3 (default) will be indexed correctly, and therefore we can safely offload peeking onto the satellites.	2019-07-08 22:25:01 -07:00
Alex Miller	6c8f50ca66	Improve the behavior of parallelPeekMore+onlySpilled. When onlySpilled transitions from true (don't peek memory) to false (do peek memory) as part of a parallel peek, we'll end up wasting the rest of the replies because we'll honor their onlySpilled=true setting and thus not have any additional data to return. Instead, we thread the onlySpilled back through in the same way that the ending version of the last peek is used overrides the requested starting version of the next peek. This simulated the same behavior that the client has, where the value of onlySpilled that we reply with comes back in the next request. I haven't actually seen it be a problem, but this should help make sure the onlySpilled transition when catching up doesn't ever cause any ill effects if a process starts riding the line between onlySpilled settings.	2019-07-08 22:13:09 -07:00
Evan Tschannen	15e894c724	Merge in master	2019-07-05 15:49:24 -07:00
Evan Tschannen	235697f688	fix: txsTags are not popped at the recovery version	2019-06-27 23:18:26 -07:00
Alex Miller	bf883d7055	Merge remote-tracking branch 'upstream/master' into flowlock-api	2019-06-25 14:26:50 -07:00
Alex Miller	7a500cd37f	A giant translation of TaskFooPriority -> TaskPriority::Foo This is so that APIs that take priorities don't take ints, which are common and easy to accidentally pass the wrong thing.	2019-06-25 02:47:35 -07:00
Evan Tschannen	1c005d5878	Merge pull request #1584 from alexmiller-apple/spilled-only-peek Save TLog resources by letting peek request only spilled data.	2019-06-20 18:22:31 -07:00
Evan Tschannen	e0be631414	shard the txs tag so that more transaction logs are involved in its recovery	2019-06-19 18:15:09 -07:00
mpilman	68ce9a5e75	ProtocolVersion type - second try	2019-06-18 17:55:27 -07:00
Alex Miller	51fd42a4d2	Merge remote-tracking branch 'upstream/master' into spilled-only-peek	2019-06-18 17:33:52 -07:00
mpilman	8576665a90	Revert "Revert "Make protocol version a type"" This reverts commit `455bf3b3ec`.	2019-06-18 14:49:04 -07:00
Alex Miller	455bf3b3ec	Revert "Make protocol version a type"	2019-06-18 10:59:17 -07:00
mpilman	da53a92bec	Make protocol version a type This fixes #1214 The basic idea is that ProtocolVersion is now its own type. This alone is an improvement as it makes many things more typesafe. For each version, we can now add breaking features (for example Fearless). After that, there's no need to test against actual (confusing) version numbers. Instead a developer can simply test `protocolVersion->hasFearless()` and this will return true iff the protocolVersion is newer than the newest version that didn't support fearless.	2019-06-16 09:59:15 -07:00
sramamoorthy	1190f2f33d	rebased related changes	2019-05-28 22:07:46 -07:00
sramamoorthy	b43c100e57	TLog bug fixes	2019-05-28 22:07:46 -07:00
sramamoorthy	3877f87481	comment change in tLogCommit	2019-05-28 22:07:46 -07:00
sramamoorthy	31b6c86650	ignorePopDeadline to have high limit in simulator - ignorePopDeadline to have highier limit in simulator to accommdate for the buggify delays and make snapshot succeed. - introduce a new knob for auto resetting the disabling of tlog pop	2019-05-28 22:07:46 -07:00
sramamoorthy	b1b96946af	logData->stop check right after execOpHold wait	2019-05-28 22:07:46 -07:00
sramamoorthy	5749e220bd	use FlowLock for implementing critical section Instead of using Promises and future to implement critcal section use FlowLock	2019-05-28 22:07:46 -07:00
sramamoorthy	e6c0b87a4d	remove unused variable	2019-05-28 22:07:46 -07:00
sramamoorthy	f27a40f118	execProcessingHelper made synchronous tLogCommit exects no blocking between duplicate check and setting of the new version, that constraint was broken when synchronous execProcessingHelper was introduced. As a fix, execProcessingHelper was made asynchronous.	2019-05-28 22:07:46 -07:00
sramamoorthy	d3a179b6f9	Multiple bug fixes - wait for snapTLogFailKeys in a loop, otherwise in some race condition it can cause a false assert - in single region, there does not seem to be a guarantee of tagLocalityListKey for a given DC ID, avoiding that assert for now - to find the workers that are coordinators, looking up by primary address is not sufficient in some cases, hence looking by both primary and secondary address - test make files to reflect the location of the new test cases	2019-05-28 22:07:46 -07:00
sramamoorthy	dcd2d96751	make spawnProcess predictable in the simulator	2019-05-28 22:07:46 -07:00
sramamoorthy	4083af0b01	Avoid using trackLatest for TLog pop test cases	2019-05-28 22:07:46 -07:00
sramamoorthy	ec7834e2f7	code re-orgnaization and address comments	2019-05-28 22:07:46 -07:00
sramamoorthy	b6e037ffbc	Replace fork with boost::process::child	2019-05-28 22:07:46 -07:00
sramamoorthy	e91c76834e	tlog: move snap create part to indepdendent funcs	2019-05-28 22:07:46 -07:00
sramamoorthy	61e93a9304	Address review comments and minor fixes	2019-05-28 22:07:46 -07:00
sramamoorthy	9e3104c2d4	Fix: races in async exec leading to bad backup	2019-05-28 22:07:46 -07:00
sramamoorthy	cfdad0c5e6	tlog to snapshot exactly at exec version	2019-05-28 22:07:46 -07:00
sramamoorthy	539e65efad	Skip parsing mutations if it is tagged for TxsTag In Tlog, if a mutation is targetted for TxsTag then skip from parsing them.	2019-05-28 22:07:46 -07:00
sramamoorthy	17ecba8313	trace cleanup and other indentation changes	2019-05-28 22:07:46 -07:00
sramamoorthy	aa79480d69	changes to make fdbfork asynchronous	2019-05-28 22:07:46 -07:00
sramamoorthy	4016f16c76	Fix few compilation and bugs in rebase	2019-05-28 22:07:46 -07:00
sramamoorthy	3d5998e9dd	tlog: when pops are disabled, store them & replay In Tlogs, disable pop is done whlie taking snapshots. Earlier, tlogs were ignoring the pops if it got pop requests when pops were disabled. In this change, instead of ignoring the pop - it remembers the list of pops in-memory and plays them once the popping is enabled.	2019-05-28 22:07:46 -07:00
sramamoorthy	4bc4c615da	exec op to all tlog, restore change in test &other - exec operation to go to all the TLogs - minor bug fix in tlog - restore implementation for the simulator - restore snap UID to be stored in restartInfo.ini - test cases added - indentation and trace file fixes	2019-05-28 22:07:46 -07:00
sramamoorthy	72dd067173	Trace message changes and fix few FIXMEs	2019-05-28 22:07:46 -07:00
sramamoorthy	69edefe68b	Snapshot based backup and resotre implementation	2019-05-28 22:07:46 -07:00
A.J. Beamon	f417e60264	Merge branch 'merge-release-6.1-into-master' into thread-safe-random-number-generation # Conflicts: # fdbserver/QuietDatabase.actor.cpp	2019-05-23 09:52:00 -07:00
A.J. Beamon	d29c7e4c9b	Merge branch 'release-6.1' into merge-release-6.1-into-master # Conflicts: # documentation/sphinx/source/release-notes.rst # fdbserver/QuietDatabase.actor.cpp # versions.target	2019-05-23 09:28:45 -07:00
Evan Tschannen	003cc6be18	fix: nothingPersistent could be incorrect when popped is equal to persistentDataVersion	2019-05-22 20:23:35 -10:00
Evan Tschannen	ee04c583fa	fix: do not pop the disk queue past the persistentDataVersion	2019-05-21 10:40:30 -07:00
Evan Tschannen	4059d68348	fix: the tlog would not pop data from the disk queue after a storage server was removed, because the tag still exists in memory on the logs fix: we could incorrectly make data durable if eraseMessagesFromMemory was in progress while running updatePersistentData the quiet database check now ensure that tlogs have no more than 30 seconds of versions unpopped from the disk queue	2019-05-20 23:58:45 -07:00
Alex Miller	4eb4c03ce5	Save TLog resources by letting peek request only spilled data. If a peek is entirely fulfilled from spilled data, then it's likely that the next peek will be also. It is thus wasteful for each of these peeks to call peekMessagesFromMemory, which memcpy's excessively, and then throw all that data away without using it. Now, TLogs will give a hint back to peek cursors about if the provided reply was served entirely from the spilled data, which peek curors then feed back as the hint into their next request. At some point, a cursor will send a request for only spilled data, get an incomplete response, and then be told to send its next request as one that peeks from memory as well, and then it will fully catch up.	2019-05-14 15:38:48 -10:00
A.J. Beamon	5f55f3f613	Replace g_random and g_nondeterministic_random with functions deterministicRandom() and nondeterministicRandom() that return thread_local random number generators. Delete g_debug_random and trace_random. Allow only deterministicRandom() to be seeded, and require it to be seeded from each thread on which it is used.	2019-05-10 14:01:52 -07:00
Evan Tschannen	22499666d0	Merge branch 'release-6.1' # Conflicts: # documentation/sphinx/source/release-notes.rst # fdbserver/LogRouter.actor.cpp # flow/Trace.cpp # versions.target	2019-05-08 18:19:35 -07:00
Evan Tschannen	93eb2a9395	Merge pull request #1527 from alexmiller-apple/tstlog-6.1 Spill-by-reference knob + TLog6.0 Spilled Peek deprioritization	2019-05-03 17:19:45 -07:00
Alex Miller	c918b21137	Deprioritize spilled peeks in spill-by-value, and improve its logic. This deprioritizes before calling peekMessagesFromMemory, which should improve the memory usage of the TLog, and makes sure to keep txsTag peeks at a high priority to help recoveries stay fast.	2019-05-03 15:27:11 -07:00
Alex Miller	4052f3826a	Add a knob to limit the number of commits indexed per key. Theoretically, we could spill 20MB of 22B mutations for one key, which would generate a very long value being stored in SQLite, and very inefficiently read back. This stops that from being a problem, at the cost of some extra write calls.	2019-05-03 15:27:10 -07:00
Evan Tschannen	12088119d2	Merge pull request #1517 from alexmiller-apple/tstlog-6.1 Add a knob to limit amount of data read from sqlite for one PeekRequest.	2019-05-03 11:01:11 -07:00
Alex Miller	f4e48c3851	Add a knob to limit amount of data read from sqlite for one PeekRequest. This prevents peeking from degrading over time if there are a very large number of SpilledData entries for one particular tag.	2019-05-02 17:26:45 -07:00
Evan Tschannen	8590b710bf	added additional logging on the logs and log routers	2019-05-02 17:24:39 -07:00
Jingyu Zhou	8b5449e608	Fix review comments for PR #1473	2019-04-29 16:45:42 -07:00
Jingyu Zhou	5462f560e7	Add pseudo locality for log routers and tlogs This changes the logic of pop operations from log routers (LG): - LG pops tagLocalityLogRouterMapped from TLogs; - TLog converts tagLocalityLogRouterMapped back to tagLocalityLogRouter before popping. Later when we add more psuedo localities, the same pattern can be used.	2019-04-23 21:35:56 -07:00
Jingyu Zhou	0b1984978a	Small code refactoring.	2019-04-21 10:41:07 -07:00
Jingyu Zhou	ec1bc5cfca	Add LogSystemType enum	2019-04-21 10:41:07 -07:00
Evan Tschannen	6220a5ce0f	Merge pull request #1370 from jzhou77/fix-unreferenced Remove unused functions	2019-04-09 11:49:45 -07:00
mpilman	1c16f87a4e	Remove trace-calls to printable (in non-workloads)	2019-04-05 13:12:19 -07:00
Jingyu Zhou	47b4b82628	Merge branch 'master' into fix-unreferenced	2019-04-01 14:07:19 -07:00
Alex Miller	e7ad39246c	Fix typo	2019-03-29 20:16:26 -07:00
Evan Tschannen	a44ffd851e	fix: the shared tlog could fail to update a stopped tlog’s queueCommitVersion to version if a second tlog registered before it could issue the first commit for the tlog	2019-03-29 20:11:30 -07:00
Evan Tschannen	b6008558d3	renamed BinaryWriter.toStringRef() to .toValue(), because the function now returns a Standalone<StringRef>() eliminated an unnecessary copy from the proxy commit path eliminated an unnecessary copy from buffered peek cursor	2019-03-28 11:52:50 -07:00
Jingyu Zhou	a55f06e082	Remove unused functions Found with -Wunused-function flag.	2019-03-27 15:45:28 -07:00
Evan Tschannen	c705a1af74	fix: make sure recoveryLocation is always a valid page	2019-03-20 19:33:09 -07:00
Evan Tschannen	1c6ad6d307	fix: change the location where stopped is checked, because a yield could cause cause stopped to be set after the existing check	2019-03-20 19:33:09 -07:00
Alex Miller	b11ecb3210	Remove random bits of code that were either unneeded or leftover from debugging.	2019-03-18 15:47:20 -07:00
Alex Miller	37ea71b117	Implement limiting how many bytes recovery will read. This time, track what location in the DiskQueue has been spilled in persistent state, and then feed it back into the disk queue before recovery. This also introduces an ASSERT that recovery only reads exactly the bytes that it needs to have in memory.	2019-03-18 15:09:43 -07:00
Alex Miller	29ab7370cd	Clear versionLocation when spilling, and pop DQ separately. Popping the disk queue now requires potentially recovering the location to which we can pop from the spilled data itself, and for each tag we must maintain the first location with relevant data. The previous queue we had to represent the ordering, queueOrder, was used by spilling, and popped when a TLog had been spilled. This means that as soon as a TLog has been fully spilled, we have no idea how it relates in order to other fully spilled TLogs. Instead, use queueOrder to keep track of all the TLog UIDs until they're removed, and use spillOrder to keep track of the order only for spilling.	2019-03-18 15:09:22 -07:00
Alex Miller	7f5bc2981f	Checksum DiskQueue pages on read, but at a lower priority. If a server has its data spilled, then it's behind the 5s window. Feeding it data is less important than committing, so we can hide the extra CPU usage from checksumming the read amplified disk queue pages.	2019-03-15 21:01:19 -07:00
Alex Miller	ee4721a63f	Make checking or ignoring checksums part of the IDiskQueue::read API.	2019-03-15 21:01:18 -07:00
Alex Miller	81c59e88a8	Persist the protocol version of a TLog instance when it is created. This allows us to do easy upgrades of SpilledData in the future, if the need arises, because we then have a protocol version to compare against.	2019-03-15 21:01:17 -07:00
Alex Miller	686b097397	Remove verification code from DiskQueue and TLogServer.	2019-03-15 21:01:15 -07:00
Alex Miller	77f596743f	Bump persistFormat in TLogServer to differ from OldTLogServer* Though this format is being deprecated in favor of an eventual plumbing through of TLogVersion, we should probably bump it anyway. And also remove the fallback to OldTLogServer code. It should never be executed, as OldTLogServer_6_0 is entirely relied upon to execute OldTLogServer_4_6.	2019-03-15 21:01:13 -07:00
Alex Miller	4f98634f59	Add LogId to all TLog TraceEvents that have it.	2019-03-15 21:01:12 -07:00
Evan Tschannen	5873705228	tlog commits very rarely take an additional 6 seconds	2019-03-11 12:11:17 -07:00
Evan Tschannen	80c3f2f8e2	added status fields detailing which processes are degraded, and also the total number of degraded processes	2019-03-10 22:58:15 -07:00
Evan Tschannen	044b6b4f8a	Merge branch 'master' into feature-degraded-tlog # Conflicts: # fdbserver/ClusterController.actor.cpp	2019-03-08 22:50:41 -05:00
Evan Tschannen	53f16b5347	when a tlog queue commit takes longer than 5 seconds, its process is marked as degraded	2019-03-08 11:46:34 -05:00
Alex Miller	c6a65389ae	Remove noexcept macro and replace with BOOST_NOEXCEPT. BOOST_NOEXCEPT does what the noexcept macro was supposed to do, but in a way that is correctly maintained over time.	2019-03-05 22:06:12 -08:00
Alex Miller	244903a9de	Spill txsTag by value under TagMsg/ and not TagMsgRef/ There's not a tremendous reason as to why this matters now, but I feel like I might regret sometime later not keeping the same schema under the same key.	2019-03-04 01:42:39 -08:00
Alex Miller	72c2cf11ab	Replace ResourceLimiter with FlowLock.	2019-03-04 01:42:38 -08:00
Alex Miller	aff9ebe21a	Spill (start,length) instead of (begin,end) to save a few bytes.	2019-03-04 01:42:38 -08:00
Alex Miller	9ef283d4e7	Implement hard limiting of memory used to serve peek requests.	2019-03-04 01:42:38 -08:00
Alex Miller	e3506ad9af	Add a yield to parseMessagesForTag	2019-03-04 01:42:38 -08:00
Alex Miller	742f6e1847	Solve overreading via pre-calculating tag bytes per commit	2019-03-04 01:42:38 -08:00
Alex Miller	e7d8520c63	Batch more when spilling data.	2019-03-04 01:42:38 -08:00
Alex Miller	04e1170c88	Spill txsTag by value	2019-03-04 01:42:38 -08:00
Alex Miller	ba31f8f1f9	Remove all code related to writing and cleaning up old-style spilling.	2019-02-26 18:13:49 -08:00
Alex Miller	0539c1df00	Peek from new spilled data and not old spilled data.	2019-02-26 18:13:49 -08:00
Alex Miller	d687a4bb85	Implement proper cleanup of disk queue when spilling refs.	2019-02-26 18:00:55 -08:00
Alex Miller	84dc41c206	Add a comment.	2019-02-26 18:00:55 -08:00
Alex Miller	cade914645	Temporarily add verification of spilled tag data.	2019-02-26 18:00:55 -08:00
Alex Miller	f659825575	Persist start,end of message when spilling tags.	2019-02-26 18:00:55 -08:00
Alex Miller	8d76cbed02	Track both start and end versions in versionLocation.	2019-02-26 18:00:55 -08:00
Evan Tschannen	8afb7fbb9d	Merge pull request #1160 from alexmiller-apple/tstlog-fork Spill-By-Reference TLog Part 2: New and Old TLogServers co-exist harmoniously	2019-02-26 18:00:04 -08:00
Alex Miller	2dc57568cb	Change many things about log_version. * log_version in the database (`/conf/log_version`) is now a hint that gets rounded to the nearest supported version. * fdbcli and FDB enforce that only a valid log_version can be configured to * TLogVersion is persisted in CoreTLogSet (and LogSet and TLogSet) * Some comments here and there * Add an assert on filename length to make sure KV-pairs in filename don't exceed a maximum length.	2019-02-26 16:47:04 -08:00
Evan Tschannen	b8910ba7cd	Merge branch 'master' into feature-fix-force-recovery # Conflicts: # fdbclient/ManagementAPI.actor.h # fdbserver/DataDistribution.actor.cpp # fdbserver/storageserver.actor.cpp # fdbserver/workloads/KillRegion.actor.cpp	2019-02-22 14:38:13 -08:00
Alex Miller	91e05575a2	Rename OldTLogServer -> OldTLogServer_4_6	2019-02-19 22:18:10 -08:00
mpilman	3f0fd2a20c	Use fwd decls in WorkerInterface Also WorkerInterface.h -> WorkerInterface.actor.h	2019-02-19 15:16:59 -08:00

1 2 3 4 5 ...

341 Commits