foundationdb

Commit Graph

Author	SHA1	Message	Date
A.J. Beamon	99c9958db7	Some more trace event normalization	2018-06-08 13:57:00 -07:00
A.J. Beamon	e5488419cc	Attempt to normalize trace events: * Detail names now all start with an uppercase character and contain no underscores. Ideally these should be head-first camel case, though that was harder to check. * Type names have the same rules, except they allow one underscore (to support a usage pattern Context_Type). The first character after the underscore is also uppercase. * Use seconds instead of milliseconds in details. Added a check when events are logged in simulation that logs a message to stderr if the first two rules above aren't followed. This probably doesn't address every instance of the above problems, but all of the events I was able to hit in simulation pass the check.	2018-06-08 11:11:08 -07:00
Evan Tschannen	e82985aea2	fix: continue setting beginVersion so that versions between 5.2.0 and 5.2.2 do not crash when decoding tasks created by 5.2.3	2018-06-06 13:34:22 -07:00
Evan Tschannen	4120062bb9	fix: backup initialized its begin version at 1 instead of the read version of the starting transaction fix: erasing log ranges did not properly divide up work between transactions to prevent making transactions which were too large	2018-06-06 13:05:53 -07:00
Evan Tschannen	8930c2e3db	DR upgrade tests now test the durability of the data.	2018-05-09 15:11:05 -07:00
Yichi Chiang	c721ab6854	Fix review comments	2018-04-27 13:54:34 -07:00
Yichi Chiang	6bddf8aefa	Upgrade DR from 5.1 to 5.2	2018-04-26 17:24:40 -07:00
Evan Tschannen	57d650062a	merge 5.1 into 5.2	2018-04-18 20:44:31 -07:00
Evan Tschannen	77d100e1e6	fix: if a DR cluster has a much lower version than the primary database, it would take a long time to process the empty versions. But the version of the DR cluster before starting the DR to avoid this problem.	2018-04-18 19:37:24 -07:00
yichic	ede5cab192	Merge pull request #89 from yichic/share-log-mutations-5.2 Share log mutations 5.2	2018-03-19 12:01:26 -07:00
Yichi Chiang	ec02e54f64	Refactor EraseLogData()	2018-03-19 11:56:01 -07:00
Yichi Chiang	1f2602d2b3	Fix all review comments	2018-03-19 11:33:33 -07:00
Yichi Chiang	d6559b144f	Share log mutations between backups and DRs which have the same backup range	2018-03-19 11:32:50 -07:00
Stephen Atherton	dcf5b2e35d	All readCommitted() functions now use Transaction instead of ReadYourWritesTransaction to reduce memory consumption in Backup and DR. Also removed one readCommitted() variant as it is just a special case of another definition.	2018-03-07 13:56:34 -08:00
Alec Grieser	0bae9880f1	remove trailing whitespace from our copyright headers ; fixed formatting of python setup.py	2018-02-21 10:25:11 -08:00
Alex Miller	f021934792	Fix yet another VersionStamp DR bug. In this episode, we discover that having a transaction retry loop in which the transaction conditionally has write conflict ranges is potentially troublesome. To simplify the problem, if we have two concurrent transaction loops: retry { if (rand() > .5) tr->set('x', rand()); if (rand() > .5) tr->set('y', rand()); } and retry { x = tr->get('x') y = tr->get('y') if (x > y) { tr->set('y', x) } tr->commit(); } Is not guaranteed that x > y in the database after the second transaction commits. This is because it could read an older snapshot of x and y, in which x was greater than y, and thus not invoke set. This means that `tr` is now a read-only transaction, which no-ops out of committing as an "optimization". If we add any write conflict range to `tr`, it then will conflict checked and committed, which would guarantee that x>y when it commits. Replace the first transaction with dumpData, and the second with version upgrade transaction, and you have the bug that we're fixing, why, and how.	2018-01-05 14:23:11 -08:00
Alex Miller	b264a98aea	Fix yet another VersionStamp DR bug. In this episode, we discover that having a transaction retry loop in which the transaction conditionally has write conflict ranges is potentially troublesome. To simplify the problem, if we have two concurrent transaction loops: retry { if (rand() > .5) tr->set('x', rand()); if (rand() > .5) tr->set('y', rand()); } and retry { x = tr->get('x') y = tr->get('y') if (x > y) { tr->set('y', x) } tr->commit(); } Is not guaranteed that x > y in the database after the second transaction commits. This is because it could read an older snapshot of x and y, in which x was greater than y, and thus not invoke set. This means that `tr` is now a read-only transaction, which no-ops out of committing as an "optimization". If we add any write conflict range to `tr`, it then will conflict checked and committed, which would guarantee that x>y when it commits. Replace the first transaction with dumpData, and the second with version upgrade transaction, and you have the bug that we're fixing, why, and how.	2018-01-04 17:29:43 -08:00
Alex Miller	f70e3b9fe8	Add or change a bunch of comments to provide descriptions of function contracts. This cleans up a bit of the VersionStamp DR work I did, and leaves hints and advice for anyone who will be touching mutation applying code in the future.	2017-12-20 16:57:14 -08:00
Evan Tschannen	38cff7d4a5	every transaction which clears applyMutation keys does so on the first proxy	2017-12-20 15:41:47 -08:00
Evan Tschannen	982f0dcb1e	Merge pull request #222 from cie/alexmiller/drtimefix2 Fix yet another VersionStamp DR issue.	2017-12-20 15:09:23 -08:00
Alex Miller	b5a6bc0ab7	Fix VersionStamp problems by instead adding a COMMIT_ON_FIRST_PROXY transaction option. Simulation identified the fact that we can violate the VersionStamps-are-always-increasing promise via the following series of events: 1. On proxy 0, dumpData adds commit requests to proxy 0's commit promise stream 2. To any proxy, a client submits the first transaction of abortBackup, which stops further dumpData calls on proxy 0. 3. To any proxy that is not proxy 0, submit a transaction that checks if it needs to upgrade the destination version. 4. The transaction from (3) is committed 5. Transactions from (1) are committed This is possible because the dumpData transactions have no read conflict ranges, and thus it's impossible to make them abort due to "conflicting" transactions. There's also no promise that if client C sends a commit to proxy A, and later a client D sends a commit to proxy B, that B must log its commit after A. (We only promise that if C is told it was committed before D is told it was committed, then A committed before B.) There was a failed attempt to fix this problem. We tried to add read conflict ranges to dumpData transactions so that they could be aborted by "conflicting" transactions. However, this failed because this now means that dumpData transactions require conflict resolution, and the stale read version that they use can cause them to be aborted with a transaction_too_old error. (Transactions that don't have read conflict ranges will never return transaction_too_old, because with no reads, the read snapshot version is effectively meaningless.) This was never previously possible, so the existing code doesn't retry commits, and to make things more complicated, the dumpData commits must be applied in order. This would require either adding dependencies to transactions (if A is going to commit then B must also be/have committed), which would be complicated, or submitting transactions with a fixed read version, and replaying the failed commits with a higher read version once we get a transaction_too_old error, which would unacceptably slow down the maximum throughput of dumpData. Thus, we've instead elected to add a special transaction option that bypasses proxy load balancing for commits, and always commits against proxy 0. We can know for certain that after the transaction from (2) is committed, all of the dumpData transactions that will be committed have been added to the commit promise stream on proxy 0. Thus, if we enqueue another transaction against proxy 0, we can know that it will be placed into the promise stream after all of the dumpData transactions, thus providing the semantics that we require: no dumpData transaction can commit after the destination version upgrade transaction.	2017-12-20 15:04:04 -08:00
Stephen Atherton	d87aa521e9	Merge branch 'backup-container-refactor' into continuous-backup	2017-12-19 23:39:00 -08:00
Stephen Atherton	e0d9cea008	Merge branch 'master' into continuous-backup # Conflicts: # fdbclient/FileBackupAgent.actor.cpp # fdbrpc/BlobStore.actor.cpp	2017-12-19 23:02:14 -08:00
Alex Miller	c7dbd31a1e	Refactoring: Create a common prefixRange and do UID->Key once in backup.	2017-12-19 17:17:50 -08:00
Yichi Chiang	50c154fed4	Add fdbbackup interface	2017-12-14 13:54:01 -08:00
Stephen Atherton	20a8aae241	Old bug fix, transaction reset() not being called in a retry loop.	2017-12-02 07:02:26 -08:00
Alex Miller	e583beb8f6	Fix a race between dumpData and version upgrades. This fixes the occasional VersionStampBackupToDB failures, that were caused by the version upgrade comarision happening before dumpData invocations were stopped. Committing the first transaction stops dumpData, and thus we can then do the primary vs secondary version check correctly.	2017-11-30 17:37:00 -08:00
Stephen Atherton	aeebe711ce	TaskBucket’s saveAndExtend() is now accomplished through extendTimeout() with an option to save parameters. SaveAndExtendIncrementally() has been removed as it is no longer needed because TaskBucket’s normal execution loop calls extendTimeout() periodically as long as the TaskFunc’s execute() actor has not finished or thrown. If a TaskFunc wants to save changes to task parameters to checkpoint progress for task restarts to benefit from it can call extendTimeout() explicitly with the updateParams flag set to true.	2017-11-30 17:18:57 -08:00
Stephen Atherton	d9c2f6d705	Bug fix. The terminator argument of readCommitted() previously did nothing, and end_of_stream() was always sent to the output stream. The parameter was fixed to enable changing this behavior but original the behavior was not being correctly preserved in at least one case.	2017-11-26 22:52:47 -08:00
Evan Tschannen	98b4270703	fix: disableKey was read before options were set	2017-10-30 13:11:54 -07:00
Evan Tschannen	fb89ae9f85	added the ability to enable and disable all backup and DR agents from fdbbackup and fdbdr.	2017-10-30 12:35:00 -07:00
Alex Miller	11668bb359	Fixing code review comments.	2017-09-29 15:58:36 -07:00
Alex Miller	87a1581871	Ensure VersionStamps are strictly increasing with DR ACI switchovers. This should be the final change in making sure that versionstamps are never higher than the read version of a database that they're read from.	2017-09-29 15:58:36 -07:00
Alex Miller	8f4c45418b	Make atomicSwitchover preserve an ever-increasing commit version.	2017-09-29 15:58:36 -07:00
Evan Tschannen	1626e16377	Merge branch 'release-4.6' into release-5.0	2017-05-31 16:23:37 -07:00
FDB Dev Team	a674cb4ef4	Initial repository commit	2017-05-25 13:48:44 -07:00

36 Commits