foundationdb

Commit Graph

Author	SHA1	Message	Date
Evan Tschannen	b46c32535c	surpassed spammy trace events	2018-04-10 15:52:32 -07:00
Evan Tschannen	5390af8be4	suppress spammy logs	2018-03-09 09:40:36 -08:00
Alvin Moore	a1382895a6	Fixed headers and some whitespace	2018-02-23 04:50:23 -08:00
Alec Grieser	0bae9880f1	remove trailing whitespace from our copyright headers ; fixed formatting of python setup.py	2018-02-21 10:25:11 -08:00
Stephen Atherton	1f78b98ac9	Bug fix, result as a state variable causes 'this' to be captured instead of copying result.	2018-02-08 09:54:47 -08:00
Stephen Atherton	69425a303b	Improved error handling for cases where blob account credentials are either not found in the provided credentials sources and/or some of the credentials sources provided are not readable or parseable.	2018-02-07 21:50:43 -08:00
A.J. Beamon	080a454051	fix: getVersionstamp would return broken promise if a transaction was disposed before being set. getAddressesForKey would not return when resetPromise was set.	2018-01-31 13:47:36 -08:00
Alec Grieser	7808099318	bump protocol version for release ; new product guid	2018-01-25 10:02:48 -08:00
Stephen Atherton	66de9d392b	New error code, http_auth_failed, which is used when blob authentication fails instead of the previous generic http_request_failed.	2018-01-22 14:58:56 -08:00
A.J. Beamon	4bfbdbf454	Extract getLocalTime to platform.cpp	2018-01-17 11:35:34 -08:00
Stephen Atherton	93b34a945f	Major usability and performance improvements to backup management. Backup descriptions now calculate and display timestamps using TimeKeeper data (if given a cluster) and restorability of snapshots. Expire now requires a --force option to leave a backup unrestorable or unrestorable after a given point in time, specified by version or timestamp. BackupContainerFilesystem now maintains metadata on key version boundaries in order to avoid large list operations for describe and expire operations. Blob parallel recursive list operations can now take a path (aka prefix) filter function. New describe and expire options are available in fdbbackup.	2018-01-17 04:09:43 -08:00
Evan Tschannen	660cee0254	increased the priority of getKeyServersLocations, because once a client gets a read version, answering their reads should be higher priority than starting new transactions	2018-01-12 13:46:20 -08:00
Evan Tschannen	de119f192d	fixed a priority inversion where the tlog would prefer to copy data from the previous generation rather than make data durable (leading to being ratekeeper controlled)	2018-01-11 16:09:49 -08:00
A.J. Beamon	2f5073d00f	Some visual studio project cleanup.	2018-01-10 10:07:18 -08:00
Stephen Atherton	0f20068e82	Renamed all TaskBucket backup tasks to more appropriate names. Created the ability to make task aliases and used this to direct old task names to a task definition which will abort backups created before version 5.1.	2018-01-04 22:53:31 -08:00
A.J. Beamon	653a46f12f	Update error string fro cluster_version_changed error	2018-01-04 15:06:09 -08:00
Stephen Atherton	cec9f4d7a4	Bug fix in DNS resolution. When the result is an error the result promise was being set twice.	2018-01-03 13:05:38 -08:00
Stephen Atherton	ec28c77353	Merge branch 'master' of github.com:apple/foundationdb	2017-12-21 01:58:47 -08:00
Stephen Atherton	e3aee45a74	Backup tools and agent now accept blob account credentials via files containing JSON which are specified using command line arguments and/or an environment variable. Improved fdbbackup help, clarifying which options are for which operations. Fdbbackup operations which do not need to use a database no longer require a cluster file parameter. Added eat() commands to StringRef for incrementally tokenizing strings using separator strings.	2017-12-21 01:58:15 -08:00
Evan Tschannen	982f0dcb1e	Merge pull request #222 from cie/alexmiller/drtimefix2 Fix yet another VersionStamp DR issue.	2017-12-20 15:09:23 -08:00
Alex Miller	b5a6bc0ab7	Fix VersionStamp problems by instead adding a COMMIT_ON_FIRST_PROXY transaction option. Simulation identified the fact that we can violate the VersionStamps-are-always-increasing promise via the following series of events: 1. On proxy 0, dumpData adds commit requests to proxy 0's commit promise stream 2. To any proxy, a client submits the first transaction of abortBackup, which stops further dumpData calls on proxy 0. 3. To any proxy that is not proxy 0, submit a transaction that checks if it needs to upgrade the destination version. 4. The transaction from (3) is committed 5. Transactions from (1) are committed This is possible because the dumpData transactions have no read conflict ranges, and thus it's impossible to make them abort due to "conflicting" transactions. There's also no promise that if client C sends a commit to proxy A, and later a client D sends a commit to proxy B, that B must log its commit after A. (We only promise that if C is told it was committed before D is told it was committed, then A committed before B.) There was a failed attempt to fix this problem. We tried to add read conflict ranges to dumpData transactions so that they could be aborted by "conflicting" transactions. However, this failed because this now means that dumpData transactions require conflict resolution, and the stale read version that they use can cause them to be aborted with a transaction_too_old error. (Transactions that don't have read conflict ranges will never return transaction_too_old, because with no reads, the read snapshot version is effectively meaningless.) This was never previously possible, so the existing code doesn't retry commits, and to make things more complicated, the dumpData commits must be applied in order. This would require either adding dependencies to transactions (if A is going to commit then B must also be/have committed), which would be complicated, or submitting transactions with a fixed read version, and replaying the failed commits with a higher read version once we get a transaction_too_old error, which would unacceptably slow down the maximum throughput of dumpData. Thus, we've instead elected to add a special transaction option that bypasses proxy load balancing for commits, and always commits against proxy 0. We can know for certain that after the transaction from (2) is committed, all of the dumpData transactions that will be committed have been added to the commit promise stream on proxy 0. Thus, if we enqueue another transaction against proxy 0, we can know that it will be placed into the promise stream after all of the dumpData transactions, thus providing the semantics that we require: no dumpData transaction can commit after the destination version upgrade transaction.	2017-12-20 15:04:04 -08:00
Stephen Atherton	7caa012fbf	Added snapshot interval option to "fdbbackup start" which defaults to a new knob's value. Added snapshot info to backup status text. Improvements to fdbbackup help.	2017-12-20 00:49:08 -08:00
Stephen Atherton	e0d9cea008	Merge branch 'master' into continuous-backup # Conflicts: # fdbclient/FileBackupAgent.actor.cpp # fdbrpc/BlobStore.actor.cpp	2017-12-19 23:02:14 -08:00
Alex Miller	1488c12c18	Simulation will return and error and print if any non-suppressed SevError events were logged. This means that loops like `seed=1; while ./fdbserver -r simulation -s $seed; do seed=$(($seed+1)); done` to find an example of an often failing test. This also means joshua will report ExitCode errors on anything that has a SevError in the log. As a part of this, we also implicitly downgrade any injected errors to SevWarnAlways.	2017-12-19 17:17:50 -08:00
Stephen Atherton	abb2dd1ebc	Merge pull request #214 from cie/alexmiller/fallocate Use fallocate to zero ranges instead of writing zeroes	2017-12-06 13:47:40 -08:00
Stephen Atherton	f8e89a40ac	Bug fixes, take(1) is incorrect usage of FlowLock.	2017-12-04 10:25:47 -08:00
Evan Tschannen	482ac38ca6	added knobs so that the client failure monitoring update rate and the server failure monitoring update rate are separate knobs	2017-12-01 13:04:32 -08:00
Alex Miller	196258080b	Refactor zeroing a chunk of a file from DiskQueue into IAsyncFile. If we're going to do the work to provide more optimized ways to zero files, then I'd feel better with this being in a more common place, so that any other zero-ers are likely to reuse it. It also makes testing easier/more obvious. Also, because it's needed for correctness, fix the aligned_alloc for OSX, which wasn't aligned, and use an actually aligned allocation function.	2017-11-30 17:57:55 -08:00
Stephen Atherton	aeebe711ce	TaskBucket’s saveAndExtend() is now accomplished through extendTimeout() with an option to save parameters. SaveAndExtendIncrementally() has been removed as it is no longer needed because TaskBucket’s normal execution loop calls extendTimeout() periodically as long as the TaskFunc’s execute() actor has not finished or thrown. If a TaskFunc wants to save changes to task parameters to checkpoint progress for task restarts to benefit from it can call extendTimeout() explicitly with the updateParams flag set to true.	2017-11-30 17:18:57 -08:00
Stephen Atherton	a77162b53d	Merge branch 'master' into backup-container-refactor # Conflicts: # fdbclient/BackupAgent.h # fdbclient/FileBackupAgent.actor.cpp # fdbclient/KeyBackedTypes.h	2017-11-15 08:14:47 -08:00
Stephen Atherton	3dfaf13b67	IBackupContainer has been rewritten to be a logical interface for storing, reading, deleting, expiring, and querying backup data. The details of how the data is organized or stored is now hidden from users of the interface. Both the local and blobstore containers have been rewritten, the key changes being a multi level directory structure and no more use of temporary files or pseudo-symlinks in the blob store implementation. This refactor has a large impact radius as the previous backup container was just a thin wrapper that presented a single level list of files and offered no methods for managing or interpreting the file structure so all of that logic was spread around other places in the code base. This made moving to the new blob store schema very messy, and without this refactor further changes in the future would only be worse. Several backup tasks have been cleaned up / simplified because they no longer need to manage the ‘raw’ structure of the backup. The addition of IBackupFile and its finish() method simplified the log and range writer tasks. Updated BlobStoreEndpoint to support now-required bucket creation and bucket listing prefix/delimiter options for finding common prefixes. Added KeyBackedSet<T> type. Moved JSONDoc to its own header. Added platform::findFilesRecursively(). Still to do: update command line tool to use new IBackupContainer interface, fix bugs in Restore startup.	2017-11-14 23:33:17 -08:00
A.J. Beamon	313e823629	Delete TDMetric data (tmpEventMetric) when a trace event is throttled.	2017-11-13 15:06:21 -08:00
A.J. Beamon	bf07fa3023	Untested changes to MemAvailable computation on kernels without MemAvailable	2017-11-06 09:35:05 -08:00
A.J. Beamon	7cf17df821	Merge branch 'master' into log-group-for-unsupported-clients # Conflicts: # flow/Net2.actor.cpp # tests/fast/SidebandWithStatus.txt # tests/rare/LargeApiCorrectnessStatus.txt # tests/slow/DDBalanceAndRemoveStatus.txt	2017-11-01 11:31:02 -07:00
Evan Tschannen	b1e3864c0e	fix: stop after does not print errors if actor cancelled	2017-11-01 10:58:39 -07:00
Balachandar Namasivayam	e377d1985c	Merge pull request #195 from cie/throttle-trace-events Increase message limit for throttling to 100,000 for circus runs.	2017-10-31 11:10:11 -07:00
Balachandar Namasivayam	dd6c24ce09	Addressed Review Comments.	2017-10-31 11:08:54 -07:00
Balachandar Namasivayam	80e5fecfe2	Increase message limit for throttling to 100,000 for circus runs. Added an optimization to use a separate set for throttled events. Since this set is expected to be small, comparison of every event against this set is going to be cheaper.	2017-10-31 10:35:26 -07:00
Evan Tschannen	54d82c0d92	Merge pull request #194 from cie/alexmiller/valgrind Fix valgrind errors	2017-10-27 17:25:12 -07:00
Alex Miller	3b61b76876	Fix a massive amount of valgrind errors and make them easier to debug in the future. std::is_pod<> being less restrictive than is_binary_serializable<> meant that structs that both were POD and had a serialize method defined would be binary serialized instead of using the defined serialize(). This means that it would also serialize any padding that the struct contained, which would cause mass waves of valgrind failures from uninitialized memory. Included in this change is additional uses of valgrind client requests so that attempts to send uninitialized memory are reported at the sending site, versus as part of checksum calculation in sending the packet.	2017-10-27 16:54:44 -07:00
Balachandar Namasivayam	1d3c88c147	Increase the spammy trace event threshold to 20000 in 20 minutes.	2017-10-27 12:22:56 -07:00
Balachandar Namasivayam	cfefab18fb	Merge branch 'master' into add-new-atomic-ops	2017-10-25 18:03:34 -07:00
Balachandar Namasivayam	9dd588dcce	Addressed review comments. Changed naming for NewMin and NewAnd to MinV2 and AndV2	2017-10-25 14:48:05 -07:00
Evan Tschannen	d852a53ae4	Merge pull request #181 from cie/throttle-spammy-logs Throttle spammy logs	2017-10-25 13:45:55 -07:00
Stephen Atherton	3afc85881e	Merge branch 'master' into backup-container-refactor # Conflicts: # fdbrpc/BlobStore.actor.cpp	2017-10-20 21:38:28 -07:00
Stephen Atherton	42955012e9	Merge branch 'release-5.0' # Conflicts: # fdbrpc/BlobStore.actor.cpp # flow/error_definitions.h	2017-10-20 21:16:55 -07:00
Stephen Atherton	efe857fef6	Fixed inconsistent styles of recently changed error messages.	2017-10-20 12:56:00 -07:00
Alec Grieser	dd6d8f3b0e	Merge branch 'master' into add-new-atomic-ops	2017-10-18 16:36:44 -07:00
Alex Miller	d3df8469dd	Upgrade protocol version as ClientWorkerInterface was changed.	2017-10-18 14:56:31 -07:00
A.J. Beamon	a5c2373dbb	Spaces->tabs	2017-10-18 09:04:35 -07:00

1 2 3

109 Commits