foundationdb

Commit Graph

Author	SHA1	Message	Date
Evan Tschannen	86958cb08d	Merge pull request #226 from cie/fix-taskBucket-unblockFuture Modify TaskBucketCorrectness to support chain and multiple tasks	2017-12-20 18:00:54 -08:00
Yichi Chiang	91e5abeaa6	Modify TaskBucketCorrectness to support chain and multiple tasks	2017-12-20 17:02:49 -08:00
Alex Miller	f70e3b9fe8	Add or change a bunch of comments to provide descriptions of function contracts. This cleans up a bit of the VersionStamp DR work I did, and leaves hints and advice for anyone who will be touching mutation applying code in the future.	2017-12-20 16:57:14 -08:00
Evan Tschannen	982f0dcb1e	Merge pull request #222 from cie/alexmiller/drtimefix2 Fix yet another VersionStamp DR issue.	2017-12-20 15:09:23 -08:00
Alex Miller	b5a6bc0ab7	Fix VersionStamp problems by instead adding a COMMIT_ON_FIRST_PROXY transaction option. Simulation identified the fact that we can violate the VersionStamps-are-always-increasing promise via the following series of events: 1. On proxy 0, dumpData adds commit requests to proxy 0's commit promise stream 2. To any proxy, a client submits the first transaction of abortBackup, which stops further dumpData calls on proxy 0. 3. To any proxy that is not proxy 0, submit a transaction that checks if it needs to upgrade the destination version. 4. The transaction from (3) is committed 5. Transactions from (1) are committed This is possible because the dumpData transactions have no read conflict ranges, and thus it's impossible to make them abort due to "conflicting" transactions. There's also no promise that if client C sends a commit to proxy A, and later a client D sends a commit to proxy B, that B must log its commit after A. (We only promise that if C is told it was committed before D is told it was committed, then A committed before B.) There was a failed attempt to fix this problem. We tried to add read conflict ranges to dumpData transactions so that they could be aborted by "conflicting" transactions. However, this failed because this now means that dumpData transactions require conflict resolution, and the stale read version that they use can cause them to be aborted with a transaction_too_old error. (Transactions that don't have read conflict ranges will never return transaction_too_old, because with no reads, the read snapshot version is effectively meaningless.) This was never previously possible, so the existing code doesn't retry commits, and to make things more complicated, the dumpData commits must be applied in order. This would require either adding dependencies to transactions (if A is going to commit then B must also be/have committed), which would be complicated, or submitting transactions with a fixed read version, and replaying the failed commits with a higher read version once we get a transaction_too_old error, which would unacceptably slow down the maximum throughput of dumpData. Thus, we've instead elected to add a special transaction option that bypasses proxy load balancing for commits, and always commits against proxy 0. We can know for certain that after the transaction from (2) is committed, all of the dumpData transactions that will be committed have been added to the commit promise stream on proxy 0. Thus, if we enqueue another transaction against proxy 0, we can know that it will be placed into the promise stream after all of the dumpData transactions, thus providing the semantics that we require: no dumpData transaction can commit after the destination version upgrade transaction.	2017-12-20 15:04:04 -08:00
Stephen Atherton	e0d9cea008	Merge branch 'master' into continuous-backup # Conflicts: # fdbclient/FileBackupAgent.actor.cpp # fdbrpc/BlobStore.actor.cpp	2017-12-19 23:02:14 -08:00
Alex Miller	c7dbd31a1e	Refactoring: Create a common prefixRange and do UID->Key once in backup.	2017-12-19 17:17:50 -08:00
Alex Miller	1488c12c18	Simulation will return and error and print if any non-suppressed SevError events were logged. This means that loops like `seed=1; while ./fdbserver -r simulation -s $seed; do seed=$(($seed+1)); done` to find an example of an often failing test. This also means joshua will report ExitCode errors on anything that has a SevError in the log. As a part of this, we also implicitly downgrade any injected errors to SevWarnAlways.	2017-12-19 17:17:50 -08:00
Stephen Atherton	e28641886d	TraceEvent improvements. Minor bug fix, restore log writing tasks didn't have the log file endVersion but it's only for logging purposes.	2017-12-19 15:27:04 -08:00
Evan Tschannen	a5601877b3	fix: valgrind issue with destruction ordering	2017-12-18 15:31:59 -08:00
Evan Tschannen	1dc9eceb6d	optimize GetKeyLocationRequests on the proxy so they only require a single map lookup, instead of doing 3 + (3* [number of ranges]) lookups	2017-12-15 20:13:44 -08:00
Stephen Atherton	33f9f1a95c	Added SnapshotDispatch task for writing snapshots in random order over a specified period of time and adapting speed to a growing or shrinking database. TaskBucket now supports scheduling tasks. TaskFuture now correctly recognizes multiple tasks in its callback space. TaskBucket extendTimeout() now supports specifying the new timeout version. Submitting a backup now requires a snapshot duration.	2017-12-14 01:44:38 -08:00
Evan Tschannen	7ce93426ed	fix: connection disabler in removeServerSafely needs to run for the whole test to avoid getting stuck on include all	2017-12-12 18:38:57 -08:00
Alec Grieser	4495a19299	Merge pull request #220 from cie/alexmiller/flowprofcircus Add class restrictions to CpuProfiler, and fix metric crash.	2017-12-11 14:13:22 -08:00
Evan Tschannen	73a0a07eac	clients ask for key location information directly from the proxy, instead of reading it from the database	2017-12-09 16:10:22 -08:00
Alex Miller	48660e9ce5	Add class restrictions to CpuProfiler, and fix metric crash. This change largely refactors away the old meaning of the value given to flow_profiler, which was the number of machines that we'd be profiling, and instead replaces it with the classes of processes to profile for the duration of the test. Most importantly, this means that one can profile in circus with a configuration that has "ssd" in it, and the circus run will still complete (as long as the argument isn't "storage"). And also finally add some other fixes I had to the same file to conditionally change the name of the metric we're looking for to comply with what's actually written.	2017-12-07 19:28:29 -08:00
Stephen Atherton	abb2dd1ebc	Merge pull request #214 from cie/alexmiller/fallocate Use fallocate to zero ranges instead of writing zeroes	2017-12-06 13:47:40 -08:00
Evan Tschannen	5a947212ed	fix: ensure all prior commits have completed before returning that a commit has committed from the disk queue	2017-12-06 12:31:07 -08:00
Stephen Atherton	f8e89a40ac	Bug fixes, take(1) is incorrect usage of FlowLock.	2017-12-04 10:25:47 -08:00
Evan Tschannen	49dac11a5f	added a SevWarnAlways for when a disk queue file grows larger than 20GB	2017-12-01 15:05:17 -08:00
Evan Tschannen	482ac38ca6	added knobs so that the client failure monitoring update rate and the server failure monitoring update rate are separate knobs	2017-12-01 13:04:32 -08:00
Evan Tschannen	c3918d892a	do not use bandwidth splitting on the keyServer shard, lots of sets and clears to this shard generally means you do not want to create additional data distribution work	2017-11-30 18:28:16 -08:00
Alex Miller	196258080b	Refactor zeroing a chunk of a file from DiskQueue into IAsyncFile. If we're going to do the work to provide more optimized ways to zero files, then I'd feel better with this being in a more common place, so that any other zero-ers are likely to reuse it. It also makes testing easier/more obvious. Also, because it's needed for correctness, fix the aligned_alloc for OSX, which wasn't aligned, and use an actually aligned allocation function.	2017-11-30 17:57:55 -08:00
Alex Miller	c7a120c59d	Rename IAsyncFile::incrementalDelete -> IAsyncFileSystem::incrementalDeleteFile. `deleteFile` existed in IAsyncFileSystem, so an incremental delete function seems to belong more as a virtual method on IAsyncFileSystem than a static method on IAsyncFile, and the naming should match. As long as we're here, change IAsyncFile to declare a virtual destructor, so that it has good and proper C++ behavior. I presume this is what was vaguely intended by the default constructor definition that previously existed?	2017-11-30 17:19:10 -08:00
Evan Tschannen	7f72aa7de5	fix: a storage server does not ever need to rollback before a version restored from disk	2017-11-30 11:19:43 -08:00
Evan Tschannen	e5a682948c	Merge pull request #212 from cie/check-cluster-controller-desired-class Check cluster controller using desired process class in consistency c…	2017-11-29 15:57:51 -08:00
Yichi Chiang	8ba0eaebff	Check cluster controller using desired process class in consistency check	2017-11-29 15:09:23 -08:00
Evan Tschannen	8c51bc4ac4	fixed low latency tests in a way that gives us better test coverage	2017-11-28 18:20:29 -08:00
Evan Tschannen	dc624a54dc	fix: avoid flushing large queues in simulation when checking latency	2017-11-27 17:23:20 -08:00
Stephen Atherton	1b1c8e985a	Merge branch 'master' into backup-container-refactor # Conflicts: # fdbclient/FileBackupAgent.actor.cpp	2017-11-25 19:54:51 -08:00
Stephen Atherton	6695c9e6a2	Bug fixes and improvements to error handling and trace events. The most serious bug was that restore would start at the wrong version, possibly skipping early log and range files.	2017-11-25 00:46:16 -08:00
Alex Miller	f19cb3bbbd	Merge pull request #208 from cie/alexmiller/grvtfix Fix the GRV performance regression	2017-11-17 15:00:44 -08:00
Yichi Chiang	d9a98aa968	Remove commented code	2017-11-16 17:25:37 -08:00
Yichi Chiang	0d5dc15ac8	Fix double recoveries	2017-11-16 16:58:55 -08:00
Alex Miller	e9412bbb11	Fix the GRV performance regression introduced by adding the policy engine to GRV calculations. Construction of LocalityGroup from LocalityData is expensive, and the previous code greatly ran afoul of that. The policy engine does a large amount of interning of strings and building compressed maps to make the expected many future selectReplica calls cheap. Unfortunately we don't call selectReplicas, so much of this work is undesireable for us, and a large amount of CPU time is spent doing this initialization work. The new changes aggressively do the minimal LocalityGroup::add() calls necessary, and make them as cheap as possibly by removing all elements from LocalityData that don't need to be considered by the policy. This optimization was also applied to the PeekCursor used during recovery, which should speed recoveries up by a small amount.	2017-11-16 16:15:52 -08:00
Evan Tschannen	ad456a939a	Merge pull request #206 from cie/change-excluded-cluster-controller Change excluded cluster controller	2017-11-15 17:28:33 -08:00
Yichi Chiang	f96faf72d9	Add fullyRecoveredConfig for checking exclusions	2017-11-15 17:15:24 -08:00
Evan Tschannen	30464e943c	Merge pull request #205 from cie/cleanup-spammy-traceevents Cleanup spammy traceevents	2017-11-15 12:41:37 -08:00
Evan Tschannen	e113dba0e3	added a new trace event tracking master recovery durations	2017-11-15 12:38:26 -08:00
Stephen Atherton	a77162b53d	Merge branch 'master' into backup-container-refactor # Conflicts: # fdbclient/BackupAgent.h # fdbclient/FileBackupAgent.actor.cpp # fdbclient/KeyBackedTypes.h	2017-11-15 08:14:47 -08:00
Stephen Atherton	3dfaf13b67	IBackupContainer has been rewritten to be a logical interface for storing, reading, deleting, expiring, and querying backup data. The details of how the data is organized or stored is now hidden from users of the interface. Both the local and blobstore containers have been rewritten, the key changes being a multi level directory structure and no more use of temporary files or pseudo-symlinks in the blob store implementation. This refactor has a large impact radius as the previous backup container was just a thin wrapper that presented a single level list of files and offered no methods for managing or interpreting the file structure so all of that logic was spread around other places in the code base. This made moving to the new blob store schema very messy, and without this refactor further changes in the future would only be worse. Several backup tasks have been cleaned up / simplified because they no longer need to manage the ‘raw’ structure of the backup. The addition of IBackupFile and its finish() method simplified the log and range writer tasks. Updated BlobStoreEndpoint to support now-required bucket creation and bucket listing prefix/delimiter options for finding common prefixes. Added KeyBackedSet<T> type. Moved JSONDoc to its own header. Added platform::findFilesRecursively(). Still to do: update command line tool to use new IBackupContainer interface, fix bugs in Restore startup.	2017-11-14 23:33:17 -08:00
Yichi Chiang	df922bc973	Change excluded cluster controller	2017-11-14 13:57:37 -08:00
A.J. Beamon	bb1297c686	Remove RkServerQueueInfo and RkTLogQueueInfo trace events, since this information is more or less already logged on the storage servers and tlogs. Update the quiet database check and magnesium to use the information from the logs and storage servers.	2017-11-14 12:59:42 -08:00
A.J. Beamon	3b952efb4e	Remove events from cluster controller that get logged for roughly every worker upon recovery, master registration, etc.	2017-11-14 10:15:45 -08:00
A.J. Beamon	0fea5e9c2f	Convert client_invalid_operation errors to ASSERTs.	2017-11-13 11:38:34 -08:00
A.J. Beamon	cd085764f1	Do not automatically change a cluster file that does not match what you expect.	2017-11-10 14:12:45 -08:00
Alex Miller	311d1ca87d	A variety of fixes that collectively fix using flow profiling in circus. To run, use --co=flow_profiling=-1, because reasons.	2017-11-07 13:55:16 -08:00
Evan Tschannen	706bf1e018	fix: we cannot trigger better master exists before a master is fully recovered because exclusions changed by the provisional master will not be committed until the master is fully recovered	2017-11-04 12:48:04 -07:00
Evan Tschannen	57aba0b3bc	fix: excluded servers were the same fitness as storage servers for the master role fix: better master exists did not considers exclusion for master fitness	2017-11-03 17:09:14 -07:00
Yichi Chiang	42fad5efe5	Introduce cluster controller process class in circus	2017-11-03 14:22:55 -07:00

1 2 3 4 5

244 Commits