swift

Author	SHA1	Message	Date
Pete Zaitcev	575538b55b	py3: port the container This started with ShardRanges and its CLI. The sharder is at the bottom of the dependency chain. Even container backend needs it. Once we started tinkering with the sharder, it all snowballed to include the rest of the container services. Beware, this does affect some of Python 2 code. Mostly it's trivial and obviously correct, but needs checking by reviewers. About killing the stray "from __future__ import unicode_literals": we do not do it in general. The specific problem it caused was a failure of functional tests because unicode leaked into a field that was supposed to be encoded. It is just too hard to track the types when rules change from file to file, so off with its head. Change-Id: Iba4e65d0e46d8c1f5a91feb96c2c07f99ca7c666	2019-02-20 21:30:46 -06:00
John Dickinson	c26d67efcf	fixed _check_node() in the container sharder Previously, _check_node() wouldn't catch the raise ValueError when a drive was unmounted. Therefore the error would bubble up, uncaught, and stop the shard cycle. The practical effect is that an unmounted drive on a node would prevent sharding for happening. This patch updates _check_node() to properly use the check_drive() method. Furthermore, the _check_node() return value has been modified to be more similar to what check_drive() actually returns. This should help prevent similar errors from being introduced in the future. Closes-Bug: #1806500 Change-Id: I3da9b5b120a5980e77ef5c4dc8fa1697e462ce0d	2018-12-04 16:16:04 -08:00
Clay Gerrard	06cf5d298f	Add databases_per_second to db daemons Most daemons have a "go as fast as you can then sleep for 30 seconds" strategy towards resource utilization; the object-updater and object-auditor however have some "X_per_second" options that allow operators much better control over how they spend their I/O budget. This change extends that pattern into the account-replicator, container-replicator, and container-sharder which have been known to peg CPUs when they're not IO limited. Partial-Bug: #1784753 Change-Id: Ib7f2497794fa2f384a1a6ab500b657c624426384	2018-10-30 22:28:05 +00:00
Tim Burke	773b633118	Change default sharding threshold to 1,000,000 objects ...instead of 10,000,000. The sample configs were already using one million, all of our testing with non-SAIO containers was done with one million, and the resulting container DBs were around 100MB which seems like a comfortable size. Pretty sure this was just a typo during some code cleanup. Change-Id: Icd31f9d8efaac2d5dc0f021cad550687859558b9	2018-05-29 10:48:51 -07:00
Matthew Oliver	2641814010	Add sharder daemon, manage_shard_ranges tool and probe tests The sharder daemon visits container dbs and when necessary executes the sharding workflow on the db. The workflow is, in overview: - perform an audit of the container for sharding purposes. - move any misplaced objects that do not belong in the container to their correct shard. - move shard ranges from FOUND state to CREATED state by creating shard containers. - move shard ranges from CREATED to CLEAVED state by cleaving objects to shard dbs and replicating those dbs. By default this is done in batches of 2 shard ranges per visit. Additionally, when the auto_shard option is True (NOT yet recommeneded in production), the sharder will identify shard ranges for containers that have exceeded the threshold for sharding, and will also manage the sharding and shrinking of shard containers. The manage_shard_ranges tool provides a means to manually identify shard ranges and merge them to a container in order to trigger sharding. This is currently the recommended way to shard a container. Co-Authored-By: Alistair Coles <alistairncoles@gmail.com> Co-Authored-By: Tim Burke <tim.burke@gmail.com> Co-Authored-By: Clay Gerrard <clay.gerrard@gmail.com> Change-Id: I7f192209d4d5580f5a0aa6838f9f04e436cf6b1f	2018-05-18 18:48:13 +01:00

5 Commits