swift

Author	SHA1	Message	Date
Clay Gerrard	aab45880f8	Breakup reclaim into batches We want to do the table scan without locking and group the locking deletes into small indexed operations to minimize the impact of background processes calling reclaim each cycle. Change-Id: I3ccd145c14a9b68ff8a9da61f79034549c9bc127 Co-Authored-By: Tim Burke <tim.burke@gmail.com> Closes-Bug: #1877651	2020-05-22 21:15:57 +00:00
Tim Burke	0fd23ee546	Fix pep8 job New flake8 came out with new & improved rules. Ignore E741; it would be too much churn. Fix the rest. Change-Id: I9125c8c53423232309a75cbcc5b695b378864c1b	2020-05-13 00:24:13 -07:00
Tim Burke	3f88907012	sharding: Better-handle newlines in container names Previously, if you were on Python 2.7.10+ [0], such a newline would cause the sharder to fail, complaining about invalid header values when trying to create the shard containers. On older versions of Python, it would most likely cause a parsing error in the container-server that was trying to handle the PUT. Now, quote all places that we pass around container paths. This includes: * The X-Container-Sysmeta-Shard-(Quoted-)Root sent when creating the (empty) remote shards * The X-Container-Sysmeta-Shard-(Quoted-)Root included when initializing the local handoff for cleaving * The X-Backend-(Quoted-)Container-Path the proxy sends to the object-server for container updates * The Location header the container-server sends to the object-updater Note that a new header was required in requests so that servers would know whether the value should be unquoted or not. We can get away with reusing Location in responses by having clients opt-in to quoting with a new X-Backend-Accept-Quoted-Location header. During a rolling upgrade, * old object-servers servicing requests from new proxy-servers will not know about the container path override and so will try to update the root container, * in general, object updates are more likely to land in the root container; the sharder will deal with them as misplaced objects, and * shard containers created by new code on servers running old code will think they are root containers until the server is running new code, too; during this time they'll fail the sharder audit and report stats to their account, but both of these should get cleared up upon upgrade. Drive-by: fix a "conainer_name" typo that prevented us from testing that we can shard a container with unicode in its name. Also, add more UTF8 probe tests. [0] See https://bugs.python.org/issue22928 Change-Id: Ie08f36e31a448a547468dd85911c3a3bc30e89f1 Closes-Bug: 1856894	2020-01-03 16:04:57 -08:00
Tim Burke	30d99f5140	Get rid of _delete_db It was parctically the same between both, only the table name was different -- and we already have a way to accomodate that. Change-Id: I112bef1cf44c7d6eeaacd30cea27cb71fb2e2406	2019-11-07 00:22:14 +00:00
Clay Gerrard	b75d593935	Refactor timestamp iter in db backend tests Change-Id: I61d044a6d778e158af73a3f4d9fc75d642af0beb	2019-10-26 16:50:32 -05:00
Tim Burke	ef664700d4	py3: Specify an encoding when loading db.pending pickles Otherwise you can hit UnicodeDecodeErrors when upgrading from py2 to py3. Change-Id: Iab808b3d0c2051badda0450015077ea8b1e5adb8	2019-06-17 08:06:39 -07:00
Tim Burke	4ca605c51e	Move base64-decoding/unpickling into DatabaseBroker ...which was the one responsible for pickling and encoding the data in the first place. Change-Id: Ia7fad7ddd7cf95c8a0168dd6c899c80ddfdd3521	2018-10-05 22:07:46 +00:00
Tim Burke	6e7ca26169	DB follow-up Change-Id: I32cff110f0771d6d92a881ef52e482aa17d8dbe4 Related-Change: I0dab4ff013f300396cff7313bc27b9d5644fe8a7	2018-10-05 21:42:46 +00:00
Pete Zaitcev	e3a5b63cc7	py3: Adapt db.py Change-Id: I0dab4ff013f300396cff7313bc27b9d5644fe8a7	2018-10-04 17:09:34 -05:00
Clay Gerrard	6714af8042	Use maybe_get more Some suggestions in the review of the related change: * use maybe_get in another obvious place * make maybe_gets test assertions stronger Related-Change-Id: Ifd54d76ab1a5a9d82848f3cae89c3e53134aa129 Change-Id: I751f1086d885c18d938f18e8afe1dd0e9c0c57e5	2018-05-22 15:03:50 -07:00
Clay Gerrard	2e321d94ed	Add a helper context for optional connection args We refactor a bunch of methods to support re-using an open connection when available. There's some code in the connection manager to support nesting connections to avoid deadlocks when we call a method that can open a connection while we already have a connection checked out - but it might be better in the long run if we just got better at passing open connections around whenever possible. Add a helper method on the base db broker class to make it easier to write methods that can optionally take an existing connection. Change-Id: Ifd54d76ab1a5a9d82848f3cae89c3e53134aa129	2018-05-22 14:17:13 -07:00
Tim Burke	83732a6813	Use assertRaises more It saves us some boilerplate and makes tighter assertions about the type of error. Change-Id: I34bf8017b5740203ee0b9f253e24f1869164f7c6	2018-05-21 11:13:05 -07:00
Alistair Coles	9d742b85ad	Refactoring, test infrastructure changes and cleanup ...in preparation for the container sharding feature. Co-Authored-By: Matthew Oliver <matt@oliver.net.au> Co-Authored-By: Tim Burke <tim.burke@gmail.com> Co-Authored-By: Clay Gerrard <clay.gerrard@gmail.com> Change-Id: I4455677abb114a645cff93cd41b394d227e805de	2018-05-15 18:18:25 +01:00
Ondřej Nový	bfe52a2e35	Quarantine DB without *_stat row Closes-Bug: #1747689 Change-Id: Ief6bd0ba6cf675edd8ba939a36fb9d90d3f4447f	2018-02-07 19:35:05 +01:00
Jenkins	21a0df4ae0	Merge "More assertion cleanup"	2017-09-13 06:44:02 +00:00
Tim Burke	85d6cd30be	Add Timestamp.now() helper Often, we want the current timestamp. May as well improve the ergonomics a bit and provide a class method for it. Change-Id: I3581c635c094a8c4339e9b770331a03eab704074	2017-04-27 14:19:00 -07:00
Tim Burke	c6b9195db8	More assertion cleanup Change-Id: Id88af19c5bfd0bcbbeabcf4eeb23beef4c50b1cb Related-Change: I416831c8ad92f8445bc8d9560040a5ebf5c90702	2016-12-12 14:08:07 -08:00
Cao Xuan Hoang	3da144a3af	Replace 'assertTrue(a not in b)' with 'assertNotIn(a, b)' trivialfix Change-Id: I416831c8ad92f8445bc8d9560040a5ebf5c90702	2016-12-12 16:23:09 +07:00
Matthew Oliver	3bde14b5cf	Quarantine malformed database schema SQLite errors Currently if an sqlite3.DatabaseError is thrown when caused by a corrupted database schema, it get logged and the database is isn't quarantined. This patch adds the malformed database schema case to the list of SQLite errors in possibly_quarantine that will trigger the db to be quarantined. Also it improved the possibly_quarantined unit test to test all existing exceptions, and catches exceptions based on the real world except we use in code. Closes-Bug: #1646247 Change-Id: Id9452c88f8394a2a910c34c69361442543aa206d	2016-12-01 14:17:02 +11:00
Victor Stinner	b923e0f892	Python 3: dict.iteritems() and dict.itervalues() * Replace dict.itervalues() with dict.values(). The Python 3 dict type has no itervalues() method, the old itervalues() method was renamed to values(). * Same change for dict.iteritems(), replaced with dict.items() * Exception: use six.itervalues() to yield on sock_data_by_port Using six.itervalues() and six.iteritems() would make the code less readable. The overhead of creating a temporary list is considered as negligible: http://lists.openstack.org/pipermail/openstack-dev/2015-June/066391.html Change-Id: Ifbe7faa16d419e7fe26f1fb464019b83c9171c45	2016-06-23 17:31:25 +02:00
Tim Burke	fe70898dac	Require account/container metadata be UTF-8 Otherwise, we get a UnicodeDecodeError when we call json.dumps() Change-Id: Ie200d029e1fd7f0ff0956c8ced98207e11ef9080	2016-02-29 10:27:29 +00:00
Zack M. Davis	1b8b08039a	remove remaining simplejson uses, prefer standard library import a1c32702, 736cf54a, and 38787d0f remove uses of `simplejson` from various parts of Swift in favor of the standard libary `json` module (introduced in Python 2.6). This commit performs the remaining `simplejson` to `json` replacements, removes two comments highlighting quirks of simplejson with respect to Unicode, and removes the references to it in setup documentation and requirements.txt. There were a lot of places where we were importing json from swift.common.utils, which is less intuitive than a direct `import json`, so that replacement is made as well. (And in two more tiny drive-bys, we add some pretty-indenting to an XML fragment and use `super` rather than naming a base class explicitly.) Change-Id: I769e88dda7f76ce15cf7ce930dc1874d24f9498a	2015-11-16 12:34:24 -08:00
janonymous	1882801be1	pep8 fix: assertNotEquals -> assertNotEqual assertNotEquals is deprecated in py3 Change-Id: Ib611351987bed1199fb8f73a750955a61d022d0a	2015-10-12 07:40:07 +00:00
janonymous	f5f9d791b0	pep8 fix: assertEquals -> assertEqual assertEquals is deprecated in py3, replacing it. Change-Id: Ida206abbb13c320095bb9e3b25a2b66cc31bfba8 Co-Authored-By: Ondřej Nový <ondrej.novy@firma.seznam.cz>	2015-10-11 12:57:25 +02:00
Jenkins	e7205fd7d6	Merge "cPickle is deprecated in py3, replacing it from six.moves"	2015-07-28 12:33:24 +00:00
janonymous	cd7b2db550	unit tests: Replace "self.assert_" by "self.assertTrue" The assert_() method is deprecated and can be safely replaced by assertTrue(). This patch makes sure that running the tests does not create undesired warnings. Change-Id: I0602ba39ef93263386644ee68088d5f65fcb4a71	2015-07-21 19:23:00 +05:30
janonymous	c907107fe4	cPickle is deprecated in py3, replacing it from six.moves cPickle is deprecated and should be replaced with six.moves to provide py2 and py3 compatibility. Change-Id: Ibad990708722360d188c641e61444d50a16a1e93	2015-07-07 22:46:37 +05:30
Victor Stinner	e5c962a28c	Replace xrange() with six.moves.range() Patch generated by the xrange operation of the sixer tool: https://pypi.python.org/pypi/sixer Manual changes: * Fix indentation for pep8 checks * Fix TestGreenthreadSafeIterator.test_access_is_serialized of test.unit.common.test_utils: replace range(1, 11) with list(range(1, 11)) * Fix UnsafeXrange docstring, revert change Change-Id: Icb7e26135c5e57b5302b8bfe066b33cafe69fe4d	2015-06-23 07:29:15 +00:00
janonymous	09e7477a39	Replace it.next() with next(it) for py3 compat The Python 2 next() method of iterators was renamed to __next__() on Python 3. Use the builtin next() function instead which works on Python 2 and Python 3. Change-Id: Ic948bc574b58f1d28c5c58e3985906dee17fa51d	2015-06-15 22:10:45 +05:30
Pradeep Kumar Singh	2759a6f159	Handle Disk IO error Exception in swift account auditor Swift account auditor fails to quarantine corrupt db due to Disk IO error. This patch fixes that by handling Disk IO Error Exception. Closes-Bug:1169189 Change-Id: I031ee2a5775e4a88d4fb00d972d553936147c42e	2015-06-02 19:20:27 +05:30
Prashanth Pai	a6f630f27c	fsync() on directories renamer() method now does a fsync on containing directory of target path and also on parent dirs of newly created directories, by default. This can be explicitly turned off in cases where it is not necessary (For example- quarantines). The following article explains why this is necessary: http://lwn.net/Articles/457667/ Although, it may seem like the right thing to do, this change does come at a performance penalty. However, no configurable option is provided to turn it off. Also, lock_path() inside invalidate_hash() was always creating part of object path in filesystem. Those are never fsync'd. This has been fixed. Change-Id: Id8e02f84f48370edda7fb0c46e030db3b53a71e3 Signed-off-by: Prashanth Pai <ppai@redhat.com>	2015-03-04 12:33:56 +05:30
Clay Gerrard	233e0aebf7	Fix reclaim on deleted containers The common db replicator's code path for reclaiming deleted db's beyond the reclaim age was not covered by unittests, and a AttributeError snuck in. In writing the test that would cover the common code both for accounts and containers I discovered another KeyError with the container conditional for validating the container's fully reported status. This fixes both those issues and adds additional tests for the cleanup empty account container partition and suffix directories. Change-Id: I2a1bfaefebd05b01231bf71dd908fcc49adb4c36	2014-12-03 17:10:15 -08:00
Richard (Rick) Hawkins	5b2c27a587	Fix metadata overall limits bug Currently metadata limits are checked on a per request basis. If multiple requests are sent within the per request limits, it is possible to exceed the overall limits. This patch adds an overall metadata check to ensure that multiple requests to add metadata to an account/container will check overall limits before adding the additional metadata. Change-Id: Ib9401a4ee05a9cb737939541bd9b84e8dc239c70 Closes-Bug: 1365350	2014-10-02 22:38:13 +00:00
Clay Gerrard	c1dc2fa624	Add two vector timestamps The normalized form of the X-Timestamp header looks like a float with a fixed width to ensure stable string sorting - normalized timestamps look like "1402464677.04188" To support overwrites of existing data without modifying the original timestamp but still maintain consistency a second internal offset vector is append to the normalized timestamp form which compares and sorts greater than the fixed width float format but less than a newer timestamp. The internalized format of timestamps looks like "1402464677.04188_0000000000000000" - the portion after the underscore is the offset and is a formatted hexadecimal integer. The internalized form is not exposed to clients in responses from Swift. Normal client operations will not create a timestamp with an offset. The Timestamp class in common.utils supports internalized and normalized formatting of timestamps and also comparison of timestamp values. When the offset value of a Timestamp is 0 - it's considered insignificant and need not be represented in the string format; to support backwards compatibility during a Swift upgrade the internalized and normalized form of a Timestamp with an insignificant offset are identical. When a timestamp includes an offset it will always be represented in the internalized form, but is still excluded from the normalized form. Timestamps with an equivalent timestamp portion (the float part) will compare and order by their offset. Timestamps with a greater timestamp portion will always compare and order greater than a Timestamp with a lesser timestamp regardless of it's offset. String comparison and ordering is guaranteed for the internalized string format, and is backwards compatible for normalized timestamps which do not include an offset. The reconciler currently uses a offset bump to ensure that objects can move to the wrong storage policy and be moved back. This use-case is valid because the content represented by the user-facing timestamp is not modified in way. Future consumers of the offset vector of timestamps should be mindful of HTTP semantics of If-Modified and take care to avoid deviation in the response from the object server without an accompanying change to the user facing timestamp. DocImpact Implements: blueprint storage-policies Change-Id: Id85c960b126ec919a481dc62469bf172b7fb8549	2014-06-19 10:18:06 -07:00
Pete Zaitcev	b02f0db126	Refactoring storage policies merge_timestamps * base implementation of is_deleted phrased to use _is_deleted * wrap pre-conn coded _is_deleted inside a transation for merge_timestamps Implements: blueprint storage-policies Change-Id: I6a948908c3e45b70707981d87171cb2cb910fe1e	2014-06-18 20:57:09 -07:00
Clay Gerrard	4321bb0af6	Add Storage Policy support to Containers Containers now have a storage policy index associated with them, stored in the container_stat table. This index is only settable at container creation time (PUT request), and cannot be changed without deleting and recreating the container. This is because a container's policy index will apply to all its objects, so changing a container's policy index would require moving large amounts of object data around. If a user wants to change the policy for data in a container, they must create a new container with the desired policy and move the data over. Keep status_changed_at up-to-date with status changes. In particular during container recreation and replication. When a container-server receives a PUT for a deleted database an extra UPDATE is issued against the container_stat table to notate the x-timestamp of the request. During replication if merge_timestamps causes a container's status to change (from DELETED to ACTIVE or vice-versa) the status_changed_at field is set to the current time. Accurate reporting of status_changed_at is useful for container replication forensics and allows resolution of "set on create" attributes like the upcoming storage_policy_index. Expose Backend container info on deleted containers. Include basic container info in backend headers on 404 responses from the container server. Default empty values are used as placeholders if the database does not exist. Specifically the X-Backend-Status-Changed-At, X-Backend-DELETE-Timestamp and the X-Backend-Storage-Policy-Index value will be needed by the reconciler to deal with reconciling out of order object writes in the face of recently deleted containers. * Add "status_changed_at" key to the response from ContainerBroker.get_info. * Add "Status Timestamp" field to swift.cli.info.print_db_info_metadata. * Add "status_changed_at" key to the response from AccountBroker.get_info. DocImpact Implements: blueprint storage-policies Change-Id: Ie6d388f067f5b096b0f96faef151120ba23c8748	2014-06-18 17:31:38 -07:00
Shane Wang	a94be9443d	Fix misspellings in swift Fix misspellings detected by: * pip install misspellings * git ls-files \| grep -v locale \| misspellings -f - Change-Id: I6594fc4ca5ae10bd30eac8a2f2493a376adcadee Closes-Bug: #1257295	2014-02-20 16:15:48 +08:00
Peter Portante	ca87827db9	Use a tempfile.mkdtemp() based temporary directory Change-Id: Ie0e8137615348b130b67323d0d2913dc5ebfd5fb	2014-01-19 16:29:45 -05:00
Samuel Merritt	b5b0b78fc7	Remove obsolete future imports The with statement has been standard since Python 2.5, so we can get rid of these imports. Change-Id: I280971c3d8c01e94cc2c17cacaedcbe9d9c8a3c3	2013-11-22 12:23:58 -08:00
Steven Lang	700479bd67	Fix DB locked error on commit This bug was introduced in ef7f9e27; while moving timeout for execute to the cursor wrapper, commit was moved as well; however commit is purely a connection method, only execute is passed on to a cursor. Added unit tests to check both methods for correct timeouts. This manifested in a test failure as: ERROR __call__ error with POST /sdb1/418/AUTH_d1c4b610b16a48de83219c696261009c/TestContainer-tempest-1572414684 : Traceback (most recent call last): File "/opt/stack/new/swift/swift/container/server.py", line 486, in __call__ res = method(req) File "/opt/stack/new/swift/swift/common/utils.py", line 1915, in wrapped return func(a, kw) File "/opt/stack/new/swift/swift/common/utils.py", line 687, in _timing_stats resp = func(ctrl, args, **kwargs) File "/opt/stack/new/swift/swift/container/server.py", line 464, in POST broker.update_metadata(metadata) File "/opt/stack/new/swift/swift/common/db.py", line 677, in update_metadata conn.commit() OperationalError: database is locked (txn: tx5065394f288740e69fcec-00528e184e) Change-Id: I269b133fac53d4792d21b62f801cc0c0ccf337ea	2013-11-21 12:02:33 -08:00
Jenkins	b6727dbb49	Merge "Use utils.ismount in place of os.path.ismount"	2013-11-21 08:37:10 +00:00
Steven Lang	ef7f9e27a2	Fixed concurrent PUT requests to accounts or containers When concurrent PUT requests come in for an account or container, the resulting DB access will try to lock the DB for writing. Normal access will retry when it encounters a locked DB, change 0fdad0d9 introduced a cursor for doing the initialization which did not have the retry capability, resulting in a hard failure. Fixes bug 1243973 Change-Id: I73b219e0f5eacf314d87b4d5e56c03daf51b2eca	2013-11-20 13:26:50 -08:00
Peter Portante	b1f51d00cd	Use utils.ismount in place of os.path.ismount See comments from: https://review.openstack.org/55991 Change-Id: Ibb4153702b3dc4c60f66abb11cd3fa1953449827 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-11-19 11:27:40 -05:00
Peter Portante	9411a24ba7	Revert "Refactor common/utils methods to common/ondisk" This reverts commit 7760f41c3ce436cb23b4b8425db3749a3da33d32 Change-Id: I95e57a2563784a8cd5e995cc826afeac0eadbe62 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-10-07 17:18:09 -04:00
ZhiQiang Fan	f72704fc82	Change OpenStack LLC to Foundation Change-Id: I7c3df47c31759dbeb3105f8883e2688ada848d58 Closes-bug: #1214176	2013-09-20 01:02:31 +08:00
Peter Portante	7760f41c3c	Refactor common/utils methods to common/ondisk Place all the methods related to on-disk layout and / or configuration into a new common module that can be shared by the various modules using the same on-disk layout. Change-Id: I27ffd4665d5115ffdde649c48a4d18e12017e6a9 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-09-17 17:32:04 -04:00
Pete Zaitcev	d4b024ad7d	Split backends off swift/common/db.py The main purpose of this patch is to lay the groundwork for allowing the container and account servers to optionally use pluggable backend implementations. The backend.py files will eventually be the module where the backend APIs are defined via docstrings of this reference implementation. The swift/common/db.py module will remain an internal module used by the reference implementation. We have a raft of changes to docstrings staged for later, but this patch takes care to relocate ContainerBroker and AccountBroker into their new home intact. Change-Id: Ibab5c7605860ab768c8aa5a3161a705705689b04	2013-09-10 13:30:28 -06:00
Dirk Mueller	3d36a76156	Use Python 3.x compatible except construct except x,y: was deprected and is removed in Python 3.x. Use "except x as y:" instead which works in any Python version >= 2.6. Change-Id: I7008c74b807340f3457d3a0c8bd0b83f23169d14	2013-09-07 10:50:54 +02:00
Peter Portante	b5a0b830e2	Pep8 remaining unit test modules in common (8 of 12) Change-Id: I6fa3291eeacb7ee5c095ad9bccbd33f027bf11e3 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-09-01 16:12:51 -04:00
Peter Portante	c067abd21e	Pep8 unit test modules for hacking and one liners (4 of 12) Address all the "hacking" lines that are flagged, and all the modules that just have one item flagged. Change-Id: I372a4bdf9c7748f73e38c4fd55e5954f1afade5b Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-09-01 15:12:39 -04:00

1 2

81 Commits