swift

Author	SHA1	Message	Date
Jenkins	b4121d8f36	Merge "Add debug logging to container sync requests"	2014-06-13 04:47:24 +00:00
gholt	11bbc09bd7	Add debug logging to container sync requests Also fixed a bug where SimpleClient would send ?format=json for object requests, which is not necessary. Change-Id: If06a7dcebc9de2d7c8b28a046d60b902dae821c1	2014-05-28 01:18:28 +00:00
gholt	4350152828	Container sync no longer sending swift_bytes value Container sync had a bug where it'd send out the trailing "; swift_bytes=xxx" part of the content-type header. That trailing part is just for internal cluster usage by SLO. Since that needed to be stripped in two places now, I separated it out to a function that both spots call. Change-Id: Ibd6035d7a6b78205344bcc9d98bc1b7a9d463427	2014-05-21 13:32:17 +00:00
gholt	bf292ae0cb	Fixed error in container sync exc handling If container sync got a non-ClientException and then a ClientException during a get/put operation, it previously would break in the error handling. Change-Id: Ib2fa70270f3ec870bf60d5bbb3e8f514aeeb0927	2014-04-03 23:06:13 +00:00
gholt	69d331d0d6	Container Sync: Simple HTTP Proxy load balancing Change-Id: I021b043b927153bacff48cae648d4d8c5bbad765	2014-01-23 01:16:05 +00:00
gholt	f60d05686f	New container sync configuration option Summary of the new configuration option: The cluster operators add the container_sync middleware to their proxy pipeline and create a container-sync-realms.conf for their cluster and copy this out to all their proxy and container servers. This file specifies the available container sync "realms". A container sync realm is a group of clusters with a shared key that have agreed to provide container syncing to one another. The end user can then set the X-Container-Sync-To value on a container to //realm/cluster/account/container instead of the previously required URL. The allowed hosts list is not used with this configuration and instead every container sync request sent is signed using the realm key and user key. This offers better security as source hosts can be faked much more easily than faking per request signatures. Replaying signed requests, assuming it could easily be done, shouldn't be an issue as the X-Timestamp is part of the signature and so would just short-circuit as already current or as superceded. This also makes configuration easier for the end user, especially with difficult networking situations where a different host might need to be used for the container sync daemon since it's connecting from within a cluster. With this new configuration option, the end user just specifies the realm and cluster names and that is resolved to the proper endpoint configured by the operator. If the operator changes their configuration (key or endpoint), the end user does not need to change theirs. DocImpact Change-Id: Ie1704990b66d0434e4991e26ed1da8b08cb05a37	2014-01-10 23:48:00 +00:00
Chmouel Boudjnah	150f338fc2	Remove swiftclient dep on direct_client Partial Implements: blueprint remove-swiftclient-dependency Change-Id: I9af7150e5d21d50e5f880e57796314b8f05822d2	2013-12-24 03:11:43 -08:00
Peter Portante	4ed1c8473f	Handle optional arguments for run_forever() All the other daemons do this, and since the out deamon wrapper scripts pass all the command line options through directly, seems simple enough to handle them by ignoring. This is also applied to run_once(). Change-Id: I1df83bdf78f0dc3d911019f67f78301967b5da72 Closes-Bug: #1253891 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-11-22 14:39:30 -05:00
Peter Portante	5202b0e586	DiskFile API, with reference implementation Refactor on-disk knowledge out of the object server by pushing the async update pickle creation to the new DiskFileManager class (name is not the best, so suggestions welcome), along with the REPLICATOR method logic. We also move the mount checking and thread pool storage to the new ondisk.Devices object, which then also becomes the new home of the audit_location_generator method. For the object server, a new setup() method is now called at the end of the controller's construction, and the _diskfile() method has been renamed to get_diskfile(), to allow implementation specific behavior. We then hide the need for the REST API layer to know how and where quarantining needs to be performed. There are now two places it is checked internally, on open() where we verify the content-length, name, and x-timestamp metadata, and in the reader on close where the etag metadata is checked if the entire file was read. We add a reader class to allow implementations to isolate the WSGI handling code for that specific environment (it is used no-where else in the REST APIs). This simplifies the caller's code to just use a "with" statement once open to avoid multiple points where close needs to be called. For a full historical comparison, including the usage patterns see: https://gist.github.com/portante/5488238 (as of master, 2b639f5, Merge "Fix 500 from account-quota This Commit middleware") --------------------------------+------------------------------------ DiskFileManager(conf) Methods: .pickle_async_update() .get_diskfile() .get_hashes() Attributes: .devices .logger .disk_chunk_size .keep_cache_size .bytes_per_sync DiskFile(a,c,o,keep_data_fp=) DiskFile(a,c,o) Methods: Methods: .__iter__() .close(verify_file=) .is_deleted() .is_expired() .quarantine() .get_data_file_size() .open() .read_metadata() .create() .create() .write_metadata() .delete() .delete() Attributes: Attributes: .quarantined_dir .keep_cache .metadata DiskFileReader() Methods: .__iter__() .close() Attributes: +.was_quarantined DiskWriter() DiskFileWriter() Methods: Methods: .write() .write() .put() .put() * Note that the DiskFile class * Note that the DiskReader() object implements all the methods returned by the necessary for a WSGI app DiskFileOpened.reader() method iterator implements all the methods necessary for a WSGI app iterator + Note that if the auditor is refactored to not use the DiskFile class, see https://review.openstack.org/44787 then we don't need the was_quarantined attribute A reference "in-memory" object server implementation of a backend DiskFile class in swift/obj/mem_server.py and swift/obj/mem_diskfile.py. One can also reference https://github.com/portante/gluster-swift/commits/diskfile for the proposed integration with the gluster-swift code based on these changes. Change-Id: I44e153fdb405a5743e9c05349008f94136764916 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-10-17 15:03:31 -04:00
ZhiQiang Fan	f72704fc82	Change OpenStack LLC to Foundation Change-Id: I7c3df47c31759dbeb3105f8883e2688ada848d58 Closes-bug: #1214176	2013-09-20 01:02:31 +08:00
Chmouel Boudjnah	d3bd30da41	Fix sync test when localhost on port 80 is binded - When localhost:80 was binding the tests was trying to connect into it. - To test you can simply run sudo python -m SimpleHTTPServer 80 which should show : 1.0.0.127.in-addr.arpa - - [06/Aug/2013 14:10:42] code 501, message Unsupported method ('DELETE') 1.0.0.127.in-addr.arpa - - [06/Aug/2013 14:10:42] "DELETE /a/c/o HTTP/1.1" 501 - (the test was passing since 501 would raise ClientException). mock delete_object in the fourth test to fix that - Refactor the code to use mock.patch as well. Closes-Bug: 1208802 Change-Id: I5ddd4ac3a97879f51cf5883fcfc0fe0f0adaeff6	2013-09-11 15:49:48 -04:00
Dirk Mueller	3d36a76156	Use Python 3.x compatible except construct except x,y: was deprected and is removed in Python 3.x. Use "except x as y:" instead which works in any Python version >= 2.6. Change-Id: I7008c74b807340f3457d3a0c8bd0b83f23169d14	2013-09-07 10:50:54 +02:00
Peter Portante	1d9213bed4	Pep8 container test_sync and test_diskfile (9 of 12) Change-Id: I469e3b75b6846fddb5e9d2b8317506129836d0a6 Signed-off-by: Peter Portante <peter.portante@redhat.com>	2013-09-01 16:12:51 -04:00
Peter Portante	9480ff8a28	Refactor auditors to rely on expected gen names Refactor the various auditors to rely on the audit_location_generator yielding tuples containing paths with the expected suffix. We also fix the exception handling for container_sync to not expect a broker object (since the act of creating a broker object can raise an exception). For the object auditor we removed an unneeded check for disk_file since get_data_file_size() will raise DiskFileNotExist under the same condition (raises code coverage slightly). Change-Id: I11d405e629063177ef21543b75e9076da1a03b61	2013-07-15 17:29:42 -04:00
gholt	0a77f04893	Added utils.FileLikeIter A really simple version of this was in container sync already, and I needed a more complete version for work I'm doing, and I noticed https://review.openstack.org/#/c/33405/ was also making use of it. So, here's a more full version. If https://review.openstack.org/#/c/33405/ lands before this, I'll update it accordingly. Change-Id: Iba66b6a97f65e312e04fdba273e8f4ad1d3e1594	2013-06-19 18:21:52 +00:00
David Hadas	a979c8007b	Add support for Hash Prefix A new configuration parameter is added to /etc/swift/swift.conf [swift-hash] swift_hash_path_prefix = 'random unique string' New installations are advised to set this parameter to a random secret, which would not be disclosed ouside the organization. The same secret needs to be used by all swift servers of the same cluster. Existing installations should set this parameter to an empty string (the default) DocImpact Fixes: Bug #1157454 Change-Id: I63b10d0b7d6dd3f74e0f10bb41b5f240fa03578a	2013-03-22 19:41:55 +02:00
David Hadas	8b140033f0	Improved container-sync resiliency container-sync now skips faulty objects in the first and second rounds. All replicas try in the second round. No server will give up until the faulty object suceeds Fixes: bug #1068423 Change-Id: I0defc174b2ce3796a6acf410a2d2eae138e8193d	2013-03-08 22:28:06 +02:00
Joe Gordon	45f0502b52	Fix spelling mistakes git ls-files \| misspellings -f - Source: https://github.com/lyda/misspell-check Change-Id: I4132e6a276e44e2a8985238358533d315ee8d9c4	2013-02-12 16:39:40 -08:00
Dan Hersam	923b8b95c3	Fix for container sync not syncing last rowid Bug 1079439 Change-Id: I1985a7176f34931ecb0f24c0289f18df5e934598	2012-11-19 23:09:17 +00:00
Donagh McCabe	00c3fde8f8	Handle down object servers in container-sync If an object server is down, container-sync stops syncing the container even if the it gets object copies from "up" obejct servers. Bug 1069910 In case the git history gets mangled, this fix was done almost entirely by Donagh McCabe <donagh.mccabe@hp.com>. Change-Id: Ieeadcfeb4e880fe5f08e284d7c12492bf7a29460	2012-11-01 16:45:35 -07:00
Chmouel Boudjnah	22572b506a	Remove swift.common.client from here. - It has been to its own gerrit project. - direct_client should follow next. - Implements blueprint clientbindings. Change-Id: I3bb50c95eba81302bfec71cb7ce5288b85a41dc0	2012-06-07 16:36:49 +02:00
Darrell Bishop	3d3ed34f44	Adding StatsD logging to Swift. Documentation, including a list of metrics reported and their semantics, is in the Admin Guide in a new section, "Reporting Metrics to StatsD". An optional "metric prefix" may be configured which will be prepended to every metric name sent to StatsD. Here is the rationale for doing a deep integration like this versus only sending metrics to StatsD in middleware. It's the only way to report some internal activities of Swift in a real-time manner. So to have one way of reporting to StatsD and one place/style of configuration, even some things (like, say, timing of PUT requests into the proxy-server) which could be logged via middleware are consistently logged the same way (deep integration via the logger delegate methods). When log_statsd_host is configured, get_logger() injects a swift.common.utils.StatsdClient object into the logger as logger.statsd_client. Then a set of delegate methods on LogAdapter either pass through to the StatsdClient object or become no-ops. This allows StatsD logging to look like: self.logger.increment('some.metric.here') and do the right thing in all cases and with no messy conditional logic. I wanted to use the pystatsd module for the StatsD client, but the version on PyPi is lagging the git repo (and is missing both the prefix functionality and timing_since() method). So I wrote my swift.common.utils.StatsdClient. The interface is the same as pystatsd.Client, but the code was written from scratch. It's pretty simple, and the tests I added cover it. This also frees Swift from an optional dependency on the pystatsd module, making this feature easier to enable. There's test coverage for the new code and all existing tests continue to pass. Refactored out _one_audit_pass() method in swift/account/auditor.py and swift/container/auditor.py. Fixed some misc. PEP8 violations. Misc test cleanups and refactorings (particularly the way "fake logging" is handled). Change-Id: Ie968a9ae8771f59ee7591e2ae11999c44bfe33b2	2012-05-11 15:25:38 -07:00
Samuel Merritt	bb509dd863	As-unique-as-possible partition replica placement. This commit introduces a new algorithm for assigning partition replicas to devices. Basically, the ring builder organizes the devices into tiers (first zone, then IP/port, then device ID). When placing a replica, the ring builder looks for the emptiest device (biggest parts_wanted) in the furthest-away tier. In the case where zone-count >= replica-count, the new algorithm will give the same results as the one it replaces. Thus, no migration is needed. In the case where zone-count < replica-count, the new algorithm behaves differently from the old algorithm. The new algorithm will distribute things evenly at each tier so that the replication is as high-quality as possible, given the circumstances. The old algorithm would just crash, so again, no migration is needed. Handoffs have also been updated to use the new algorithm. When generating handoff nodes, first the ring looks for nodes in other zones, then other ips/ports, then any other drive. The first handoff nodes (the ones in other zones) will be the same as before; this commit just extends the list of handoff nodes. The proxy server and replicators have been altered to avoid looking at the ring's replica count directly. Previously, with a replica count of C, RingData.get_nodes() and RingData.get_part_nodes() would return lists of length C, so some other code used the replica count when it needed the number of nodes. If two of a partition's replicas are on the same device (e.g. with 3 replicas, 2 devices), then that assumption is no longer true. Fortunately, all the proxy server and replicators really needed was the number of nodes returned, which they already had. (Bonus: now the only code that mentions replica_count directly is in the ring and the ring builder.) Change-Id: Iba2929edfc6ece89791890d0635d4763d821a3aa	2012-05-09 15:56:06 -07:00
John Dickinson	1ecf5ebba1	updated copyright date for all files Change-Id: Ifd909d3561c2647770a7e0caa3cd91acd1b4f298	2012-03-19 13:45:34 -05:00
gholt	f9fb4c739c	consync: Now queries all primary nodes for a put and uses the newest object if it is newer or equal to the object to sync	2011-06-16 01:31:51 +00:00
gholt	78d417dda7	consync: More tests and slight refactor to be more testable	2011-06-16 00:59:55 +00:00
gholt	6587fd914e	consync: Some more tests and bugfixes.	2011-06-15 02:01:01 +00:00

27 Commits