swift

Author	SHA1	Message	Date
Romain LE DISEZ	71f6fd025e	Allows to configure the rsync modules where the replicators will send data Currently, the rsync module where the replicators send data is static. It forbids administrators to set rsync configuration based on their current deployment or needs. As an example, the rsyncd configuration example encourages to set a connections limit for the modules account, container and object. It permits to protect devices from excessives parallels connections, because it would impact performances. On a server with many devices, it is tempting to increase this number proportionally, but nothing guarantees that the distribution of the connections will be balanced. In the worst scenario, a single device can receive all the connections, which is a severe impact on performances. This commit adds a new option named 'rsync_module' to the -replicator sections of the -server configuration file. This configuration variable can be extrapolated with device attributes like ip, port, device, zone, ... by using the format {NAME}. eg: rsync_module = {replication_ip}::object_{device} With this configuration, an administrators can solve the problem of connections distribution by creating one module per device in rsyncd configuration. The default values are backward compatible: {replication_ip}::account {replication_ip}::container {replication_ip}::object Option vm_test_mode is deprecated by this commit, but backward compatibility is maintained. The option is only effective when rsync_module is not set. In that case, {replication_port} is appended to the default value of rsync_module. Change-Id: Iad91df50dadbe96c921181797799b4444323ce2e	2015-09-07 08:00:18 +02:00
paul luse	893f30c61d	EC GET path: require fragments to be of same set And if they are not, exhaust the node iter to go get more. The problem without this implementation is a simple overwrite where a GET follows before the handoff has put the newer obj back on the 'alive again' node such that the proxy gets n-1 fragments of the newest set and 1 of the older. This patch bucketizes the fragments by etag and if it doesn't have enough continues to exhaust the node iterator until it has a large enough matching set. Change-Id: Ib710a133ce1be278365067fd0d6610d80f1f7372 Co-Authored-By: Clay Gerrard <clay.gerrard@gmail.com> Co-Authored-By: Alistair Coles <alistair.coles@hp.com> Closes-Bug: 1457691	2015-08-27 21:09:41 -07:00
Jenkins	f644eaef7d	Merge "test/unit: Replace python print operator with print function (pep H233, py33)"	2015-08-11 09:49:50 +00:00
Jenkins	5a04412ad2	Merge "sys.exc_type/exc_value/exc_traceback are Deprecated"	2015-08-03 14:54:52 +00:00
Victor Stinner	a0db56dcde	Fix pep8 E265 warning of hacking 0.10 Fix the warning E265 "block comment should start with '# '" added in pep 1.5. Change-Id: Ib57282e958be9c7cddffc7bca34fbbf1d4c460fd	2015-07-30 09:33:18 +02:00
janonymous	c5b5cf91a9	test/unit: Replace python print operator with print function (pep H233, py33) 'print' function is compatible with 2.x and 3.x python versions Link : https://www.python.org/dev/peps/pep-3105/ Python 2.6 has a __future__ import that removes print as language syntax, letting you use the functional form instead Change-Id: I94e1bc6bd83ad6b05695c7ebdf7cbfd8f6d9f9af	2015-07-28 21:03:05 +05:30
Jenkins	b87e32591d	Merge "Use six to fix imports on Python 3"	2015-07-28 10:25:25 +00:00
Jenkins	7eb0306d0e	Merge "Deprecated xreadlines() usage fixed"	2015-07-28 00:18:03 +00:00
Victor Stinner	e24d7c36fa	Use six to fix imports on Python 3 Get configparser, queue, http_client modules from six.moves. Patch generated by the six_moves operation of the sixer tool: https://pypi.python.org/pypi/sixer Change-Id: I666241ab50101b8cc6f992dd80134ce27327bd7d	2015-07-24 11:48:28 +02:00
Jenkins	260e976e50	Merge "Get StringIO and cStringIO from six.moves"	2015-07-24 06:52:36 +00:00
janonymous	cd7b2db550	unit tests: Replace "self.assert_" by "self.assertTrue" The assert_() method is deprecated and can be safely replaced by assertTrue(). This patch makes sure that running the tests does not create undesired warnings. Change-Id: I0602ba39ef93263386644ee68088d5f65fcb4a71	2015-07-21 19:23:00 +05:30
janonymous	39621b8629	Deprecated xreadlines() usage fixed Change-Id: Iadf42c2f86f78c11259e21c88b4aef51db3441ad	2015-07-18 14:12:58 +05:30
Victor Stinner	6e70f3fa32	Get StringIO and cStringIO from six.moves * replace "from cStringIO import StringIO" with "from six.moves import cStringIO as StringIO" * replace "from StringIO import StringIO" with "from six import StringIO" * replace "import cStringIO" and "cStringIO.StringIO()" with "from six import moves" and "moves.cStringIO()" * replace "import StringIO" and "StringIO.StringIO()" with "import six" and "six.StringIO()" This patch was generated by the stringio operation of the sixer tool: https://pypi.python.org/pypi/sixer Change-Id: Iacba77fec3045f96773d1090c0bd48613729a561	2015-07-15 16:56:33 +02:00
janonymous	c8131c6abb	sys.exc_type/exc_value/exc_traceback are Deprecated sys.exc_info() contains a tuple of these three. Change-Id: I530cbeb37c43da98b4924db41f6604871077bd47	2015-07-12 14:48:35 +05:30
Victor Stinner	e5c962a28c	Replace xrange() with six.moves.range() Patch generated by the xrange operation of the sixer tool: https://pypi.python.org/pypi/sixer Manual changes: * Fix indentation for pep8 checks * Fix TestGreenthreadSafeIterator.test_access_is_serialized of test.unit.common.test_utils: replace range(1, 11) with list(range(1, 11)) * Fix UnsafeXrange docstring, revert change Change-Id: Icb7e26135c5e57b5302b8bfe066b33cafe69fe4d	2015-06-23 07:29:15 +00:00
Darrell Bishop	df134df901	Allow 1+ object-servers-per-disk deployment Enabled by a new > 0 integer config value, "servers_per_port" in the [DEFAULT] config section for object-server and/or replication server configs. The setting's integer value determines how many different object-server workers handle requests for any single unique local port in the ring. In this mode, the parent swift-object-server process continues to run as the original user (i.e. root if low-port binding is required), binds to all ports as defined in the ring, and forks off the specified number of workers per listen socket. The child, per-port servers drop privileges and behave pretty much how object-server workers always have, except that because the ring has unique ports per disk, the object-servers will only be handling requests for a single disk. The parent process detects dead servers and restarts them (with the correct listen socket), starts missing servers when an updated ring file is found with a device on the server with a new port, and kills extraneous servers when their port is found to no longer be in the ring. The ring files are stat'ed at most every "ring_check_interval" seconds, as configured in the object-server config (same default of 15s). Immediately stopping all swift-object-worker processes still works by sending the parent a SIGTERM. Likewise, a SIGHUP to the parent process still causes the parent process to close all listen sockets and exit, allowing existing children to finish serving their existing requests. The drop_privileges helper function now has an optional param to suppress the setsid() call, which otherwise screws up the child workers' process management. The class method RingData.load() can be told to only load the ring metadata (i.e. everything except replica2part2dev_id) with the optional kwarg, header_only=True. This is used to keep the parent and all forked off workers from unnecessarily having full copies of all storage policy rings in memory. A new helper class, swift.common.storage_policy.BindPortsCache, provides a method to return a set of all device ports in all rings for the server on which it is instantiated (identified by its set of IP addresses). The BindPortsCache instance will track mtimes of ring files, so they are not opened more frequently than necessary. This patch includes enhancements to the probe tests and object-replicator/object-reconstructor config plumbing to allow the probe tests to work correctly both in the "normal" config (same IP but unique ports for each SAIO "server") and a server-per-port setup where each SAIO "server" must have a unique IP address and unique port per disk within each "server". The main probe tests only work with 4 servers and 4 disks, but you can see the difference in the rings for the EC probe tests where there are 2 disks per server for a total of 8 disks. Specifically, swift.common.ring.utils.is_local_device() will ignore the ports when the "my_port" argument is None. Then, object-replicator and object-reconstructor both set self.bind_port to None if server_per_port is enabled. Bonus improvement for IPv6 addresses in is_local_device(). This PR for vagrant-swift-all-in-one will aid in testing this patch: https://github.com/swiftstack/vagrant-swift-all-in-one/pull/16/ Also allow SAIO to answer is_local_device() better; common SAIO setups have multiple "servers" all on the same host with different ports for the different "servers" (which happen to match the IPs specified in the rings for the devices on each of those "servers"). However, you can configure the SAIO to have different localhost IP addresses (e.g. 127.0.0.1, 127.0.0.2, etc.) in the ring and in the servers' config files' bind_ip setting. This new whataremyips() implementation combined with a little plumbing allows is_local_device() to accurately answer, even on an SAIO. In the default case (an unspecified bind_ip defaults to '0.0.0.0') as well as an explict "bind to everything" like '0.0.0.0' or '::', whataremyips() behaves as it always has, returning all IP addresses for the server. Also updated probe tests to handle each "server" in the SAIO having a unique IP address. For some (noisy) benchmarks that show servers_per_port=X is at least as good as the same number of "normal" workers: https://gist.github.com/dbishop/c214f89ca708a6b1624a#file-summary-md Benchmarks showing the benefits of I/O isolation with a small number of slow disks: https://gist.github.com/dbishop/fd0ab067babdecfb07ca#file-results-md If you were wondering what the overhead of threads_per_disk looks like: https://gist.github.com/dbishop/1d14755fedc86a161718#file-tabular_results-md DocImpact Change-Id: I2239a4000b41a7e7cc53465ce794af49d44796c6	2015-06-18 12:43:50 -07:00
janonymous	09e7477a39	Replace it.next() with next(it) for py3 compat The Python 2 next() method of iterators was renamed to __next__() on Python 3. Use the builtin next() function instead which works on Python 2 and Python 3. Change-Id: Ic948bc574b58f1d28c5c58e3985906dee17fa51d	2015-06-15 22:10:45 +05:30
Samuel Merritt	4f2ed8bcd0	EC: support multiple ranges for GET requests This commit lets clients receive multipart/byteranges responses (see RFC 7233, Appendix A) for erasure-coded objects. Clients can already do this for replicated objects, so this brings EC closer to feature parity (ha!). GetOrHeadHandler got a base class extracted from it that treats an HTTP response as a sequence of byte-range responses. This way, it can continue to yield whole fragments, not just N-byte pieces of the raw HTTP response, since an N-byte piece of a multipart/byteranges response is pretty much useless. There are a couple of bonus fixes in here, too. For starters, download resuming now works on multipart/byteranges responses. Before, it only worked on 200 responses or 206 responses for a single byte range. Also, BufferedHTTPResponse grew a readline() method. Also, the MIME response for replicated objects got tightened up a little. Before, it had some leading and trailing CRLFs which, while allowed by RFC 7233, provide no benefit. Now, both replicated and EC multipart/byteranges avoid extraneous bytes. This let me re-use the Content-Length calculation in swob instead of having to either hack around it or add extraneous whitespace to match. Change-Id: I16fc65e0ec4e356706d327bdb02a3741e36330a0	2015-06-03 11:42:00 -07:00
Michael MATUR	e7c8c578d9	fixup!Patch of "parse_content_disposition" method to meet RFC2183 The spec of Content-Disposition does not require a space character after comma: http://www.ietf.org/rfc/rfc2183.txt Change-Id: Iff438dc36ce78c6a79bb66ab3d889a8dae7c0e1f Closes-Bug: #1458497	2015-05-26 11:23:58 +02:00
Samuel Merritt	94215049fd	Bump up a timeout in a test Got a slow crappy VM like I do? You might see this fail occasionally. Bump up the timeout a little to help it out. Change-Id: I8c0e5b99012830ea3525fa55b0811268db3da2a2	2015-05-04 15:08:51 -07:00
Samuel Merritt	decbcd24d4	Foundational support for PUT and GET of erasure-coded objects This commit makes it possible to PUT an object into Swift and have it stored using erasure coding instead of replication, and also to GET the object back from Swift at a later time. This works by splitting the incoming object into a number of segments, erasure-coding each segment in turn to get fragments, then concatenating the fragments into fragment archives. Segments are 1 MiB in size, except the last, which is between 1 B and 1 MiB. +====================================================================+ \| object data \| +====================================================================+ \| +------------------------+----------------------+ \| \| \| v v v +===================+ +===================+ +==============+ \| segment 1 \| \| segment 2 \| ... \| segment N \| +===================+ +===================+ +==============+ \| \| \| \| v v /=========\ /=========\ \| pyeclib \| \| pyeclib \| ... \=========/ \=========/ \| \| \| \| +--> fragment A-1 +--> fragment A-2 \| \| \| \| \| \| \| \| \| \| +--> fragment B-1 +--> fragment B-2 \| \| \| \| ... ... Then, object server A gets the concatenation of fragment A-1, A-2, ..., A-N, so its .data file looks like this (called a "fragment archive"): +=====================================================================+ \| fragment A-1 \| fragment A-2 \| ... \| fragment A-N \| +=====================================================================+ Since this means that the object server never sees the object data as the client sent it, we have to do a few things to ensure data integrity. First, the proxy has to check the Etag if the client provided it; the object server can't do it since the object server doesn't see the raw data. Second, if the client does not provide an Etag, the proxy computes it and uses the MIME-PUT mechanism to provide it to the object servers after the object body. Otherwise, the object would not have an Etag at all. Third, the proxy computes the MD5 of each fragment archive and sends it to the object server using the MIME-PUT mechanism. With replicated objects, the proxy checks that the Etags from all the object servers match, and if they don't, returns a 500 to the client. This mitigates the risk of data corruption in one of the proxy --> object connections, and signals to the client when it happens. With EC objects, we can't use that same mechanism, so we must send the checksum with each fragment archive to get comparable protection. On the GET path, the inverse happens: the proxy connects to a bunch of object servers (M of them, for an M+K scheme), reads one fragment at a time from each fragment archive, decodes those fragments into a segment, and serves the segment to the client. When an object server dies partway through a GET response, any partially-fetched fragment is discarded, the resumption point is wound back to the nearest fragment boundary, and the GET is retried with the next object server. GET requests for a single byterange work; GET requests for multiple byteranges do not. There are a number of things _not_ included in this commit. Some of them are listed here: * multi-range GET * deferred cleanup of old .data files * durability (daemon to reconstruct missing archives) Co-Authored-By: Alistair Coles <alistair.coles@hp.com> Co-Authored-By: Thiago da Silva <thiago@redhat.com> Co-Authored-By: John Dickinson <me@not.mn> Co-Authored-By: Clay Gerrard <clay.gerrard@gmail.com> Co-Authored-By: Tushar Gohad <tushar.gohad@intel.com> Co-Authored-By: Paul Luse <paul.e.luse@intel.com> Co-Authored-By: Christian Schwede <christian.schwede@enovance.com> Co-Authored-By: Yuan Zhou <yuan.zhou@intel.com> Change-Id: I9c13c03616489f8eab7dcd7c5f21237ed4cb6fd2	2015-04-14 00:52:17 -07:00
Kota Tsuyuzaki	99a3d1a1c5	Fix the prefix of messages caputured from stderr Swift caputures both sys.stdout and sys.stderr messages and transfers them to syslog. However, whole of the messages uses the "STDOUT:" prefix, currently. It will be quite confusable for debugging purpose because we cannot make sure whether "Swift didn't caputure stderr" or "Swift captured stderr but didn't show it" when we want to log some stderr messages. This patch enables to allow a new prefix "STDERR:" and use it for sys.stderr. Change-Id: Idf4649e9860b77c3600a5cd0a3e0bd674b31fb4f	2015-03-13 04:39:16 -07:00
Prashanth Pai	a6f630f27c	fsync() on directories renamer() method now does a fsync on containing directory of target path and also on parent dirs of newly created directories, by default. This can be explicitly turned off in cases where it is not necessary (For example- quarantines). The following article explains why this is necessary: http://lwn.net/Articles/457667/ Although, it may seem like the right thing to do, this change does come at a performance penalty. However, no configurable option is provided to turn it off. Also, lock_path() inside invalidate_hash() was always creating part of object path in filesystem. Those are never fsync'd. This has been fixed. Change-Id: Id8e02f84f48370edda7fb0c46e030db3b53a71e3 Signed-off-by: Prashanth Pai <ppai@redhat.com>	2015-03-04 12:33:56 +05:30
Jenkins	d6467d3385	Merge "Add multiple reseller prefixes and composite tokens"	2015-02-24 16:12:01 +00:00
Donagh McCabe	89397c5b67	Add multiple reseller prefixes and composite tokens This change is in support of Composite Tokens and Service Accounts (see http://specs.openstack.org/openstack/swift-specs/specs/in_progress/ service_token.html) During coding, minor changes were made compared to the original specification. See https://review.openstack.org/138771 for these changes. DocImpact Change-Id: I6072b4efb3a479a8e0cc2d9c11ffda5764b55e30	2015-02-23 15:57:20 +00:00
Jenkins	a10e42d910	Merge "Fix logger test coupling"	2015-02-13 18:53:35 +00:00
Alistair Coles	f5ab30574e	Fix logger test coupling Two tests fail if test/unit/common/test_daemon.py is excluded from the test set: test.unit.common.test_utils.TestUtils.test_swift_log_formatter test.unit.common.test_utils.TestStatsdLoggingDelegation.test_thread_locals They fail because they assert that logger.txn_id is None, when previous tests cause txn_id to be set. This coupling is masked when test_daemon.py is included because it reloads the common/utils module, and is executed just before test.unit.common.test_utils. The coupling can be observed by running, for example: nosetests ./test/unit/common/middleware/test_except.py ./test/unit/common/test_utils.py or nosetests ./test/unit/proxy ./test/unit/common/test_utils.py The failing tests should reset logger thread_locals before making assertions. test_utils.reset_loggers() attempts this but is broken because it sets thread_local on the wrong object. Changes in this patch: - fix reset_loggers() to reset the LogAdapter thread local state - add a reset_logger_state decorator to call reset_loggers before and after a decorated method - use the decorator for increased isolation of tests that previously called reset_loggers only on exit Change-Id: If9aa781a2dd1929a47ef69322ec8c53263d47660	2015-02-12 15:32:25 +00:00
Jenkins	0186194e96	Merge "Output logs of policy index"	2015-02-10 22:21:13 +00:00
Hisashi Osanai	efb39a5665	Allow hostnames for nodes in Rings This change modifies the swift-ring-builder and introduces new format of sub-commands (search, list_parts, set_weight, set_info and remove) in addition to add sub-command so that hostnames can be used in place of an ip-address for the sub-commands. The account reaper, container synchronizer, and replicators were also updated so that they still have a way to identify a particular device as being "local". Previously this was Change-Id: Ie471902413002872fc6755bacd36af3b9c613b74 Change-Id: Ieff583ffb932133e3820744a3f8f9f491686b08d Co-Authored-By: Alex Pecoraro <alex.pecoraro@emc.com> Implements: blueprint allow-hostnames-for-nodes-in-rings	2015-02-02 05:06:03 +09:00
Daisuke Morita	afdbf73f12	Output logs of policy index To make it easier for Swift operators to specify problematic devices, a policy index will be recorded in log files of proxy and storage servers for each user request which is related to storage policy. This patch simply adds 'storage_policy_index' field in a log format. If there is no specified policy index, '-' is output in this field. Extra fix: Doc about the log line of storage nodes now properly reflects 'server_pid' field. DocImpact Change-Id: I7286ae85bcbcec73b5377dc115cbdb0f57d1b025 Implements: blueprint logging-policy-number	2015-01-23 10:48:38 +09:00
Samuel Merritt	f58f397cf6	Make ThreadPools deallocatable. Currently, a ThreadPool acquires resources that last until process exit. You can let the ThreadPool go out of scope, but that doesn't terminate the worker threads or close file descriptors or anything. This commit makes it so you can .terminate() a ThreadPool object and get its resources back. Also, after you call .terminate(), trying to use the ThreadPool raises an exception so you know you've goofed. I have some internal code that could really use this, plus it makes the unit test run not leak resources, which is nice. Change-Id: Ibf7c6dc14c14f379421a79afb6c90a5e64b235fa	2015-01-22 13:54:52 -08:00
Clay Gerrard	83834eb276	Raise ValueError for offset on Timestamp over limit We can't order a Timestamp with an offset larger than 16 hex digits correctly, so we raise a ValueError if you try to create one. Change-Id: I8c8d4cf13785a1a8eb7416392263eae5242aa407	2014-12-01 11:44:10 -08:00
Keshava Bharadwaj	0f93fff46a	Fixes unit tests to clean up temporary directories This patch fixes the unit tests to remove the temporary directories created during run of unit tests. Some of unit tests did not tear down correctly, whatever it had set it up for running. This would over period of time bloat up the tmp directory. As on date, there were around 49 tmp directories left uncleared per round of unit tests. This patch fixes it. Change-Id: If591375ca9cc87d52c7c9c6dc16c9fb4b49e99fc	2014-09-26 22:39:48 +05:30
Jenkins	0d502de2f5	Merge "Zero-copy object-server GET responses with splice()"	2014-09-26 15:48:25 +00:00
Samuel Merritt	4d23a0fcf5	Reject overly-taxing ranged-GET requests RFC 7233 says that servers MAY reject egregious range-GET requests such as requests with hundreds of ranges, requests with non-ascending ranges, and so on. Such requests are fairly hard for Swift to process. Consider a Range header that asks for the first byte of every 10th MiB in a 4 GiB object, but in some random order. That'll cause a lot of seeks on the object server, but the corresponding response body is quite small in comparison to the workload. This commit makes Swift reject, with a 416 response, any ranged GET request with more than fifty ranges, more than three overlapping ranges, or more than eight non-increasing ranges. This is a necessary prerequisite for supporting multi-range GETs on large objects. Otherwise, a malicious user could construct a Range header with hundreds of byte ranges where each individual byterange requires the proxy to contact a different object server. If seeking all over a disk is bad, connecting all over the cluster is way worse. DocImpact Change-Id: I4dcedcaae6c3deada06a0223479e611094d57234	2014-09-22 17:13:26 +00:00
Samuel Merritt	7d0e5ebe69	Zero-copy object-server GET responses with splice() This commit lets the object server use splice() and tee() to move data from disk to the network without ever copying it into user space. Requires Linux. Sorry, FreeBSD folks. You still have the old mechanism, as does anyone who doesn't want to use splice. This requires a relatively recent kernel (2.6.38+) to work, which includes the two most recent Ubuntu LTS releases (Precise and Trusty) as well as RHEL 7. However, it excludes Lucid and RHEL 6. On those systems, setting "splice = on" will result in warnings in the logs but no actual use of splice. Note that this only applies to GET responses without Range headers. It can easily be extended to single-range GET requests, but this commit leaves that for future work. Same goes for PUT requests, or at least non-chunked ones. On some real hardware I had laying around (not a VM), this produced a 37% reduction in CPU usage for GETs made directly to the object server. Measurements were done by looking at /proc/<pid>/stat, specifically the utime and stime fields (user and kernel CPU jiffies, respectively). Note: There is a Python module called "splicetee" available on PyPi, but it's licensed under the GPL, so it cannot easily be added to OpenStack's requirements. That's why this patch uses ctypes instead. Also fixed a long-standing annoyance in FakeLogger: >>> fake_logger.warn('stuff') >>> fake_logger.get_lines_for_level('warn') [] >>> This, of course, is because the correct log level is 'warning'. Now you get a KeyError if you call get_lines_for_level with a bogus log level. Change-Id: Ic6d6b833a5b04ca2019be94b1b90d941929d21c8	2014-09-18 16:02:47 -07:00
Samuel Merritt	a7604da065	Move multipart MIME parser into utils Over on the EC branch, we need to be able to parse multipart MIME documents in the object server. The formpost middleware has a perfectly good MIME parser, but it seems sort of awful to import things from formpost in swift/obj/server.py, so I pulled it out into common.utils. Change-Id: Ieb4c05d02d8e4ef51a3a11d26c503786b1897f60	2014-09-16 10:10:59 -07:00
Jenkins	eacc8e4d69	Merge "Fix last_modified_date_to_timestamp on non-UTC systems"	2014-09-06 19:51:37 +00:00
saranjan	2a8b43e5e7	Spelling mistakes corrected in comments. Change-Id: Ibbd7511c3a2b08519feb4db18eca6e000603ea32	2014-09-03 10:40:30 -07:00
Samuel Merritt	33980c792d	Fix last_modified_date_to_timestamp on non-UTC systems Before, we were calling datetime.datetime.strftime('%s.%f') to convert a datetime to epoch seconds + microseconds. However, the '%s' format isn't actually part of Python's library. Rather, Python passes that on to the system C library, which is typically glibc. Now, glibc takes the '%s' format and helpfully* applies the current timezone as an offset. This gives bogus results on machines where UTC is not the system timezone. (Yes, some people really do that.) For example: >>> import os >>> from swift.common import utils >>> os.environ['TZ'] = 'PST8PDT,M3.2.0,M11.1.0' >>> float(utils.last_modified_date_to_timestamp('1970-01-01T00:00:00.000000')) 28800.0 >>> That timestamp should obviously be 0. This patch replaces the strftime() call with datetime arithmetic, which is entirely in Python so the system timezone doesn't mess it up. * unhelpfully Change-Id: I56855acd79a5d8f2c98a771fa9fd2729e4f490b1	2014-08-29 18:32:45 -07:00
Jenkins	1bbd8fe990	Merge "Sleep for longer at a time in lock_path."	2014-08-15 22:33:00 +00:00
David Goetz	a0e0014159	Sleep for longer at a time in lock_path. When lock_path is called and the lock goes for the whole 10 seconds, the flock is called 1000 times. With this patch, the short 0.01 sleep is used for the first 1% of the total lock time and then 1% of the total lock time is used. Change-Id: Ibed6bdb49bddcdb868742c41f86d2482a7edfd29	2014-08-14 12:16:21 -07:00
John Dickinson	1bc4fe891a	Catch permissions errors when writing StatsD packets Closes-Bug: #1183152 Change-Id: I4b2c6e947241c987779a385fdff270d037470a57	2014-08-12 11:23:35 -07:00
Michael Barton	b908a65649	lock_file race fixes I attempted to use this function and found a few problems. We shouldn’t unlink the file after closing it, because someone else could lock it in between. Switch to unlink before close. If someone else locked the file between our open and flock, they are likely to unlink it out from underneath us. Then we have a lock on a file that no longer exists. So stat the filename after locking to make sure the inode hasn't changed or gone away. We probably shouldn’t unlink the file if we time out waiting for a lock. So move that to before the finally block. Change-Id: Id1858c97805d3ab81c584eaee8ce0d43d34a8089	2014-07-21 08:49:27 +00:00
John Dickinson	7ab2afe5bd	added process pid to the end of storage node log lines Change-Id: I1c2709d85575fc7d4868fafd9ee757fd01868436	2014-07-09 12:12:33 -07:00
Nirmal Thacker	b61fce6cba	Container Auditor should log a warning if the devices path contains a non-directory. If the devices path configured in container-server.conf contains a file then an uncaught exception is seen in the logs. For example if file foo exists as such /srv/1/node/foo then when the container-auditor runs, the exception that foo/containers is not a directory is seen in the logs This patch was essentially clayg and can be found in the bug I tested it and wanted to get a feel of the openstack workflow so going through the commit process I have added a unit test as well as cleaned up and improved the unit test coverage for this module. - unit test for above fix is added - unit test to verify exceptions that are raised in the module - unit test to verify the logger's behavior - unit test to verify mount_check behavior Change-Id: I903b2b1e11646404cfb0551ee582a514d008c844 Closes-Bug: #1317257	2014-07-01 04:56:39 +00:00
Samuel Merritt	48d94d96b9	Don't count 412 or 416 as errors in stats The backend HTTP servers emit StatsD metrics of the form <server>.<method>.timing and <server>.<method>.errors.timing (e.g. object-server.GET.timing). Whether something counts as an error or not is based on the response's HTTP status code. Prior to this commit, "success" was 2xx, 3xx, or 404, while everything else was considered "error". This adds 412 and 416 to the "success" set. Like 404, these status codes indicate that we got the request and processed it without error, but the response was "no". They shouldn't be in the same category as statuses like 507 that indicate something stopped the request from being processed. Change-Id: I5582a51cf6f64aa22c890da01aaaa077f3a54202	2014-06-24 11:04:45 -07:00
Clay Gerrard	c1dc2fa624	Add two vector timestamps The normalized form of the X-Timestamp header looks like a float with a fixed width to ensure stable string sorting - normalized timestamps look like "1402464677.04188" To support overwrites of existing data without modifying the original timestamp but still maintain consistency a second internal offset vector is append to the normalized timestamp form which compares and sorts greater than the fixed width float format but less than a newer timestamp. The internalized format of timestamps looks like "1402464677.04188_0000000000000000" - the portion after the underscore is the offset and is a formatted hexadecimal integer. The internalized form is not exposed to clients in responses from Swift. Normal client operations will not create a timestamp with an offset. The Timestamp class in common.utils supports internalized and normalized formatting of timestamps and also comparison of timestamp values. When the offset value of a Timestamp is 0 - it's considered insignificant and need not be represented in the string format; to support backwards compatibility during a Swift upgrade the internalized and normalized form of a Timestamp with an insignificant offset are identical. When a timestamp includes an offset it will always be represented in the internalized form, but is still excluded from the normalized form. Timestamps with an equivalent timestamp portion (the float part) will compare and order by their offset. Timestamps with a greater timestamp portion will always compare and order greater than a Timestamp with a lesser timestamp regardless of it's offset. String comparison and ordering is guaranteed for the internalized string format, and is backwards compatible for normalized timestamps which do not include an offset. The reconciler currently uses a offset bump to ensure that objects can move to the wrong storage policy and be moved back. This use-case is valid because the content represented by the user-facing timestamp is not modified in way. Future consumers of the offset vector of timestamps should be mindful of HTTP semantics of If-Modified and take care to avoid deviation in the response from the object server without an accompanying change to the user facing timestamp. DocImpact Implements: blueprint storage-policies Change-Id: Id85c960b126ec919a481dc62469bf172b7fb8549	2014-06-19 10:18:06 -07:00
Clay Gerrard	fbcfb83566	Add LRUCache to common.utils This decorator will memonize a function using a fixed size cache that evicts the oldest entries. It also supports a maxtime paramter to configure a "time-to-live" for entries in the cache. The reconciler code uses this to cache computations of the correct storage policy index for a container for 30 seconds. DocImpact Implements: blueprint storage-policies Change-Id: I0f220869e33c461a4100b21c6324ad725da864fa	2014-06-18 21:09:53 -07:00
Clay Gerrard	3fc4d6f91d	Add container-reconciler daemon This daemon will take objects that are in the wrong storage policy and move them to the right ones, or delete requests that went to the wrong storage policy and apply them to the right ones. It operates on a queue similar to the object-expirer's queue. Discovering that the object is in the wrong policy will be done in subsequent commits by the container replicator; this is the daemon that handles them once they happen. Like the object expirer, you only need to run one of these per cluster see etc/container-reconciler.conf. DocImpact Implements: blueprint storage-policies Change-Id: I5ea62eb77ddcbc7cfebf903429f2ee4c098771c9	2014-06-18 17:31:39 -07:00

1 2 3 4 5

201 Commits