2015-01-02 14:24:57 -05:00
|
|
|
# Copyright 2010 United States Government as represented by the
|
|
|
|
# Administrator of the National Aeronautics and Space Administration.
|
|
|
|
# All Rights Reserved.
|
|
|
|
# Copyright 2013 Red Hat, Inc.
|
|
|
|
# Copyright 2013 New Dream Network, LLC (DreamHost)
|
|
|
|
#
|
|
|
|
# Licensed under the Apache License, Version 2.0 (the "License"); you may
|
|
|
|
# not use this file except in compliance with the License. You may obtain
|
|
|
|
# a copy of the License at
|
|
|
|
#
|
|
|
|
# http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
#
|
|
|
|
# Unless required by applicable law or agreed to in writing, software
|
|
|
|
# distributed under the License is distributed on an "AS IS" BASIS, WITHOUT
|
|
|
|
# WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the
|
|
|
|
# License for the specific language governing permissions and limitations
|
|
|
|
# under the License.
|
|
|
|
|
2016-03-25 13:39:53 +02:00
|
|
|
import abc
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
import functools
|
|
|
|
import inspect
|
2015-08-16 17:30:46 -04:00
|
|
|
import logging
|
2015-09-16 14:07:52 +02:00
|
|
|
import threading
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
import traceback
|
2015-08-16 17:30:46 -04:00
|
|
|
|
2016-02-20 11:31:15 -05:00
|
|
|
from oslo_config import cfg
|
2015-07-01 16:12:08 +03:00
|
|
|
from oslo_service import service
|
2016-02-20 11:31:15 -05:00
|
|
|
from oslo_utils import eventletutils
|
2015-10-13 18:30:50 +02:00
|
|
|
from oslo_utils import timeutils
|
2015-01-02 14:24:57 -05:00
|
|
|
from stevedore import driver
|
|
|
|
|
|
|
|
from oslo_messaging._drivers import base as driver_base
|
2020-01-17 16:24:29 +01:00
|
|
|
from oslo_messaging import _utils as utils
|
2015-01-02 14:24:57 -05:00
|
|
|
from oslo_messaging import exceptions
|
|
|
|
|
2018-12-28 22:52:08 +08:00
|
|
|
__all__ = [
|
|
|
|
'ExecutorLoadFailure',
|
|
|
|
'MessageHandlingServer',
|
|
|
|
'MessagingServerError',
|
|
|
|
'ServerListenError',
|
|
|
|
]
|
|
|
|
|
2015-08-16 17:30:46 -04:00
|
|
|
LOG = logging.getLogger(__name__)
|
|
|
|
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
# The default number of seconds of waiting after which we will emit a log
|
|
|
|
# message
|
|
|
|
DEFAULT_LOG_AFTER = 30
|
|
|
|
|
2015-01-02 14:24:57 -05:00
|
|
|
|
2016-02-20 11:31:15 -05:00
|
|
|
_pool_opts = [
|
|
|
|
cfg.IntOpt('executor_thread_pool_size',
|
|
|
|
default=64,
|
|
|
|
deprecated_name="rpc_thread_pool_size",
|
2017-03-21 11:34:21 +08:00
|
|
|
help='Size of executor thread pool when'
|
|
|
|
' executor is threading or eventlet.'),
|
2016-02-20 11:31:15 -05:00
|
|
|
]
|
|
|
|
|
|
|
|
|
2015-01-02 14:24:57 -05:00
|
|
|
class MessagingServerError(exceptions.MessagingException):
|
|
|
|
"""Base class for all MessageHandlingServer exceptions."""
|
|
|
|
|
|
|
|
|
|
|
|
class ExecutorLoadFailure(MessagingServerError):
|
|
|
|
"""Raised if an executor can't be loaded."""
|
|
|
|
|
|
|
|
def __init__(self, executor, ex):
|
|
|
|
msg = 'Failed to load executor "%s": %s' % (executor, ex)
|
|
|
|
super(ExecutorLoadFailure, self).__init__(msg)
|
|
|
|
self.executor = executor
|
|
|
|
self.ex = ex
|
|
|
|
|
|
|
|
|
|
|
|
class ServerListenError(MessagingServerError):
|
|
|
|
"""Raised if we failed to listen on a target."""
|
|
|
|
|
|
|
|
def __init__(self, target, ex):
|
|
|
|
msg = 'Failed to listen on target "%s": %s' % (target, ex)
|
|
|
|
super(ServerListenError, self).__init__(msg)
|
|
|
|
self.target = target
|
|
|
|
self.ex = ex
|
|
|
|
|
|
|
|
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
class TaskTimeout(MessagingServerError):
|
|
|
|
"""Raised if we timed out waiting for a task to complete."""
|
|
|
|
|
|
|
|
|
|
|
|
class _OrderedTask(object):
|
|
|
|
"""A task which must be executed in a particular order.
|
|
|
|
|
|
|
|
A caller may wait for this task to complete by calling
|
|
|
|
`wait_for_completion`.
|
|
|
|
|
|
|
|
A caller may run this task with `run_once`, which will ensure that however
|
|
|
|
many times the task is called it only runs once. Simultaneous callers will
|
|
|
|
block until the running task completes, which means that any caller can be
|
|
|
|
sure that the task has completed after run_once returns.
|
|
|
|
"""
|
|
|
|
|
|
|
|
INIT = 0 # The task has not yet started
|
|
|
|
RUNNING = 1 # The task is running somewhere
|
|
|
|
COMPLETE = 2 # The task has run somewhere
|
|
|
|
|
|
|
|
def __init__(self, name):
|
|
|
|
"""Create a new _OrderedTask.
|
|
|
|
|
|
|
|
:param name: The name of this task. Used in log messages.
|
|
|
|
"""
|
|
|
|
super(_OrderedTask, self).__init__()
|
|
|
|
|
|
|
|
self._name = name
|
|
|
|
self._cond = threading.Condition()
|
|
|
|
self._state = self.INIT
|
|
|
|
|
|
|
|
def _wait(self, condition, msg, log_after, timeout_timer):
|
|
|
|
"""Wait while condition() is true. Write a log message if condition()
|
|
|
|
has not become false within `log_after` seconds. Raise TaskTimeout if
|
|
|
|
timeout_timer expires while waiting.
|
|
|
|
"""
|
|
|
|
|
|
|
|
log_timer = None
|
|
|
|
if log_after != 0:
|
|
|
|
log_timer = timeutils.StopWatch(duration=log_after)
|
|
|
|
log_timer.start()
|
|
|
|
|
|
|
|
while condition():
|
|
|
|
if log_timer is not None and log_timer.expired():
|
2019-04-11 10:47:24 +02:00
|
|
|
LOG.warning('Possible hang: %s', msg)
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
LOG.debug(''.join(traceback.format_stack()))
|
|
|
|
# Only log once. After than we wait indefinitely without
|
|
|
|
# logging.
|
|
|
|
log_timer = None
|
|
|
|
|
|
|
|
if timeout_timer is not None and timeout_timer.expired():
|
|
|
|
raise TaskTimeout(msg)
|
|
|
|
|
|
|
|
timeouts = []
|
|
|
|
if log_timer is not None:
|
|
|
|
timeouts.append(log_timer.leftover())
|
|
|
|
if timeout_timer is not None:
|
|
|
|
timeouts.append(timeout_timer.leftover())
|
|
|
|
|
|
|
|
wait = None
|
|
|
|
if timeouts:
|
|
|
|
wait = min(timeouts)
|
|
|
|
self._cond.wait(wait)
|
|
|
|
|
|
|
|
@property
|
|
|
|
def complete(self):
|
|
|
|
return self._state == self.COMPLETE
|
|
|
|
|
|
|
|
def wait_for_completion(self, caller, log_after, timeout_timer):
|
|
|
|
"""Wait until this task has completed.
|
|
|
|
|
|
|
|
:param caller: The name of the task which is waiting.
|
|
|
|
:param log_after: Emit a log message if waiting longer than `log_after`
|
|
|
|
seconds.
|
|
|
|
:param timeout_timer: Raise TaskTimeout if StopWatch object
|
|
|
|
`timeout_timer` expires while waiting.
|
|
|
|
"""
|
|
|
|
with self._cond:
|
|
|
|
msg = '%s is waiting for %s to complete' % (caller, self._name)
|
|
|
|
self._wait(lambda: not self.complete,
|
|
|
|
msg, log_after, timeout_timer)
|
|
|
|
|
|
|
|
def run_once(self, fn, log_after, timeout_timer):
|
|
|
|
"""Run a task exactly once. If it is currently running in another
|
|
|
|
thread, wait for it to complete. If it has already run, return
|
|
|
|
immediately without running it again.
|
|
|
|
|
|
|
|
:param fn: The task to run. It must be a callable taking no arguments.
|
|
|
|
It may optionally return another callable, which also takes
|
|
|
|
no arguments, which will be executed after completion has
|
|
|
|
been signaled to other threads.
|
|
|
|
:param log_after: Emit a log message if waiting longer than `log_after`
|
|
|
|
seconds.
|
|
|
|
:param timeout_timer: Raise TaskTimeout if StopWatch object
|
|
|
|
`timeout_timer` expires while waiting.
|
|
|
|
"""
|
|
|
|
with self._cond:
|
|
|
|
if self._state == self.INIT:
|
|
|
|
self._state = self.RUNNING
|
|
|
|
# Note that nothing waits on RUNNING, so no need to notify
|
|
|
|
|
|
|
|
# We need to release the condition lock before calling out to
|
|
|
|
# prevent deadlocks. Reacquire it immediately afterwards.
|
|
|
|
self._cond.release()
|
|
|
|
try:
|
|
|
|
post_fn = fn()
|
|
|
|
finally:
|
|
|
|
self._cond.acquire()
|
|
|
|
self._state = self.COMPLETE
|
|
|
|
self._cond.notify_all()
|
|
|
|
|
|
|
|
if post_fn is not None:
|
|
|
|
# Release the condition lock before calling out to prevent
|
|
|
|
# deadlocks. Reacquire it immediately afterwards.
|
|
|
|
self._cond.release()
|
|
|
|
try:
|
|
|
|
post_fn()
|
|
|
|
finally:
|
|
|
|
self._cond.acquire()
|
|
|
|
elif self._state == self.RUNNING:
|
|
|
|
msg = ('%s is waiting for another thread to complete'
|
|
|
|
% self._name)
|
|
|
|
self._wait(lambda: self._state == self.RUNNING,
|
|
|
|
msg, log_after, timeout_timer)
|
|
|
|
|
|
|
|
|
|
|
|
class _OrderedTaskRunner(object):
|
|
|
|
"""Mixin for a class which executes ordered tasks."""
|
|
|
|
|
|
|
|
def __init__(self, *args, **kwargs):
|
|
|
|
super(_OrderedTaskRunner, self).__init__(*args, **kwargs)
|
|
|
|
|
|
|
|
# Get a list of methods on this object which have the _ordered
|
|
|
|
# attribute
|
|
|
|
self._tasks = [name
|
|
|
|
for (name, member) in inspect.getmembers(self)
|
|
|
|
if inspect.ismethod(member) and
|
|
|
|
getattr(member, '_ordered', False)]
|
|
|
|
self.reset_states()
|
|
|
|
|
|
|
|
self._reset_lock = threading.Lock()
|
|
|
|
|
|
|
|
def reset_states(self):
|
|
|
|
# Create new task states for tasks in reset
|
|
|
|
self._states = {task: _OrderedTask(task) for task in self._tasks}
|
|
|
|
|
|
|
|
@staticmethod
|
|
|
|
def decorate_ordered(fn, state, after, reset_after):
|
|
|
|
|
|
|
|
@functools.wraps(fn)
|
|
|
|
def wrapper(self, *args, **kwargs):
|
|
|
|
# If the reset_after state has already completed, reset state so
|
|
|
|
# we can run again.
|
|
|
|
# NOTE(mdbooth): This is ugly and requires external locking to be
|
|
|
|
# deterministic when using multiple threads. Consider a thread that
|
|
|
|
# does: server.stop(), server.wait(). If another thread causes a
|
|
|
|
# reset between stop() and wait(), this will not have the intended
|
|
|
|
# behaviour. It is safe without external locking, if the caller
|
|
|
|
# instantiates a new object.
|
|
|
|
with self._reset_lock:
|
|
|
|
if (reset_after is not None and
|
|
|
|
self._states[reset_after].complete):
|
|
|
|
self.reset_states()
|
|
|
|
|
|
|
|
# Store the states we started with in case the state wraps on us
|
|
|
|
# while we're sleeping. We must wait and run_once in the same
|
|
|
|
# epoch. If the epoch ended while we were sleeping, run_once will
|
|
|
|
# safely do nothing.
|
|
|
|
states = self._states
|
|
|
|
|
|
|
|
log_after = kwargs.pop('log_after', DEFAULT_LOG_AFTER)
|
|
|
|
timeout = kwargs.pop('timeout', None)
|
|
|
|
|
|
|
|
timeout_timer = None
|
|
|
|
if timeout is not None:
|
|
|
|
timeout_timer = timeutils.StopWatch(duration=timeout)
|
|
|
|
timeout_timer.start()
|
|
|
|
|
|
|
|
# Wait for the given preceding state to complete
|
|
|
|
if after is not None:
|
|
|
|
states[after].wait_for_completion(state,
|
|
|
|
log_after, timeout_timer)
|
|
|
|
|
|
|
|
# Run this state
|
|
|
|
states[state].run_once(lambda: fn(self, *args, **kwargs),
|
|
|
|
log_after, timeout_timer)
|
|
|
|
return wrapper
|
|
|
|
|
|
|
|
|
|
|
|
def ordered(after=None, reset_after=None):
|
|
|
|
"""A method which will be executed as an ordered task. The method will be
|
|
|
|
called exactly once, however many times it is called. If it is called
|
|
|
|
multiple times simultaneously it will only be called once, but all callers
|
|
|
|
will wait until execution is complete.
|
|
|
|
|
|
|
|
If `after` is given, this method will not run until `after` has completed.
|
|
|
|
|
|
|
|
If `reset_after` is given and the target method has completed, allow this
|
|
|
|
task to run again by resetting all task states.
|
|
|
|
|
|
|
|
:param after: Optionally, the name of another `ordered` method. Wait for
|
|
|
|
the completion of `after` before executing this method.
|
|
|
|
:param reset_after: Optionally, the name of another `ordered` method. Reset
|
|
|
|
all states when calling this method if `reset_after`
|
|
|
|
has completed.
|
|
|
|
"""
|
|
|
|
def _ordered(fn):
|
|
|
|
# Set an attribute on the method so we can find it later
|
|
|
|
setattr(fn, '_ordered', True)
|
|
|
|
state = fn.__name__
|
|
|
|
|
|
|
|
return _OrderedTaskRunner.decorate_ordered(fn, state, after,
|
|
|
|
reset_after)
|
|
|
|
return _ordered
|
|
|
|
|
|
|
|
|
2020-05-11 10:17:47 +02:00
|
|
|
class MessageHandlingServer(service.ServiceBase, _OrderedTaskRunner,
|
|
|
|
metaclass=abc.ABCMeta):
|
2015-01-02 14:24:57 -05:00
|
|
|
"""Server for handling messages.
|
|
|
|
|
|
|
|
Connect a transport to a dispatcher that knows how to process the
|
|
|
|
message using an executor that knows how the app wants to create
|
|
|
|
new tasks.
|
|
|
|
"""
|
|
|
|
|
2020-01-17 16:24:29 +01:00
|
|
|
def __init__(self, transport, dispatcher, executor=None):
|
2015-01-02 14:24:57 -05:00
|
|
|
"""Construct a message handling server.
|
|
|
|
|
2016-04-15 19:57:16 +03:00
|
|
|
The dispatcher parameter is a DispatcherBase instance which is used
|
|
|
|
for routing request to endpoint for processing.
|
2015-01-02 14:24:57 -05:00
|
|
|
|
|
|
|
The executor parameter controls how incoming messages will be received
|
2020-01-17 16:24:29 +01:00
|
|
|
and dispatched. Executor is automatically detected from
|
|
|
|
execution environment.
|
|
|
|
It handles many message in parallel. If your application need
|
|
|
|
asynchronism then you need to consider to use the eventlet executor.
|
2015-01-02 14:24:57 -05:00
|
|
|
|
|
|
|
:param transport: the messaging transport
|
|
|
|
:type transport: Transport
|
2016-04-15 19:57:16 +03:00
|
|
|
:param dispatcher: has a dispatch() method which is invoked for each
|
|
|
|
incoming request
|
|
|
|
:type dispatcher: DispatcherBase
|
2017-03-20 17:24:32 +08:00
|
|
|
:param executor: name of message executor - available values are
|
2017-06-01 09:27:40 +02:00
|
|
|
'eventlet' and 'threading'
|
2015-01-02 14:24:57 -05:00
|
|
|
:type executor: str
|
|
|
|
"""
|
2020-01-17 16:24:29 +01:00
|
|
|
if executor and executor not in ("threading", "eventlet"):
|
|
|
|
raise ExecutorLoadFailure(
|
|
|
|
executor,
|
|
|
|
"Executor should be None or 'eventlet' and 'threading'")
|
|
|
|
if not executor:
|
|
|
|
executor = utils.get_executor_with_context()
|
|
|
|
|
2015-01-02 14:24:57 -05:00
|
|
|
self.conf = transport.conf
|
2016-02-20 11:31:15 -05:00
|
|
|
self.conf.register_opts(_pool_opts)
|
2015-01-02 14:24:57 -05:00
|
|
|
|
|
|
|
self.transport = transport
|
|
|
|
self.dispatcher = dispatcher
|
2016-02-20 11:31:15 -05:00
|
|
|
self.executor_type = executor
|
2020-01-17 16:24:29 +01:00
|
|
|
if self.executor_type == "eventlet":
|
2017-03-21 11:34:21 +08:00
|
|
|
eventletutils.warn_eventlet_not_patched(
|
|
|
|
expected_patched_modules=['thread'],
|
|
|
|
what="the 'oslo.messaging eventlet executor'")
|
2016-02-20 11:31:15 -05:00
|
|
|
|
|
|
|
self.listener = None
|
2015-01-02 14:24:57 -05:00
|
|
|
|
|
|
|
try:
|
|
|
|
mgr = driver.DriverManager('oslo.messaging.executors',
|
2016-02-20 11:31:15 -05:00
|
|
|
self.executor_type)
|
2015-01-02 14:24:57 -05:00
|
|
|
except RuntimeError as ex:
|
2016-02-20 11:31:15 -05:00
|
|
|
raise ExecutorLoadFailure(self.executor_type, ex)
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
|
|
|
|
self._executor_cls = mgr.driver
|
2016-02-20 11:31:15 -05:00
|
|
|
|
|
|
|
self._work_executor = None
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
|
|
|
|
self._started = False
|
2015-01-02 14:24:57 -05:00
|
|
|
|
|
|
|
super(MessageHandlingServer, self).__init__()
|
|
|
|
|
2016-04-02 14:58:29 +03:00
|
|
|
def _on_incoming(self, incoming):
|
2016-04-15 19:57:16 +03:00
|
|
|
"""Handles on_incoming event
|
2016-04-02 14:58:29 +03:00
|
|
|
|
|
|
|
:param incoming: incoming request.
|
|
|
|
"""
|
|
|
|
self._work_executor.submit(self._process_incoming, incoming)
|
|
|
|
|
2016-03-25 13:39:53 +02:00
|
|
|
@abc.abstractmethod
|
|
|
|
def _process_incoming(self, incoming):
|
2016-04-02 14:58:29 +03:00
|
|
|
"""Perform processing incoming request
|
2016-03-25 13:39:53 +02:00
|
|
|
|
|
|
|
:param incoming: incoming request.
|
|
|
|
"""
|
|
|
|
|
|
|
|
@abc.abstractmethod
|
|
|
|
def _create_listener(self):
|
|
|
|
"""Creates listener object for polling requests
|
|
|
|
:return: MessageListenerAdapter
|
|
|
|
"""
|
2016-02-20 11:31:15 -05:00
|
|
|
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
@ordered(reset_after='stop')
|
2016-02-17 09:47:37 -05:00
|
|
|
def start(self, override_pool_size=None):
|
2015-01-02 14:24:57 -05:00
|
|
|
"""Start handling incoming messages.
|
|
|
|
|
|
|
|
This method causes the server to begin polling the transport for
|
|
|
|
incoming messages and passing them to the dispatcher. Message
|
|
|
|
processing will continue until the stop() method is called.
|
|
|
|
|
|
|
|
The executor controls how the server integrates with the applications
|
|
|
|
I/O handling strategy - it may choose to poll for messages in a new
|
|
|
|
process, thread or co-operatively scheduled coroutine or simply by
|
|
|
|
registering a callback with an event loop. Similarly, the executor may
|
|
|
|
choose to dispatch messages in a new thread, coroutine or simply the
|
|
|
|
current thread.
|
|
|
|
"""
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
if self._started:
|
2019-04-11 10:47:24 +02:00
|
|
|
LOG.warning('The server has already been started. Ignoring '
|
|
|
|
'the redundant call to start().')
|
2018-07-03 13:38:46 -04:00
|
|
|
return
|
|
|
|
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
self._started = True
|
|
|
|
|
2016-02-20 11:31:15 -05:00
|
|
|
executor_opts = {}
|
|
|
|
|
2020-01-17 16:24:29 +01:00
|
|
|
executor_opts["max_workers"] = (
|
|
|
|
override_pool_size or self.conf.executor_thread_pool_size
|
|
|
|
)
|
2016-02-20 11:31:15 -05:00
|
|
|
self._work_executor = self._executor_cls(**executor_opts)
|
|
|
|
|
2016-04-02 14:58:29 +03:00
|
|
|
try:
|
|
|
|
self.listener = self._create_listener()
|
|
|
|
except driver_base.TransportDriverError as ex:
|
|
|
|
raise ServerListenError(self.target, ex)
|
|
|
|
|
2016-04-15 19:57:16 +03:00
|
|
|
self.listener.start(self._on_incoming)
|
Fix a race calling blocking MessageHandlingServer.start()
This fixes a race due to the quirkiness of the blocking executor. The
blocking executor does not create a separate thread, but is instead
explicitly executed in the calling thread. Other threads will,
however, continue to interact with it.
In the non-blocking case, the executor will have done certain
initialisation in start() before starting a worker thread and
returning control to the caller. That is, the caller can be sure that
this initialisation has occurred when control is returned. However, in
the blocking case, control is never returned. We currently work round
this by setting self._running to True before executing executor.start,
and by not doing any locking whatsoever in MessageHandlingServer.
However, this current means there is a race whereby executor.stop()
can run before executor.start(). This is fragile and extremely
difficult to reason about robustly, if not currently broken.
The solution is to split the initialisation from the execution in the
blocking case. executor.start() is no longer a blocking operation for
the blocking executor. As for the non-blocking case, executor.start()
returns as soon as initialisation is complete, indicating that it is
safe to subsequently call stop(). Actual execution is done explicitly
via the new execute() method, which blocks.
In doing this, we also make FakeBlockingThread a more complete
implementation of threading.Thread. This fixes a related issue in
that, previously, calling server.wait() on a blocking executor from
another thread would not wait for the completion of the executor. This
has a knock-on effect in test_server's ServerSetupMixin. This mixin
created an endpoint with a stop method which called server.stop().
However, as this is executed by the executor, and also joins the
executor thread, which is now blocking, this results in a deadlock. I
am satisfied that, in general, this is not a sane thing to do.
However, it is useful for these tests. We fix the tests by making the
stop method non-blocking, and do the actual stop and wait calls from
the main thread.
Change-Id: I0d332f74c06c22b44179319432153e15b69f2f45
2015-10-19 13:04:37 +01:00
|
|
|
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
@ordered(after='start')
|
2015-01-02 14:24:57 -05:00
|
|
|
def stop(self):
|
|
|
|
"""Stop handling incoming messages.
|
|
|
|
|
|
|
|
Once this method returns, no new incoming messages will be handled by
|
|
|
|
the server. However, the server may still be in the process of handling
|
|
|
|
some messages, and underlying driver resources associated to this
|
|
|
|
server are still in use. See 'wait' for more details.
|
|
|
|
"""
|
2018-08-02 13:24:11 +02:00
|
|
|
if self.listener:
|
|
|
|
self.listener.stop()
|
2016-02-20 11:31:15 -05:00
|
|
|
self._started = False
|
|
|
|
|
Robustify locking in MessageHandlingServer
This change formalises locking in MessageHandlingServer, which closes
several bugs:
* It adds locking for internal state when using the blocking executor,
which closes a number of races.
* It does not hold a lock while executing server functions,
which removes a potential cause of deadlock if the server does its
own locking.
* It fixes a regression introduced in change
gI3cfbe1bf02d451e379b1dcc23dacb0139c03be76. If multiple threads
called wait() simultaneously, only 1 of them would wait and the
others would return immediately, despite message handling not having
completed. With this change only 1 will call the underlying wait,
but all will wait on its completion.
Additionally, it introduces some new functionality:
* It allows the user to make calls in any order and it will ensure,
with locking, that these will be reordered appropriately.
* The caller can pass a `timeout` argument to any server method, which
will cause it to raise an exception if it waits too long.
* The caller can pass a `log_after` argument to any server method,
which will cause it to raise a log message if it waits too long. It
can also be used to disable logging when waiting is intentional.
We remove DummyCondition as it no longer has any users.
This change was originally committed as change
I9d516b208446963dcd80b75e2d5a2cecb1187efa, but was reverted as it
caused a hang in a Nova test. This was caused by the locking behaviour
for handling restarting a previously stopped server. The original
patch caused the state to 'wrap' immediately after the user called
wait(). This caused a hang in tests which redundantly called stop()
and wait() multiple times. This new patch only wraps when the user
calls start() again. Callers who do not restart a server will
therefore not be affected by the wrapping behaviour. Callers who do
restart a server will be no worse than before. We add a deprecation
warning on restart, as this operation is inherently racy with this api
and there is a simple, safe alternative.
This new version has been successfully tested against the unit and
functional tests of nova, cinder, glance, and ceilometer.
Change-Id: Ic79f87e7b069c1f62d6121486fd6cafd732fdde7
2015-10-19 14:11:23 +01:00
|
|
|
@ordered(after='stop')
|
2015-01-02 14:24:57 -05:00
|
|
|
def wait(self):
|
|
|
|
"""Wait for message processing to complete.
|
|
|
|
|
2015-10-09 21:55:55 +09:00
|
|
|
After calling stop(), there may still be some existing messages
|
2015-01-02 14:24:57 -05:00
|
|
|
which have not been completely processed. The wait() method blocks
|
|
|
|
until all message processing has completed.
|
|
|
|
|
|
|
|
Once it's finished, the underlying driver resources associated to this
|
|
|
|
server are released (like closing useless network connections).
|
|
|
|
"""
|
2016-02-20 11:31:15 -05:00
|
|
|
self._work_executor.shutdown(wait=True)
|
|
|
|
|
|
|
|
# Close listener connection after processing all messages
|
2018-08-02 13:24:11 +02:00
|
|
|
if self.listener:
|
|
|
|
self.listener.cleanup()
|
2015-07-01 16:12:08 +03:00
|
|
|
|
|
|
|
def reset(self):
|
|
|
|
"""Reset service.
|
|
|
|
|
|
|
|
Called in case service running in daemon mode receives SIGHUP.
|
|
|
|
"""
|
|
|
|
# TODO(sergey.vilgelm): implement this method
|
|
|
|
pass
|