Merge "Maintenance Mode update"

2015-08-14 11:46:30 +00:00
parent ac3935502b 3433d5f0fb
commit 723e8d5e49
3 changed files with 145 additions and 136 deletions
--- a/_images/mm_bootflow.png
+++ b/_images/mm_bootflow.png
--- a/pages/operations/7450-mm-ops.rst
+++ b/pages/operations/7450-mm-ops.rst
@@ -19,35 +19,35 @@ parameter in one of the following ways:

 * by selecting the respective option in the boot menu in Ubuntu or CentOS;

-* by forcing the reboot into maintenance mode from shell with the ``umm on``
-  command;
+* by forcing the reboot into maintenance mode from shell with
+  the :command:`umm on` command;

 * automatically, by reaching an number of unclean-reboots specified in
-  REBOOT_COUNT parameters.
+  ``REBOOT_COUNT`` parameters.

-"Unclean reboot" means that system reboots unexpectedly without a
+`Unclean reboot` means that system reboots unexpectedly without a
 direct call from the user.

 You can also disable the maintenance mode functionality
-if you do not need it (e.g. you do not want to
+if you do not need it (for example, you do not want to
 be automatically booted into it every time).

 You can operate in maintenance mode through ssh or tty2.

-A return back into normal mode is issued with the *umm off* command.
+A return back into normal mode is issued with the :command:`umm off`
+command.

 .. Note ::

  If you manually start a service in the maintenance mode, it will not
  be automatically restarted when you put the system back in the normal
-  mode with the *umm off* command.
+  mode with the :command:`umm off` command.


+Using the :command:`umm` command
+--------------------------------

-Using the ``umm`` command
-------------------------
-
-There are several parameters to use with the *umm* command:
+There are several parameters to use with the :command:`umm` command:

 - ``umm on [cmd]`` - enter the maintenance mode, and execute cmd when MM is reached;

@@ -68,10 +68,10 @@ There are several parameters to use with the *umm* command:
 - ``umm disable`` - disable the maintenance mode functionality.


-Configuring the ``UMM.conf`` file
+Configuring the `UMM.conf` file
 ---------------------------------

-You can automate the maintenance mode start by editing the */etc/umm.conf* file.
+You can automate the maintenance mode start by editing the `/etc/umm.conf` file.

 The configuration options are:

@@ -82,20 +82,19 @@ The configuration options are:

 where:

- UMM
-   will tell the system to go into the maintenance mode based on
-   the REBOOT_COUNT and COUNTER_RESET_TIME values. If the value is
-   anything other than ``yes`` (or if the ``UMM.conf`` file is missing), the
+UMM
+  tells the system to go into the maintenance mode based on
+  the ``REBOOT_COUNT`` and ``COUNTER_RESET_TIME`` values. If the value is
+  anything other than ``yes`` (or if the `UMM.conf` file is missing), the
  system will go into the native Ubuntu recovery mode.

- REBOOT_COUNT
-   determines the number of unclean reboots that will
-   trigger the system to go into the maintenance mode;
-
- COUNTER_RESET_TIME
-   this is a time value in minutes after the system reboot when
-   "Unclean reboot" counter will be resetted.
+REBOOT_COUNT
+  determines the number of unclean reboots that trigger the system to go
+  into the maintenance mode.

+COUNTER_RESET_TIME
+  determines the period of time (in minutes) before the `Unclean reboot`
+  counter reset.


 Example of using MM on one node
@@ -103,7 +102,8 @@ Example of using MM on one node

 - Switching node into MM:

-  ::
+  .. code-block:: bash
+     :linenos:

     root@node-1:~#umm on
     umm-gr start/running, process 6657
@@ -125,11 +125,12 @@ Example of using MM on one node
     root@node-1:~#ps -Af


-  We can see only small set of working process.
+  We can see only small set of working processes.

 - Start the service:

-  ::
+  .. code-block:: bash
+     :linenos:

     root@node-1:~# /etc/init.d/apache2 start
     root@node-1:~# /etc/init.d/apache2 status
@@ -138,13 +139,15 @@ Example of using MM on one node

 - Switch back to the working mode:

-  ::
+  .. code-block:: bash
+     :linenos:

     root@node-1:~#umm off

 - Continue booting into working mode:

-  ::
+  .. code-block:: bash
+     :linenos:

     root@node-1:~#umm status
     runlevel N 2
@@ -157,14 +160,16 @@ Example of using MM on one node

 - Check the state of the OpenStack services:

-  ::
+  .. code-block:: bash
+     :linenos:

     root@node-1:~#crm status

 - If you want to reach working mode by reboot, you should use the following
  command:

-  ::
+  .. code-block:: bash
+     :linenos:

     root@node-1:~# umm off reboot umm-gr start/running, process 2825

@@ -180,14 +185,15 @@ Example of using MM on one node
 Example of putting all nodes into the maintenance mode at the same time
 -----------------------------------------------------------------------

-The following maintenance mode sequence is called "Last input First out".
+The following maintenance mode sequence is called `Last input First out`.
 This guarantees that there is going to be the most recent data on
 the Cloud Infrastructure Controller (CIC) that comes back first.


- Determine what nodes have Controller (CIC) role:
+- Determine which nodes have Controller (CIC) role:

-  ::
+  .. code-block:: bash
+     :linenos:

     [root@fuel ~]# fuel nodes
     id | status | name             | cluster| ip        | mac               | roles      | pending_roles| online
@@ -197,9 +203,10 @@ the Cloud Infrastructure Controller (CIC) that comes back first.
     1  | ready  | Untitled (c0:01) | 1      | 10.20.0.3 | fa:a1:39:94:7f:4c | controller |              | True
     3  | ready  | Untitled (c0:03) | 1      | 10.20.0.5 | 82:cb:bb:50:40:47 | controller |              | True

- Copy id_rsa to the CICs for passwordless ssh authentification:
+- Copy ``id_rsa`` to the CICs for passwordless ssh authentification:

-  ::
+  .. code-block:: bash
+     :linenos:

     [root@fuel ~]# scp .ssh/id_rsa node-1:.ssh/id_rsa
     Warning: Permanently added 'node-1' (RSA) to the list of known hosts.
@@ -213,7 +220,8 @@ the Cloud Infrastructure Controller (CIC) that comes back first.

 - Enforce switching into MM mode on all nodes:

-  ::
+  .. code-block:: bash
+     :linenos:

     [root@fuel ~]# ssh node-1 umm on ssh node-2 umm on ssh node-3 umm on
     Warning: Permanently added 'node-1' (RSA) to the list of known hosts.
@@ -238,7 +246,8 @@ the Cloud Infrastructure Controller (CIC) that comes back first.

 - Wait until the last node reboots:

-  ::
+  .. code-block:: bash
+     :linenos:

     [root@fuel ~]# ssh node-3
     Warning: Permanently added 'node-3' (RSA) to the list of known hosts.
@@ -253,12 +262,13 @@ the Cloud Infrastructure Controller (CIC) that comes back first.
     Connection to node-3 closed.
     [root@fuel ~]#

- Perform all the steps, planned for MM.
+- Perform all the steps planned for MM.


 - Enforce a return back into normal mode in reverse state:

-  ::
+  .. code-block:: bash
+     :linenos:

     [root@fuel ~]# ssh node-3 umm off
     Warning: Permanently added 'node-3' (RSA) to the list of known hosts.
--- a/pages/terminology/m/maintenance-mode.rst
+++ b/pages/terminology/m/maintenance-mode.rst
@@ -6,21 +6,20 @@ Maintenance Mode

 Maintenance mode (MM) is a mode when the operating system on the node
 has only a critical set of working services that the system needs for
-basic network and disk operations. The purpose of the maintenance mode
-is to do a system repair or run other service operations on the system.
-The implementation of maintenance mode in 15B is based on the Ubuntu
-recovery mode. The system goes into a reboot and goes through the
-regular boot process until the system initialization stage (rc-sysinit).
-This is where the system enters the maintenance mode with the network
-and filesystem services started. In this moment we have already started
-network and filesystem. In MM stage are started sshd, tty2 and main MM
-service wait command for boot flow continue.
+basic network and disk operations. The purpose of MM is to perform a system
+repair or run other maintenance operations on the system.
+
+For switching to MM, the system shuts down and then goes through the regular
+boot process until the system initialization stage (rc-sysinit).
+At that moment the system enters MM, the network and filesystem services
+have already started. During the MM stage, the ``sshd`` and ``tty2``
+services start, and the main MM service waits for the command to continue the boot flow.
+
+See the :ref:`mm-ops` section of the Operations guide for the details.

 Here is a Cloud Infrastructure Controller boot flow scheme:

 .. image:: /_images/mm_bootflow.png

-For more information, see:

- :ref:`mm-ops`.