deb-sahara/doc/source/userdoc/guest-requirements.rst
luhuichun abffc5d1c5 Spark doc references vanilla diskimagebuilder page
"The Spark plugin has been developed and tested with the images
generated by the _Building Images for Vanilla Plugin_." rename
diskimagebuilder to avoid misunderstanding

Change-Id: Idb01707aa5d1982413560961d3a08aa10fdc7aa0
Closes-bug: 1462133
2015-06-16 13:55:35 +08:00

1.7 KiB

Requirements for Guests

Sahara manages guests of various platforms (for example Ubuntu, Fedora, RHEL, and CentOS) with various versions of the Hadoop ecosystem projects installed. There are common requirements for all guests, and additional requirements based on the plugin that is used for cluster deployment.

Common Requirements

  • The operating system must be Linux
  • cloud-init must be installed
  • ssh-server must be installed
    • if a firewall is active it must allow connections on port 22 to enable ssh

Vanilla Plugin Requirements

If the Vanilla Plugin is used for cluster deployment the guest is required to have

  • ssh-client installed
  • Java (version >= 6)
  • Apache Hadoop installed
  • 'hadoop' user created

See hadoop-swift for information on using Swift with your Sahara cluster (for EDP support Swift integration is currently required).

To support EDP, the following components must also be installed on the guest:

  • Oozie version 4 or higher
  • mysql
  • hive

See vanilla_imagebuilder for instructions on building images for this plugin.

HDP Plugin

This plugin does not have any additional requirements. Currently, only the CentOS Linux distribution is supported but other distributions will be supported in the future. To speed up provisioning, the HDP packages can be pre-installed on the image used. The packages' versions depend on the HDP version being used.

Cloudera Plugin Requirements

If the Cloudera Plugin is used for cluster deployment the guest is required to have

  • Cloudera Express installed

See cdh_imagebuilder for instructions on building images for this plugin.