Description of common configuration options
Configuration option = Default value Description
[DEFAULT]
cluster_remote_threshold = 70 (IntOpt) The same as global_remote_threshold, but for a single cluster.
compute_topology_file = etc/sahara/compute.topology (StrOpt) File with nova compute topology. It should contain mapping between nova computes and racks. File format: compute1 /rack1 compute2 /rack2 compute3 /rack2
control_exchange = openstack (StrOpt) The default exchange under which topics are scoped. May be overridden by an exchange name specified in the transport_url option.
detach_volume_timeout = 300 (IntOpt) Timeout for detaching volumes from instance (in seconds).
enable_data_locality = False (BoolOpt) Enables data locality for hadoop cluster. Also enables data locality for Swift used by hadoop. If enabled, 'compute_topology' and 'swift_topology' configuration parameters should point to OpenStack and Swift topology correspondingly.
enable_hypervisor_awareness = True (BoolOpt) Enables four-level topology for data locality. Works only if corresponding plugin supports such mode.
global_remote_threshold = 100 (IntOpt) Maximum number of remote operations that will be running at the same time. Note that each remote operation requires its own process to run.
host = (StrOpt) Hostname or IP address that will be used to listen on.
infrastructure_engine = direct (StrOpt) An engine which will be used to provision infrastructure for Hadoop cluster.
job_binary_max_KB = 5120 (IntOpt) Maximum length of job binary data in kilobytes that may be stored or retrieved in a single operation.
job_workflow_postfix = (StrOpt) Postfix for storing jobs in hdfs. Will be added to /user/hadoop/.
lock_path = None (StrOpt) Directory to use for lock files.
memcached_servers = None (ListOpt) Memcached servers or None for in process cache.
min_transient_cluster_active_time = 30 (IntOpt) Minimal "lifetime" in seconds for a transient cluster. Cluster is guaranteed to be "alive" within this time period.
node_domain = novalocal (StrOpt) The suffix of the node's FQDN. In nova-network that is the dhcp_domain config parameter.
os_region_name = None (StrOpt) Region name used to get services endpoints.
periodic_enable = True (BoolOpt) Enable periodic tasks.
periodic_fuzzy_delay = 60 (IntOpt) Range in seconds to randomly delay when starting the periodic task scheduler to reduce stampeding. (Disable by setting to 0).
periodic_interval_max = 60 (IntOpt) Max interval size between periodic tasks execution in seconds.
plugins = vanilla, hdp (ListOpt) List of plugins to be loaded. Sahara preserves the order of the list when returning it.
port = 8386 (IntOpt) Port that will be used to listen on.
remote = ssh (StrOpt) A method for Sahara to execute commands on VMs.
run_external_periodic_tasks = True (BoolOpt) Some periodic tasks can be run in a separate process. Should we run them here?
swift_topology_file = etc/sahara/swift.topology (StrOpt) File with Swift topology. It should contain mapping between Swift nodes and racks. File format: node1 /rack1 node2 /rack2 node3 /rack2
transport_url = None (StrOpt) A URL representing the messaging driver to use and its full configuration. If not set, we fall back to the rpc_backend option and driver specific configuration.
use_floating_ips = True (BoolOpt) If set to True, Sahara will use floating IPs to communicate with instances. To make sure that all instances have floating IPs assigned in Nova Network set "auto_assign_floating_ip=True" in nova.conf. If Neutron is used for networking, make sure that all Node Groups have "floating_ip_pool" parameter defined.
use_identity_api_v3 = True (BoolOpt) Enables Sahara to use Keystone API v3. If that flag is disabled, per-job clusters will not be terminated automatically.
use_namespaces = False (BoolOpt) Use network namespaces for communication (only valid to use in conjunction with use_neutron=True).
use_neutron = False (BoolOpt) Use Neutron Networking (False indicates the use of Nova networking).
[conductor]
use_local = True (BoolOpt) Perform sahara-conductor operations locally.