d3a6b6b68b
Includes: * Sahara Overview * Installation instructions * Basic verification steps Change-Id: Ic68c37527f99c72084c156487d9f9829af49def6
49 lines
2.1 KiB
XML
49 lines
2.1 KiB
XML
<?xml version="1.0" encoding="UTF-8"?>
|
|
<section xmlns="http://docbook.org/ns/docbook"
|
|
xmlns:xi="http://www.w3.org/2001/XInclude"
|
|
xmlns:xlink="http://www.w3.org/1999/xlink"
|
|
version="5.0"
|
|
xml:id="sahara-service">
|
|
<title>Data processing service</title>
|
|
<para>The Data processing service for OpenStack (sahara) aims to provide
|
|
users with simple means to provision data processing (Hadoop, Spark)
|
|
clusters by specifying several parameters like Hadoop version, cluster
|
|
topology, nodes hardware details and a few more. After user fills in
|
|
all the parameters, the Data processing service deploys the cluster in a
|
|
few minutes. Also sahara provides means to scale already provisioned
|
|
clusters by adding/removing worker nodes on demand.
|
|
</para>
|
|
|
|
<para>The solution addresses the following use cases:
|
|
<itemizedlist>
|
|
<listitem><para>Fast provisioning of Hadoop clusters on OpenStack for
|
|
development and QA.</para></listitem>
|
|
<listitem><para>Utilization of unused compute power from general
|
|
purpose OpenStack IaaS cloud.</para></listitem>
|
|
<listitem><para>Analytics-as-a-Service for ad-hoc or bursty analytic
|
|
workloads.</para></listitem>
|
|
</itemizedlist>
|
|
</para>
|
|
|
|
|
|
<para>Key features are:
|
|
<itemizedlist>
|
|
<listitem><para>Designed as an OpenStack component.</para></listitem>
|
|
<listitem><para>Managed through REST API with UI available as part
|
|
of OpenStack dashboard.</para></listitem>
|
|
<listitem><para>Support for different Hadoop distributions:
|
|
<itemizedlist>
|
|
<listitem><para>Pluggable system of Hadoop installation
|
|
engines.</para></listitem>
|
|
<listitem><para>Integration with vendor specific management tools,
|
|
such as Apache Ambari or Cloudera Management Console.</para></listitem>
|
|
</itemizedlist>
|
|
</para></listitem>
|
|
<listitem><para>Predefined templates of Hadoop configurations with
|
|
ability to modify parameters.</para></listitem>
|
|
<listitem><para>User-friendly UI for ad-hoc analytics queries based on
|
|
Hive or Pig.</para></listitem>
|
|
</itemizedlist>
|
|
</para>
|
|
</section>
|