Introduction

Welcome to the Cloudbreak on the Azure Marketplace Technical Preview documentation!

Cloudbreak on the Azure Marketplace allows you to provision HDP and HDF clusters on Azure using the Microsoft Azure infrastructure.

Cloudbreak is a tool that simplifies the provisioning, management, and monitoring of on-demand HDP clusters in virtual and cloud environments. It leverages cloud infrastructure to create host instances, and uses Apache Ambari via Ambari blueprints to provision and manage Hortonworks clusters.

Primary Use Cases

Cloudbreak allows you to create, manage, and monitor your clusters on your chosen cloud platform:

Architecture

The following graphic illustrates high-level architecture of Cloudbreak on the Azure Marketplace:

When you launch the Cloudbreak, a new resource group is created and the following Azure resources are provisioned within it:

If you chose to use an existing virtual network, the virtual network will not be added to the resource group.

For each created cluster, a new resource group is created and the following Azure resources are provisioned within it:

Supported Regions

You can launch Cloudbreak and provision your clusters in all regions supported by Microsoft Azure.

Default Cluster Configurations

Cloudbreak includes default cluster configurations (in the form of bleuprints) and supports using your own custom cluster configurations (in the form of custom blueprints).

The following default cluster configurations are available:

Platform Version: HDP 2.6

Cluster Type Main Services Description List of All Services Included
Data Science Spark 2,
Zeppelin
Useful for data science with Spark 2 and Zeppelin. HDFS, YARN, MapReduce2, Tez, Hive, Pig, Sqoop, ZooKeeper, Ambari Metrics, Spark 2, Zeppelin
EDW - Analytics Hive 2 LLAP,
Zeppelin
Useful for EDW analytics using Hive LLAP. HDFS, YARN, MapReduce2, Tez, Hive 2 LLAP, Druid, Pig, ZooKeeper, Ambari Metrics, Spark 2
EDW - ETL Hive,
Spark 2
Useful for ETL data processing with Hive and Spark 2. HDFS, YARN, MapReduce2, Tez, Hive, Pig, ZooKeeper, Ambari Metrics, Spark 2

Platform Version: HDF 3.1

Cluster type Main services Description List of all services included
Flow Management NiFi Useful for flow management with NiFi. NiFi, NiFi Registry, ZooKeeper, Ambari Metrics
Messaging Management Kafka Useful for messaging management with Kafka. Kafka, ZooKeeper, Ambari Metrics

The following configuration classification applies:

Get Started

To quickly get started with Cloudbreak on the Azure Marketplace:

  1. Meet the prerequisites
  2. Launch a Cloudbreak instance, log in to the Cloudbreak UI, and create a Cloudbreak credential
  3. Create a cluster

Note

The Cloudbreak software runs in your Azure environment. You are responsible for Azure Portal charges while running Cloudbreak and the clusters being managed by Cloudbreak. To learn more about Azure pricing, refer to Azure documentation.