Welcome to the documentation for DC/OS Apache HDFS. DC/OS Apache HDFS is a managed service that makes it easy to deploy and manage an HA (High Availability) Apache HDFS cluster on Mesosphere DC/OS. Apache HDFS (Hadoop Distributed File System) is an open source distributed file system based on Google’s GFS (Google File System) paper. It is a replicated and distributed file system interface for use with “big data” and “fast data” applications.
Benefits
DC/OS Apache HDFS offers the following benefits:
- Easy installation
- Multiple Apache HDFS clusters
- Elastic scaling of data nodes
- Integrated monitoring
Features
DC/OS Apache HDFS provides the following features:
- Single-command installation for rapid provisioning
- Persistent storage volumes for enhanced data durability
- Runtime configuration and software updates for high availability
- Health checks and metrics for monitoring
- Distributed storage scale out
- HA name service with Quorum Journaling and ZooKeeper failure detection
Related Services
Release Notes
Discover the new features, updates, and known limitations in this release of the HDFS Service…Read More
Getting Started
Getting started with DC/OS Apache HDFS…Read More
Overview
Advanced features of the DC/OS Apache HDFS service…Read More
Operations
Plan and pod operations in DC/OS Apache HDFS service…Read More
API Reference
API reference for the DC/OS Apache HDFS service…Read More
Limitations
Known limitations of the DC/OS Apache HDFS service…Read More
Supported Versions
DC/OS and certified package version support policy…Read More