Cluster Setup Guide

This guide details the deployment of a Ceph storage cluster using cephadm on Ubuntu 22.04.

1. Inventory & Architecture

Role	Hostname	IP Address	Description
Admin	ceph-mgr-1	192.168.202.221	Bootstrap node, Mgr #1, Mon #1, Dashboard
Standby	ceph-mgr-2	192.168.202.222	Mgr #2, Mon #2
OSD/Mon	ceph-osd-1	192.168.202.231	OSD, Mon #3
OSD/MDS	ceph-osd-2	192.168.202.232	OSD, MDS #1
OSD/MDS	ceph-osd-3	192.168.202.233	OSD, MDS #2
OSD	ceph-osd-4	192.168.202.234	OSD (Storage Only)

2. Prerequisites

Run on ALL Nodes

The following steps ensure environment consistency and time synchronization.

2.1 Set Hostnames

bash

sudo hostnamectl set-hostname ceph-mgr-2

2.2 Configure /etc/hosts

Ensure all nodes can resolve each other by name.

text

192.168.202.221 ceph-mgr-1
192.168.202.222 ceph-mgr-2
192.168.202.231 ceph-osd-1
192.168.202.232 ceph-osd-2
192.168.202.233 ceph-osd-3
192.168.202.234 ceph-osd-4

2.3 Dependencies

We use Docker as the container runtime, lvm2 for disk management, and chrony for time sync.

bash

sudo apt update
sudo apt install -y docker.io lvm2 chrony python3

2.4 Time Synchronization

WARNING

Ceph is extremely sensitive to clock drift. Ensure chrony is active.

bash

sudo systemctl enable --now chrony
chronyc sources

3. Admin Node Setup

Admin Node Only

Run these steps strictly on ceph-mgr-1.

3.1 Install Cephadm

bash

# Add the Squid repository
sudo curl --silent --remote-name --location https://download.ceph.com/rpm-squid/el9/noarch/cephadm
sudo chmod +x cephadm
sudo ./cephadm add-repo --release squid
sudo apt update
sudo apt install -y cephadm

3.2 Bootstrap the Cluster

bash

sudo cephadm bootstrap --mon-ip 192.168.202.221 --allow-fqdn-hostname

Wait for the process to finish. Upon success, it will output the dashboard URL.

3.3 Enable Ceph CLI

The bootstrap process installs a minimal config at /etc/ceph/ceph.conf. To use the ceph command without typing sudo or entering the container every time:

bash

# Install the common tools
sudo cephadm install ceph-common

# Verify access
sudo ceph -s
# Status should be HEALTH_WARN (OSD count 0)

3.4 Distribute SSH Keys

cephadm uses SSH to manage other nodes. The bootstrap process generated an SSH key at /etc/ceph/ceph.pub. Copy this key to all other nodes (ceph-mgr-2 and ceph-osd-1 through 04).

bash

# Copy key to ceph-mgr-2
ssh-copy-id -f -i /etc/ceph/ceph.pub root@ceph-mgr-2
ssh-copy-id -f -i /etc/ceph/ceph.pub root@ceph-osd-1
ssh-copy-id -f -i /etc/ceph/ceph.pub root@ceph-osd-2
ssh-copy-id -f -i /etc/ceph/ceph.pub root@ceph-osd-3
ssh-copy-id -f -i /etc/ceph/ceph.pub root@ceph-osd-4

Note: You must have root SSH access enabled on target nodes, or copy the key to a user with sudo privileges and configure cephadm to use that user.

4. Expanding the Cluster

Run all following commands from the Admin Node.

4.1 Add the Manager Node

Add the second manager node to the cluster with manager and monitor labels.

bash

sudo ceph orch host add ceph-mgr-2 192.168.202.222 --labels _admin,mgr,mon

4.2 Add OSD Nodes

Add the four storage nodes. Labels reflect storage and service placement roles.

bash

sudo ceph orch host add ceph-osd-1 192.168.202.231 --labels=osd,mon
sudo ceph orch host add ceph-osd-2 192.168.202.232 --labels=osd,mds
sudo ceph orch host add ceph-osd-3 192.168.202.233 --labels=osd,mds
sudo ceph orch host add ceph-osd-4 192.168.202.234 --labels=osd

4.3 Verify Host List

bash

sudo ceph orch host ls

Output should show 6 hosts total.

5. Deploy Services

5.1 Finalize Monitor Placement

By default, Ceph's bootstrap places the first Monitor on the bootstrap node (ceph-mgr-1). For the latest placement, Monitors run on both manager nodes and one OSD node (ceph-mgr-1, ceph-mgr-2, ceph-osd-1) to keep an odd quorum and align with the cluster layout.

CRITICAL: You need an odd number of monitors (3).

bash

# Apply the monitor specification to place them explicitly
sudo ceph orch apply mon --placement="ceph-mgr-1,ceph-mgr-2,ceph-osd-1"

# Verify the monitors have successfully migrated and are running
sudo ceph orch ps --daemon_type mon

5.2 Finalize Manager Placement

By default, the Ceph bootstrap process only creates one Manager (mgr) daemon on the admin node (ceph-mgr-1). To ensure high availability of the Ceph dashboard and orchestrator, add a standby manager on ceph-mgr-2.

bash

# Apply the manager specification to ensure it runs on both mgr nodes
sudo ceph orch apply mgr --placement="ceph-mgr-1,ceph-mgr-2"

# Verify that two mgr daemons are now running (one active, one standby)
sudo ceph orch ps --daemon_type mgr

5.3 Deploy OSDs (Storage)

We will tell Ceph to consume all unused raw disks on nodes labeled osd.

Ensure your OSD nodes have empty, unformatted secondary disks (e.g., /dev/sdb).

bash

# Check available devices
sudo ceph orch device ls

# Provision OSDs on all available devices on 'osd' labeled hosts
sudo ceph orch apply osd --all-available-devices

6.1 Check Cluster Status

bash

sudo ceph -s

Healthy Output:

text

health: HEALTH_OK
mon: 3 daemons (ceph-mgr-1, ceph-mgr-2, ceph-osd-1)
mgr: ceph-mgr-1(active, since X), standbys: ceph-mgr-2
osd: X osds: X up, X in (where X is number of disks)

6.2 Access Dashboard

URL: https://192.168.202.221:8443
User: admin
Password: StrongPassword123! (or the one generated during bootstrap)

Note: You will get a self-signed certificate warning in your browser.

TIP

Having issues? Check the 00-troubleshooting guide for solutions to common deployment errors, missing disks, or stuck orchestrator daemons!

Cluster Setup Guide ​

1. Inventory & Architecture ​

2. Prerequisites ​

2.1 Set Hostnames ​

2.2 Configure /etc/hosts ​

2.3 Dependencies ​

2.4 Time Synchronization ​

3. Admin Node Setup ​

3.1 Install Cephadm ​

3.2 Bootstrap the Cluster ​

3.3 Enable Ceph CLI ​

3.4 Distribute SSH Keys ​

4. Expanding the Cluster ​

4.1 Add the Manager Node ​

4.2 Add OSD Nodes ​

4.3 Verify Host List ​

5. Deploy Services ​

5.1 Finalize Monitor Placement ​

5.2 Finalize Manager Placement ​

5.3 Deploy OSDs (Storage) ​

6.1 Check Cluster Status ​

6.2 Access Dashboard ​

Cluster Setup Guide

1. Inventory & Architecture

2. Prerequisites

2.1 Set Hostnames

2.2 Configure /etc/hosts

2.3 Dependencies

2.4 Time Synchronization

3. Admin Node Setup

3.1 Install Cephadm

3.2 Bootstrap the Cluster

3.3 Enable Ceph CLI

3.4 Distribute SSH Keys

4. Expanding the Cluster

4.1 Add the Manager Node

4.2 Add OSD Nodes

4.3 Verify Host List

5. Deploy Services

5.1 Finalize Monitor Placement

5.2 Finalize Manager Placement

5.3 Deploy OSDs (Storage)

6.1 Check Cluster Status

6.2 Access Dashboard