Ceph Cluster Upgrade
Ceph upgrades are done in three steps.
0. Terminology
Trivial Update:
* each Ceph component can be updated in random order
Non-trivial upgrade:
* each Ceph component must be updated in fixed order, usually (if not specified otherwise in release notes): monitors always first, rest after that, mds always last.
* sometimes specific config settings are required prior or during updates
1. Prerequisites
- read ceph release notes: https://docs.ceph.com/docs/master/release-notes/
- cluster must be health ok
- set noout on cluster
- set backfil max to minimal values (if needed)
- check that all MDS (including standby!) are up and running
- keep a terminal with ‘ceph -w’ open during upgrades to monitor cluster messages
2. Host System Updates
- upgrade all admin*.ceph.bfh.ch hosts first
- then upgrade all storage*.ceph.bfh.ch hosts (while ceph-osds are not running in containers yet, install all system updates except ceph)
- finally upgrade all gateway*.ceph.bfh.ch hosts
For every host upgrade:
- install all updates
- shutdown all containers
- reboot host system
- remove obsolete kernels, if any
check ‘ceph -s’ if everything is well before proceeding to the next.
3. Container System Updates
for every ceph container:
- upgrade all packages (ceph package updates do not restart ceph services)
- restart all ceph services
check ‘ceph -s’ if everything is well before proceeding to the next.