Red Hat Bugzilla – Bug 1488149
[ceph-container] - dmcrypt - osds failed to start after node reboot
Last modified: 2018-05-30 11:28:13 EDT
Created attachment 1321841 [details] File contains log snippet of an OSD service Description of problem: After upgrading cluster (OSDs - dmcrypt & dedicated journal) from 2.3 to 2.4, OSDs are failing to get started on rebooting the OSD node, saying mount 'failed to open journal'. Not sure whether this issue is dependent on upgrade or not, as upgrade worked fine. OSD services started and was running fine after following upgrade procedure (service start). Version-Release number of selected component (if applicable): ceph version 10.2.7-32.el7cp brew-pulp-docker01.web.<-->:8888/rhceph:2.4 How reproducible: Always (2/2) Steps to Reproduce: 1. Upgrade a containerized ceph cluster with encrypted OSDs and dedicated journals from 2.3 to 2.4 2. Reboot an OSD Node Actual results: OSD services are not getting started after node reboot Log snippet - filestore(/var/lib/ceph/osd/bb-1) mount failed to open journal /var/lib/ceph/osd/bb-1/journal: (2) No such file or directory Expected results: OSD services must get started after node reboot
There is fix for this in 3.0. Not sure how I can do a backport for this. @Ken, which branch should I use to backport a fix? Thanks!
lgtm.
LGTM, thanks
Latest build of ceph2 container image is broken rhceph-rhel7-docker-2.4-3 see bug 1503598 The revert was not sufficient and complete to get us back to something that works. $ docker run -it brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/rhceph:2.4-3 version common_functions.sh: line 3: disk_list.sh: No such file or directory
please reopen if this is still an issue