.Encrypted containerized OSDs starts as expected after a reboot
Encrypted containerized OSD daemons failed to start after a
reboot. In addition, the following log message was added to the OSD log file:
----
filestore(/var/lib/ceph/osd/bb-1) mount failed to open journal /var/lib/ceph/osd/bb-1/journal: (2) No such file or directory
----
This bug has been fixed, and such OSDs start as expected in this situation.
Created attachment 1321841[details]
File contains log snippet of an OSD service
Description of problem:
After upgrading cluster (OSDs - dmcrypt & dedicated journal) from 2.3 to 2.4, OSDs are failing to get started on rebooting the OSD node, saying mount 'failed to open journal'.
Not sure whether this issue is dependent on upgrade or not, as upgrade worked fine. OSD services started and was running fine after following upgrade procedure (service start).
Version-Release number of selected component (if applicable):
ceph version 10.2.7-32.el7cp
brew-pulp-docker01.web.<-->:8888/rhceph:2.4
How reproducible:
Always (2/2)
Steps to Reproduce:
1. Upgrade a containerized ceph cluster with encrypted OSDs and dedicated journals from 2.3 to 2.4
2. Reboot an OSD Node
Actual results:
OSD services are not getting started after node reboot
Log snippet - filestore(/var/lib/ceph/osd/bb-1) mount failed to open journal /var/lib/ceph/osd/bb-1/journal: (2) No such file or directory
Expected results:
OSD services must get started after node reboot
Latest build of ceph2 container image is broken rhceph-rhel7-docker-2.4-3 see bug 1503598 The revert was not sufficient and complete to get us back to something that works.
$ docker run -it brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888/rhceph:2.4-3 version
common_functions.sh: line 3: disk_list.sh: No such file or directory
Comment 14Ken Dreyer (Red Hat)
2018-05-30 15:28:13 UTC
Created attachment 1321841 [details] File contains log snippet of an OSD service Description of problem: After upgrading cluster (OSDs - dmcrypt & dedicated journal) from 2.3 to 2.4, OSDs are failing to get started on rebooting the OSD node, saying mount 'failed to open journal'. Not sure whether this issue is dependent on upgrade or not, as upgrade worked fine. OSD services started and was running fine after following upgrade procedure (service start). Version-Release number of selected component (if applicable): ceph version 10.2.7-32.el7cp brew-pulp-docker01.web.<-->:8888/rhceph:2.4 How reproducible: Always (2/2) Steps to Reproduce: 1. Upgrade a containerized ceph cluster with encrypted OSDs and dedicated journals from 2.3 to 2.4 2. Reboot an OSD Node Actual results: OSD services are not getting started after node reboot Log snippet - filestore(/var/lib/ceph/osd/bb-1) mount failed to open journal /var/lib/ceph/osd/bb-1/journal: (2) No such file or directory Expected results: OSD services must get started after node reboot