Bug 1517303

Summary: mkfs tries to format the partition before it's ready
Product: Red Hat OpenStack Reporter: Gonéri Le Bouder <goneri>
Component: ceph-ansibleAssignee: Sébastien Han <shan>
Status: CLOSED DUPLICATE QA Contact: Yogev Rabl <yrabl>
Severity: unspecified Docs Contact:
Priority: unspecified    
Version: 12.0 (Pike)CC: gfidente, mburns
Target Milestone: ---   
Target Release: ---   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-11-28 13:00:55 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Attachments:
Description Flags
the error itself
none
ceph-install-workflow.log none

Description Gonéri Le Bouder 2017-11-24 15:06:50 UTC
Created attachment 1358683 [details]
the error itself

Description of problem:

I did 3 deployments, I faced the problem two times.

create_partition: Creating data partition num 1 size 0 on /dev/sdl
command_check_call: Running command: /usr/sbin/sgdisk --largest-new=1 --change-name=1:ceph data --partition-guid=1:04a127b4-7242-4219-8838-9827970a299f --typecode=1:89c57f98-2fe5-4dc0-89c1-f3ad0ceff2be --mbrtogpt -- /dev/sdl
update_partition: Calling partprobe on created device /dev/sdl
command_check_call: Running command: /usr/bin/udevadm settle --timeout=600
command: Running command: /usr/bin/flock -s /dev/sdl /usr/sbin/partprobe /dev/sdl
command_check_call: Running command: /usr/bin/udevadm settle --timeout=600
get_dm_uuid: get_dm_uuid /dev/sdl uuid path is /sys/dev/block/8:176/dm/uuid
get_dm_uuid: get_dm_uuid /dev/sdl uuid path is /sys/dev/block/8:176/dm/uuid
get_dm_uuid: get_dm_uuid /dev/sdl1 uuid path is /sys/dev/block/8:177/dm/uuid
populate_data_path_device: Creating xfs fs on /dev/sdl1
command_check_call: Running command: /usr/sbin/mkfs -t xfs -f -i size=2048 -f -- /dev/sdl1

mkfs is called before /dev/sdl1 is available.

Comment 1 Gonéri Le Bouder 2017-11-24 15:07:11 UTC
Created attachment 1358684 [details]
ceph-install-workflow.log

Comment 2 Gonéri Le Bouder 2017-11-24 17:22:36 UTC
ceph-ansible-3.0.14-1.el7cp.noarch
OSP puddle 2017-11-21.1

Comment 3 Sébastien Han 2017-11-27 16:56:47 UTC
The first partition gets created by this command: "Running command: /usr/sbin/sgdisk --largest-new=1 --change-name=1:ceph data --partition-guid=1:04a127b4-7242-4219-8838-9827970a299f --typecode=1:89c57f98-2fe5-4dc0-89c1-f3ad0ceff2be --mbrtogpt -- /dev/sdl"

If the partition is not available this means you're hitting "the" known race condition. Looks like a dub to me.

Comment 4 Gonéri Le Bouder 2017-11-27 17:06:22 UTC
Hi Sebastien,

Could you give the reference of this known race condition.

Comment 5 Sébastien Han 2017-11-28 13:00:55 UTC

*** This bug has been marked as a duplicate of bug 1480658 ***