Bug 1778046

Summary: 1 PG was undersized for more than 10 hours during upgrade
Product: [Red Hat Storage] Red Hat Ceph Storage Reporter: Uday kurundwade <ukurundw>
Component: RADOSAssignee: Neha Ojha <nojha>
Status: CLOSED INSUFFICIENT_DATA QA Contact: Manohar Murthy <mmurthy>
Severity: low Docs Contact:
Priority: low    
Version: 4.0CC: ceph-eng-bugs, dzafman, kchai, nojha, vumrao
Target Milestone: rc   
Target Release: 5.*   
Hardware: Unspecified   
OS: Unspecified   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2020-07-09 09:12:50 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:
Bug Depends On:    
Bug Blocks: 1750994    

Description Uday kurundwade 2019-11-29 06:40:56 UTC
Description of problem:
1 PG was undersized for more than 10 hours during upgrade from 3.3z1 to 4.0 on RHEL 7

Version-Release number of selected component (if applicable):
ceph-osd-14.2.4-16.el7cp.x86_64
ceph-base-14.2.4-16.el7cp.x86_64
ceph-common-14.2.4-16.el7cp.x86_64

How reproducible:


Steps to Reproduce:
1.Deploy ceph 3.3z1 cluster and fill it up to 50%
2.Upgrade cluster to ceph 4.0 while IOs are in progress

Actual results:
PG_DEGRADED Degraded data redundancy: 1 pg undersized
    pg 11.d0 is stuck undersized for 42078.884080, current state active+undersized+remapped+backfilling, last acting [130,201]

Expected results:
PG should not stuck undersized for so long

Additional info:

Comment 1 RHEL Program Management 2019-11-29 06:41:03 UTC
Please specify the severity of this bug. Severity is defined here:
https://bugzilla.redhat.com/page.cgi?id=fields.html#bug_severity.