Bug 1845853
Summary: | Pods consuming azuredisk volume are stuck in ContainerCreating status on RHEL78 node | |||
---|---|---|---|---|
Product: | OpenShift Container Platform | Reporter: | Qin Ping <piqin> | |
Component: | Storage | Assignee: | Jan Safranek <jsafrane> | |
Status: | CLOSED NOTABUG | QA Contact: | Qin Ping <piqin> | |
Severity: | urgent | Docs Contact: | ||
Priority: | urgent | |||
Version: | 4.3.z | CC: | aos-bugs, jsafrane, scuppett | |
Target Milestone: | --- | Keywords: | TestBlocker | |
Target Release: | 4.6.0 | |||
Hardware: | Unspecified | |||
OS: | Unspecified | |||
Whiteboard: | ||||
Fixed In Version: | Doc Type: | If docs needed, set a value | ||
Doc Text: | Story Points: | --- | ||
Clone Of: | ||||
: | 1847085 (view as bug list) | Environment: | ||
Last Closed: | 2020-06-15 18:45:18 UTC | Type: | Bug | |
Regression: | --- | Mount Type: | --- | |
Documentation: | --- | CRM: | ||
Verified Versions: | Category: | --- | ||
oVirt Team: | --- | RHEL 7.3 requirements from Atomic Host: | ||
Cloudforms Team: | --- | Target Upstream Version: | ||
Embargoed: | ||||
Bug Depends On: | ||||
Bug Blocks: | 1847085, 1847087 |
Description
Qin Ping
2020-06-10 09:13:17 UTC
This is not a blocker for 4.5.0 GA. Setting target release to current development branch (4.6.0). For fixes (if any) requested/required on earlier versions, clones will be created for 4.5.z or earlier as appropriate. The root cause is that udev rules in RHEL7 do not create symlinks in /dev/azure, only in /dev/azure/iscsi<n>. RHEL8, with a Azure Disk attached as /dev/sdc: /dev/disk/azure/resource -> ../../sdb /dev/disk/azure/resource-part1 -> ../../sdb1 /dev/disk/azure/root -> ../../sda /dev/disk/azure/root-part1 -> ../../sda1 /dev/disk/azure/root-part2 -> ../../sda2 /dev/disk/azure/root-part3 -> ../../sda3 /dev/disk/azure/root-part4 -> ../../sda4 /dev/disk/azure/scsi1 /dev/disk/azure/scsi1/lun0 -> ../../../sdc RHEL7, with a Azure Disk attached as /dev/sdd: /dev/disk/azure/scsi0 /dev/disk/azure/scsi0/lun0 -> ../../../sda /dev/disk/azure/scsi0/lun0-part1 -> ../../../sda1 /dev/disk/azure/scsi0/lun0-part2 -> ../../../sda2 /dev/disk/azure/scsi0/lun0-part3 -> ../../../sda3 /dev/disk/azure/scsi0/lun0-part4 -> ../../../sda4 /dev/disk/azure/scsi0/lun1 -> ../../../sdb /dev/disk/azure/scsi0/lun1-part1 -> ../../../sdb1 /dev/disk/azure/scsi1 /dev/disk/azure/scsi1/lun1 -> ../../../sdd Notice /dev/disk/azure/resource and root in RHEL8. On RHEL7, I can see this in logs because /dev/disk/azure is empty there (contains only subdirs): Jun 10 15:54:26.846816 qe-43debug-kjqrr-rhel-2 hyperkube[1716]: I0610 15:54:26.846388 1716 azure_common_linux.go:105] /dev/disk/azure is not populated, now try to parse 0:0:0:0 directly Jun 10 15:54:26.847285 qe-43debug-kjqrr-rhel-2 hyperkube[1716]: I0610 15:54:26.846433 1716 azure_common_linux.go:105] /dev/disk/azure is not populated, now try to parse 0:0:0:1 directly And kubelet can't find attached disks because of that. We can either fix the udev rules or backport https://github.com/kubernetes/kubernetes/pull/86612. It has been backported to Kubernetes 1.18-1.15, not sure if it was backported to OCP too. I checked that 4.4 - 4.6 already have the aforementioned patch and we need to fix only 4.3. According Eric, only 4.4. BZ is really needed |