Note: This bug is displayed in read-only format because the product is no longer active in Red Hat Bugzilla.

RHEL Engineering is moving the tracking of its product development work on RHEL 6 through RHEL 9 to Red Hat Jira (issues.redhat.com). If you're a Red Hat customer, please continue to file support cases via the Red Hat customer portal. If you're not, please head to the "RHEL project" in Red Hat Jira and file new tickets here. Individual Bugzilla bugs in the statuses "NEW", "ASSIGNED", and "POST" are being migrated throughout September 2023. Bugs of Red Hat partners with an assigned Engineering Partner Manager (EPM) are migrated in late September as per pre-agreed dates. Bugs against components "kernel", "kernel-rt", and "kpatch" are only migrated if still in "NEW" or "ASSIGNED". If you cannot log in to RH Jira, please consult article #7032570. That failing, please send an e-mail to the RH Jira admins at rh-issues@redhat.com to troubleshoot your issue as a user management inquiry. The email creates a ServiceNow ticket with Red Hat. Individual Bugzilla bugs that are migrated will be moved to status "CLOSED", resolution "MIGRATED", and set with "MigratedToJIRA" in "Keywords". The link to the successor Jira issue will be found under "Links", have a little "two-footprint" icon next to it, and direct you to the "RHEL project" in Red Hat Jira (issue links are of type "https://issues.redhat.com/browse/RHEL-XXXX", where "X" is a digit). This same link will be available in a blue banner at the top of the page informing you that that bug has been migrated.

Bug 1724345

Summary:	mkfs.xfs hangs issuing discards
Product:	Red Hat Enterprise Linux 7	Reporter:	John Pittman <jpittman>
Component:	kernel	Assignee:	Ming Lei <minlei>
kernel sub component:	NVMe	QA Contact:	Zhang Yi <yizhan>
Status:	CLOSED ERRATA	Docs Contact:
Severity:	urgent
Priority:	urgent	CC:	alex.wang, amityony, asanders, bubrown, dmilburn, emilne, gtiwari, linville, minlei, pdwyer, rgirase, yizhan, ysudarev
Version:	7.6	Keywords:	OtherQA
Target Milestone:	rc
Target Release:	---
Hardware:	All
OS:	Linux
Whiteboard:
Fixed In Version:	kernel-3.10.0-1145.el7	Doc Type:	If docs needed, set a value
Doc Text:		Story Points:	---
Clone Of:		Environment:
Last Closed:	2020-09-29 21:01:42 UTC	Type:	Bug
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:

Description John Pittman 2019-06-26 20:21:33 UTC

Description of problem:

mkfs.xfs hangs issuing discards to a lvm/nvme setup

Version-Release number of selected component (if applicable):

kernel-3.10.0-957.21.3.el7.x86_64
lvm2-2.02.180-10.el7_6.8.x86_64

How reproducible:

Every time by customer.  Issue can be worked around with 'mkfs.xfs -K'

Actual results:

[  119.516243] nvme nvme0: I/O 0 QID 2 timeout, aborting
[  119.516275] nvme nvme0: I/O 1 QID 2 timeout, aborting
[  119.516281] nvme nvme0: I/O 2 QID 2 timeout, aborting
[  119.516287] nvme nvme0: I/O 3 QID 2 timeout, aborting
...

[  119.675526] nvme nvme1: Abort status: 0x0
[  119.680136] nvme nvme1: Abort status: 0x0
[  120.298280] nvme nvme1: Abort status: 0x0
[  120.298287] nvme nvme1: Abort status: 0x0
...
[  149.451712] nvme nvme0: I/O 0 QID 2 timeout, reset controller
[  149.451833] nvme nvme1: I/O 417 QID 2 timeout, reset controller
[  152.195795] nvme 0000:d8:00.0: irq 39 for MSI/MSI-X
[  154.595054] nvme 0000:d8:00.0: irq 39 for MSI/MSI-X
[  154.595092] nvme 0000:d8:00.0: irq 92 for MSI/MSI-X
...
[  165.870974] nvme nvme0: Abort status: 0x7
[  165.870977] nvme nvme0: Abort status: 0x7
[  165.870978] nvme nvme0: Abort status: 0x7
[  165.870978] nvme nvme0: Abort status: 0x7
...
[  185.360665] nvme nvme1: I/O 417 QID 2 timeout, disable controller
[  198.329955] nvme nvme0: I/O 0 QID 2 timeout, disable controller
[  242.815720] INFO: task mkfs.xfs:39904 blocked for more than 120 seconds.
[  242.815772] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[  242.815822] mkfs.xfs        D ffff9317e1d730c0     0 39904  38384 0x00000000
[  242.815828] Call Trace:
[  242.815841]  [<ffffffff83568ed9>] schedule+0x29/0x70
[  242.815846]  [<ffffffff835669e1>] schedule_timeout+0x221/0x2d0
[  242.815853]  [<ffffffff83156cb1>] ? blk_mq_sched_insert_requests+0xb1/0xc0
[  242.815860]  [<ffffffff83151d1c>] ? blk_mq_flush_plug_list+0x19c/0x200
[  242.815867]  [<ffffffff82f01292>] ? ktime_get_ts64+0x52/0xf0
[  242.815871]  [<ffffffff835685ad>] io_schedule_timeout+0xad/0x130
[  242.815876]  [<ffffffff8356950d>] wait_for_completion_io+0xfd/0x140
[  242.815882]  [<ffffffff82ed6b60>] ? wake_up_state+0x20/0x20
[  242.815886]  [<ffffffff8314de3c>] blkdev_issue_discard+0x2ac/0x2d0
[  242.815890]  [<ffffffff83157335>] blk_ioctl_discard+0xe5/0x130
[  242.815893]  [<ffffffff83157e82>] blkdev_ioctl+0x632/0xa20
[  242.815899]  [<ffffffff83081c51>] ? __blkdev_put+0x141/0x1a0
[  242.815903]  [<ffffffff83080fb1>] block_ioctl+0x41/0x50
[  242.815909]  [<ffffffff830569d0>] do_vfs_ioctl+0x3a0/0x5a0
[  242.815916]  [<ffffffff83064924>] ? mntput+0x24/0x40
[  242.815921]  [<ffffffff83043c66>] ? __fput+0x186/0x260
[  242.815925]  [<ffffffff83056c71>] SyS_ioctl+0xa1/0xc0
[  242.815933]  [<ffffffff83575ddb>] system_call_fastpath+0x22/0x27

crash> epython /home/rdu/jpittman/nvme.py -l
Node             SN                   Model                          Namespace Usage                      Format           FW Rev  
---------------- -------------------- ------------------------------ --------- -------------------------- ---------------- --------
/dev/nvme0n1     SDM00000ED1E         UCSC-F-H16003                  1         <unsupported>              512   B + 0 B    KNCCD101
/dev/nvme1n1     SDM0000230CB         UCSC-F-H16003                  1         <unsupported>              512   B + 0 B    KNCCD101

crash> dev -d | grep nvme
  259 ffff9317e5ffe000   nvme0n1    ffff9317dc418000     294   294     0 N/A(MQ)
  259 ffff9317e41e6800   nvme1n1    ffff9317da3a8998     286   286     0 N/A(MQ)

crash> request_queue.limits ffff9317dc418000 | grep discard
    max_discard_sectors = 4294967295, 
    discard_granularity = 512, 
    discard_alignment = 512, 
    discard_misaligned = 0 '\000', 
    discard_zeroes_data = 0 '\000', 

crash> request_queue.limits ffff9317da3a8998 | grep discard
    max_discard_sectors = 4294967295, 
    discard_granularity = 512, 
    discard_alignment = 512, 
    discard_misaligned = 0 '\000', 
    discard_zeroes_data = 0 '\000',

Comment 4 Ming Lei 2019-06-27 03:45:13 UTC

The following script may show what the last bad request is, and could you please run it before starting the mkfs test,
then post the log after the timeout is triggered.


/usr/share/bcc/tools/trace -t -C \
    -I 'linux/blkdev.h' -I 'linux/blk-mq.h'  \
    'nvme_queue_rq(void *p, struct blk_mq_queue_data *bd) "%d + %d %x", bd->rq->__sector, bd->rq->__data_len >> 9, bd->rq->cmd_flags' \
    'blk_mq_complete_request(struct request *rq, int error) "%d %d + %d %x", error, rq->__sector, rq->__data_len >> 9, rq->cmd_flags'




BTW, 'bcc' package is required for the above script, also please run it in another standalone session.

Comment 5 Ming Lei 2019-06-27 03:59:23 UTC

(In reply to Ming Lei from comment #4)
> The following script may show what the last bad request is, and could you
> please run it before starting the mkfs test,
> then post the log after the timeout is triggered.
> 
> 
> /usr/share/bcc/tools/trace -t -C \
>     -I 'linux/blkdev.h' -I 'linux/blk-mq.h'  \
>     'nvme_queue_rq(void *p, struct blk_mq_queue_data *bd) "%d + %d %x",
> bd->rq->__sector, bd->rq->__data_len >> 9, bd->rq->cmd_flags' \
>     'blk_mq_complete_request(struct request *rq, int error) "%d %d + %d %x",
> error, rq->__sector, rq->__data_len >> 9, rq->cmd_flags'
> 
> 
> 
> 
> BTW, 'bcc' package is required for the above script, also please run it in
> another standalone session.

Maybe better to dump the rq->tag, so please use the following one:

/usr/share/bcc/tools/trace -t -C \
	-I 'linux/blkdev.h' -I 'linux/blk-mq.h'  \
	'scsi_queue_rq(void *p, struct blk_mq_queue_data *bd) "%d: %d + %d %x", bd->rq->tag, bd->rq->__sector, bd->rq->__data_len >> 9, bd->rq->cmd_flags' \
	'blk_mq_complete_request(struct request *rq, int error) "%d %d: %d + %d %x", rq->tag, error, rq->__sector, rq->__data_len >> 9, rq->cmd_flags'

Comment 6 John Pittman 2019-06-27 14:10:32 UTC

Created attachment 1585181 [details]
bcc output 1

Hi Ming.  Thanks a lot for looking.  I tried the command on one of our 7.6 test systems and the compile failed.  Could you please take a look?  Thanks!

Comment 7 Ming Lei 2019-06-28 00:06:21 UTC

Comment on attachment 1585181 [details]
bcc output 1


I didn't see such issue in my rhel7 VM, maybe I install kernel-devel or kernel
header package.

However, you can workaround the issue by changing the following line in
/lib/modules/3.10.0-957.21.3.el7.x86_64/build/include/linux/compiler-gcc.h


#define asm_volatile_goto(x...) do { asm goto(x); asm (""); } while (0)

into 

#define asm_volatile_goto(x...)


Thanks,

Comment 15 John Pittman 2019-07-09 18:38:43 UTC

Hi Ming.  The customer finally got back to us.  Unfortunately they were unable to change the discard_max_bytes value.

# echo 2147483647 > /sys/block/nvme0n1/queue/discard_max_bytes
-bash: echo: write error: Invalid argument
# cat /sys/block/nvme0n1/queue/discard_max_bytes
2199023255040

Additionally, I made a small error on providing your test kernel to the customer.  Of course your original packages are no longer available, and I forgot to give the customer kernel-devel, so the script failed with the below:

> # /usr/share/bcc/tools/trace -t -C \
> >     -I 'linux/blkdev.h' -I 'linux/blk-mq.h'  \
> >     'nvme_queue_rq(struct blk_mq_hw_ctx *hctx, struct blk_mq_queue_data *bd) "%d-%d: %u + %u %x", hctx->queue_num, bd->rq->tag, bd->rq->__sector, bd->rq->__data_len >> 9, bd->rq->cmd_flags' \
> >     'nvme_pci_complete_rq(struct request *rq) "%d: %u + %u %x", rq->tag, rq->__sector, rq->__data_len >> 9, rq->cmd_flags'
> chdir(/lib/modules/3.10.0-1059.el7.1724345.x86_64/build): No such file or directory
> Failed to compile BPF text

How can we fix the permission error on discard_max_bytes?  Can you provide another set of test kernels?

Comment 17 John Pittman 2019-07-22 14:30:20 UTC

Hi Ming, unfortunately the customer has moved to a different hardware profile and can no longer provide testing.  The details of the hardware is provided below.

NVMe(2x 1.6TB HHHL AIC HGST SN260 NVMe Extreme Perf. High Endurance pcie cards) using striped LVM in RHEL 7.6.

5e:00.0 Non-Volatile memory controller: HGST, Inc. Ultrastar SN200 Series NVMe SSD (rev 02)
d8:00.0 Non-Volatile memory controller: HGST, Inc. Ultrastar SN200 Series NVMe SSD (rev 02)

/dev/nvme0n1 SDM00000ED1E UCSC-F-H16003 1 1.60 TB / 1.60 TB 512 B + 0 B KNCCD101
/dev/nvme1n1 SDM0000230CB UCSC-F-H16003 1 1.60 TB / 1.60 TB 512 B + 0 B KNCCD101

Comment 18 John Pittman 2019-07-22 16:08:00 UTC

Ming, is there anyway we can fix this issue without that particular hardware?

Comment 20 Ming Lei 2019-07-23 00:56:53 UTC

(In reply to John Pittman from comment #18)
> Ming, is there anyway we can fix this issue without that particular hardware?

From IO trace, IO timeout is triggered on big or lots of discard request, which should be one issue wrt. the type of drive,
or even the specific disk.  We do run this kinds of tests on NVMe, and not see such report yet.

So I believe we need to run some test on the drive for figuring out one root cause.

Yeah, it is always helpful to loop our partner to take a look at it.

Comment 23 John Pittman 2019-08-23 18:58:00 UTC

Hi Ming, we are still trying to get our hands on the proper hardware.  Rupesh happened to find this thread:  https://patchwork.kernel.org/patch/10028457/.  Do you think this is what is happening here?

Comment 24 Ming Lei 2019-08-23 22:30:27 UTC

(In reply to John Pittman from comment #23)
> Hi Ming, we are still trying to get our hands on the proper hardware. 
> Rupesh happened to find this thread: 
> https://patchwork.kernel.org/patch/10028457/.  Do you think this is what is
> happening here?

From the IO trace shared, it is basically same with the issue Keith is trying to address, see
the following comments from Keith:

"
The block limit only specifies the maximum size of a *single* discard
request as seen by the end device. This single request is not a problem
for timeouts, as far as I know.

The timeouts occur when queueing many of them at the same time: the last
one in the queue will have very high latency compared to ones ahead of
it in the queue if the device processes discards serially (many do).

There's no such limit to say the maximum outstanding number discard
requests that can be dispatched at the same time; the max number of
dispatched commands are shard with read and write.
"

Comment 25 Ming Lei 2019-08-23 23:02:18 UTC

Hi John,

Could you ask our customer to test the following workaround?

rmmod nvme_core
modprobe nvme_core nvme_io_timeout=90


Thanks,

Comment 30 John Pittman 2019-08-26 12:48:20 UTC

Hi Ming, below are the stats for the disk as pulled from the vmcore.  It's 1 TiB large, and has a very high max hw sectors value.  The queue depth is 1024.

crash> epython /cores/crashext/epython/storage/nvme.py -l

Node             SN                   Model                                    Namespace Capacity(gendisk)  Format           FW Rev  
---------------- -------------------- ---------------------------------------- --------- ------------------ ---------------- --------
/dev/nvme0n1     SDM00000ED1E         UCSC-F-H16003                            1         1   TiB            512   B + 0 B    KNCCD101
/dev/nvme1n1     SDM0000230CB         UCSC-F-H16003                            1         1   TiB            512   B + 0 B    KNCCD101

=============================================
=============================================

crash> epython /cores/crashext/epython/storage/nvme.py -c

Name        Ctrl Addr         Namespaces(list_head)  AdminQ            ConnectQ          Subsystem         Ctrl Device     
----------  ----------------  ---------------------  ----------------  ----------------  ----------------  ----------------
nvme0       ffff9317e6b5c1b0  ffff9317e6b5c1f8       ffff9317e6bb8998  0                 <unavailable>     ffff9317e6b5c228

Quirks:		NVME_QUIRK_DELAY_BEFORE_CHK_RDY
NumQueues:	17
CtrlState:	NVME_CTRL_CONNECTING
MaxHWSectors:	4294967295  <====
MaxSegments:	<unavailable>
PageSize:	4096

Name        Ctrl Addr         Namespaces(list_head)  AdminQ            ConnectQ          Subsystem         Ctrl Device     
----------  ----------------  ---------------------  ----------------  ----------------  ----------------  ----------------
nvme1       ffff9317e6b581b0  ffff9317e6b581f8       ffff9317da3a8000  0                 <unavailable>     ffff9317e6b58228

Quirks:		NVME_QUIRK_DELAY_BEFORE_CHK_RDY
NumQueues:	17
CtrlState:	NVME_CTRL_CONNECTING
MaxHWSectors:	4294967295   <=====
MaxSegments:	<unavailable>
PageSize:	4096

=============================================
=============================================

crash> epython /cores/crashext/epython/storage/nvme.py -q nvme0 -i 1

pci:Ctrl[qid]  Queue Addr        DMA Dev           NVMe Dev          SQ Cmds           Completion        Tags            
-------------  ----------------  ----------------  ----------------  ----------------  ----------------  ----------------
nvme0[1]       ffff9317e0de0098  ffff931aef988098  ffff9317e6b5c000  ffff9317e72a0000  ffff9317e5f88000  ffff925bd9db7800

QDepth:		1024          	CQVector:	-1            
SQTail:		0             	LastSQTail:	<unavailable> 
CQHead:		0             	LastCQHead:	<unavailable> 
QiD:		1             	CQPhase:	1             
Flags:		<unavailable> 	QCount:		17

Comment 31 John Pittman 2019-08-27 14:20:09 UTC

Hi Ming.  It looks we have a second customer tripping over discards building in the queues.  They are hitting timeouts when issuing "many concurrent blkdiscard operations along with some dd write commands on specific drives"

[29335.840122] nvme 0000:db:00.0: irq 238 for MSI/MSI-X
[29335.840145] nvme 0000:db:00.0: irq 239 for MSI/MSI-X
[29335.840167] nvme 0000:db:00.0: irq 240 for MSI/MSI-X
....
[235416.159141]  nvme1n1: p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p16
[235441.266349]  nvme2n1: p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p16
[235448.816967]  nvme3n1: p1 p2 p3 p4 p5 p6 p7 p8 p9 p10 p11 p12 p13 p14 p15 p16
....
[259511.358607] nvme nvme2: I/O 83 QID 25 timeout, aborting
[259511.429656] nvme nvme2: I/O 84 QID 25 timeout, aborting
[259511.500677] nvme nvme2: I/O 122 QID 25 timeout, aborting
[259511.571018] nvme nvme2: I/O 978 QID 25 timeout, aborting
[259519.767856] nvme nvme2: Abort status: 0x0
[259519.817955] nvme nvme2: Abort status: 0x0
[259519.867994] nvme nvme2: Abort status: 0x0
[259519.918008] nvme nvme2: Abort status: 0x0
[260498.023600] nvme nvme2: I/O 592 QID 36 timeout, aborting
[260498.044565] nvme nvme1: I/O 30 QID 29 timeout, aborting
[260498.044576] nvme nvme1: I/O 33 QID 29 timeout, aborting
[260498.044579] nvme nvme1: I/O 34 QID 29 timeout, aborting
[260498.044582] nvme nvme1: I/O 35 QID 29 timeout, aborting
[260498.346809] nvme nvme2: I/O 594 QID 36 timeout, aborting
[260498.411955] nvme nvme2: I/O 597 QID 36 timeout, aborting
[260498.477086] nvme nvme2: I/O 599 QID 36 timeout, aborting
[262142.182133] nvme nvme1: I/O 69 QID 18 timeout, aborting
[262144.248473] nvme nvme1: Abort status: 0x0
[262144.297620] nvme nvme1: Abort status: 0x0
[262144.346745] nvme nvme1: Abort status: 0x0
[262144.395862] nvme nvme1: Abort status: 0x0
....
[262173.108846] nvme nvme2: I/O 334 QID 30 timeout, aborting
[262173.173862] nvme nvme2: I/O 335 QID 30 timeout, aborting
[262173.238739] nvme nvme2: I/O 336 QID 30 timeout, aborting
[262173.303647] nvme nvme2: I/O 337 QID 30 timeout, aborting
[262173.368660] nvme nvme1: I/O 69 QID 18 timeout, reset controller

I have suggested that the increase the io_timeout to 90 and I have also requested that they attempt the same test as the other customer, to see if they can reproduce just by doing a mkfs.  Unfortunately, our original customer had to move the systems into prod so they can no longer assist with testing, but I think the new customer may be able to.

Comment 33 John Pittman 2019-08-27 14:50:56 UTC

New customer's (comment 31) discard sysfs values from the sosreport:

3.10.0-862.el7

    ATTR{discard_zeroes_data}=="1"
    ATTR{discard_granularity}=="512"
    ATTR{discard_max_bytes}=="2199023255040"
    ATTRS{discard_alignment}=="512"

Comment 39 John Pittman 2019-08-29 19:59:31 UTC

The customer was able to reproduce the issue with the 90 second timeout.  They are trying 300 next.  This issue happened when they issued blkdiscard as an ioctl on their filesystem which is assembled on four nvme partitions with raid level 1 + 0.

Aug 29 11:19:16 user.warn kernel: [10586.454123] nvme nvme0: I/O 218 QID 25 timeout, aborting
Aug 29 11:19:16 user.warn kernel: [10586.517541] nvme nvme0: I/O 219 QID 25 timeout, aborting
Aug 29 11:19:16 user.warn kernel: [10586.580952] nvme nvme0: I/O 220 QID 25 timeout, aborting
Aug 29 11:19:16 user.warn kernel: [10586.644357] nvme nvme0: I/O 221 QID 25 timeout, aborting
Aug 29 11:20:17 user.warn kernel: [10647.307580] nvme nvme0: I/O 1 QID 0 timeout, reset controller
Aug 29 11:20:47 user.warn kernel: [10677.249119] nvme nvme0: I/O 218 QID 25 timeout, reset controller
Aug 29 11:22:17 user.err kernel: [10767.624454] nvme nvme0: Device not ready; aborting reset
Aug 29 11:22:17 user.warn kernel: [10767.688198] nvme nvme0: Abort status: 0x7
Aug 29 11:22:17 user.warn kernel: [10767.736038] nvme nvme0: Abort status: 0x7
Aug 29 11:22:17 user.warn kernel: [10767.783881] nvme nvme0: Abort status: 0x7
Aug 29 11:22:17 user.warn kernel: [10767.831736] nvme nvme0: Abort status: 0x7
Aug 29 11:22:17 user.debug kernel: [10767.832138] nvme 0000:d8:00.0: irq 406 for MSI/MSI-X
Aug 29 11:23:18 user.err kernel: [10828.221837] nvme nvme0: Device not ready; aborting reset
Aug 29 11:23:18 user.warn kernel: [10828.285244] nvme nvme0: Removing after probe failure status: -19
Aug 29 11:23:18 user.info kernel: [10828.356992] nvme0n1: detected capacity change from 3200631791616 to 0
Aug 29 11:23:20 user.err kernel: [10830.678618] blk_update_request: I/O error, dev nvme0n1, sector 1377623040
Aug 29 11:23:21 user.err kernel: [10830.764185] blk_update_request: I/O error, dev nvme0n1, sector 1386011136
Aug 29 11:23:21 user.err kernel: [10830.849618] blk_update_request: I/O error, dev nvme0n1, sector 1394399232
Aug 29 11:23:21 user.err kernel: [10830.935051] blk_update_request: I/O error, dev nvme0n1, sector 1402787328
Aug 29 11:23:21 user.err kernel: [10831.020473] blk_update_request: I/O error, dev nvme0n1, sector 1411175424
Aug 29 11:23:21 user.err kernel: [10831.105900] blk_update_request: I/O error, dev nvme0n1, sector 1419563520
Aug 29 11:23:21 user.err kernel: [10831.191311] blk_update_request: I/O error, dev nvme0n1, sector 1427951616
Aug 29 11:23:21 user.err kernel: [10831.276738] blk_update_request: I/O error, dev nvme0n1, sector 1436339712
Aug 29 11:23:21 user.err kernel: [10831.362159] blk_update_request: I/O error, dev nvme0n1, sector 1444727808
Aug 29 11:23:21 user.err kernel: [10831.447588] blk_update_request: I/O error, dev nvme0n1, sector 1453115904
Aug 29 11:23:22 user.err kernel: [10831.793476] md: super_written gets error=-5, uptodate=0
Aug 29 11:23:22 user.crit kernel: [10831.855850] md/raid10:md100: Disk failure on nvme0n1p1, disabling device.
Aug 29 11:23:22 user.crit kernel: [10831.855850] md/raid10:md100: Operation continuing on 3 devices.

I was talking to Dave Jeffery about this and he mentioned that upstream discard max is writable.  Is that something that's easily back-ported?  Would it potentially help here?

Comment 48 Ming Lei 2020-02-19 01:06:55 UTC

BTW, the issue may be addressed by backporting blk-wbt to rhel7.

However, that can be a new feature, and the change is a bit big.

Comment 58 Ming Lei 2020-05-07 04:06:45 UTC


Hi Guys,

Please test the following kernel build and see if the task hung issue can be fixed?

http://people.redhat.com/minlei/.1724345/

Comment 59 Andrew Sanders 2020-05-07 16:35:38 UTC

(In reply to Ming Lei from comment #58)
> 
> Hi Guys,
> 
> Please test the following kernel build and see if the task hung issue can be
> fixed?
> 
> http://people.redhat.com/minlei/.1724345/

Can you confirm the sha256sum's for me please?  I have the following:
e76e1f9f0af6dc3c621a8fe2117fa2a1228775b364d1a1f74933a6ad693677e8  kernel-3.10.0-1135.el7.1724345.x86_64.rpm
6f6ec452b79782f8c4354ea3f779ab96a4725a380c08c9b6464adc0fa24e3b58  kernel-devel-3.10.0-1135.el7.1724345.x86_64.rpm

Comment 60 Ming Lei 2020-05-08 07:31:31 UTC

(In reply to Andrew Sanders from comment #59)
> (In reply to Ming Lei from comment #58)
> > 
> > Hi Guys,
> > 
> > Please test the following kernel build and see if the task hung issue can be
> > fixed?
> > 
> > http://people.redhat.com/minlei/.1724345/
> 
> Can you confirm the sha256sum's for me please?  I have the following:
> e76e1f9f0af6dc3c621a8fe2117fa2a1228775b364d1a1f74933a6ad693677e8 
> kernel-3.10.0-1135.el7.1724345.x86_64.rpm
> 6f6ec452b79782f8c4354ea3f779ab96a4725a380c08c9b6464adc0fa24e3b58 
> kernel-devel-3.10.0-1135.el7.1724345.x86_64.rpm

Yeah, it is correct, or you may download from the brew build link directly.

[minlei@slayer temp]$ sha256sum *
e76e1f9f0af6dc3c621a8fe2117fa2a1228775b364d1a1f74933a6ad693677e8  kernel-3.10.0-1135.el7.1724345.x86_64.rpm
6f6ec452b79782f8c4354ea3f779ab96a4725a380c08c9b6464adc0fa24e3b58  kernel-devel-3.10.0-1135.el7.1724345.x86_64.rpm


Thanks,
Ming

Comment 61 John Pittman 2020-05-13 14:04:42 UTC

Thanks Ming.  As Andrew has the only case left open, moving the needsinfo to him for potential results.

Comment 62 Andrew Sanders 2020-05-21 15:12:50 UTC

By the time the customer got the test packages in hand they had already used the workaround to provision their servers and get them into production.  Since their servers affected by this are all in production they no longer have servers to test the fix against.  There's no reason to hold on me or my customer at this point for testing feedback.

Comment 64 Jan Stancek 2020-05-25 10:55:55 UTC

Patch(es) committed on kernel-3.10.0-1145.el7

Comment 79 errata-xmlrpc 2020-09-29 21:01:42 UTC

Since the problem described in this bug report should be
resolved in a recent advisory, it has been closed with a
resolution of ERRATA.

For information on the advisory (Important: kernel security, bug fix, and enhancement update), and where to find the updated
files, follow the link below.

If the solution does not work for you, open a new bug report.

https://access.redhat.com/errata/RHSA-2020:4060