Bug 1404098

Summary: makedumpfile with nr_cpus=16 and -d 15 freezes sometime
Product: Red Hat Enterprise Linux 7 Reporter: Pratyush Anand <panand>
Component: kexec-toolsAssignee: Pratyush Anand <panand>
Status: CLOSED WORKSFORME QA Contact: Qiao Zhao <qzhao>
Severity: medium Docs Contact:
Priority: unspecified    
Version: 7.3CC: bugproxy, cye, hannsj_uhl, qzhao, ruyang, xiawu
Target Milestone: rc   
Target Release: 7.4   
Hardware: x86_64   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: If docs needed, set a value
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2017-07-31 02:45:39 UTC Type: Bug
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Bug Depends On:    
Bug Blocks: 1299988, 1473055    
Attachments:
Description Flags
Console log of the machine where makedumpfile failed with -d 15 none

Description Pratyush Anand 2016-12-13 03:47:21 UTC
Created attachment 1231021 [details]
Console log of the machine where makedumpfile failed with -d 15

Description of problem:

(1) Machine used:  ibm-x3950x6-01.rhts.eng.pek2.redhat.com (4T memory and 192 cpus)
(2) RHEL version:  RHEL-7.4-20161123.n.0 Server x86_64
(3) Passed nr_cpus = 16 in /etc/sysconfig/kdump: KDUMP_COMMANDLINE_APPEND
and used makedumpfile as `makedumpfile -l --message-level 1 -d 15`  which did not complete. It freezed after 16% of dump copy. Machine could be recovered only after power cycle.
However, with nr_cpus = 1, same makedumpfile did succeed.

Teste beaker job is: https://beaker.engineering.redhat.com/jobs/1612054

Console log is: http://beaker-archive.app.eng.bos.redhat.com/beaker-logs/2016/11/16120/1612054/3291557/console.log

Version-Release number of selected component (if applicable):
RHEL-7.4-20161123.n.0 Server x86_64

How reproducible:
Random

Actual results:
makedumpfile freeze

Expected results:
makedumpfile should pass with -d 15 when nr_cpus = 16.