Bug 585266

Summary:	Add coordination between Kdump and Cluster Fencing for long kernel panic dumps
Product:	Red Hat Enterprise Linux 5	Reporter:	Lon Hohberger <lhh>
Component:	kexec-tools	Assignee:	Ryan O'Hara <rohara>
Status:	CLOSED WONTFIX	QA Contact:	Red Hat Kernel QE team <kernel-qe>
Severity:	high	Docs Contact:
Priority:	high
Version:	5.8	CC:	amwang, bernhard.furtmueller, bstevens, casmith, clasohm, cluster-maint, hlawatschek, jwilleford, kskmori, lwang, michael.hagmann, nhorman, qcai, rick.beldin, samuel.kielek, tao
Target Milestone:	rc	Keywords:	FutureFeature
Target Release:	---
Hardware:	All
OS:	Linux
Whiteboard:
Fixed In Version:		Doc Type:	Enhancement
Doc Text:		Story Points:	---
Clone Of:	309991
Clones:	585332 (view as bug list)		Environment:
Last Closed:	2011-07-28 16:25:15 UTC	Type:	---
Regression:	---	Mount Type:	---
Documentation:	---	CRM:
Verified Versions:		Category:	---
oVirt Team:	---	RHEL 7.3 requirements from Atomic Host:
Cloudforms Team:	---	Target Upstream Version:
Embargoed:
Bug Depends On:	309991, 461948
Bug Blocks:	585332

Description Lon Hohberger 2010-04-23 15:08:13 UTC

+++ This bug was initially created as a clone of Bug #309991 +++

With large memory configurations, some machines take a long time to dump state
when a panic occurs.  The cluster software may well force a reboot as a fence
operation before the dump completes.  This cause the loss of important data to
diagnose the root problem.

Cluster fencing needs a mechanism to hold off fencing until the dump completes
or assurance from the failed node that it will not re-awaken and cause data
corruption of shared information.

--- Additional comment from nhorman on 2007-09-28 07:56:30 EDT ---

I've added, as part of bz 269761, the ability to run an arbitrary script from
the kdump initrd prior to capturing a vmcore.  My thought was that we could use
this ability to fork a process that spoke to the cluster suite peer daemons in
such a way as to stall the fencing process.  This obviously requires that the
fencing suite contain some utility to drive the communication appropriately,
which can then be added to kdump via /etc/kdump.conf.  Thoughts Jim?



This is a clone to address the bits in kexec-tools which need to be modified / admended in order to provide the required functionality.

Comment 2 RHEL Program Management 2011-07-28 16:25:15 UTC

Development Management has reviewed and declined this request.  You may appeal
this decision by reopening this request.