Red Hat Bugzilla – Bug 428119
slab error in kmem_cache_destroy(): cache `dlm_conn': Can't free all objects
Last modified: 2009-09-03 12:51:29 EDT
Description of problem:
Systems often display this message at shutdown:
"slab error in kmem_cache_destroy(): cache `dlm_conn': Can't free all objects"
Version-Release number of selected component (if applicable):
Probably since 5.2
Steps to Reproduce:
1. Start up cluster with clvmd. 3 nodes for preference
2. Shutdown some nodes
one or more nodes will display the above message
it gets worse...
If you stop all DLM lockspaces and attempt to restart them you get that
"kmem_cache_create: duplicate cache dlm_conn"
and the DLM won't start. this needs a node reboot.
The DLM should shut down cleanly.
This is because the ->othercon structures are not being freed. With all the
othercon patching that went on a while ago it seems like a bit of the patch has
Created attachment 291245 [details]
Patch to fix
This patch has been sent to Steve for inclusion upstream.
Proposing for inclusion in 5.2 release. Patch available and will affect
customers who are trying to stop and start the clusters.
Closing this bug as the patch was only missing from upstream (and is now