Bug 807439

Summary: NFS snapshot directory issues with 2.6.18-308.el5
Product: Red Hat Enterprise Linux 5 Reporter: derek
Component: kernelAssignee: nfs-maint
Status: CLOSED DUPLICATE QA Contact: Red Hat Kernel QE team <kernel-qe>
Severity: high Docs Contact:
Priority: unspecified    
Version: 5.8CC: dwysocha, jlayton, nfs-maint, pasteur, rwheeler, steved, t.h.amundsen, vincent
Target Milestone: rcKeywords: Reopened
Target Release: ---   
Hardware: All   
OS: Linux   
Whiteboard:
Fixed In Version: Doc Type: Bug Fix
Doc Text:
Story Points: ---
Clone Of: Environment:
Last Closed: 2012-05-09 13:52:30 UTC Type: ---
Regression: --- Mount Type: ---
Documentation: --- CRM:
Verified Versions: Category: ---
oVirt Team: --- RHEL 7.3 requirements from Atomic Host:
Cloudforms Team: --- Target Upstream Version:
Embargoed:

Description derek 2012-03-27 19:23:46 UTC
Description of problem:
When running the new 5.8 kernel 2.6.18-308.el5 we can no longer get into special copy on write snapshot directories over NFS.

Version-Release number of selected component (if applicable):
2.6.18-308.el5

How reproducible:
Every time

Steps to Reproduce:
1. Run 2.6.18-308.el5 on RHEL5
2. Mount a ZFS file system over NFS (v3) to /mnt
3. cd /mnt/.zfs/snapshot/<name_of_snapshot>
  
Actual results:
You will get something like, 'snap-hourly-4-latest/: Not a directory'

Expected results:
Change directory would work as expected into the snapshot.

Additional info:
Downgrading to kernel 2.6.18-274.18.1.el5 or earlier fixes the issue.

Comment 1 Ric Wheeler 2012-03-27 19:29:03 UTC
We don't support ZFS in production in any system.

I suggest trying more recent versions of RHEL to see if this was resolved for you.

Comment 2 derek 2012-03-27 21:04:48 UTC
Specifically is a regression in the NFS _client_ in 2.6.18-308.el5.  I can confirm this behavior also using RHEL5 2.6.18-308.el5 client to access a ONStor NFS file share with a .snapshot directory.   This behavior is not ZFS related but it is an easy way to reproduce the problem.  RHEL6 does not exhibit this behavior (2.6.32-220.7.1.el6).

Comment 3 Tru Huynh 2012-03-28 14:30:48 UTC
I also see that on 2.6.18-308.1.1.el5 (mounting zfs from solaris 10 server), it used to work for the 5.7 kernel series.

Comment 4 Ric Wheeler 2012-03-28 14:53:53 UTC
If either of you have a Red Hat support account, please open a support call officially so the field can collect more information.

Opening a ticket directly in our bugzilla bypasses the support team who could work with you to find a supported test case that shows an issue.

Other than that, you can take this issue to the upstream NFS list or to the other vendor (Oracle for Solaris).

Thanks!

Comment 5 Vincent S. Cojot 2012-03-29 14:27:41 UTC
Could this be related to BUGID 798809?
@Derek: Are you using NFSv4?

Comment 6 Tru Huynh 2012-03-29 16:17:05 UTC
I am using nfsv3 and the issue does NOT occur on Hitachi HNAS (was bluearc) served snapshots.

Comment 7 Trond H. Amundsen 2012-04-02 16:33:55 UTC
This issue DOES occur with NetApp (NFSv3), but only if the volume is root-exported (equivalent to no_root_squash). For volumes not exported with root privileges this is not an issue. Is it the same for the other storage solutions mentioned?

Comment 8 Trond H. Amundsen 2012-04-02 16:36:37 UTC
(In reply to comment #7)
> For volumes not exported *with* root privileges this is not an issue.

For volumes not exported *without* root privileges this is not an issue.

Sorry about that.

Comment 9 Trond H. Amundsen 2012-04-02 16:42:32 UTC
(In reply to comment #8)
> (In reply to comment #7)
> > For volumes not exported *with* root privileges this is not an issue.
> 
> For volumes not exported *without* root privileges this is not an issue.

This is ridiculous.. I actually got it right in the first place.

For those of you with Solaris/ZFS and other storage solutions, please confirm if the issue occurs with and/or without no_root_squash.

Comment 10 derek 2012-04-03 01:08:27 UTC
My testing shows that under ZFS the defect is still apparent with or without no_root_squash exportation from the server.

Comment 11 Ric Wheeler 2012-04-03 19:00:12 UTC
Please do open a proper support ticket for this issue, the Red Hat support team is quite good at gathering data and helping debug.

Updating this bugzilla is not a replacement for proper support escalations.

Comment 13 derek 2012-04-19 19:08:55 UTC
I would hope that Red Hat would like to continue to have its paying customers that do not have the ability to log support cases to let Red Hat know about bugs within their products.

I just want to update this to say that RHSA-2012:0480-01 fixes the issues described here in the product which includes bug 801726.  

So the regression was not exclusive to NFSv4 but also NFSv3.  I have marked this bug as a duplicate of 801726.

Comment 14 Dave Wysochanski 2012-05-09 13:52:30 UTC

*** This bug has been marked as a duplicate of bug 801726 ***

Comment 15 Dave Wysochanski 2012-05-09 14:05:33 UTC
Hi Derek - thanks for reporting this.  I've marked it a dup as you said in comment #13.  And yes of course, we always want to hear about bugs.

Can you log into the portal (http://access.redhat.com) and search our kbase?
If so, you can often find common issues like this one documented.  Here's the article which documents this problem:
https://access.redhat.com/knowledge/solutions/75553