Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • Lustre 2.5.1
    • Lustre with ZFS for MDT and OSTs
    • 3
    • 14658

    Description

      When a directory was copied to the mount point for the lustre file system it does not show up directory listing from an ls or otherwise but the directory exists and is accessible if the path is given directly. Directory in question is called tftpboot below.

      [root@r01svr1 hydra60]# ls /lustre/hydra60/tftpboot
      memtest  pxelinux.0  pxelinux.cfg
      [root@r01svr1 hydra60]# stat /lustre/hydra60/tftpboot
        File: `/lustre/hydra60/tftpboot'
        Size: 5632            Blocks: 11         IO Block: 131072 directory
      Device: 680a27c4h/1745496004d   Inode: 144115205641631522  Links: 3
      Access: (0755/drwxr-xr-x)  Uid: (    0/    root)   Gid: (    0/    root)
      Access: 2014-06-10 22:06:07.000000000 +0000
      Modify: 2014-06-25 00:22:31.000000000 +0000
      Change: 2014-06-25 00:22:31.000000000 +0000
      [root@r01svr1 hydra60]# lfs path2fid /lustre/hydra60/tftpboot
      [0x200000417:0x7722:0x0]
      [root@r01svr1 hydra60]# ls /lustre/hydra60/ | grep tftp
      [root@r01svr1 hydra60]# ls /lustre/hydra60/ 
      af06            mnt      r01svr10  r01svr13  r01svr16  r01svr19  r01svr3  r01svr6  r01svr9  tmp
      centos_updates  notes    r01svr11  r01svr14  r01svr17  r01svr2   r01svr4  r01svr7  test2    tmp_striped
      misc            r01svr1  r01svr12  r01svr15  r01svr18  r01svr20  r01svr5  r01svr8  test3
      [root@r01svr1 hydra60]# 
      

      Attachments

        Issue Links

          Activity

            [LU-5254] readdir missing a directory

            Chris, I don't think that keeping this issue open with Jeremy's information would help diagnose whatever problem you are seeing with your system, since you are running quite different versions of the code. The client-side directory handling changed a bunch between 2.5.53 and 2.5.60 because of striped directories and there several bugs fixed during its development that was fixed in a later release.

            I think it is better to keep this bug separate from yours, since I suspect they have different root causes, though you can of course reference this one in your bug if you think that is helpful.

            adilger Andreas Dilger added a comment - Chris, I don't think that keeping this issue open with Jeremy's information would help diagnose whatever problem you are seeing with your system, since you are running quite different versions of the code. The client-side directory handling changed a bunch between 2.5.53 and 2.5.60 because of striped directories and there several bugs fixed during its development that was fixed in a later release. I think it is better to keep this bug separate from yours, since I suspect they have different root causes, though you can of course reference this one in your bug if you think that is helpful.

            I could not reproduce this with Lustre 2.5.1 (zfs 0.6.2) nor 2.4.3 (zfs 0.6.1) with a basic copy test.

            utopiabound Nathaniel Clark added a comment - I could not reproduce this with Lustre 2.5.1 (zfs 0.6.2) nor 2.4.3 (zfs 0.6.1) with a basic copy test.

            This issue in particular was consistent for the client. The directory entry was either there or not, so the problem wasn't a race condition. The information I posted doesn't likely pertain to what you are seeing at LLNL so a new bug may be appropriate but I'm fine either way.

            jfilizetti Jeremy Filizetti added a comment - This issue in particular was consistent for the client. The directory entry was either there or not, so the problem wasn't a race condition. The information I posted doesn't likely pertain to what you are seeing at LLNL so a new bug may be appropriate but I'm fine either way.

            I'm reopening. We have seen this in production. If you want to argue that the LLNL production instance is unrelated and needs a separate ticket, I suppose you can do that. But you need to convey in words what you are thinking in that case.

            morrone Christopher Morrone (Inactive) added a comment - - edited I'm reopening. We have seen this in production. If you want to argue that the LLNL production instance is unrelated and needs a separate ticket, I suppose you can do that. But you need to convey in words what you are thinking in that case.

            How do you know the problem is gone? Did you have a very reliable reproducer with 2.5.56?

            For us at least the problem is racy, and I don't know how to reproduce it on demand.

            morrone Christopher Morrone (Inactive) added a comment - How do you know the problem is gone? Did you have a very reliable reproducer with 2.5.56? For us at least the problem is racy, and I don't know how to reproduce it on demand.
            jfilizetti Jeremy Filizetti added a comment - - edited

            Looks like I was testing with 2.5.56 and not version 2.5.2 like I thought was installed there. 2.5.2 did not have the issue and I also tested 2.6.50 and it also worked so I think we can close this ticket.

            jfilizetti Jeremy Filizetti added a comment - - edited Looks like I was testing with 2.5.56 and not version 2.5.2 like I thought was installed there. 2.5.2 did not have the issue and I also tested 2.6.50 and it also worked so I think we can close this ticket.

            I believe that we have seen the same bug in production using our 2.4.0-Xchaos branch. In our case, the invisible directory was not at the top-level (mount point) of the Lustre tree, it was deeper in the directory tree. We too are using zfs obds for the filesystem where we saw this.

            morrone Christopher Morrone (Inactive) added a comment - - edited I believe that we have seen the same bug in production using our 2.4.0-Xchaos branch. In our case, the invisible directory was not at the top-level (mount point) of the Lustre tree, it was deeper in the directory tree. We too are using zfs obds for the filesystem where we saw this.

            This might have been fixed by a patch landed since 2.5.1. Please check git log before spending too much time digging into the code.

            adilger Andreas Dilger added a comment - This might have been fixed by a patch landed since 2.5.1. Please check git log before spending too much time digging into the code.
            pjones Peter Jones added a comment -

            Nathaniel could you please look into this one? Thanks peter

            pjones Peter Jones added a comment - Nathaniel could you please look into this one? Thanks peter

            Just tested this with a Lustre 1.8.9-wc1 client and the tftpboot directory shows up.

            [root@r01svr2 ~]# ls /lustre/hydra60
            af06            misc  notes    r01svr10  r01svr12  r01svr14  r01svr16  r01svr18  r01svr2   r01svr3  r01svr5  r01svr7  r01svr9  test3     tmp
            centos_updates  mnt   r01svr1  r01svr11  r01svr13  r01svr15  r01svr17  r01svr19  r01svr20  r01svr4  r01svr6  r01svr8  test2    tftpboot  tmp_striped
            [root@r01svr2 ~]# ls /lustre/hydra60/tftpboot/
            memtest  pxelinux.0  pxelinux.cfg
            [root@r01svr2 ~]# 
            
            jfilizetti Jeremy Filizetti added a comment - Just tested this with a Lustre 1.8.9-wc1 client and the tftpboot directory shows up. [root@r01svr2 ~]# ls /lustre/hydra60 af06 misc notes r01svr10 r01svr12 r01svr14 r01svr16 r01svr18 r01svr2 r01svr3 r01svr5 r01svr7 r01svr9 test3 tmp centos_updates mnt r01svr1 r01svr11 r01svr13 r01svr15 r01svr17 r01svr19 r01svr20 r01svr4 r01svr6 r01svr8 test2 tftpboot tmp_striped [root@r01svr2 ~]# ls /lustre/hydra60/tftpboot/ memtest pxelinux.0 pxelinux.cfg [root@r01svr2 ~]#

            People

              utopiabound Nathaniel Clark
              jfilizetti Jeremy Filizetti
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: