Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-430

Issues with mount.lustre and automounter

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Trivial Trivial
    • None
    • Lustre 1.8.6
    • None
    • Lustre, RHEL5.6, Automounter
    • 3
    • 10417

      We are using automounter to mount some of our lustre filesystems on workernodes around the cluster.
      These filesystems get unmounted after a long period of inactivity.

      The problem i'd like to point is not frequent but may affect other users as well.
      In some cases, when fs gets unmounted, the related entry is not removed from /etc/mtab file.
      This leads to the situation when automounter is unable to mount lustre again.

      Part of strace log from automount daemon:

      ...
      [pid 19154] execve("/sbin/mount.lustre", ["/sbin/mount.lustre", "10.8.1.101:/scratch", "/mnt/auto/scratch-lustre", "-f", "-o", "rw,nosuid,nodev,localflock"], [/* 14 vars */]) = 0

      ...
      [pid 19154] write(2, "mount.lustre: according to /etc/mtab 10.8.1.101:/scratch is already mounted on /mnt/auto/scratch-lustre\n", 104) = 104
      ...

      To make mounting possible again, the related entry needs to be removed from /etc/mtab
      I am not sure which part of the lustre-automount pair is mis-behaving here.
      Is it automounter not removing the entry from /etc/mtab or mount.lustre ifself not checking
      mount status in /proc/mounts?

      More details:

      [root@n2-1-1 ~]# grep -e lustre /etc/mtab
      10.8.1.101:/scratch /mnt/auto/scratch-lustre lustre rw,nosuid,nodev,localflock 0 0
      10.8.1.101:/storage /mnt/auto/storage-lustre lustre rw,nosuid,nodev,localflock 0 0
      172.16.193.1@o2ib:/scratch /mnt/lustre/scratch lustre rw,nosuid,nodev,user_xattr,flock,acl,user_xattr,flock,acl 0 0

      [root@n2-1-1 ~]# grep -e lustre /proc/mounts
      10.8.1.101@tcp:/storage /mnt/auto/storage-lustre lustre rw,nosuid,nodev,localflock,acl 0 0
      172.16.193.1@o2ib:/scratch /mnt/lustre/scratch lustre rw,nosuid,nodev,flock,acl 0 0

      Best Regards

      Lukasz Flis
      ACC Cyfronet

            wc-triage WC Triage
            lflis Lukasz Flis
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

              Created:
              Updated:
              Resolved: