Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8469

Sanity test 54c: Unable to unmount loop device

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • None
    • None
    • RHEL 7, master, VMs
    • 3
    • 9223372036854775807

    Description

      Ran into a lock-up of sanity test 133g when it tried to unmount the Lustre file system. It kept printing this to the console:

      /mnt/client is still busy, wait one second
      /mnt/client is still busy, wait one second
      /mnt/client is still busy, wait one second

      I stopped the test and tried to unmount manually. That failed with "file system busy" error.

      I found that a loop device was still mounted against the Lustre file system. This is a left over from test 54c.

      Looking back at the logs, I can see that 54c is unable to unmount the loop device:

      [ 1529.373768] Lustre: DEBUG MARKER: == sanity test 54c: block device works in lustre ===================================================== 21:31:35 (1469766695)
      [ 1529.543207] EXT4-fs (loop3): mounting ext2 file system using the ext4 subsystem
      [ 1529.550865] EXT4-fs (loop3): mounted filesystem without journal. Opts: (null)
      [ 1529.810716] Lustre: DEBUG MARKER: sanity test_54c: @@@@@@ FAIL: test_54c failed with 32

      I looked at other sanity runs which were successful, and found this:

      == sanity test 54c: block device works in lustre ===================================================== 17:27:05 (1469838425)
      make a loop file system with /mnt/client/f54c.sanity on /mnt/client/loop54c (3).
      1+0 records in
      1+0 records out
      4096 bytes (4.1 kB) copied, 0.00140237 s, 2.9 MB/s
      mke2fs 1.42.12.wc1 (15-Sep-2014)
      Creating filesystem with 4100 1k blocks and 1032 inodes

      Allocating group tables: done
      Writing inode tables: done
      Writing superblocks and filesystem accounting information: done

      30+0 records in
      30+0 records out
      122880 bytes (123 kB) copied, 0.000805954 s, 152 MB/s
      Filesystem 1K-blocks Used Available Use% Mounted on
      /mnt/client/loop54c 3966 151 3610 5% /mnt/client/d54c.sanity
      30+0 records in
      30+0 records out
      122880 bytes (123 kB) copied, 0.00015479 s, 794 MB/s
      losetup: /mnt/client/loop54c: detach failed: No such device or address
      losetup: /dev/loop3: detach failed: No such device or address
      Resetting fail_loc on all nodes...done.
      PASS 54c (1s)

      So, 54c is passing when it cannot use the loop device?!?

      This all seems very broken to me. 54c passes when we can't use the loop device (why can't we use it) and fail when we do mount the loop device because it cannot unmount it.

      Attachments

        Activity

          People

            wc-triage WC Triage
            doug Doug Oucharek (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: