Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-3723

LBUG on MDS when unmounting file system after lfsck -t namespace on 2.4

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • None
    • Lustre 2.4.0
    • None
    • 3
    • 9585

    Description

      After doing running lfsck -t namespace (specifically lctl lfsck_start -M [fsname]-MDT0000 -t namespace) on a 2.4 formatted file system, the MDS LBUGs when unmounting the file system.

      This was observed with master on CentOS 6 and with 2.4 on SLES11SP1. The dump I'll be making available is on SLES11SP1 with 2.4.

      This issue has been observed both during an upgrade from 1.8.6 to 2.4, and also on a fresh 2.4 install.

      Here's the stack trace:
      2013-08-07T10:30:11.921271-05:00 c0-0c1s5n0 LustreError: 20626:0:(lu_object.c:1141:lu_device_fini()) ASSERTION( cfs_atomic_read(&d->ld_ref) == 0 ) failed: Refcount is 1
      2013-08-07T10:30:11.921310-05:00 c0-0c1s5n0 LustreError: 20626:0:(lu_object.c:1141:lu_device_fini()) LBUG
      2013-08-07T10:30:11.921321-05:00 c0-0c1s5n0 Pid: 20626, comm: umount
      2013-08-07T10:30:11.921329-05:00 c0-0c1s5n0 Call Trace:
      2013-08-07T10:30:11.921338-05:00 c0-0c1s5n0 [<ffffffff81007e59>] try_stack_unwind+0x1a9/0x200
      2013-08-07T10:30:11.921347-05:00 c0-0c1s5n0 [<ffffffff81006625>] dump_trace+0x95/0x300
      2013-08-07T10:30:11.921356-05:00 c0-0c1s5n0 [<ffffffffa044c8d7>] libcfs_debug_dumpstack+0x57/0x80 [libcfs]
      2013-08-07T10:30:11.921365-05:00 c0-0c1s5n0 [<ffffffffa044ce27>] lbug_with_loc+0x47/0xb0 [libcfs]
      2013-08-07T10:30:11.921373-05:00 c0-0c1s5n0 [<ffffffffa05590c7>] lu_device_fini+0x87/0xc0 [obdclass]
      2013-08-07T10:30:11.921382-05:00 c0-0c1s5n0 [<ffffffffa053e4e9>] ls_device_put+0xa9/0x200 [obdclass]
      2013-08-07T10:30:11.921390-05:00 c0-0c1s5n0 [<ffffffffa053e74b>] local_oid_storage_fini+0x10b/0x210 [obdclass]
      2013-08-07T10:30:11.921398-05:00 c0-0c1s5n0 [<ffffffffa0251944>] mdd_process_config+0x274/0x610 [mdd]
      2013-08-07T10:30:11.921407-05:00 c0-0c1s5n0 [<ffffffffa0b7ed6b>] mdt_stack_fini+0x17b/0xbc0 [mdt]
      2013-08-07T10:30:11.921416-05:00 c0-0c1s5n0 [<ffffffffa0b7fe39>] mdt_device_fini+0x689/0xdd0 [mdt]
      2013-08-07T10:30:11.921424-05:00 c0-0c1s5n0 [<ffffffffa054b00f>] class_cleanup+0x65f/0xdb0 [obdclass]
      2013-08-07T10:30:11.921432-05:00 c0-0c1s5n0 [<ffffffffa054c874>] class_process_config+0x1114/0x1cb0 [obdclass]
      2013-08-07T10:30:11.921440-05:00 c0-0c1s5n0 [<ffffffffa054d587>] class_manual_cleanup+0x177/0x6f0 [obdclass]
      2013-08-07T10:30:11.921448-05:00 c0-0c1s5n0 [<ffffffffa05845ba>] server_put_super+0x5ba/0xf00 [obdclass]
      2013-08-07T10:30:11.921456-05:00 c0-0c1s5n0 [<ffffffff811159bd>] generic_shutdown_super+0x5d/0x110
      2013-08-07T10:30:11.921465-05:00 c0-0c1s5n0 [<ffffffff81115ad6>] kill_anon_super+0x16/0x60
      2013-08-07T10:30:11.921473-05:00 c0-0c1s5n0 [<ffffffffa054f2c6>] lustre_kill_super+0x36/0x50 [obdclass]
      2013-08-07T10:30:11.921481-05:00 c0-0c1s5n0 [<ffffffff81115f73>] deactivate_super+0x73/0x90
      2013-08-07T10:30:11.921490-05:00 c0-0c1s5n0 [<ffffffff8112e082>] mntput_no_expire+0xc2/0xf0
      2013-08-07T10:30:11.921498-05:00 c0-0c1s5n0 [<ffffffff8112e43c>] sys_umount+0x7c/0x360
      2013-08-07T10:30:11.921506-05:00 c0-0c1s5n0 [<ffffffff8100305b>] system_call_fastpath+0x16/0x1b
      2013-08-07T10:30:11.921514-05:00 c0-0c1s5n0 [<00007fa6f1b37d07>] 0x7fa6f1b37d07
      2013-08-07T10:30:11.921523-05:00 c0-0c1s5n0 Kernel panic - not syncing: LBUG
      2013-08-07T10:30:11.921531-05:00 c0-0c1s5n0 Pid: 20626, comm: umount Tainted: P 2.6.32.59-0.7.1_1.0000.7461-cray_gem_s #1
      2013-08-07T10:30:11.921539-05:00 c0-0c1s5n0 Call Trace:
      2013-08-07T10:30:11.921547-05:00 c0-0c1s5n0 [<ffffffff81007e59>] try_stack_unwind+0x1a9/0x200
      2013-08-07T10:30:11.921555-05:00 c0-0c1s5n0 [<ffffffff81006625>] dump_trace+0x95/0x300
      2013-08-07T10:30:11.921563-05:00 c0-0c1s5n0 [<ffffffff8100786c>] show_trace_log_lvl+0x5c/0x80
      2013-08-07T10:30:11.921572-05:00 c0-0c1s5n0 [<ffffffff810078a5>] show_trace+0x15/0x20
      2013-08-07T10:30:11.921580-05:00 c0-0c1s5n0 [<ffffffff814283c5>] dump_stack+0x77/0x82
      2013-08-07T10:30:11.921588-05:00 c0-0c1s5n0 [<ffffffff8142844a>] panic+0x7a/0x165
      2013-08-07T10:30:11.921597-05:00 c0-0c1s5n0 [<ffffffffa044ce7b>] lbug_with_loc+0x9b/0xb0 [libcfs]
      2013-08-07T10:30:11.921605-05:00 c0-0c1s5n0 [<ffffffffa05590c7>] lu_device_fini+0x87/0xc0 [obdclass]
      2013-08-07T10:30:11.921613-05:00 c0-0c1s5n0 [<ffffffffa053e4e9>] ls_device_put+0xa9/0x200 [obdclass]
      2013-08-07T10:30:11.921621-05:00 c0-0c1s5n0 [<ffffffffa053e74b>] local_oid_storage_fini+0x10b/0x210 [obdclass]
      2013-08-07T10:30:11.921630-05:00 c0-0c1s5n0 [<ffffffffa0251944>] mdd_process_config+0x274/0x610 [mdd]
      2013-08-07T10:30:11.921638-05:00 c0-0c1s5n0 [<ffffffffa0b7ed6b>] mdt_stack_fini+0x17b/0xbc0 [mdt]
      2013-08-07T10:30:11.921646-05:00 c0-0c1s5n0 [<ffffffffa0b7fe39>] mdt_device_fini+0x689/0xdd0 [mdt]
      2013-08-07T10:30:11.921654-05:00 c0-0c1s5n0 [<ffffffffa054b00f>] class_cleanup+0x65f/0xdb0 [obdclass]
      2013-08-07T10:30:11.921663-05:00 c0-0c1s5n0 [<ffffffffa054c874>] class_process_config+0x1114/0x1cb0 [obdclass]
      2013-08-07T10:30:11.921672-05:00 c0-0c1s5n0 [<ffffffffa054d587>] class_manual_cleanup+0x177/0x6f0 [obdclass]
      2013-08-07T10:30:11.921680-05:00 c0-0c1s5n0 [<ffffffffa05845ba>] server_put_super+0x5ba/0xf00 [obdclass]
      2013-08-07T10:30:11.921688-05:00 c0-0c1s5n0 [<ffffffff811159bd>] generic_shutdown_super+0x5d/0x110
      2013-08-07T10:30:11.921697-05:00 c0-0c1s5n0 [<ffffffff81115ad6>] kill_anon_super+0x16/0x60
      2013-08-07T10:30:11.921705-05:00 c0-0c1s5n0 [<ffffffffa054f2c6>] lustre_kill_super+0x36/0x50 [obdclass]
      2013-08-07T10:30:11.921713-05:00 c0-0c1s5n0 [<ffffffff81115f73>] deactivate_super+0x73/0x90
      2013-08-07T10:30:11.921725-05:00 c0-0c1s5n0 [<ffffffff8112e082>] mntput_no_expire+0xc2/0xf0
      2013-08-07T10:30:11.921751-05:00 c0-0c1s5n0 [<ffffffff8112e43c>] sys_umount+0x7c/0x360
      2013-08-07T10:30:11.921760-05:00 c0-0c1s5n0 [<ffffffff8100305b>] system_call_fastpath+0x16/0x1b
      2013-08-07T10:30:11.921768-05:00 c0-0c1s5n0 [<00007fa6f1b37d07>] 0x7fa6f1b37d07

      Attachments

        Activity

          People

            yong.fan nasf (Inactive)
            paf Patrick Farrell (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: