Details
-
Bug
-
Resolution: Fixed
-
Major
-
Lustre 1.8.6
-
None
-
RHEL 5.5 and Lustre 1.8.0.1 on J4400's
-
3
-
10266
Description
OST 10 /dev/md30 resident on OSS3
From /var/log/messages
LDisk-fs warning (device md30): ldisk_multi_mount_protect: fsck is running on filesystem
LDisk-fs warning (device md30): ldisk_multi_mount_protect: MMP failure info: <time in unix seconds>, last update node: OSS3, last update device /dev/md30
This is a scenario that keeps sending the customer in circles. They know for certain that an fsck is not running. Since they know that they can try to turn the mmp bit off vi the following commands:
To manually disable MMP, run:
tune2fs -O ^mmp <device>
To manually enable MMP, run:
tune2fs -O mmp <device>
These commands fail saying that valid superblock does not exist, but they can see their valid superblock (with mmp set) by running the following command:
Tune2fs -l /dev/md30
It is their understanding that a fix for this issue was released with a later version of Lustre, but aside from that, is there a way to do this?
Customer contact is tyler.s.wiegers@lmco.com
ost15 had a fairly large amount of filesystem corruption when running the e2fsck. We used a lustre restore from lost and found command to attempt to restore that data. ost11 did not have corruption I don't beleive.
The recovery status using lctl get_param obdfilter.*.recovery_status on the oss shows everything as COMPLETE, which is good.
Using lctl get_param osc.*.import (not state):
The mds shows state as FULL for all OSTs, which is good
The client shows state as NEW for OST 11 and 15, but FULL for all others. There are also 3 entries for OST11 and 15 in this listing
We're working on the log output for attempting to mount