[LU-1998] Can't fsck/mount mdadm device with external journal after journal was deleted and re-added. LDISKFS-fs: journal UUID does not match Created: 20/Sep/12 Updated: 20/Sep/12 Resolved: 20/Sep/12 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 1.8.x (1.8.0 - 1.8.5) |
| Fix Version/s: | None |
| Type: | Epic | Priority: | Major |
| Reporter: | Joe Mervini | Assignee: | WC Triage |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Sun Blade (redsky) hardware running TOSS-4.4.4-1 (RHEL 5.5) and Sun J4400 JBODs |
||
| Rank (Obsolete): | 10521 |
| Description |
|
We encountered a problem with one of our OSTs that would not start because of "[ 5437.653340] md2: bitmap superblock UUID mismatch". Also while we were able to run fsck.ldiskfs (livermore's renaming) on the other md device on the OSS, on the problem target fsck was refusing to run because of the following error: "external journal does not support filesystem". So following directions posted by Andreas in an earlier lustre discuss thread, we removed the journal, ran fsck and re-added the external journal. After running those steps and trying to mount -t lustre, it fails asking whether the ldiskfs module is available with "LDISKFS-fs: journal UUID does not match" in dmesg. In addition, fsck still reports the journal does not support filesystem. I need some help fixing this problem ASAP because this is a production system. dmesg output: |
| Comments |
| Comment by Johann Lombardi (Inactive) [ 20/Sep/12 ] |
|
The problem might be related to the major/minor of the journal block device which have changed (those numbers are hardcoded in the ldiskfs filesystem). Could you please provide us with: Thanks |
| Comment by Joe Mervini [ 20/Sep/12 ] |
|
Never mind. I decided to remake the journal file system (mke2fs -b 4096 -O journal_dev /dev/md12) and after doing that everything came back fine. |
| Comment by Johann Lombardi (Inactive) [ 20/Sep/12 ] |
|
ok |