[LU-1449] Lustre b2_2 head corrupts file system during install Created: 30/May/12  Updated: 31/May/12  Resolved: 30/May/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.2
Fix Version/s: None

Type: Bug Priority: Blocker
Reporter: Chris Gearing (Inactive) Assignee: Andreas Dilger
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
Severity: 3
Rank (Obsolete): 6389

 Description   

If you run loadjenkinsbuild with b2_2 head the file system is corrupted durring the install

loadjenkinsbuild -n client-32vm6 -j lustre-b2_2 -b 0 -t client -d el6 -a x86_64 --profile test --jenkins-uri http://build.lab.whamcloud.com/ --cobbleruri http://cobbler.lab.whamcloud.com/cobbler_api --cobbleruser remote --cobblerpass aib2Iegh -v --packages="expect,lsof,curl,gcc,make,cvs,bc,posix" --reboot --powerup

Gives =====
Checking filesystems
Checking all file systems.
[/sbin/fsck.ext3 (1) -- /] fsck.ext3 -a /dev/vda1
fsck.ext3: Device or resource busy while trying to open /dev/vda1
Filesystem mounted or opened exclusively by another program?
Welcome to CentOS
Starting udev: [ OK ]
Setting hostname client-32vm6.lab.whamcloud.com: [ OK ]
Setting up Logical Volume Management: No volume groups found
[ OK ]
Checking filesystems
Checking all file systems.
[/sbin/fsck.ext3 (1) -- /] fsck.ext3 -a /dev/vda1
fsck.ext3: Device or resource busy while trying to open /dev/vda1
Filesystem mounted or opened exclusively by another program?
[FAILED]

      • An error occurred during the file system check.
      • Dropping you to a shell; the system will reboot
      • when you leave the shell.
        Give root password for maintenance
        (or type Control-D to continue):
        ===========

Other branches work perfectly well.



 Comments   
Comment by Chris Gearing (Inactive) [ 30/May/12 ]

My guess is that actually it's really post install.

The loadjenkins install via chef installs el6 and then installs the lustre kernel it is after this reboot with the b2_2 kernel that the fs appears to be corrupted.

Comment by Sarah Liu [ 30/May/12 ]

this also affects 1.8.8 RHEL6 build

Comment by Peter Jones [ 30/May/12 ]

Andreas is working on this one

Comment by Andreas Dilger [ 30/May/12 ]

This was caused by a patch change when pushing the patch upstream.

A fix for the problem is in http://review.whamcloud.com/2985.

Comment by Andreas Dilger [ 30/May/12 ]

New fix in http://review.whamcloud.com/2986, which also passes the RHEL5 build.

Comment by Andreas Dilger [ 30/May/12 ]

This fix is landed for e2fsprogs-1.42.3.wc1.

Generated at Sat Feb 10 01:16:44 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.