[LU-8469] Sanity test 54c: Unable to unmount loop device Created: 02/Aug/16 Updated: 14/Dec/21 Resolved: 14/Dec/21 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Doug Oucharek (Inactive) | Assignee: | WC Triage |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
RHEL 7, master, VMs |
||
| Attachments: |
|
| Severity: | 3 |
| Rank (Obsolete): | 9223372036854775807 |
| Description |
|
Ran into a lock-up of sanity test 133g when it tried to unmount the Lustre file system. It kept printing this to the console: /mnt/client is still busy, wait one second I stopped the test and tried to unmount manually. That failed with "file system busy" error. I found that a loop device was still mounted against the Lustre file system. This is a left over from test 54c. Looking back at the logs, I can see that 54c is unable to unmount the loop device: [ 1529.373768] Lustre: DEBUG MARKER: == sanity test 54c: block device works in lustre ===================================================== 21:31:35 (1469766695) I looked at other sanity runs which were successful, and found this: == sanity test 54c: block device works in lustre ===================================================== 17:27:05 (1469838425) Allocating group tables: done 30+0 records in So, 54c is passing when it cannot use the loop device?!? This all seems very broken to me. 54c passes when we can't use the loop device (why can't we use it) and fail when we do mount the loop device because it cannot unmount it. |
| Comments |
| Comment by Doug Oucharek (Inactive) [ 02/Aug/16 ] |
|
I'm attaching all the test 54c test logs for the failing case. |
| Comment by Oleg Drokin [ 02/Aug/16 ] |
122880 bytes (123 kB) copied, 0.00015479 s, 794 MB/s losetup: /mnt/client/loop54c: detach failed: No such device or address losetup: /dev/loop3: detach failed: No such device or address this is understandable: cleanup_54c() {
loopdev="$DIR/loop54c"
trap 0
$UMOUNT $DIR/$tdir || rc=$?
losetup -d $loopdev || true
losetup -d $LOOPDEV || true
So normally unmount when successful would detach loop device and the following losetups would error out which is what you have quoted in the ticket description- this is normal. Now the failure to unmount as in the attached log - that's a problem - something holds the mountpoint. when this happens would be great to see what was that. some background process got to it? |