[LU-7786] improve racer cleanup , racer timeout, /mnt/lustre2 is still busy ... Created: 17/Feb/16  Updated: 17/May/16  Resolved: 14/Mar/16

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: Lustre 2.9.0

Type: Bug Priority: Major
Reporter: Lokesh Nagappa Jaliminche (Inactive) Assignee: WC Triage
Resolution: Fixed Votes: 0
Labels: patch

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

racer timeout, t-f stucks on umount /mnt/lustre2 :

== racer test complete, duration 920 sec == 22:14:53 (1382220893)
Stopping clients: mft77,mft78 /mnt/lustre2 (opts:)
Stopping client mft78 /mnt/lustre2 opts:
COMMAND  PID USER   FD   TYPE      DEVICE SIZE/OFF               NODE NAME
dd      3567 root    1w   REG 1273,181606 80902144 144115205255725062 /mnt/lustre/racer/4
Stopping client mft77 /mnt/lustre2 opts:
COMMAND   PID USER   FD   TYPE      DEVICE  SIZE/OFF               NODE NAME
dd       7390 root    1w   REG 1273,181606  81574912 144115205255725062 /mnt/lustre/racer/8
dd       7395 root    1w   REG 1273,181606  81574912 144115205255725062 /mnt/lustre/racer/8
dd       7433 root    1w   REG 1273,181606  81592320 144115205255725062 /mnt/lustre2/racer/9
dd      31735 root    1w   REG 1273,181606 153261056 144115205255780734 /mnt/lustre/racer/2
/mnt/lustre2 is still busy, wait one second
/mnt/lustre2 is still busy, wait one second
/mnt/lustre2 is still busy, wait one second


 Comments   
Comment by Lokesh Nagappa Jaliminche (Inactive) [ 17/Feb/16 ]

On cleanup racer terminates child scripts: file_create.sh, dir_create.sh, etc. Children of those srcipts do not get terminated
that way. Long running commands, like dd, causes annoying warnings: /mnt/lustre2 is still busy, wait one second
on attempt to umount $DIR2.
Added trap to all child scripts to have them to cleanup on exiting.

Comment by Gerrit Updater [ 17/Feb/16 ]

lokesh.jaliminche (lokesh.jaliminche@seagate.com) uploaded a new patch: http://review.whamcloud.com/18475
Subject: LU-7786 tests: improve racer cleanup
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 2f30a5a04ffbdc7d176e4d859b43913e05fa51a3

Comment by Gerrit Updater [ 14/Mar/16 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/18475/
Subject: LU-7786 tests: improve racer cleanup
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: dd48eb801321ab87c89dfbe3dd89fa487cf874b8

Generated at Sat Feb 10 02:11:54 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.