Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
Lustre 2.11.0
-
None
-
trevis, full
servers: el7.4, ldiskfs, branch master, v2.10.53.1, b3642
clients: sles12.2, branch master, v2.10.53.1, b3642
-
3
-
9223372036854775807
Description
https://testing.hpdd.intel.com/test_sessions/bce14d16-91a3-441a-b3ec-8a1817b00025
Nothing found in any of the console or dmesg logs. It appears that the process of
stopping the clients is not able to finish. This might be SLES related.
From suite_log:
umount lustre on /mnt/lustre.....
CMD: trevis-3vm1 grep -c /mnt/lustre' ' /proc/mounts
Stopping client trevis-3vm1 /mnt/lustre (opts:)
CMD: trevis-3vm1 lsof -t /mnt/lustre
CMD: trevis-3vm1 umount /mnt/lustre 2>&1
stop ost1 service on trevis-3vm3
CMD: trevis-3vm3 grep -c /mnt/lustre-ost1' ' /proc/mounts
Stopping /mnt/lustre-ost1 (opts:-f) on trevis-3vm3
CMD: trevis-3vm3 umount -f /mnt/lustre-ost1
CMD: trevis-3vm3 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
stop mds service on trevis-3vm4
CMD: trevis-3vm4 grep -c /mnt/lustre-mds1' ' /proc/mounts
Stopping /mnt/lustre-mds1 (opts:-f) on trevis-3vm4
CMD: trevis-3vm4 umount -f /mnt/lustre-mds1
CMD: trevis-3vm4 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
CMD: trevis-3vm1 lsmod | grep lnet > /dev/null && lctl dl | grep ' ST '
modules unloaded.
Stopping clients: trevis-3vm1,trevis-3vm2 /mnt/lustre (opts:)
CMD: trevis-3vm1,trevis-3vm2 running=\$(grep -c /mnt/lustre' ' /proc/mounts);
if [ \$running -ne 0 ] ; then
echo Stopping client \$(hostname) /mnt/lustre opts:;
lsof /mnt/lustre || need_kill=no;
if [ x != x -a x\$need_kill != xno ]; then
pids=\$(lsof -t /mnt/lustre | sort -u);
if [ -n \"\$pids\" ]; then
kill -9 \$pids;
fi
fi;
while umount /mnt/lustre 2>&1 | grep -q busy; do
echo /mnt/lustre is still busy, wait one second && sleep 1;
done;
fi