Details
-
Bug
-
Resolution: Won't Fix
-
Minor
-
None
-
Lustre 2.1.2
-
None
-
CentOS release 6.2 (Final)
Lustre 2.1.2
-
3
-
14003
Description
One of the OSS server (Oss3) had the following error. and it got rebooted.
MGC10.243.12.16@o2ib@10.243.12.17@o2ib:26/25 lens 368/512 e 0 to 1 dl 1399672339 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
May 9 23:52:28 oss03 kernel: : Lustre: scratch-OST0002 is waiting for obd_unlinked_exports more than 16 seconds. The obd refcount = 8. Is it stuck?
May 9 23:52:29 oss03 kernel: : Lustre: scratch-OST0003 is waiting for obd_unlinked_exports more than 16 seconds. The obd refcount = 16. Is it stuck?
May 9 23:52:49 oss03 kernel: : Lustre: 7036:0:(client.c:1780:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1399672358/real 1399672358] req@ffff8802ee5cac00 x1459215404517420/t0(0) o250->MGC10.243.12.16@o2ib@10.243.12.16@o2ib:26/25 lens 368/512 e 0 to 1 dl 1399672369 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
May 9 23:52:51 oss03 kernel: : LustreError: 137-5: home-OST0003: Not available for connect from 10.243.200.25@o2ib (no target)
May 9 23:52:51 oss03 kernel: : LustreError: Skipped 445 previous similar messages
May 9 23:53:00 oss03 kernel: : Lustre: scratch-OST0002 is waiting for obd_unlinked_exports more than 32 seconds. The obd refcount = 8. Is it stuck?
May 9 23:53:14 oss03 kernel: : Lustre: 7036:0:(client.c:1780:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1399672383/real 1399672383] req@ffff8802f2b72400 x1459215404517421/t0(0) o250->MGC10.243.12.16@o2ib@10.243.12.17@o2ib:26/25 lens 368/512 e 0 to 1 dl 1399672394 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
May 9 23:53:44 oss03 kernel: : Lustre: 7036:0:(client.c:1780:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1399672408/real 1399672408] req@ffff8802f2db2c00 x1459215404517422/t0(0) o250->MGC10.243.12.16@o2ib@10.243.12.16@o2ib:26/25 lens 368/512 e 0 to 1 dl 1399672424 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
May 9 23:53:55 oss03 kernel: : LustreError: 137-5: home-OST0002: Not available for connect from 10.243.201.4@o2ib (no target)
May 9 23:53:55 oss03 kernel: : LustreError: Skipped 1562 previous similar messages
May 9 23:54:04 oss03 kernel: : Lustre: scratch-OST0002 is waiting for obd_unlinked_exports more than 64 seconds. The obd refcount = 8. Is it stuck?
May 9 23:54:04 oss03 kernel: : Lustre: Skipped 1 previous similar message
May 9 23:54:09 oss03 kernel: : Lustre: 7036:0:(client.c:1780:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1399672433/real 1399672433] req@ffff8802ef55bc00 x1459215404517423/t0(0) o250->MGC10.243.12.16@o2ib@10.243.12.17@o2ib:26/25 lens 368/512 e 0 to 1 dl 1399672449 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
May 9 23:55:04 oss03 kernel: : Lustre: 7036:0:(client.c:1780:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1399672483/real 1399672483] req@ffff88007b818800 x1459215404517425/t0(0) o250->MGC10.243.12.16@o2ib@10.243.12.17@o2ib:26/25 lens 368/512 e 0 to 1 dl 1399672504 ref 1 fl Rpc:XN/0/ffffffff rc 0/-1
May 9 23:55:04 oss03 kernel: : Lustre: 7036:0:(client.c:1780:ptlrpc_expire_one_request()) Skipped 1 previous similar message
May 9 23:56:03 oss03 kernel: : LustreError: 137-5: home-OST0003: Not available for connect from 10.248.2.21@tcp1 (no target)
May 9 23:56:03 oss03 kernel: : LustreError: Skipped 3114 previous similar messages
May 9 23:56:12 oss03 kernel: : Lustre: scratch-OST0002 is waiting for obd_unlinked_exports more than 128 seconds. The obd refcount = 8. Is it stuck?
May 9 23:56:12 oss03 kernel: : Lustre: Skipped 1 previous similar message
Mark the issue as "Resolve/Won't Fix", and please reopen it if more work is needed.