Details
-
Bug
-
Resolution: Fixed
-
Blocker
-
Lustre 2.4.0
-
LLNL/Hyperion lustre-reviews 9573
-
3
-
4299
Description
<ConMan> Console [hyperion-rst6] log at 2012-09-30 16:00:00 PDT. 2012-09-30 16:20:23 [A[B[1;2B[1;2B[1;2B[1;2B[1;2B[B[B[B[B[B[BLustre: 4162:0:(client.c:1917:ptlrpc_expire_one_request()) @@@ Request sent has timed out for slow reply: [sent 1349048656/real 1349048656] req@ffff880176df8c00 x1414547178955781/t0(0) o400->MGC192.168.127.6@o2ib@0@lo:26/25 lens 224/224 e 0 to 1 dl 1349048762 ref 1 fl Rpc:RXN/0/ffffffff rc 0/-1 2012-09-30 16:46:02 LustreError: 166-1: MGC192.168.127.6@o2ib: Connection to MGS (at 0@lo) was lost; in progress operations using this service will fail 2012-09-30 16:46:02 Lustre: MGS: Client b825c79a-32bd-3ab8-c5fa-b66aeb9bb741 (at 0@lo) reconnecting 2012-09-30 16:46:02 LustreError: 4242:0:(obd_class.h:527:obd_set_info_async()) obd_set_info_async: dev 0 no operation 2012-09-30 16:47:09 Lustre: MGC192.168.127.6@o2ib: Reactivating import 2012-09-30 16:47:09 Lustre: MGC192.168.127.6@o2ib: Connection restored to MGS (at 0@lo) <ConMan> Console [hyperion-rst6] log at 2012-09-30 17:00:00 PDT. 2012-09-30 17:02:38 LustreError: 6069:0:(osd_handler.c:837:osd_trans_start()) ASSERTION( oti->oti_w_locks == 0 ) failed: 2012-09-30 17:02:38 LustreError: 6069:0:(osd_handler.c:837:osd_trans_start()) LBUG 2012-09-30 17:02:38 Pid: 6069, comm: mdt_rdpg03_002 2012-09-30 17:02:38
For some reason, stack did not dump, but vmcore obtained
crash> bt
PID: 6069 TASK: ffff880141836aa0 CPU: 7 COMMAND: "mdt_rdpg03_002"
#0 [ffff880156e759c8] machine_kexec at ffffffff8103281b
#1 [ffff880156e75a28] crash_kexec at ffffffff810ba792
#2 [ffff880156e75af8] panic at ffffffff814fd591
#3 [ffff880156e75b78] lbug_with_loc at ffffffffa0393f6b [libcfs]
#4 [ffff880156e75b98] osd_trans_start at ffffffffa0a8d2bc [osd_ldiskfs]
#5 [ffff880156e75bd8] mdd_trans_start at ffffffffa0f043a4 [mdd]
#6 [ffff880156e75be8] mdd_close at ffffffffa0edf4f6 [mdd]
#7 [ffff880156e75c58] cml_close at ffffffffa06baef6 [cmm]
#8 [ffff880156e75c88] mdt_mfd_close at ffffffffa0f8a18e [mdt]
#9 [ffff880156e75ce8] mdt_close at ffffffffa0f8ae0a [mdt]
#10 [ffff880156e75d38] mdt_handle_common at ffffffffa0f66802 [mdt]
#11 [ffff880156e75d88] mdt_readpage_handle at ffffffffa0f676d5 [mdt]
#12 [ffff880156e75d98] ptlrpc_server_handle_request at ffffffffa0966b3c [ptlrpc]
#13 [ffff880156e75e98] ptlrpc_main at ffffffffa0968111 [ptlrpc]
#14 [ffff880156e75f48] kernel_thread at ffffffff8100c14a