Details
-
Bug
-
Resolution: Not a Bug
-
Major
-
None
-
Lustre 2.1.1
-
None
-
3
-
4045
Description
mds starts to report "parent doesn't exist" Load in the mds became very high we ending up dumping the server. Have vmcore if needed. Could be a dup of LU-1350 but with hung service thread.
Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) header@ffff880c18139480[0x0, 1, [0x325698b32c:0x9:0x0] hash lru]
{ ^M Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) ....mdt@ffff880c181394d8mdt-object@ffff880c18139480(ioepoch=0 flags=0x0, epochcount=0, writecount=0)^M Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) ....cmm@ffff880b36c71d40[local]^M Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) ....mdd@ffff8808ceb92a40mdd-object@ffff8808ceb92a40(open_count=0, valid=0, cltime=0, flags=0)^M Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) ....osd-ldiskfs@ffff8808ceb92980osd-ldiskfs-object@ffff8808ceb92980(i:(null):0/0)[plain]^M Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) } header@ffff880c18139480^M
Lustre: 4504:0:(mdt_handler.c:888:mdt_getattr_name_lock()) Parent doesn't exist!^M
Lustre: 4946:0:(mdt_xattr.c:375:mdt_reint_setxattr()) client miss to set OBD_MD_FLCTIME when setxattr: [object [0x2f00600666:0x44:0x0]] [valid 68719476736]^M
Lustre: Service thread pid 9153 was inactive for 200.00s. The thread might be hung, or it might only be slow and will resume later. Dumping the stack trace for debugging purposes:^M
Lustre: Skipped 2 previous similar messages^M
Pid: 9153, comm: mdt_mdss_153^M
^M
Call Trace:^M
[<ffffffffa0a76785>] jbd2_log_wait_commit+0xc5/0x140 [jbd2]^M
[<ffffffff8108fff0>] ? autoremove_wake_function+0x0/0x40^M
[<ffffffff81012c69>] ? read_tsc+0x9/0x20^M
[<ffffffffa0a6eb4b>] jbd2_journal_stop+0x2cb/0x320 [jbd2]^M
[<ffffffffa0ac7048>] __ldiskfs_journal_stop+0x68/0xa0 [ldiskfs]^M
[<ffffffffa0c498f8>] osd_trans_stop+0xb8/0x290 [osd_ldiskfs]^M
[<ffffffffa089fb06>] ? seq_store_write+0xc6/0x2b0 [fid]^M
[<ffffffffa089f867>] seq_store_trans_stop+0x57/0xe0 [fid]^M
[<ffffffffa089fd8c>] seq_store_update+0x9c/0x1e0 [fid]^M
[<ffffffffa089e99a>] seq_server_alloc_meta+0x4aa/0x720 [fid]^M
[<ffffffffa0630800>] ? lustre_swab_lu_seq_range+0x0/0x30 [obdclass]^M
[<ffffffffa089efc8>] seq_query+0x3b8/0x680 [fid]^M
[<ffffffffa075e954>] ? lustre_msg_get_opc+0x94/0x100 [ptlrpc]^M
[<ffffffffa0be7e85>] mdt_handle_common+0x8d5/0x1810 [mdt]^M
[<ffffffffa075e954>] ? lustre_msg_get_opc+0x94/0x100 [ptlrpc]^M
[<ffffffffa0be8e35>] mdt_mdss_handle+0x15/0x20 [mdt]^M
[<ffffffffa076f42e>] ptlrpc_main+0xb7e/0x18f0 [ptlrpc]^M
[<ffffffffa076e8b0>] ? ptlrpc_main+0x0/0x18f0 [ptlrpc]^M
[<ffffffff8100c14a>] child_rip+0xa/0x20^M
[<ffffffffa076e8b0>] ? ptlrpc_main+0x0/0x18f0 [ptlrpc]^M
[<ffffffffa076e8b0>] ? ptlrpc_main+0x0/0x18f0 [ptlrpc]^M
[<ffffffff8100c140>] ? child_rip+0x0/0x20^M