Details
-
Bug
-
Resolution: Duplicate
-
Major
-
None
-
Lustre 2.2.0, Lustre 2.3.0, Lustre 2.1.3, Lustre 2.5.0
-
None
-
3
-
4394
Description
This issue was created by maloo for yujian <yujian@whamcloud.com>
This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/6fac1700-0e30-11e2-91a3-52540035b04c.
Lustre Client Build: http://build.whamcloud.com/job/lustre-b2_3/28
Lustre Server Build: http://build.whamcloud.com/job/lustre-b2_2/17
Distro/Arch: RHEL6.3/x86_64
The sub-test test_41b failed with the following error:
Starting mds1: -o user_xattr,acl -o nomgs,force /dev/lvm-MDS/P1 /mnt/mds1 CMD: client-29vm7 mkdir -p /mnt/mds1; mount -t lustre -o user_xattr,acl -o nomgs,force /dev/lvm-MDS/P1 /mnt/mds1 test failed to respond and timed out
Info required for matching: conf-sanity 41b
Console log on MDS showed that:
11:22:49:Lustre: DEBUG MARKER: mkdir -p /mnt/mds1; mount -t lustre -o user_xattr,acl -o nomgs,force /dev/lvm-MDS/P1 /mnt/mds1 11:22:50:LustreError: 166-1: MGC10.10.4.178@tcp: Connection to service MGS via nid 0@lo was lost; in progress operations using this service will fail. 11:22:50:Lustre: 3618:0:(ldlm_lib.c:633:target_handle_reconnect()) MGS: 4172abe8-fa59-d3f9-325d-4acb9d3d67d0 reconnecting 11:22:50:LustreError: 3618:0:(obd_class.h:521:obd_set_info_async()) obd_set_info_async: dev 0 no operation 11:22:50:LustreError: 3619:0:(ldlm_lock.c:818:ldlm_lock_decref_and_cancel()) ASSERTION( lock != ((void *)0) ) failed: 11:22:50:LustreError: 3619:0:(ldlm_lock.c:818:ldlm_lock_decref_and_cancel()) LBUG 11:22:50:Pid: 3619, comm: ll_mgs_02 11:22:50: 11:22:50:Call Trace: 11:22:50: [<ffffffffa04cb835>] libcfs_debug_dumpstack+0x55/0x80 [libcfs] 11:22:50: [<ffffffffa04cbd67>] lbug_with_loc+0x47/0xb0 [libcfs] 11:22:50: [<ffffffffa071d2b1>] ldlm_lock_decref_and_cancel+0x111/0x120 [ptlrpc] 11:22:50: [<ffffffffa0ae95ab>] mgs_completion_ast_config+0xfb/0x110 [mgs] 11:22:50: [<ffffffffa0734540>] ldlm_cli_enqueue_local+0x1f0/0x4d0 [ptlrpc] 11:22:50: [<ffffffffa0ae94b0>] ? mgs_completion_ast_config+0x0/0x110 [mgs] 11:22:50: [<ffffffffa0733670>] ? ldlm_blocking_ast+0x0/0x130 [ptlrpc] 11:22:50: [<ffffffffa0ae92ac>] mgs_revoke_lock+0x13c/0x230 [mgs] 11:22:50: [<ffffffffa0733670>] ? ldlm_blocking_ast+0x0/0x130 [ptlrpc] 11:22:50: [<ffffffffa0ae94b0>] ? mgs_completion_ast_config+0x0/0x110 [mgs] 11:22:50: [<ffffffffa04d54f1>] ? libcfs_debug_msg+0x41/0x50 [libcfs] 11:22:50: [<ffffffffa0aebc8f>] mgs_handle+0xf0f/0x1820 [mgs] 11:22:50: [<ffffffffa0763011>] ptlrpc_server_handle_request+0x3c1/0xcb0 [ptlrpc] 11:22:50: [<ffffffffa04cc3ee>] ? cfs_timer_arm+0xe/0x10 [libcfs] 11:22:50: [<ffffffffa04d6e19>] ? lc_watchdog_touch+0x79/0x110 [libcfs] 11:22:50: [<ffffffffa075d0e2>] ? ptlrpc_wait_event+0xb2/0x2c0 [ptlrpc] 11:22:50: [<ffffffff8105e7f0>] ? default_wake_function+0x0/0x20 11:22:50: [<ffffffffa076401f>] ptlrpc_main+0x71f/0x1210 [ptlrpc] 11:22:50: [<ffffffffa0763900>] ? ptlrpc_main+0x0/0x1210 [ptlrpc] 11:22:50: [<ffffffff8100c14a>] child_rip+0xa/0x20 11:22:50: [<ffffffffa0763900>] ? ptlrpc_main+0x0/0x1210 [ptlrpc] 11:22:50: [<ffffffffa0763900>] ? ptlrpc_main+0x0/0x1210 [ptlrpc] 11:22:50: [<ffffffff8100c140>] ? child_rip+0x0/0x20 11:22:50: 11:22:50:Kernel panic - not syncing: LBUG
Attachments
Issue Links
- is related to
-
LU-3647 HSM _not only_ small fixes and to do list goes here
- Closed