[LU-852] Test failure on test suite conf-sanity, subtest test_53b Created: 16/Nov/11  Updated: 23/Feb/12  Resolved: 02/Feb/12

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Maloo Assignee: Liang Zhen (Inactive)
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-1048 Test failure on test suite conf-sanity Resolved
Severity: 3
Bugzilla ID: 1,048
Rank (Obsolete): 5210

 Description   

This issue was created by maloo for Chris Gearing <chris@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/cfce0b7e-0f45-11e1-9051-52540025f9af.

The sub-test test_53b failed with the following error:

$'Assertion 23 failed: (($tstarted && $tmin && $tmax)) (expanded: ((0 && 0 && 0)))
Insane MDT thread counts (PDSH problems?)'

Info required for matching: conf-sanity 53b



 Comments   
Comment by Di Wang [ 21/Nov/11 ]

This is the error running with my patch. And I will fix it in my patch.

Comment by Di Wang [ 21/Nov/11 ]

Will fix it in lu-718

Comment by Andreas Dilger [ 27/Jan/12 ]

Di, since the LU-718 patch will likely not be landed to master for 2.2, since it is more like a feature than a fix at this stage, is it still possible to fix this problem with a smaller patch?

This failure is being hit via autotest:

https://maloo.whamcloud.com/test_sets/8029de58-48b2-11e1-9c04-5254004bbbd3
https://maloo.whamcloud.com/test_sets/8c925a50-48d3-11e1-9c04-5254004bbbd3

Comment by Di Wang [ 27/Jan/12 ]

Andreas

I just checked the log carefully, and this seems a different problem.

The reason of this failure is that MDT can not be start, which seems related with Liang's pdir patch. I will reassign this to Liang.

17:36:06:kmem_cache_create: duplicate cache dynlock_cache
17:36:07:Pid: 10192, comm: modprobe Not tainted 2.6.32-220.el6_lustre.x86_64 #1
17:36:08:Call Trace:
17:36:08: [<ffffffff81161da8>] ? kmem_cache_create+0x538/0x5a0
17:36:08: [<ffffffff81161cc6>] ? kmem_cache_create+0x456/0x5a0
17:36:09: [<ffffffffa06e4eb0>] ? init_once+0x0/0x90 [ldiskfs]
17:36:09: [<ffffffffa046c30d>] ? dynlock_cache_init+0x1f/0x42 [ldiskfs]
17:36:09: [<ffffffffa046c2af>] ? init_ldiskfs_fs+0x1d7/0x1e2 [ldiskfs]
17:36:10: [<ffffffffa046c0d8>] ? init_ldiskfs_fs+0x0/0x1e2 [ldiskfs]
17:36:12: [<ffffffff8100204c>] ? do_one_initcall+0x3c/0x1d0
17:36:12: [<ffffffff810af641>] ? sys_init_module+0xe1/0x250
17:36:12: [<ffffffff8100b0f2>] ? system_call_fastpath+0x16/0x1b
17:36:13:Not able to create dynlock cache
17:36:14:LustreError: 10167:0:(obd_mount.c:1445:server_kernel_mount()) premount /dev/mapper/lvm--MDS-P0:0x0 ldiskfs failed: -19 Is the ldiskfs module available?
17:36:14:LustreError: 10167:0:(obd_mount.c:1770:server_fill_super()) Unable to mount device /dev/mapper/lvm--MDS-P0: -19
17:36:15:LustreError: 10167:0:(obd_mount.c:2316:lustre_fill_super()) Unable to mount (-19)
17:36:15:Lustre: DEBUG MARKER: conf-sanity test_53b: @@@@@@ FAIL: MDT start failed

Comment by Di Wang [ 02/Feb/12 ]

It seems this is a duplicate of LU-1048, not by pdir ops patch. Sorry.

Generated at Sat Feb 10 01:11:01 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.