[LU-10880] sanity-hsm test_40: mcmd: connect failed: No route to host Created: 04/Apr/18  Updated: 12/Aug/22  Resolved: 12/Aug/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Bob Glossman <bob.glossman@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/a75e6948-3853-11e8-b45c-52540065bddc

test_40 failed with the following error:

test_40 returned 254

lots of "mcmd: connect failed: No route to host" in the test log
I think this could be a DCO issue, not a lustre bug

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity-hsm test_40 - test_40 returned 254



 Comments   
Comment by Bob Glossman (Inactive) [ 04/Apr/18 ]

maybe not a DCO issue after all. console log for MDS shows this kernel panic:

[15481.021061] Kernel panic - not syncing: Pool 'lustre-mdt1' has encountered an uncorrectable I/O failure and the failure mode property for this pool is set to panic.
[15481.022037] CPU: 1 PID: 7200 Comm: mmp Tainted: P           OE  ------------   3.10.0-693.21.1.el7_lustre.x86_64 #1
[15481.022037] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
[15481.022037] Call Trace:
[15481.022037]  [<ffffffff816ae7c8>] dump_stack+0x19/0x1b
[15481.022037]  [<ffffffff816a8634>] panic+0xe8/0x21f
[15481.022037]  [<ffffffffc08c4256>] zio_suspend+0x106/0x110 [zfs]
[15481.022037]  [<ffffffffc084adda>] mmp_thread+0x70a/0x760 [zfs]
[15481.022037]  [<ffffffffc084a560>] ? mmp_random_leaf+0xb0/0xb0 [zfs]
[15481.022037]  [<ffffffffc084a6d0>] ? mmp_write_done+0x170/0x170 [zfs]
[15481.022037]  [<ffffffffc0714fc3>] thread_generic_wrapper+0x73/0x80 [spl]
[15481.022037]  [<ffffffffc0714f50>] ? __thread_exit+0x20/0x20 [spl]
[15481.022037]  [<ffffffff810b4031>] kthread+0xd1/0xe0
[15481.022037]  [<ffffffff810b3f60>] ? insert_kthread_work+0x40/0x40
[15481.022037]  [<ffffffff816c0577>] ret_from_fork+0x77/0xb0
[15481.022037]  [<ffffffff810b3f60>] ? insert_kthread_work+0x40/0x40
[15481.022037] Kernel Offset: disabled
Comment by James Nunez (Inactive) [ 05/Apr/18 ]

Is this a duplicate of LU-9845?

Generated at Sat Feb 10 02:38:59 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.