[LU-10832] sanity-flr test_200: Timeout occurred after 69 mins Created: 21/Mar/18  Updated: 21/Mar/18  Resolved: 21/Mar/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Duplicate
duplicates LU-9845 ost-pools test_22 hangs with ‘WARNING... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for liuying <emoly.liu@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/db7af22c-2c6a-11e8-b3c6-52540065bddc

test_200 failed with the following error:
on client

CMD: trevis-66vm5.trevis.hpdd.intel.com grep -c /mnt/lustre2' ' /proc/mounts
Stopping client trevis-66vm5.trevis.hpdd.intel.com /mnt/lustre2 (opts:)
CMD: trevis-66vm5.trevis.hpdd.intel.com lsof -t /mnt/lustre2
CMD: trevis-66vm5.trevis.hpdd.intel.com umount  /mnt/lustre2 2>&1
10.9.6.215@tcp:/lustre /mnt/lustre3 lustre rw,flock,user_xattr,lazystatfs 0 0
CMD: trevis-66vm5.trevis.hpdd.intel.com grep -c /mnt/lustre3' ' /proc/mounts
Stopping client trevis-66vm5.trevis.hpdd.intel.com /mnt/lustre3 (opts:)
CMD: trevis-66vm5.trevis.hpdd.intel.com lsof -t /mnt/lustre3
CMD: trevis-66vm5.trevis.hpdd.intel.com umount  /mnt/lustre3 2>&1

on MDS

Mar 20 16:43:39 trevis-66vm8 kernel: Lustre: *** cfs_fail_loc=1a03, val=0***
Mar 20 16:43:47 trevis-66vm8 kernel: Lustre: *** cfs_fail_loc=1a03, val=0***
Mar 20 16:43:53 trevis-66vm8 kernel: Lustre: *** cfs_fail_loc=1a03, val=0***
Mar 20 16:43:59 trevis-66vm8 kernel: Lustre: *** cfs_fail_loc=1a03, val=0***
Mar 20 16:44:04 trevis-66vm8 kernel: Lustre: *** cfs_fail_loc=1a03, val=0***
Mar 20 16:44:15 trevis-66vm8 kernel: Lustre: *** cfs_fail_loc=1a03, val=0***
Mar 20 16:44:15 trevis-66vm8 kernel: Lustre: Skipped 1 previous similar message
�������������������������������������������������������������������������������������������������������������������������������������������������������������
Mar 20 16:47:31 trevis-66vm8 kernel: Initializing cgroup subsys cpuset

<<Please provide additional information about the failure here>>

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity-flr test_200 - Timeout occurred after 69 mins, last suite running was sanity-flr, restarting cluster to continue tests



 Comments   
Comment by Jian Yu [ 21/Mar/18 ]

Console log on MDS showed that:

Kernel panic - not syncing: Pool 'lustre-mdt1' has encountered an uncorrectable I/O failure and the failure mode property for this pool is set to panic.

This is a duplicate of LU-9845.

Generated at Sat Feb 10 02:38:35 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.