[LU-11214] conf-sanity: test_91 failed: 'can't find 3529257c-20b1-c5b1-2d42-639643550592 10.9.5.228@tcp on OST' Created: 05/Aug/18  Updated: 13/Jun/19

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.13.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Lai Siyao <lai.siyao@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/59f5adec-983e-11e8-87f3-52540065bddc

trevis-40vm3: == rpc test complete, duration -o sec ================================================================ 21:54:24 (1533419664)
trevis-40vm3: trevis-40vm3.trevis.whamcloud.com: executing set_default_debug -1 all 4
CMD: trevis-40vm3 e2label /dev/mapper/ost1_flakey 				2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
CMD: trevis-40vm3 e2label /dev/mapper/ost1_flakey 				2>/dev/null | grep -E ':[a-zA-Z]{3}[0-9]{4}'
CMD: trevis-40vm3 e2label /dev/mapper/ost1_flakey 2>/dev/null
Started lustre-OST0000
mount lustre on /mnt/lustre.....
Starting client: trevis-40vm1.trevis.whamcloud.com:  -o user_xattr,flock trevis-40vm4@tcp:/lustre /mnt/lustre
CMD: trevis-40vm1.trevis.whamcloud.com mkdir -p /mnt/lustre
CMD: trevis-40vm1.trevis.whamcloud.com mount -t lustre -o user_xattr,flock trevis-40vm4@tcp:/lustre /mnt/lustre
CMD: trevis-40vm1.trevis.whamcloud.com cp /etc/passwd /mnt/lustre/a
CMD: trevis-40vm1.trevis.whamcloud.com rm /mnt/lustre/a
CMD: trevis-40vm1.trevis.whamcloud.com grep /mnt/lustre' ' /proc/mounts > /dev/null
setup single mount lustre success
list nids on mdt:
CMD: trevis-40vm4 /usr/sbin/lctl list_param mdt.lustre*.exports.*
mdt.lustre-MDT0000.exports.0@lo
mdt.lustre-MDT0000.exports.10.9.5.228@tcp
mdt.lustre-MDT0000.exports.10.9.5.230@tcp
mdt.lustre-MDT0000.exports.10.9.5.232@tcp
mdt.lustre-MDT0000.exports.clear
mdt.lustre-MDT0002.exports.0@lo
mdt.lustre-MDT0002.exports.10.9.5.228@tcp
mdt.lustre-MDT0002.exports.10.9.5.230@tcp
mdt.lustre-MDT0002.exports.10.9.5.232@tcp
mdt.lustre-MDT0002.exports.clear
uuid from 10\.9\.5\.228@tcp:
CMD: trevis-40vm4 /usr/sbin/lctl get_param mdt.lustre*.exports.'10\.9\.5\.228@tcp'.uuid
mdt.lustre-MDT0000.exports.10.9.5.228@tcp.uuid=
3529257c-20b1-c5b1-2d42-639643550592
mdt.lustre-MDT0002.exports.10.9.5.228@tcp.uuid=
3529257c-20b1-c5b1-2d42-639643550592
CMD: trevis-40vm4 /usr/sbin/lctl get_param mdt.lustre*.exports.'10\.9\.5\.228@tcp'.uuid
CMD: trevis-40vm3 /usr/sbin/lctl get_param obdfilter.lustre*.exports.'10\.9\.5\.228@tcp'.uuid
 conf-sanity test_91: @@@@@@ FAIL: can't find 3529257c-20b1-c5b1-2d42-639643550592 10\.9\.5\.228@tcp on OST 


 Comments   
Comment by James Nunez (Inactive) [ 19/Apr/19 ]

We've started seeing this test fail with this error message at a high rate starting on 19 April, 2019; failed eight times for review-dne and review-dne-zfs on the 19th.

Only four patches have landed to master in the past 24 hours:
LU-11213 uapi: reserve connect flag for plain layout
LU-10092 pcc: Reserve a new connection flag for PCC
LU-12021 lsom: Add an OBD_CONNECT2_LSOM connect flag
LU-12175 tests: Partial revert of LU-11636

Here are a few of the test failures:
https://testing.whamcloud.com/test_sets/4ab16cf0-6268-11e9-aeec-52540065bddc
https://testing.whamcloud.com/test_sets/a6ab75de-6270-11e9-8bb1-52540065bddc

Comment by Minh Diep [ 13/Jun/19 ]

+1 on b2_12 https://testing.whamcloud.com/test_sets/08455c0a-8d9c-11e9-9bb5-52540065bddc

Generated at Sat Feb 10 02:41:57 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.