Details
-
Bug
-
Resolution: Unresolved
-
Minor
-
None
-
None
-
None
-
3
-
9223372036854775807
Description
This issue was created by maloo for eaujames <eaujames@ddn.com>
This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/5539a96a-504f-43a2-ba1c-4726aac3881d
test_210 failed with the following error:
Expect 1 NIDs found:
Test session details:
clients: https://build.whamcloud.com/job/lustre-reviews/96624 - 4.18.0-425.10.1.el8_7.aarch64
servers: https://build.whamcloud.com/job/lustre-reviews/96624 - 4.18.0-477.15.1.el8_lustre.x86_64
Added drop rule 255.255.255.255@tcp->255.255.255.255@tcp (1/1)
Added drop rule 255.255.255.255@tcp1->255.255.255.255@tcp1 (1/1)
/usr/sbin/lnetctl discover 10.240.44.222@tcp
manage:
- discover:
errno: -1
descr: failed to discover 10.240.44.222@tcp: Input/output error
Check "-l" recovery queue
local NI recovery:
nid-0: 10.240.44.222@tcp
Check ping counts:
- nid: 0@lo
health value: 0
ping_count: 0
next_ping: 0
- nid: 10.240.44.222@tcp
health value: 900
ping_count: 2
next_ping: 19862
- nid: 10.240.44.222@tcp1
health value: 1000
ping_count: 0
next_ping: 0
Expect ping count "2" found "2"
Check "-l" recovery queue
sanity-lnet test_210: @@@@@@ FAIL: Expect 1 NIDs found: ""
Client dmesg:
[19842.009754] LNet: Added LNI 10.240.44.222@tcp [8/256/0/180] [19842.014821] LNet: Accept all, port 7988 [19842.228135] Lustre: DEBUG MARKER: /usr/sbin/lnetctl net add --net tcp1 --if eth0 [19842.240355] LNet: Added LNI 10.240.44.222@tcp1 [8/256/0/180] [19842.498271] Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover 10.240.44.222@tcp [19842.527407] Lustre: DEBUG MARKER: /usr/sbin/lnetctl set recovery_limit 10 [19842.562264] Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover 10.240.44.222@tcp [19842.573344] LNet: There was an unexpected network error while writing to 10.240.44.222: rc = -22 [19843.062883] LNet: 1 local NIs in recovery (showing 1): 10.240.44.222@tcp [19844.103514] LNet: There was an unexpected network error while writing to 10.240.44.222: rc = -22 [19846.182996] LNet: There was an unexpected network error while writing to 10.240.44.222: rc = -22 [19850.343220] LNet: There was an unexpected network error while writing to 10.240.44.222: rc = -22 [19852.707980] Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet unconfigure [19852.717866] LNet: 1066429:0:(lib-ptl.c:956:lnet_clear_lazy_portal()) Active lazy portal 0 on exit [19852.724243] LNet: Removed LNI 10.240.44.222@tcp [19853.782995] LNet: Removed LNI 10.240.44.222@tcp1 [19853.793794] Lustre: DEBUG MARKER: /usr/sbin/lnetctl lnet configure [19853.817743] Lustre: DEBUG MARKER: /usr/sbin/lnetctl net add --net tcp --if eth0 [19853.830270] LNet: Added LNI 10.240.44.222@tcp [8/256/0/180] [19853.834781] LNet: Accept all, port 7988 [19854.034172] Lustre: DEBUG MARKER: /usr/sbin/lnetctl net add --net tcp1 --if eth0 [19854.045933] LNet: Added LNI 10.240.44.222@tcp1 [8/256/0/180] [19854.282258] Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover 10.240.44.222@tcp [19854.300824] Lustre: DEBUG MARKER: /usr/sbin/lnetctl set recovery_limit 0 [19854.325394] Lustre: DEBUG MARKER: /usr/sbin/lnetctl set max_recovery_ping_interval 4 [19854.358236] Lustre: DEBUG MARKER: /usr/sbin/lnetctl discover 10.240.44.222@tcp [19854.368588] LNet: There was an unexpected network error while writing to 10.240.44.222: rc = -22 [19868.268929] Lustre: DEBUG MARKER: /usr/sbin/lctl mark sanity-lnet test_210: @@@@@@ FAIL: Expect 1 NIDs found: [19869.153720] Lustre: DEBUG MARKER: sanity-lnet test_210: @@@@@@ FAIL: Expect 1 NIDs found:
VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity-lnet test_210 - Expect 1 NIDs found: