[LU-9377] obdfilter-survey.sh test_3a case=network is not functional on setup with N oss servers Created: 21/Apr/17  Updated: 08/Sep/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Arshad Hussain Assignee: Emoly Liu
Resolution: Unresolved Votes: 0
Labels: None

Issue Links:
Duplicate
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

obdfilter-survey when case is 'network' is failing because multiple IP/hostname are supplied as input.

stdout.log
+ NETTYPE=tcp thrlo=8 nobjhi=1 thrhi=16 size=1024 case=network rslt_loc=/tmp targets="192.168.109.6 192.168.109.5" /usr/bin/obdfilter-survey
Permission denied, please try again.

Permission denied, please try again.


 Comments   
Comment by Gerrit Updater [ 21/Apr/17 ]

Arshad Hussain (arshad.hussain@seagate.com) uploaded a new patch: https://review.whamcloud.com/26768
Subject: LU-9377 tests: Fix obdfilter-survey/3a to take single IP
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 001d6baaae9695eda731a0fa86ccb368b3e6e1b2

Comment by Peter Jones [ 21/Apr/17 ]

Emoly

Could you please review this patch?

Thanks

Peter

Comment by Emoly Liu [ 24/Apr/17 ]

Actually, the "case='network'" does only support single IP/hostname.
If you want to support multiple IP/hostnames in test, you should improve some functions in files lustre-iokit/obdfilter-survey/{obdfilter-survey, iokit-libecho}, but not obdfilter-survey.sh.

BTW, I just tested your patch with two OSS nodes, it showed

[root@trevis-73vm7 tests]# mgs_HOST=trevis-73vm3 MGSDEV=/tmp/lustre-mgs MDSCOUNT=2 mds1_HOST=trevis-73vm3 mds2_HOST=trevis-73vm4 ost1_HOST=trevis-73vm5 ost2_HOST=trevis-73vm6 OSTCOUNT=2 PDSH="pdsh -S -Rrsh -w" ONLY=3 sh obdfilter-survey.sh
Logging to local directory: /tmp/test_logs/1493005947
excepting tests: 1b 2b
trevis-73vm7.trevis.hpdd.intel.com: Checking config lustre mounted on /mnt/lustre
Checking servers environments
Checking clients trevis-73vm7.trevis.hpdd.intel.com environments
Using TIMEOUT=20
disable quota as required
osd-ldiskfs.track_declares_assert=1
osd-ldiskfs.track_declares_assert=1
osd-ldiskfs.track_declares_assert=1
osd-ldiskfs.track_declares_assert=1
min kbytesavail: 309264 using size=251 MBytes per obd instance


== obdfilter-survey test 3a: Network survey ========================================================== 03:52:33 (1493005953)
Stopping clients: trevis-73vm7.trevis.hpdd.intel.com /mnt/lustre (opts:)
Stopping client trevis-73vm7.trevis.hpdd.intel.com /mnt/lustre opts:
Stopping clients: trevis-73vm7.trevis.hpdd.intel.com /mnt/lustre2 (opts:)
Stopping /mnt/mds1 (opts:-f) on trevis-73vm3
Stopping /mnt/mds2 (opts:-f) on trevis-73vm4
Stopping /mnt/ost1 (opts:-f) on trevis-73vm5
Stopping /mnt/ost2 (opts:-f) on trevis-73vm6
Stopping /mnt/mgs (opts:) on trevis-73vm3
modules unloaded.
+ NETTYPE=tcp thrlo=2 nobjhi=1 thrhi=4 size=251 case=network rslt_loc=/tmp targets="" /bin/obdfilter-survey

targets="", this is not correct.

Comment by Colin Faber [X] (Inactive) [ 07/Sep/21 ]

Hi, is this still active?

Comment by Arshad Hussain [ 08/Sep/21 ]

Hi Colin,

>Hi, is this still active?

Could not follow up on Emoly's review comment. This patch was not actively worked on.

The motivation of the patch was to accept single IP when parameter "case=network" is given and not fail. The failure seen was inability to correctly check "obdecho" module on server.

Currently, when multiple IP is provided, we "mess" up the parameters passed and end up having

192.168.50.95@192.168.50.95 root 'lsmod | grep obdecho'

instead of more correctly (I think so)

192.168.50.95 root 'lsmod | grep obdecho'

Based on Emoly's review. (this is where we stand today)
1. Current patch does not work

2. "case='network'" currently only support single IP/hostname. Therefore, easy fix would be to either not permit multiple ip when case=network. So it does not fail. (This is what this patch intended to do). Further, Update doc pdf.

3. Major improvement to make it accept multiple IP (improve iokit-libecho/setup_srv_obd), which this patch does not target.

Generated at Sat Feb 10 02:25:40 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.