[LU-9377] obdfilter-survey.sh test_3a case=network is not functional on setup with N oss servers Created: 21/Apr/17 Updated: 08/Sep/21 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | Arshad Hussain | Assignee: | Emoly Liu |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||
| Severity: | 3 | ||||
| Rank (Obsolete): | 9223372036854775807 | ||||
| Description |
|
obdfilter-survey when case is 'network' is failing because multiple IP/hostname are supplied as input. stdout.log + NETTYPE=tcp thrlo=8 nobjhi=1 thrhi=16 size=1024 case=network rslt_loc=/tmp targets="192.168.109.6 192.168.109.5" /usr/bin/obdfilter-survey Permission denied, please try again. Permission denied, please try again. |
| Comments |
| Comment by Gerrit Updater [ 21/Apr/17 ] |
|
Arshad Hussain (arshad.hussain@seagate.com) uploaded a new patch: https://review.whamcloud.com/26768 |
| Comment by Peter Jones [ 21/Apr/17 ] |
|
Emoly Could you please review this patch? Thanks Peter |
| Comment by Emoly Liu [ 24/Apr/17 ] |
|
Actually, the "case='network'" does only support single IP/hostname. BTW, I just tested your patch with two OSS nodes, it showed [root@trevis-73vm7 tests]# mgs_HOST=trevis-73vm3 MGSDEV=/tmp/lustre-mgs MDSCOUNT=2 mds1_HOST=trevis-73vm3 mds2_HOST=trevis-73vm4 ost1_HOST=trevis-73vm5 ost2_HOST=trevis-73vm6 OSTCOUNT=2 PDSH="pdsh -S -Rrsh -w" ONLY=3 sh obdfilter-survey.sh Logging to local directory: /tmp/test_logs/1493005947 excepting tests: 1b 2b trevis-73vm7.trevis.hpdd.intel.com: Checking config lustre mounted on /mnt/lustre Checking servers environments Checking clients trevis-73vm7.trevis.hpdd.intel.com environments Using TIMEOUT=20 disable quota as required osd-ldiskfs.track_declares_assert=1 osd-ldiskfs.track_declares_assert=1 osd-ldiskfs.track_declares_assert=1 osd-ldiskfs.track_declares_assert=1 min kbytesavail: 309264 using size=251 MBytes per obd instance == obdfilter-survey test 3a: Network survey ========================================================== 03:52:33 (1493005953) Stopping clients: trevis-73vm7.trevis.hpdd.intel.com /mnt/lustre (opts:) Stopping client trevis-73vm7.trevis.hpdd.intel.com /mnt/lustre opts: Stopping clients: trevis-73vm7.trevis.hpdd.intel.com /mnt/lustre2 (opts:) Stopping /mnt/mds1 (opts:-f) on trevis-73vm3 Stopping /mnt/mds2 (opts:-f) on trevis-73vm4 Stopping /mnt/ost1 (opts:-f) on trevis-73vm5 Stopping /mnt/ost2 (opts:-f) on trevis-73vm6 Stopping /mnt/mgs (opts:) on trevis-73vm3 modules unloaded. + NETTYPE=tcp thrlo=2 nobjhi=1 thrhi=4 size=251 case=network rslt_loc=/tmp targets="" /bin/obdfilter-survey targets="", this is not correct. |
| Comment by Colin Faber [X] (Inactive) [ 07/Sep/21 ] |
|
Hi, is this still active? |
| Comment by Arshad Hussain [ 08/Sep/21 ] |
|
Hi Colin, >Hi, is this still active? Could not follow up on Emoly's review comment. This patch was not actively worked on. The motivation of the patch was to accept single IP when parameter "case=network" is given and not fail. The failure seen was inability to correctly check "obdecho" module on server. Currently, when multiple IP is provided, we "mess" up the parameters passed and end up having 192.168.50.95@192.168.50.95 root 'lsmod | grep obdecho' instead of more correctly (I think so) 192.168.50.95 root 'lsmod | grep obdecho' Based on Emoly's review. (this is where we stand today) 2. "case='network'" currently only support single IP/hostname. Therefore, easy fix would be to either not permit multiple ip when case=network. So it does not fail. (This is what this patch intended to do). Further, Update doc pdf. 3. Major improvement to make it accept multiple IP (improve iokit-libecho/setup_srv_obd), which this patch does not target. |