[LU-2494] error: get_param: /proc/{fs,sys}/{lnet,lustre}/obdfilter/*/mds_sync: Found no match Created: 14/Dec/12  Updated: 19/Apr/13  Resolved: 11/Feb/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: Lustre 2.4.0

Type: Bug Priority: Blocker
Reporter: Jian Yu Assignee: Bob Glossman (Inactive)
Resolution: Fixed Votes: 0
Labels: HB
Environment:

Lustre Branch: master


Issue Links:
Related
is related to LU-2620 Failure on test suite replay-ost-sing... Resolved
Severity: 3
Rank (Obsolete): 5849

 Description   

The following error message occurred in many tests which call wait_mds_ost_sync:

CMD: fat-intel-3vm8 /usr/sbin/lctl get_param -n obdfilter.*.mds_sync
fat-intel-3vm8: error: get_param: /proc/{fs,sys}/{lnet,lustre}/obdfilter/*/mds_sync: Found no match

https://maloo.whamcloud.com/test_sets/b9a24768-44a8-11e2-8c8b-52540035b04c
https://maloo.whamcloud.com/test_sets/5419a010-44aa-11e2-8c8b-52540035b04c
https://maloo.whamcloud.com/test_sets/c923effc-44ad-11e2-8c8b-52540035b04c
https://maloo.whamcloud.com/test_sets/34f7f994-44ae-11e2-8c8b-52540035b04c
https://maloo.whamcloud.com/test_sets/d0434e5c-44af-11e2-8c8b-52540035b04c



 Comments   
Comment by Peter Jones [ 20/Dec/12 ]

Bob

Could you please look into this one?

Thanks

Peter

Comment by Bob Glossman (Inactive) [ 20/Dec/12 ]

This /proc var seems to have been entirely done away with in the conversion to osd. In previous revs there was an entry in lprocfs_filter_obd_vars[]

        { "mds_sync",     lprocfs_filter_rd_mds_sync, 0, 0},

This read out the value of obd->u.filter.fo_mds_ost_sync

In current code it looks like obd->u.filter.fo_mds_ost_sync still exists, but nobody ever sets or clears it.

There is no "mds_sync" entry in the closest equivalent I can find to lprocfs_filter_obd_vars[], namely lprocfs_osd_obd_vars[].

Is there some other proc var now someplace else in /proc that fetches similar info from some new layer? The function seems to be lost from obdfilter entirely.

Comment by Peter Jones [ 21/Dec/12 ]

Alex could you please comment on this? Thanks Peter

Comment by Alex Zhuravlev [ 21/Dec/12 ]

right, because OSS does not drive recovery any more. it's all done on MDS and from OSS point of view this is a regular request.
so, instead of looking for signs on OSS side, we should watch for all OSPs to get old_sync_processed=1

Comment by Bob Glossman (Inactive) [ 21/Dec/12 ]

change framework to look at old_sync_processed
http://review.whamcloud.com/4885

Comment by Andreas Dilger [ 14/Jan/13 ]

Bob,
while it is good to fix the test-framework.sh to check for the new parameter, but this parameter does not exist on older servers. Since the test-framework.sh runs from the client (which may be newer or older than the server), it needs to be able to work with the old-style /proc parameters as well.

Could you please make another patch that adds a check for the old mds_sync file if old_sync_processed does not exist.

Comment by Bob Glossman (Inactive) [ 14/Jan/13 ]

interop support added
http://review.whamcloud.com/5021

Comment by Jodi Levi (Inactive) [ 11/Feb/13 ]

Patch landed to master.

Generated at Sat Feb 10 01:25:41 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.