Details
-
Task
-
Resolution: Unresolved
-
Major
-
Lustre 2.4.0
-
None
-
8169
Description
Lustre's proc seems to have had a number of regressions. LMT's ltop is no longer able to find many of the values it used to display.
In particular, brw_stats from obdfilter is gone, and does not appear to have been replaced after the OSD work. At minimum, that was used by ltop to report the number of bulk rpcs handled.
The MDS display is also missing a number of values.
We don't necessarily need to put them back exactly how they were before, but we need to export them in some way that will make them usable for folks.
It would be best to decide on interfaces before 2.4.0 is locked in.
Attachments
Issue Links
- duplicates
-
LU-3106 create symlinks in procfs from ofd to osd
-
- Resolved
-
- is related to
-
LU-3355 proc regression testing
-
- Resolved
-
- is related to
-
LU-2396 ofd brw_stats not maintained
-
- Resolved
-
-
LU-3296 fs/lustre/mdt/*/md_stats not showing any stats after remount
-
- Resolved
-
-
LU-2096 name ofd device type "ofd" instead of "obdfilter"
-
- Open
-
-
LU-4259 No brw_stats on ZFS
-
- Resolved
-
1.
|
No brw_stats on ZFS |
![]() |
Resolved | Nathaniel Clark |
Chris, does John's proposal for using the obdfilter "stats" data address your needs for LMT? This would be a way for LMT to get aggregate IO and per-client IO stats that works on both 2.1 and 2.4.
It would also be possible to create the brw_stats for ZFS with just the RPC ("page") information to start with, but based on your comment I don't know if this is what you want. It isn't clear to me if it will be possible to add the ZFS block IO stats later or not. Would just having the "page" information in ZFS brw_stats be useful? This would allow the admins to at least see whether the clients are submitting poorly formed RPCs. I don't think it would be too hard to do just that part.