[LU-1996] Fine-grained job activity tracking using changelogs Created: 20/Sep/12  Updated: 15/Dec/21  Resolved: 22/Nov/14

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: Lustre 2.7.0

Type: New Feature Priority: Minor
Reporter: Henri Doreau (Inactive) Assignee: Niu Yawei (Inactive)
Resolution: Fixed Votes: 0
Labels: patch

Issue Links:
Related
is related to LU-5862 Jobstats tracking in changelogs doesn... Resolved
is related to LU-5631 Incorrect swabbing of struct llog_rec... Resolved
is related to LU-15372 add projid to changelog Open
Rank (Obsolete): 9063

 Description   

We propose a patch (gerrit will follow) to add a jobid field to the changelogs for job activity tracking. Unlike jobstats, changelog allow fine grained tracking as they contain the FIDs and the events.

Note that this patch introduces a new jobid field (32chars long) to the changelog record structures.



 Comments   
Comment by Henri Doreau (Inactive) [ 20/Sep/12 ]

Patch is at: http://review.whamcloud.com/#change,4060

Comment by Peter Jones [ 21/Sep/12 ]

Jodi please can someone review this as part of the HSM efforts

Comment by Henri Doreau (Inactive) [ 21/Sep/12 ]

I didn't mention it but actually this patch is not related to the HSM project.

Comment by Jodi Levi (Inactive) [ 21/Sep/12 ]

This seems to be related to the JobStats work that you did. Would you be able to take this one as well?

Comment by Henri Doreau (Inactive) [ 13/Mar/13 ]

I got no feedback on this patch so far.
Any update on this?

Comment by Andreas Dilger [ 11/Jul/13 ]

Presumably this patch would break compatibility with the existing ChangeLog on-disk and in-memory record format. Is there any attempt made at interoperability for old applications (e.g. some kind of flag on open to indicate if the new or old changelog structure is in use)? Looking at the patch appears that there is only one-way compatibility (i.e. new clients/apps with old records).

Comment by Henri Doreau (Inactive) [ 12/Jul/13 ]

Right, we did not focus on the second case. I'll update the patch with what you proposed on gerrit: opening the changelog w/ or w/o CHANGELOG_FLAG_JOBID determines whether the records get delivered w/ the jobid field.

Comment by Henri Doreau (Inactive) [ 12/Jul/13 ]

What is the LLAPI compatibility policy? Shall I preserve binary compatibility? If so, I'd store jobid in cr_name (the variable length field), even though cr_name already contains name and sname...

What do you think?

Comment by Andreas Dilger [ 14/Jul/13 ]

I'd rather not hack around with the changelog_ext_rec structure, which we'll have to carry around for a long time in the future. It should be possible to introduce a new changelog_rec_v2 or similar, and possibly a new CHANGELOG_REC_V2 type to distinguish it. We don't have to keep compatibility for the old record type forever, since Changelogs are relatively infrequently used, and it is possible to start printing a warning with 2.5 when opening the changelog without CHANGELOG_FLAG_JOBID, but we need to keep the compatibility around for at least a few releases.

Comment by John Fuchs-Chesney (Inactive) [ 08/Mar/14 ]

Henri,
Is there likely to be any further movement on this issue?
If not, may I mark it as resolved?
Thanks,
~ jfc.

Comment by Thomas LEIBOVICI - CEA (Inactive) [ 10/Mar/14 ]

Henri is still working on this feature
and we still want to have it in Lustre.
Please keep it opened.
Thanks,

  • Thomas
Comment by Michael MacDonald (Inactive) [ 04/Nov/14 ]

FYI, I believe I have found a bug introduced by this work. I am tracking it at LU-5862. I haven't dug into the cause yet, but when jobid_var is set to 'disable' (the default), the changelog records emitted by "lfs changelog" have the filename in the jobid field.

Comment by Andreas Dilger [ 22/Nov/14 ]

I'm closing this bug, since the feature patch to enable the jobid in ChangeLogs has landed. There is a bug in the patch that is being tracked via LU-5862.

Generated at Sat Feb 10 01:21:29 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.