Details
-
Bug
-
Resolution: Incomplete
-
Minor
-
None
-
IEEL3.0 lustre client : lustre: 2.7.16.10
CentOS7.2 3.10.0-327.36.2.el7.x86_64
Lemur
Interconnect Intel Omnipath
-
3
-
9223372036854775807
Description
1) I have created 30k , 10kb sized files using dd. I have lhsmd posix plugin running in the debug mode where I monitor the progress of archival job.
2) When I issue the command "lhsm archive *.bin" in the directory where the 30k files are located, I see ALERTS on the debug logs that some handlers were unable to find the files although they exist. However, archival of other files by other handlers still proceeds.
3) At the end of the archival when I check the MDT I see that not all 30k files were archived.
lctl get_param -n mdt.*.hsm.agents
uuid=f9ee32b4-d8fa-821d-e19c-9b0700d1e276 archive_id=ANY requests=[current:0 ok:4241 *errors:25759*]
4) ) However, 15k files of same size were all successfully archived , starting 30k upto 1M archival ends up in errors and our test case calls for successful archival of 1M files.
lctl get_param -n mdt.*.hsm.agents --> sucessful 15k
uuid=7fb34125-b8fd-bbc4-1632-007ceaa3df78 archive_id=ANY requests=[current:0 ok:15000 errors:0]
5) Attachments
a) agent conf file
b) lhsm posix conf file
c) Alerts seen on the lemur archival logs
d) Lemur rpms installed
Please let us know if you would need any more information.
Thanks