[LU-3439] User code creating multiple lockfiles with same name Created: 05/Jun/13  Updated: 10/Jun/13  Resolved: 10/Jun/13

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.5
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Karl W Schulz (Inactive) Assignee: Lai Siyao
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Lustre: 2.1.5
OFED: 1.5.4.1
Kernel: 2.6.32-279.el6.x86_64


Attachments: File multiple_lockfile.tar.gz    
Issue Links:
Related
is related to LU-2901 Duplicate filename on the same ldiskf... Resolved
Severity: 3
Rank (Obsolete): 8568

 Description   

Not sure if this a known issue already, but we have a user who noticed they can create identical filenames within a single directory via their home-grown locking mechanism. We have worked with them to create a small reproducer which shows the issue on 6 Lustre clients (96 MPI tasks). The net effect of running this test is creation of duplicate filenames:

{{c558-801$ ls -l lockdir/
total 12
---------- 1 karl G-800747 52 Jun 5 08:39 Lockfile.lck
---------- 1 karl G-800747 52 Jun 5 08:39 Lockfile.lck
---------- 1 karl G-800747 52 Jun 5 08:39 Lockfile.lck}}

Tarball attached with user code and example output from our Stampede environment. We can confirm that we do not get the repeat filenames when running the reproducer on Lustre 1.8.6

Thanks.



 Comments   
Comment by Kit Westneat (Inactive) [ 05/Jun/13 ]

This looks like LU-2901, we have hit this recently as well but didn't have a reproducer.

Comment by Peter Jones [ 05/Jun/13 ]

Lai

Could you please confirm whether this is a duplicate of LU-2901? If so, are you now able to reproduce this issue and move forward with a fix?

Thanks

Peter

Comment by Lai Siyao [ 06/Jun/13 ]

Hi Kit, could you tell me which file has the duplicate name in the testlog in your tarball? BTW, the reproducer program doesn't close file, which is strange. And is there a way to know when it should stop and quit? I haven't been able to reproduce yet.

Comment by Jodi Levi (Inactive) [ 10/Jun/13 ]

Duplicate of LU-2901

Generated at Sat Feb 10 01:33:55 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.