[LU-13578] sanity test_39r: atime on client != ost Created: 18/May/20  Updated: 19/Dec/23  Resolved: 21/May/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.14.0
Fix Version/s: Lustre 2.14.0, Lustre 2.15.0

Type: Bug Priority: Minor
Reporter: Maloo Assignee: John Hammond
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Duplicate
is duplicated by LU-16421 sanity-flr test_61a: atime: old '1670... Resolved
is duplicated by LU-14091 sanity test_39r: 'atime on client 160... Closed
Related
is related to LU-17265 sanity test_39r: atime on client 1699... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for S Buisson <sbuisson@ddn.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/ec17386d-cefc-4749-9b51-1570a2c15729

test_39r failed with the following error:

atime on client 1589573245 != ost 0x5ebef67c

Very few additional information available:

OST atime:  atime: 0x5ebef67c:00000000 -- Fri May 15 20:07:24 2020
 sanity test_39r: @@@@@@ FAIL: atime on client 1589573245 != ost 0x5ebef67c 

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_39r - atime on client 1589573245 != ost 0x5ebef67c



 Comments   
Comment by Andreas Dilger [ 29/Jul/20 ]

Saw this again on master:

atime on client 1595930572 != ost 0x5f1ff7cb

Again the OST timestamp was out by 1s. It may be that there is some kind of race, or the 1s granularity of the clocks makes the test fail some small fraction of time?

This is failing about once every 3 days (9x in the past 4 weeks). Not critical, but it might be nice to understand it better:
https://testing.whamcloud.com/test_sets/40740ba8-bf93-4123-be3c-06965acb4214
https://testing.whamcloud.com/test_sets/c25091a4-bb82-476d-abba-77839eaa665a
https://testing.whamcloud.com/test_sets/bb7c9f0f-88bb-4743-8d09-ae869bcc8037
https://testing.whamcloud.com/test_sets/1bd52851-161f-4ced-818f-54e9f073298d
https://testing.whamcloud.com/test_sets/62c3553a-1bad-4c3c-847f-511531523230
https://testing.whamcloud.com/test_sets/a58fda9c-348e-48c6-9d9a-b4039e8cd333
https://testing.whamcloud.com/test_sets/aa7b32cb-737b-483b-90ee-563abb9a757b
https://testing.whamcloud.com/test_sets/259a006a-8e65-4ada-9a61-d27a067419b3
https://testing.whamcloud.com/test_sets/682470a0-36c2-4364-8f73-661e7f8e9f1d

Comment by Bruno Faccini (Inactive) [ 11/Oct/20 ]

+1 on recent master at https://testing.whamcloud.com/test_sets/999472ce-8888-4fcd-bc47-7210a11127f6

Comment by Andreas Dilger [ 14/Oct/20 ]

+1 on master https://testing.whamcloud.com/test_sets/c9dca577-ca19-4ebb-8275-17d37fcdf09a

Comment by John Hammond [ 02/Nov/20 ]

Copied over from LU-14091:

Note that 0x5f9a449d == 1603945629 and that since the test uses (( ... )) this is not due to a difference of base.

This may be from a final read() done by dd which returns no bytes and does not generate a BRW RPC to the OST. Even though it returns 0 bytes, it requested a non-zero number of bytes and is therefore required to update the file access time.

Comment by Gerrit Updater [ 15/Dec/20 ]

John L. Hammond (jhammond@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/40973
Subject: LU-13578 test: use a single read() in sanity test_39r
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: ed1f8d0a2d1a2c28ab72434c6ca4c9d031d9f729

Comment by Andreas Dilger [ 16/Jan/21 ]

+1 on master https://testing.whamcloud.com/test_sets/7f189347-052c-4c16-897f-cf26a3c17114

Comment by Gerrit Updater [ 27/Jan/21 ]

Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/40973/
Subject: LU-13578 test: use a single read() in sanity test_39r
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 842f8158a533b21d8c6e401881db5ed7013b7890

Comment by Peter Jones [ 27/Jan/21 ]

Landed for 2.14

Comment by Alexander Zarochentsev [ 28/May/21 ]

I am still seeing it:
https://testing.whamcloud.com/sub_tests/701c1ca0-8038-4a1e-b2fd-e908867b3a12

and there are more:
https://testing.whamcloud.com/sub_tests/88e41e63-7b1c-4428-a1d9-5fc12a1540c7
https://testing.whamcloud.com/sub_tests/eae6b8ed-cd53-428c-94e7-367fa7789efa
...
8 failures in last 4 weeks.

Reopen?

Comment by Serguei Smirnov [ 20/Jul/21 ]

+1 on master: https://testing.whamcloud.com/test_sets/c8e6816e-9641-4929-8ffd-d2505b6df743

Comment by Gerrit Updater [ 13/May/22 ]

"John L. Hammond <jhammond@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/47346
Subject: LU-13578 test: sleep longer in sanity test_39
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8cafb56be4f2baf93c76fb172adb8b44b460a28a

Comment by Gerrit Updater [ 18/May/22 ]

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/47346/
Subject: LU-13578 test: sleep longer in sanity test_39
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: be2525ffddb4bf55fde77e97b00d1c349119daed

Generated at Sat Feb 10 03:02:27 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.