Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16380

conf-sanity test_108b: timeout at read, write and append

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.16.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

      This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/9a2819f9-34e0-4cd6-9930-78d2ee19929c

      test_108b failed with the following error:

      Timeout occurred after 682 minutes, last suite running was conf-sanity
      

      VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
      conf-sanity test_108b - Timeout occurred after 682 minutes, last suite running was conf-sanity

      Attachments

        Issue Links

          Activity

            [LU-16380] conf-sanity test_108b: timeout at read, write and append
            pjones Peter Jones added a comment -

            Landed for 2.16

            pjones Peter Jones added a comment - Landed for 2.16

            "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49514/
            Subject: LU-16380 osd-ldiskfs: race in OI mapping
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 43fe6e51804f8fb4cca4445be576233595e27b42

            gerrit Gerrit Updater added a comment - "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/c/fs/lustre-release/+/49514/ Subject: LU-16380 osd-ldiskfs: race in OI mapping Project: fs/lustre-release Branch: master Current Patch Set: Commit: 43fe6e51804f8fb4cca4445be576233595e27b42
            laisiyao Lai Siyao added a comment -

            Alex, thanks, patch uploaded.

            laisiyao Lai Siyao added a comment - Alex, thanks, patch uploaded.

            "Lai Siyao <lai.siyao@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49514
            Subject: LU-16380 osd-ldiskfs: race in OI mapping
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: 5d57c11758229071ca481f8ccb9cb6142c2b8993

            gerrit Gerrit Updater added a comment - "Lai Siyao <lai.siyao@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49514 Subject: LU-16380 osd-ldiskfs: race in OI mapping Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 5d57c11758229071ca481f8ccb9cb6142c2b8993

            attached, if this is not what you need, ping again.

            bzzz Alex Zhuravlev added a comment - attached, if this is not what you need, ping again.

            sure, will do

            bzzz Alex Zhuravlev added a comment - sure, will do
            laisiyao Lai Siyao added a comment - - edited

            It'll be great if you can capture debug logs with "trace lfsck inode info" enabled. I'm testing in my local system too, but haven't reproduced yet.

            laisiyao Lai Siyao added a comment - - edited It'll be great if you can capture debug logs with "trace lfsck inode info" enabled. I'm testing in my local system too, but haven't reproduced yet.

            laisiyao if needed I can try to reproduce with specific PTLDEBUG or a debugging patch, it's not frequent, but happens (~6% of runs)

            bzzz Alex Zhuravlev added a comment - laisiyao if needed I can try to reproduce with specific PTLDEBUG or a debugging patch, it's not frequent, but happens (~6% of runs)
            COMMIT          TESTED  PASSED  FAILED          COMMIT DESCRIPTION
            4c0c01e29c      28      27      1       BAD     LU-10391 lnet: change lnet_find_best_lpni to handle large NIDs
            558784caad      10      9       1       BAD     LU-15643 osd-ldiskfs: don't trigger scrub on irreparable FIDs
            c74c630ff7      30      30      0       GOOD    LU-16317 build: dkms build requires flex, bison and libmount-devel
            
            bzzz Alex Zhuravlev added a comment - COMMIT TESTED PASSED FAILED COMMIT DESCRIPTION 4c0c01e29c 28 27 1 BAD LU-10391 lnet: change lnet_find_best_lpni to handle large NIDs 558784caad 10 9 1 BAD LU-15643 osd-ldiskfs: don't trigger scrub on irreparable FIDs c74c630ff7 30 30 0 GOOD LU-16317 build: dkms build requires flex, bison and libmount-devel
            bzzz Alex Zhuravlev added a comment - - edited

            hitting this locally as well. the last activity in the log is:

            [ 7920.470684] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146795584/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806351 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0'
            [ 7971.670605] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146810496/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806403 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0'
            [ 8022.870656] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146826560/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806454 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0'
            [ 8125.270637] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146856064/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806556 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0'

            LU-15643 osd-ldiskfs: don't trigger scrub on irreparable FIDs – can be related?

            bzzz Alex Zhuravlev added a comment - - edited hitting this locally as well. the last activity in the log is: [ 7920.470684] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146795584/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806351 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0' [ 7971.670605] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146810496/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806403 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0' [ 8022.870656] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146826560/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806454 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0' [ 8125.270637] Lustre: 406340:0:(client.c:1485:after_reply()) @@@ resending request on EINPROGRESS req@000000009d60d042 x1751967146856064/t0(0) o101->lustre-MDT0000-mdc-ffff9f397275d000@0@lo:12/10 lens 576/224 e 0 to 0 dl 1670806556 ref 2 fl Rpc:RQU/2/0 rc 0/-115 job:'sha1sum.0' LU-15643 osd-ldiskfs: don't trigger scrub on irreparable FIDs – can be related?

            People

              laisiyao Lai Siyao
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: