Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-13804

LustreError (osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22

Details

    • Bug
    • Resolution: Not a Bug
    • Minor
    • None
    • None
    • Lustre 2.10.8 and 2.12.5
      RHEL 7.8
    • 3
    • 9223372036854775807

    Description

      jet1 console log reports the following during an I/O SWL (and a few other misc jobs) on Opal:

      Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22
      Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) Skipped 1 previous similar message
      Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data026: rc = -22
      Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) Skipped 75 previous similar messages
      

      There were no console log messages on Opal that appeared to correspond.  No messages at 09:51, and no unusual messages at all on opal.

      Testing both under Lustre 2.10 and Lustre 2.12 included creating striped directories via lfs mkdir -i3 -c4 <target>. Some of these directories were likely created under 2.10 and deleted under 2.12.

      Before this occurred, the jet servers had been upgraded from Lustre 2.10 to Lustre 2.12, then downgraded to 2.10, and then upgraded to 2.12 again. Significant I/O was performed between each Lustre version, and changelog users were deregistered and logs cleared.

      Attachments

        Activity

          [LU-13804] LustreError (osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22
          ofaaland Olaf Faaland made changes -
          Resolution New: Not a Bug [ 6 ]
          Status Original: Open [ 1 ] New: Resolved [ 5 ]
          pjones Peter Jones made changes -
          Assignee Original: WC Triage [ wc-triage ] New: Lai Siyao [ laisiyao ]
          ofaaland Olaf Faaland made changes -
          Description Original: jet1 console log reports the following during an I/O SWL (and a few other misc jobs) on Opal:
          {noformat}
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) Skipped 1 previous similar message
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data026: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) Skipped 75 previous similar messages
          {noformat}
          There were no console log messages on Opal that appeared to correspond.  No messages at 09:51, and no unusual messages at all on opal.

          Testing both under Lustre 2.10 and Lustre 2.12 included creating striped directories via {{lfs mkdir -i3 -c4 <target>}}

          Before this occurred, the jet servers had been upgraded from Lustre 2.10 to Lustre 2.12, then downgraded to 2.10, and then upgraded to 2.12 again. Significant I/O was performed between each Lustre version, and changelog users were deregistered and logs cleared.
          New: jet1 console log reports the following during an I/O SWL (and a few other misc jobs) on Opal:
          {noformat}
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) Skipped 1 previous similar message
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data026: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) Skipped 75 previous similar messages
          {noformat}
          There were no console log messages on Opal that appeared to correspond.  No messages at 09:51, and no unusual messages at all on opal.

          Testing both under Lustre 2.10 and Lustre 2.12 included creating striped directories via {{lfs mkdir -i3 -c4 <target>}}. Some of these directories were likely created under 2.10 and deleted under 2.12.

          Before this occurred, the jet servers had been upgraded from Lustre 2.10 to Lustre 2.12, then downgraded to 2.10, and then upgraded to 2.12 again. Significant I/O was performed between each Lustre version, and changelog users were deregistered and logs cleared.
          ofaaland Olaf Faaland made changes -
          Description Original: jet1 console log reports the following during an I/O SWL (and a few other misc jobs) on Opal:
          {noformat}
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) Skipped 1 previous similar message
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data026: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) Skipped 75 previous similar messages
          {noformat}
          There were no console log messages on Opal that appeared to correspond.  No messages at 09:51, and no unusual messages at all on opal.

          Before this occurred, the jet servers had been upgraded from Lustre 2.10 to Lustre 2.12, then downgraded to 2.10, and then upgraded to 2.12 again. Significant I/O was performed between each Lustre version, and changelog users were deregistered and logs cleared.
          New: jet1 console log reports the following during an I/O SWL (and a few other misc jobs) on Opal:
          {noformat}
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data049: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 1391:0:(osd_index.c:1201:osd_dir_delete()) Skipped 1 previous similar message
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) lquake-MDT0000: failed to destroy agent object (0) for the entry data026: rc = -22
          Jul 15 09:51:54 jet1 kernel: LustreError: 17478:0:(osd_index.c:1201:osd_dir_delete()) Skipped 75 previous similar messages
          {noformat}
          There were no console log messages on Opal that appeared to correspond.  No messages at 09:51, and no unusual messages at all on opal.

          Testing both under Lustre 2.10 and Lustre 2.12 included creating striped directories via {{lfs mkdir -i3 -c4 <target>}}

          Before this occurred, the jet servers had been upgraded from Lustre 2.10 to Lustre 2.12, then downgraded to 2.10, and then upgraded to 2.12 again. Significant I/O was performed between each Lustre version, and changelog users were deregistered and logs cleared.
          ofaaland Olaf Faaland created issue -

          People

            laisiyao Lai Siyao
            ofaaland Olaf Faaland
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: