Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-5629

osp_sync_interpret() ASSERTION( rc || req->rq_transno ) failed

Details

    • Bug
    • Resolution: Duplicate
    • Critical
    • None
    • Lustre 2.6.0, Lustre 2.4.2, Lustre 2.5.3
    • Lustre 2.4.2-14chaos (see github.com/chaos/lustre)
    • 3
    • 15744

    Description

      One of our MDS nodes crashed to day with the following assertion:

      client.c:304:ptlrpc_at_adj_net_latency()) Reported service time 548 > total measured time 165
      osp_sync.c:355:osp_sync_interpret())  ASSERTION( rc || req->rq_transno ) failed
      

      Note that the two messages above were printed in the same second (as reported by syslog) and by the same kernel thread. I don't know if the ptlrpc_at_adj_net_latency() message is actually related to the assertion or not, but the proximity makes it worth noting.

      There were a few OST to which the MDS lost and reestablished a connection a couple of minutes earlier in the log.

      The backtrace was:

      panic
      lbug_with_loc
      osp_sync_interpret
      ptlrpc_check_set
      ptlrpcd_check
      ptlrpcd
      kernel_thread
      

      It was running lustre version 2.4.2-14chaos (see github.com/chaos/lustre).

      We cannot provide logs or crash dumps for this machine.

      Attachments

        1. LU-5629-syslog.bz2
          174 kB
        2. lbugmay2.zip
          53.38 MB

        Issue Links

          Activity

            [LU-5629] osp_sync_interpret() ASSERTION( rc || req->rq_transno ) failed
            pjones Peter Jones made changes -
            Resolution New: Duplicate [ 3 ]
            Status Original: Reopened [ 4 ] New: Resolved [ 5 ]
            dmiter Dmitry Eremin (Inactive) made changes -
            Link New: This issue is related to LU-9135 [ LU-9135 ]
            adilger Andreas Dilger made changes -
            Labels Original: MB llnl New: llnl
            apargal Alex Parga made changes -
            Attachment New: lbugmay2.zip [ 26607 ]
            ruth.klundt@gmail.com Ruth Klundt (Inactive) made changes -
            Attachment New: LU-5629-syslog.bz2 [ 22169 ]
            pjones Peter Jones made changes -
            End date New: 23/May/16
            Start date New: 16/Sep/14
            dmiter Dmitry Eremin (Inactive) made changes -
            Link Original: This issue is related to LU-7453 [ LU-7453 ]
            dmiter Dmitry Eremin (Inactive) made changes -
            Link New: This issue is related to LU-7453 [ LU-7453 ]
            dmiter Dmitry Eremin (Inactive) made changes -
            Link New: This issue is related to LU-7453 [ LU-7453 ]
            doug Doug Oucharek (Inactive) made changes -
            Assignee Original: WC Triage [ wc-triage ] New: Dmitry Eremin [ dmiter ]

            People

              dmiter Dmitry Eremin (Inactive)
              morrone Christopher Morrone (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              15 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: