Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10002

client_bulk_callback() event type 1, status -5, desc ffff881fe5a8f800

Details

    • 3
    • 9223372036854775807

    Description

      Repeated console log entries like this on a client while running mdtest:

      Lustre: lcy-OST000c-osc-ffff883ff7d0e800: Connection restored to 10.1.1.183@o2ib9 (at 10.1.1.183@o2ib9)
      Lustre: Skipped 17 previous similar messages
      mlx5_0:dump_cqe:262:(pid 5096): dump error cqe
      00000000 00000000 00000000 00000000
      00000000 00000000 00000000 00000000
      00000000 00000000 00000000 00000000
      00000000 08007806 25000230 001344d2
      LustreError: 5099:0:(events.c:203:client_bulk_callback()) event type 1, status -5, desc ffff881fe5a8b800
      LustreError: 5098:0:(events.c:203:client_bulk_callback()) event type 1, status -5, desc ffff881fd8390800
      mlx5_0:dump_cqe:262:(pid 5097): dump error cqe
      00000000 00000000 00000000 00000000
      00000000 00000000 00000000 00000000
      00000000 00000000 00000000 00000000
      00000000 08007806 25000231 00044ad2
      

      Along with reconnects to the servers.

      This issue occurs when running with the patches backported under LU-9932

      Attachments

        Activity

          [LU-10002] client_bulk_callback() event type 1, status -5, desc ffff881fe5a8f800
          pjones Peter Jones added a comment -

          Seems that this issue can be closed

          pjones Peter Jones added a comment - Seems that this issue can be closed

          That appears to have solved the issue. Thanks.

          ofaaland Olaf Faaland added a comment - That appears to have solved the issue. Thanks.
          ofaaland Olaf Faaland added a comment -

          Hi Sonia,
          Yes we have nodes with mlx5 cards connecting to nodes with mlx4 cards, and no we do not have LU-8752. We will get that patch added.
          Thanks

          ofaaland Olaf Faaland added a comment - Hi Sonia, Yes we have nodes with mlx5 cards connecting to nodes with mlx4 cards, and no we do not have LU-8752 . We will get that patch added. Thanks
          sharmaso Sonia Sharma (Inactive) added a comment - - edited

          Hi Olaf,
          Do you have nodes with mlx5 cards talking to nodes with mlx4 cards in your setup?
          The dump cqe error is seen on the nodes with mlx5 cards. You would need LU-8752 patch on the nodes with mlx5 cards. Can you check if LU-8752 is included on these nodes?

          Thanks

          sharmaso Sonia Sharma (Inactive) added a comment - - edited Hi Olaf, Do you have nodes with mlx5 cards talking to nodes with mlx4 cards in your setup? The dump cqe error is seen on the nodes with mlx5 cards. You would need LU-8752 patch on the nodes with mlx5 cards. Can you check if LU-8752 is included on these nodes? Thanks
          pjones Peter Jones added a comment -

          Sonia

          Can you please advise

          Thanks

          Peter

          pjones Peter Jones added a comment - Sonia Can you please advise Thanks Peter

          I see the dump_cqe message is comgin from the infiniband driver

          ofaaland Olaf Faaland added a comment - I see the dump_cqe message is comgin from the infiniband driver

          People

            sharmaso Sonia Sharma (Inactive)
            ofaaland Olaf Faaland
            Votes:
            0 Vote for this issue
            Watchers:
            5 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: