Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1065

High rate of obd_ping failure with client <-> OST evictions

Details

    • Bug
    • Resolution: Duplicate
    • Major
    • None
    • Lustre 1.8.x (1.8.0 - 1.8.5)
    • None
    • Clients: Lustre Version: 1.8.6.81
      Servers: 2.1.0 with ofed 1.5.3.1
    • 3
    • 6470

    Description

      We are seeing a large amount of obd_ping failures then client eviction from our 2.1 servers relative to our 1.8.6/5 servers.
      We have one 2.1 filesystem and six 1.8.x filesystems.
      Here you can see obd_ping counts for each data:
      ---- lustre-20120125 -----
      nbp5 1022
      allothers 24
      ---- lustre-20120126 -----
      nbp5 760
      allothers 33
      ---- lustre-20120127 -----
      nbp5 420
      allothers 6
      ---- lustre-20120128 -----
      nbp5 226
      allothers 97
      ---- lustre-20120129 -----
      nbp5 36
      allothers 7
      ---- lustre-20120130 -----
      nbp5 243
      allothers 19
      ---- lustre-20120131 -----
      nbp5 808
      allothers 17
      ---- lustre-20120201 -----
      nbp5 81
      allothers 2

      Attached are typical client log(r8610n14.lustre)
      and server logs(service151.lustre.feb1)

      Attachments

        1. r8610n14.lustre
          8 kB
          Mahmoud Hanafi
        2. service151.lustre.feb1
          68 kB
          Mahmoud Hanafi

        Issue Links

          Activity

            [LU-1065] High rate of obd_ping failure with client <-> OST evictions
            pjones Peter Jones added a comment -

            Assumed to be a duplicate of LU-874. We will reopen if this proves to not be the case

            pjones Peter Jones added a comment - Assumed to be a duplicate of LU-874 . We will reopen if this proves to not be the case

            Hi Mahmoud

            have you tested it with the patch in LU-874? what is the result?
            Thanks

            hongchao.zhang Hongchao Zhang added a comment - Hi Mahmoud have you tested it with the patch in LU-874 ? what is the result? Thanks
            pjones Peter Jones added a comment -

            Hi Hongchao

            Could you please look into this one?

            Thanks

            Peter

            pjones Peter Jones added a comment - Hi Hongchao Could you please look into this one? Thanks Peter

            This issue is already being tracked under LU-874, which has a number of patches scheduled to land for the 2.1.1 release.

            adilger Andreas Dilger added a comment - This issue is already being tracked under LU-874 , which has a number of patches scheduled to land for the 2.1.1 release.

            People

              hongchao.zhang Hongchao Zhang
              mhanafi Mahmoud Hanafi
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: