Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1065

High rate of obd_ping failure with client <-> OST evictions

    XMLWordPrintable

Details

    • Bug
    • Resolution: Duplicate
    • Major
    • None
    • Lustre 1.8.x (1.8.0 - 1.8.5)
    • None
    • Clients: Lustre Version: 1.8.6.81
      Servers: 2.1.0 with ofed 1.5.3.1
    • 3
    • 6470

    Description

      We are seeing a large amount of obd_ping failures then client eviction from our 2.1 servers relative to our 1.8.6/5 servers.
      We have one 2.1 filesystem and six 1.8.x filesystems.
      Here you can see obd_ping counts for each data:
      ---- lustre-20120125 -----
      nbp5 1022
      allothers 24
      ---- lustre-20120126 -----
      nbp5 760
      allothers 33
      ---- lustre-20120127 -----
      nbp5 420
      allothers 6
      ---- lustre-20120128 -----
      nbp5 226
      allothers 97
      ---- lustre-20120129 -----
      nbp5 36
      allothers 7
      ---- lustre-20120130 -----
      nbp5 243
      allothers 19
      ---- lustre-20120131 -----
      nbp5 808
      allothers 17
      ---- lustre-20120201 -----
      nbp5 81
      allothers 2

      Attached are typical client log(r8610n14.lustre)
      and server logs(service151.lustre.feb1)

      Attachments

        1. r8610n14.lustre
          8 kB
          Mahmoud Hanafi
        2. service151.lustre.feb1
          68 kB
          Mahmoud Hanafi

        Issue Links

          Activity

            People

              hongchao.zhang Hongchao Zhang
              mhanafi Mahmoud Hanafi
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: