Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1065

High rate of obd_ping failure with client <-> OST evictions

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Duplicate
    • Icon: Major Major
    • None
    • Lustre 1.8.x (1.8.0 - 1.8.5)
    • None
    • Clients: Lustre Version: 1.8.6.81
      Servers: 2.1.0 with ofed 1.5.3.1
    • 3
    • 6470

      We are seeing a large amount of obd_ping failures then client eviction from our 2.1 servers relative to our 1.8.6/5 servers.
      We have one 2.1 filesystem and six 1.8.x filesystems.
      Here you can see obd_ping counts for each data:
      ---- lustre-20120125 -----
      nbp5 1022
      allothers 24
      ---- lustre-20120126 -----
      nbp5 760
      allothers 33
      ---- lustre-20120127 -----
      nbp5 420
      allothers 6
      ---- lustre-20120128 -----
      nbp5 226
      allothers 97
      ---- lustre-20120129 -----
      nbp5 36
      allothers 7
      ---- lustre-20120130 -----
      nbp5 243
      allothers 19
      ---- lustre-20120131 -----
      nbp5 808
      allothers 17
      ---- lustre-20120201 -----
      nbp5 81
      allothers 2

      Attached are typical client log(r8610n14.lustre)
      and server logs(service151.lustre.feb1)

            hongchao.zhang Hongchao Zhang
            mhanafi Mahmoud Hanafi
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

              Created:
              Updated:
              Resolved: