[LU-12933] Evicted client doesn't reconnect Created: 04/Nov/19  Updated: 07/Dec/19

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.12.3
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Stephane Thiell Assignee: Peter Jones
Resolution: Unresolved Votes: 0
Labels: None
Environment:

CentOS 7.6, MOFED 4.7, Lustre 2.12.3


Attachments: Text File fir-io2-s1.log     Text File fir-md1-s3.log     Text File sh-103-53.log    
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Hi,

We are noticing a few clients (5) being evicted by MDTs or OSTs and not reconnecting. We're only seeing this since the upgrade to 2.12.3.

Example with sh-103-53 10.9.103.53@o2ib4:

[root@sh-103-53 ~]# lfs df -v /scratch/
UUID                   1K-blocks        Used   Available Use% Mounted on
fir-MDT0000_UUID     18287292984  8929938396  8420624916  52% /scratch[MDT:0] f
fir-MDT0001_UUID     18287292984  4272171644 13078803336  25% /scratch[MDT:1] f
MDT0002             : inactive device
fir-MDT0003_UUID     18287292984  3657699480 13693239252  22% /scratch[MDT:3] f
OST0000             : inactive device
OST0001             : inactive device
fir-OST0002_UUID     61986877596 30702519208 30658586164  51% /scratch[OST:2]
fir-OST0003_UUID     61986877596 30350994280 31010091948  50% /scratch[OST:3]
fir-OST0004_UUID     61986877596 30501193096 30860120040  50% /scratch[OST:4]
fir-OST0005_UUID     61986877596 30550135432 30811199588  50% /scratch[OST:5]
fir-OST0006_UUID     61986877596 30070489688 31290937284  50% /scratch[OST:6]
fir-OST0007_UUID     61986877596 30744050608 30617311348  51% /scratch[OST:7]
fir-OST0008_UUID     61986877596 30493910864 30867362968  50% /scratch[OST:8]
fir-OST0009_UUID     61986877596 30649739912 30711554208  50% /scratch[OST:9]
fir-OST000a_UUID     61986877596 29661667812 31699536896  49% /scratch[OST:10]
fir-OST000b_UUID     61986877596 29826938776 31534313376  49% /scratch[OST:11]
OST000c             : inactive device
<hung>

They stay in the evicted state.

  • MDT
[root@sh-103-53 ~]# cd /proc/fs/lustre/mdc/fir-MDT0002-mdc-ffff9781f2230800/
[root@sh-103-53 fir-MDT0002-mdc-ffff9781f2230800]# date +%s; cat state 
1572895124
current_state: EVICTED
state_history:
 - [ 1572839940, CONNECTING ]
 - [ 1572839995, DISCONN ]
 - [ 1572840015, CONNECTING ]
 - [ 1572840070, DISCONN ]
 - [ 1572840090, CONNECTING ]
 - [ 1572840145, DISCONN ]
 - [ 1572840165, CONNECTING ]
 - [ 1572840165, REPLAY ]
 - [ 1572840165, REPLAY_LOCKS ]
 - [ 1572841556, CONNECTING ]
 - [ 1572841556, EVICTED ]
 - [ 1572841556, RECOVER ]
 - [ 1572841556, FULL ]
 - [ 1572888266, DISCONN ]
 - [ 1572888266, CONNECTING ]
 - [ 1572888266, EVICTED ]

Logs from the MDS fir-md1-s3 or 10.0.10.53@o2ib7:

Nov 04 09:24:19 fir-md1-s3 kernel: LustreError: 40361:0:(ldlm_lockd.c:256:expired_lock_main()) ### lock callback timer expired after 150s: evicting client at 10.9.103.53@o2ib4  ns: mdt-fir-MDT0002_UUID lock: ffff9a5eea6057c0/0x3428b9d2e97b844b lrc: 3/0,0 mode: PW/PW res: [0x2c0032e40:0x1:0x0].0x0 bits 0x40/0x0 rrc: 5 type: IBT flags: 0x60200400000020 nid: 10.9.103.53@o2ib4 remote: 0x51ab3c4efd129db expref: 24 pid: 43285 timeout: 55884 lvb_type: 0
  • OST 
[root@sh-103-53 ~]# cd /proc/fs/lustre/osc/fir-OST000c-osc-ffff9781f2230800
[root@sh-103-53 fir-OST000c-osc-ffff9781f2230800]# date +%s; cat state 
1572895019
current_state: EVICTED
state_history:
 - [ 1572839919, DISCONN ]
 - [ 1572839940, CONNECTING ]
 - [ 1572839995, DISCONN ]
 - [ 1572840015, CONNECTING ]
 - [ 1572840070, DISCONN ]
 - [ 1572840090, CONNECTING ]
 - [ 1572840145, DISCONN ]
 - [ 1572840165, CONNECTING ]
 - [ 1572840165, REPLAY ]
 - [ 1572840165, REPLAY_LOCKS ]
 - [ 1572840165, REPLAY_WAIT ]
 - [ 1572840409, RECOVER ]
 - [ 1572840409, FULL ]
 - [ 1572888417, DISCONN ]
 - [ 1572888417, CONNECTING ]
 - [ 1572888417, EVICTED ]

Logs from the OSS of fir-OST000c (fir-io2-s1 or 10.0.10.103@o2ib7):

Nov 04 09:26:51 fir-io2-s1 kernel: LustreError: 53462:0:(ldlm_lockd.c:256:expired_lock_main()) ### lock callback timer expired after 149s: evicting client at 10.9.103.53@o2ib4  ns: filter-fir-OST000c_UUID lock: ffff966494a91f80/0xe2c2779bcc5329d5 lrc: 3/0,0 mode: PW/PW res: [0x3c0000400:0x1245ef1:0x0].0x0 rrc: 3 type: EXT [0->18446744073709551615] (req 8388608->536870911) flags: 0x60000400010020 nid: 10.9.103.53@o2ib4 remote: 0x51ab3c4efd129f0 expref: 6 pid: 56541 timeout: 237932 lvb_type: 0
Nov 04 09:30:44 fir-io2-s1 kernel: Lustre: fir-OST000c: haven't heard from client f169454a-d158-5ca7-0fb6-b3c51a09a392 (at 10.9.103.53@o2ib4) in 227 seconds. I think it's dead, and I am evicting it. exp ffff962e33420000, cur 1572888644 expire 1572888494 last 1572888417

Attaching kernel logs for the client sh-103-53 as sh-103-53.log
Attaching kernel logs for the MDS {

{fir-md1-s3}

as fir-md1-s3.log
Attaching kernel logs for the OSS fir-io2-s1 as fir-io2-s1.log

Thanks!
Stephane

 



 Comments   
Comment by Stephane Thiell [ 04/Nov/19 ]

More info on some of the other clients found in that state (we can see that it's not always the same targets that are impacted). The rest of the nodes on Sherlock (1000+) are apparently fine.

  • sh-31-09 10.8.31.9@o2ib6
    [root@sh-31-09 ~]# lfs df -v /scratch
    UUID                   1K-blocks        Used   Available Use% Mounted on
    fir-MDT0000_UUID     18287292984  8939092268  8411591900  52% /scratch[MDT:0] f
    fir-MDT0001_UUID     18287292984  4277026904 13073799168  25% /scratch[MDT:1] f
    fir-MDT0002_UUID     18287292984  7080879448 10270132756  41% /scratch[MDT:2] f
    fir-MDT0003_UUID     18287292984  3658330152 13692645740  22% /scratch[MDT:3] f
    fir-OST0000_UUID     61986877596 30949957496 30411312792  51% /scratch[OST:0]
    fir-OST0001_UUID     61986877596 30215977912 31145205600  50% /scratch[OST:1]
    fir-OST0002_UUID     61986877596 30709166112 30652406664  51% /scratch[OST:2]
    fir-OST0003_UUID     61986877596 30355727296 31005500224  50% /scratch[OST:3]
    fir-OST0004_UUID     61986877596 30509331704 30851976060  50% /scratch[OST:4]
    fir-OST0005_UUID     61986877596 30557864356 30803397248  50% /scratch[OST:5]
    fir-OST0006_UUID     61986877596 30077579320 31283927464  50% /scratch[OST:6]
    fir-OST0007_UUID     61986877596 30750589336 30610882152  51% /scratch[OST:7]
    fir-OST0008_UUID     61986877596 30497811536 30863485612  50% /scratch[OST:8]
    fir-OST0009_UUID     61986877596 30652300488 30709035008  50% /scratch[OST:9]
    fir-OST000a_UUID     61986877596 29670740024 31690554904  49% /scratch[OST:10]
    OST000b             : inactive device
    fir-OST000c_UUID     61986877596 29597506152 31763777956  49% /scratch[OST:12]
    OST000d             : inactive device
    fir-OST000e_UUID     61986877596 30748544848 30612822608  51% /scratch[OST:14]
    fir-OST000f_UUID     61986877596 29817693992 31543549068  49% /scratch[OST:15]
    fir-OST0010_UUID     61986877596 30348718584 31012342584  50% /scratch[OST:16]
    fir-OST0011_UUID     61986877596 30520160012 30841155676  50% /scratch[OST:17]
    fir-OST0012_UUID     61986877596 30373509384 30987808224  50% /scratch[OST:18]
    fir-OST0013_UUID     61986877596 30322939000 31038011344  50% /scratch[OST:19]
    fir-OST0014_UUID     61986877596 30114752004 31246434104  50% /scratch[OST:20]
    fir-OST0015_UUID     61986877596 31000512848 30360742432  51% /scratch[OST:21]
    fir-OST0016_UUID     61986877596 30500010868 30861224276  50% /scratch[OST:22]
    OST0017             : inactive device
    fir-OST0018_UUID     61986877596 30472950396 30888381476  50% /scratch[OST:24]
    fir-OST0019_UUID     61986877596 30111104796 31250328840  50% /scratch[OST:25]
    fir-OST001a_UUID     61986877596 30363747204 30997493896  50% /scratch[OST:26]
    fir-OST001b_UUID     61986877596 29733122628 31628229248  49% /scratch[OST:27]
    OST001c             : inactive device
    fir-OST001d_UUID     61986877596 30963470920 30397754836  51% /scratch[OST:29]
    OST001e             : inactive device
    fir-OST001f_UUID     61986877596 30339614880 31021547564  50% /scratch[OST:31]
    fir-OST0020_UUID     61986877596 30931807172 30429259448  51% /scratch[OST:32]
    fir-OST0021_UUID     61986877596 31373213420 29987923052  52% /scratch[OST:33]
    fir-OST0022_UUID     61986877596 30770502444 30590982164  51% /scratch[OST:34]
    fir-OST0023_UUID     61986877596 30702780792 30658609016  51% /scratch[OST:35]
    fir-OST0024_UUID     61986877596 30586817044 30774501748  50% /scratch[OST:36]
    fir-OST0025_UUID     61986877596 30231628608 31129708148  50% /scratch[OST:37]
    fir-OST0026_UUID     61986877596 30277966200 31083417480  50% /scratch[OST:38]
    fir-OST0027_UUID     61986877596 29900758936 31460310460  49% /scratch[OST:39]
    fir-OST0028_UUID     61986877596 30454670188 30906596984  50% /scratch[OST:40]
    fir-OST0029_UUID     61986877596 30640732804 30720696176  50% /scratch[OST:41]
    fir-OST002a_UUID     61986877596 30465828080 30895358200  50% /scratch[OST:42]
    fir-OST002b_UUID     61986877596 30265299116 31095790364  50% /scratch[OST:43]
    fir-OST002c_UUID     61986877596 31641994748 29719055956  52% /scratch[OST:44]
    fir-OST002d_UUID     61986877596 30225079240 31136143492  50% /scratch[OST:45]
    fir-OST002e_UUID     61986877596 29676602976 31684845376  49% /scratch[OST:46]
    fir-OST002f_UUID     61986877596 30111759352 31249700184  50% /scratch[OST:47]
    ^C
     
    [root@sh-31-09 ~]# lctl dl | grep fir
      1 UP lov fir-clilov-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 3
      2 UP lmv fir-clilmv-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      3 UP mdc fir-MDT0000-mdc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      4 UP mdc fir-MDT0001-mdc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      5 UP mdc fir-MDT0002-mdc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      6 UP mdc fir-MDT0003-mdc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      7 UP osc fir-OST0000-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      8 UP osc fir-OST0001-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
      9 UP osc fir-OST0002-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     10 UP osc fir-OST0003-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     11 UP osc fir-OST0004-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     12 UP osc fir-OST0005-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     13 UP osc fir-OST0006-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     14 UP osc fir-OST0007-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     15 UP osc fir-OST0008-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     16 UP osc fir-OST0009-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     17 UP osc fir-OST000a-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     18 IN osc fir-OST000b-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     19 UP osc fir-OST000c-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     20 IN osc fir-OST000d-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     21 UP osc fir-OST000e-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     22 UP osc fir-OST000f-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     23 UP osc fir-OST0010-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     24 UP osc fir-OST0011-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     25 UP osc fir-OST0012-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     26 UP osc fir-OST0013-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     27 UP osc fir-OST0014-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     28 UP osc fir-OST0015-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     29 UP osc fir-OST0016-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     30 IN osc fir-OST0017-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     31 UP osc fir-OST0018-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     32 UP osc fir-OST0019-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     33 UP osc fir-OST001a-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     34 UP osc fir-OST001b-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     35 IN osc fir-OST001c-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     36 UP osc fir-OST001d-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     37 IN osc fir-OST001e-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     38 UP osc fir-OST001f-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     39 UP osc fir-OST0020-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     40 UP osc fir-OST0021-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     41 UP osc fir-OST0022-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     42 UP osc fir-OST0023-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     43 UP osc fir-OST0024-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     44 UP osc fir-OST0025-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     45 UP osc fir-OST0026-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     46 UP osc fir-OST0027-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     47 UP osc fir-OST0028-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     48 UP osc fir-OST0029-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     49 UP osc fir-OST002a-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     50 UP osc fir-OST002b-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     51 UP osc fir-OST002c-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     52 UP osc fir-OST002d-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     53 UP osc fir-OST002e-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     54 UP osc fir-OST002f-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     55 UP osc fir-OST0030-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     56 UP osc fir-OST0031-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     57 UP osc fir-OST0032-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     58 UP osc fir-OST0033-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     59 UP osc fir-OST0034-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     60 UP osc fir-OST0035-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     61 UP osc fir-OST0036-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     62 UP osc fir-OST0037-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     63 UP osc fir-OST0038-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     64 UP osc fir-OST0039-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     65 UP osc fir-OST003a-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     66 UP osc fir-OST003b-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     67 UP osc fir-OST003c-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     68 UP osc fir-OST003d-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     69 UP osc fir-OST003e-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     70 UP osc fir-OST003f-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     71 UP osc fir-OST0040-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     72 UP osc fir-OST0041-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     73 UP osc fir-OST0042-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     74 UP osc fir-OST0043-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     75 UP osc fir-OST0044-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     76 UP osc fir-OST0045-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     77 UP osc fir-OST0046-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     78 UP osc fir-OST0047-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     79 UP osc fir-OST0048-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     80 UP osc fir-OST0049-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     81 UP osc fir-OST004a-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     82 UP osc fir-OST004b-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     83 UP osc fir-OST004c-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     84 UP osc fir-OST004d-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     85 UP osc fir-OST004e-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     86 UP osc fir-OST004f-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     87 UP osc fir-OST0050-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     88 UP osc fir-OST0051-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     89 UP osc fir-OST0052-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     90 UP osc fir-OST0053-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     91 UP osc fir-OST0054-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     92 UP osc fir-OST0055-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     93 UP osc fir-OST0056-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     94 UP osc fir-OST0057-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     95 UP osc fir-OST0058-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     96 UP osc fir-OST0059-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     97 UP osc fir-OST005a-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     98 UP osc fir-OST005b-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
     99 UP osc fir-OST005c-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
    100 UP osc fir-OST005d-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
    101 UP osc fir-OST005e-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
    102 UP osc fir-OST005f-osc-ffff9faa8978a800 3765f0e1-8b9f-c9bd-15c5-1326950110f9 4
    

     

  • sh-07-13 10.8.7.13@o2ib6
    [root@sh-07-13 ~]# lfs df -v /scratch
    UUID                   1K-blocks        Used   Available Use% Mounted on
    fir-MDT0000_UUID     18287292984  8939631940  8411027920  52% /scratch[MDT:0] f
    MDT0001             : inactive device
    fir-MDT0002_UUID     18287292984  7080858324 10270158788  41% /scratch[MDT:2] f
    fir-MDT0003_UUID     18287292984  3658379376 13692600816  22% /scratch[MDT:3] f
    fir-OST0000_UUID     61986877596 30950524024 30410748772  51% /scratch[OST:0]
    fir-OST0001_UUID     61986877596 30216446884 31144824708  50% /scratch[OST:1]
    fir-OST0002_UUID     61986877596 30708442648 30652995932  51% /scratch[OST:2]
    fir-OST0003_UUID     61986877596 30358659584 31002689780  50% /scratch[OST:3]
    fir-OST0004_UUID     61986877596 30510265848 30851049420  50% /scratch[OST:4]
    fir-OST0005_UUID     61986877596 30558835640 30802448684  50% /scratch[OST:5]
    fir-OST0006_UUID     61986877596 30077941048 31283596728  50% /scratch[OST:6]
    fir-OST0007_UUID     61986877596 30751074652 30610370076  51% /scratch[OST:7]
    fir-OST0008_UUID     61986877596 30498183648 30863067520  50% /scratch[OST:8]
    fir-OST0009_UUID     61986877596 30652427468 30708879776  50% /scratch[OST:9]
    fir-OST000a_UUID     61986877596 29671105172 31690193220  49% /scratch[OST:10]
    fir-OST000b_UUID     61986877596 29838310432 31522943840  49% /scratch[OST:11]
    fir-OST000c_UUID     61986877596 29597981568 31763376016  49% /scratch[OST:12]
    fir-OST000d_UUID     61986877596 30086491724 31274838296  50% /scratch[OST:13]
    fir-OST000e_UUID     61986877596 30749480900 30611916020  51% /scratch[OST:14]
    fir-OST000f_UUID     61986877596 29818145704 31542983876  49% /scratch[OST:15]
    fir-OST0010_UUID     61986877596 30349236812 31011795604  50% /scratch[OST:16]
    fir-OST0011_UUID     61986877596 30520504868 30840821676  50% /scratch[OST:17]
    fir-OST0012_UUID     61986877596 30373853832 30987511120  50% /scratch[OST:18]
    fir-OST0013_UUID     61986877596 30325330096 31035949288  50% /scratch[OST:19]
    fir-OST0014_UUID     61986877596 30114932684 31246391948  50% /scratch[OST:20]
    fir-OST0015_UUID     61986877596 31000968752 30360330336  51% /scratch[OST:21]
    fir-OST0016_UUID     61986877596 30500671836 30860592848  50% /scratch[OST:22]
    fir-OST0017_UUID     61986877596 30038865812 31322727020  49% /scratch[OST:23]
    fir-OST0018_UUID     61986877596 30473121860 30888170896  50% /scratch[OST:24]
    fir-OST0019_UUID     61986877596 30081927088 31279522992  50% /scratch[OST:25]
    fir-OST001a_UUID     61986877596 30364483800 30996714244  50% /scratch[OST:26]
    fir-OST001b_UUID     61986877596 29733478820 31627825708  49% /scratch[OST:27]
    fir-OST001c_UUID     61986877596 31203094848 30158089048  51% /scratch[OST:28]
    fir-OST001d_UUID     61986877596 30964160192 30397130396  51% /scratch[OST:29]
    fir-OST001e_UUID     61986877596 29980976316 31380369756  49% /scratch[OST:30]
    fir-OST001f_UUID     61986877596 30339848284 31021319936  50% /scratch[OST:31]
    fir-OST0020_UUID     61986877596 30932449068 30428590864  51% /scratch[OST:32]
    fir-OST0021_UUID     61986877596 31373885068 29987384752  52% /scratch[OST:33]
    ^C
    
  • sh-103-33 10.9.103.33@o2ib4 with inactive MDT0
    [root@sh-103-33 ~]# lfs df -v /scratch
    error: invalid path '/scratch': Cannot send after transport endpoint shutdown
    [root@sh-103-33 ~]# lctl dl | grep fir
      1 UP lov fir-clilov-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 3
      2 UP lmv fir-clilmv-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      3 IN mdc fir-MDT0000-mdc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      4 UP mdc fir-MDT0001-mdc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      5 UP mdc fir-MDT0002-mdc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      6 UP mdc fir-MDT0003-mdc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      7 UP osc fir-OST0000-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      8 UP osc fir-OST0001-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
      9 UP osc fir-OST0002-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     10 IN osc fir-OST0003-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     11 UP osc fir-OST0004-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     12 UP osc fir-OST0005-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     13 UP osc fir-OST0006-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     14 UP osc fir-OST0007-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     15 UP osc fir-OST0008-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     16 UP osc fir-OST0009-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     17 UP osc fir-OST000a-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     18 UP osc fir-OST000b-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     19 UP osc fir-OST000c-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     20 UP osc fir-OST000d-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     21 UP osc fir-OST000e-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     22 UP osc fir-OST000f-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     23 UP osc fir-OST0010-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     24 UP osc fir-OST0011-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     25 UP osc fir-OST0012-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     26 UP osc fir-OST0013-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     27 UP osc fir-OST0014-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     28 UP osc fir-OST0015-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     29 UP osc fir-OST0016-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     30 UP osc fir-OST0017-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     31 UP osc fir-OST0018-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     32 UP osc fir-OST0019-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     33 UP osc fir-OST001a-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     34 UP osc fir-OST001b-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     35 IN osc fir-OST001c-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     36 UP osc fir-OST001d-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     37 UP osc fir-OST001e-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     38 UP osc fir-OST001f-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     39 UP osc fir-OST0020-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     40 IN osc fir-OST0021-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     41 UP osc fir-OST0022-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     42 UP osc fir-OST0023-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     43 UP osc fir-OST0024-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     44 UP osc fir-OST0025-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     45 UP osc fir-OST0026-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     46 UP osc fir-OST0027-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     47 IN osc fir-OST0028-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     48 UP osc fir-OST0029-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     49 UP osc fir-OST002a-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     50 UP osc fir-OST002b-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     51 UP osc fir-OST002c-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     52 UP osc fir-OST002d-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     53 UP osc fir-OST002e-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     54 UP osc fir-OST002f-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     55 UP osc fir-OST0030-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     56 UP osc fir-OST0031-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     57 UP osc fir-OST0032-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     58 UP osc fir-OST0033-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     59 IN osc fir-OST0034-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     60 UP osc fir-OST0035-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     61 IN osc fir-OST0036-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     62 UP osc fir-OST0037-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     63 UP osc fir-OST0038-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     64 UP osc fir-OST0039-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     65 UP osc fir-OST003a-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     66 IN osc fir-OST003b-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     67 UP osc fir-OST003c-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     68 IN osc fir-OST003d-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     69 UP osc fir-OST003e-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     70 IN osc fir-OST003f-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     71 UP osc fir-OST0040-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     72 UP osc fir-OST0041-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     73 UP osc fir-OST0042-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     74 UP osc fir-OST0043-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     75 UP osc fir-OST0044-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     76 UP osc fir-OST0045-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     77 UP osc fir-OST0046-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     78 IN osc fir-OST0047-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     79 UP osc fir-OST0048-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     80 UP osc fir-OST0049-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     81 UP osc fir-OST004a-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     82 UP osc fir-OST004b-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     83 IN osc fir-OST004c-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     84 UP osc fir-OST004d-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     85 UP osc fir-OST004e-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     86 IN osc fir-OST004f-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     87 IN osc fir-OST0050-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     88 UP osc fir-OST0051-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     89 UP osc fir-OST0052-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     90 IN osc fir-OST0053-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     91 UP osc fir-OST0054-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     92 UP osc fir-OST0055-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     93 UP osc fir-OST0056-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     94 UP osc fir-OST0057-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     95 UP osc fir-OST0058-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     96 IN osc fir-OST0059-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     97 IN osc fir-OST005a-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     98 IN osc fir-OST005b-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
     99 UP osc fir-OST005c-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
    100 UP osc fir-OST005d-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
    101 UP osc fir-OST005e-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
    102 IN osc fir-OST005f-osc-ffff9b9fe1a9e800 535246f0-9862-c25f-7814-3ac6259d8dbc 4
    
Comment by Stephane Thiell [ 04/Nov/19 ]
  • sh-103-68 10.9.103.68@o2ib4
    [root@sh-103-68 ~]# lfs df -v /scratch
    error: invalid path '/scratch': Cannot send after transport endpoint shutdown
    [root@sh-103-68 ~]# lctl dl | grep fir
      1 UP lov fir-clilov-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 3
      2 UP lmv fir-clilmv-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      3 IN mdc fir-MDT0000-mdc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      4 UP mdc fir-MDT0001-mdc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      5 UP mdc fir-MDT0002-mdc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      6 UP mdc fir-MDT0003-mdc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      7 UP osc fir-OST0000-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      8 IN osc fir-OST0001-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
      9 IN osc fir-OST0002-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     10 UP osc fir-OST0003-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     11 UP osc fir-OST0004-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     12 UP osc fir-OST0005-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     13 UP osc fir-OST0006-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     14 UP osc fir-OST0007-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     15 UP osc fir-OST0008-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     16 UP osc fir-OST0009-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     17 UP osc fir-OST000a-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     18 IN osc fir-OST000b-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     19 IN osc fir-OST000c-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     20 UP osc fir-OST000d-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     21 IN osc fir-OST000e-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     22 UP osc fir-OST000f-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     23 UP osc fir-OST0010-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     24 UP osc fir-OST0011-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     25 UP osc fir-OST0012-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     26 UP osc fir-OST0013-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     27 UP osc fir-OST0014-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     28 UP osc fir-OST0015-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     29 UP osc fir-OST0016-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     30 UP osc fir-OST0017-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     31 UP osc fir-OST0018-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     32 UP osc fir-OST0019-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     33 IN osc fir-OST001a-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     34 IN osc fir-OST001b-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     35 UP osc fir-OST001c-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     36 IN osc fir-OST001d-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     37 UP osc fir-OST001e-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     38 UP osc fir-OST001f-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     39 UP osc fir-OST0020-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     40 UP osc fir-OST0021-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     41 IN osc fir-OST0022-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     42 UP osc fir-OST0023-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     43 UP osc fir-OST0024-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     44 IN osc fir-OST0025-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     45 UP osc fir-OST0026-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     46 UP osc fir-OST0027-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     47 UP osc fir-OST0028-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     48 UP osc fir-OST0029-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     49 UP osc fir-OST002a-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     50 UP osc fir-OST002b-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     51 UP osc fir-OST002c-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     52 UP osc fir-OST002d-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     53 UP osc fir-OST002e-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     54 IN osc fir-OST002f-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     55 IN osc fir-OST0030-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     56 UP osc fir-OST0031-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     57 UP osc fir-OST0032-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     58 UP osc fir-OST0033-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     59 UP osc fir-OST0034-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     60 IN osc fir-OST0035-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     61 UP osc fir-OST0036-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     62 IN osc fir-OST0037-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     63 IN osc fir-OST0038-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     64 UP osc fir-OST0039-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     65 UP osc fir-OST003a-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     66 UP osc fir-OST003b-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     67 IN osc fir-OST003c-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     68 IN osc fir-OST003d-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     69 UP osc fir-OST003e-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     70 UP osc fir-OST003f-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     71 IN osc fir-OST0040-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     72 UP osc fir-OST0041-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     73 UP osc fir-OST0042-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     74 UP osc fir-OST0043-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     75 IN osc fir-OST0044-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     76 UP osc fir-OST0045-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     77 UP osc fir-OST0046-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     78 UP osc fir-OST0047-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     79 UP osc fir-OST0048-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     80 UP osc fir-OST0049-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     81 IN osc fir-OST004a-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     82 IN osc fir-OST004b-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     83 UP osc fir-OST004c-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     84 IN osc fir-OST004d-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     85 UP osc fir-OST004e-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     86 UP osc fir-OST004f-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     87 IN osc fir-OST0050-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     88 IN osc fir-OST0051-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     89 UP osc fir-OST0052-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     90 UP osc fir-OST0053-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     91 UP osc fir-OST0054-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     92 UP osc fir-OST0055-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     93 IN osc fir-OST0056-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     94 UP osc fir-OST0057-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     95 UP osc fir-OST0058-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     96 UP osc fir-OST0059-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     97 IN osc fir-OST005a-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     98 UP osc fir-OST005b-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
     99 UP osc fir-OST005c-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
    100 UP osc fir-OST005d-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
    101 UP osc fir-OST005e-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
    102 UP osc fir-OST005f-osc-ffff916185740800 0793d310-98b7-3c69-61fc-4fc3ff0f31a5 4
    
Comment by Andreas Dilger [ 05/Nov/19 ]

Do the OSC/MDC connections that are inactive on a particular node change over time? For example, if you checked sh-103-68 and sh-31-09 again, do they still have the same inactive connections, or do they have different inactive connections?

The clients that are showing fir-MDT0000 as being disconnected might be fallout from LU-12935 when MDT0000 was having problems, but that doesn't explain the OST problems.

You could try running on e.g. sh-103-68 "lctl --device 8 activate" to see if the client can reconnect to fir-OST0001 manually?

Comment by Stephane Thiell [ 05/Nov/19 ]

Thanks! At this point, we don't have any client in that state anymore, those have been rebooted since then. We have now a test in NHC (Node Health Check) on Sherlock that will detect any new occurrence of inactive target connections.

Generated at Sat Feb 10 02:56:55 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.