Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10429

soak, LBUG lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.11.0
    • Lustre 2.11.0
    • Soak test cluster, tip of master plus patch for LU-10321
    • 3
    • 9223372036854775807

    Description

      MDS completed a failover/failback sequence, then LBUGged.Bogus recovery timer

      [  573.543826] Lustre: soaked-MDT0002: Connection restored to soaked-MDT0002-lwp-OST0000_UUID (at 192.168.1.102@o2ib)
      [  573.580891] Lustre: Skipped 11 previous similar messages
      [  620.036501] Lustre: soaked-MDT0002: Denying connection for new client b3e244d6-b85b-a278-a032-0b483389bc28(at 192.168.1.116@o2ib), waiting for 31 known clients (3 recovered, 0 in progress, and 0 evicted) to recover in 71579:58
      [  620.590408] Lustre: soaked-MDT0002: Denying connection for new client b894efcc-1695-2739-719d-dce1f0686406(at 192.168.1.136@o2ib), waiting for 31 known clients (3 recovered, 0 in progress, and 0 evicted) to recover in 71579:57
      [  621.816145] Lustre: soaked-MDT0002: Denying connection for new client 62ac41a6-bf69-92f4-befc-577b5fee6b59(at 192.168.1.138@o2ib), waiting for 31 known clients (3 recovered, 0 in progress, and 0 evicted) to recover in 71579:56
      [  621.888798] Lustre: Skipped 1 previous similar message
      [  624.206173] Lustre: soaked-MDT0002: Denying connection for new client 3f82d520-6a51-5083-afbc-e5ea95b116e5(at 192.168.1.142@o2ib), waiting for 31 known clients (3 recovered, 0 in progress, and 0 evicted) to recover in 71579:54
      [  624.279133] Lustre: Skipped 3 previous similar messages
      [  628.413382] Lustre: soaked-MDT0002: Denying connection for new client 470fa3e8-31ff-e6f2-850c-c32c2fccf087(at 192.168.1.120@o2ib), waiting for 31 known clients (3 recovered, 0 in progress, and 0 evicted) to recover in 71579:49
      [  628.486832] Lustre: Skipped 4 previous similar messages
      [  630.186023] Lustre: 2476:0:(ldlm_lib.c:2029:target_recovery_overseer()) recovery is aborted, evict exports in recovery
      [  630.248533] Lustre: soaked-MDT0002: disconnecting 28 stale clients
      [  630.275436] Lustre: soaked-MDT0002: Recovery over after 3:00, of 31 clients 3 recovered and 28 were evicted.
      [  637.790744] Lustre: soaked-MDT0002: Connection restored to a4e0fd52-cac9-d111-1b57-d9203252395d (at 192.168.1.135@o2ib)
      [  637.829810] Lustre: Skipped 31 previous similar messages
      [  738.737321] LNet: 2239:0:(o2iblnd_cb.c:3198:kiblnd_check_conns()) Timed out tx for 192.168.1.115@o2ib: 0 seconds
      [  738.862656] LNet: 2239:0:(o2iblnd_cb.c:3198:kiblnd_check_conns()) Skipped 1 previous similar message
      [  834.963313] Lustre: soaked-OST0004-osc-MDT0002: Connection restored to 192.168.1.106@o2ib (at 192.168.1.106@o2ib)
      [  834.978267] Lustre: Skipped 16 previous similar messages
      [  930.946444] LustreError: 2580:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946461] LustreError: 2578:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946465] LustreError: 2578:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946466] Pid: 2578, comm: mdt01_012
      [  930.946467]
      Call Trace:
      [  930.946474] LustreError: 2539:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946480] LustreError: 2539:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946482] Pid: 2539, comm: mdt00_011
      [  930.946483]
        930.946465] LustreError: 2578:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946466] Pid: 2578, comm: mdt01_012
      [  930.946467]
      Call Trace:
      [  930.946474] LustreError: 2539:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946480] LustreError: 2539:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946482] Pid: 2539, comm: mdt00_011
      [  930.946483]
      Call Trace:
      [  930.946488] LustreError: 2538:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946493] LustreError: 2538:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946494] Pid: 2538, comm: mdt00_010
      [  930.946495]
      Call Trace:   
      [  930.946498]  [<ffffffffc09737ae>] libcfs_call_trace+0x4e/0x60 [libcfs]
      [  930.946504] LustreError: 2536:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946508] LustreError: 2536:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946510] Pid: 2536, comm: mdt00_009
      [  930.946510]
      Call Trace:   
      [  930.946512]  [<ffffffffc097383c>] lbug_with_loc+0x4c/0xb0 [libcfs]
      [  930.946523]  [<ffffffffc09737ae>] libcfs_call_trace+0x4e/0x60 [libcfs]
      [  930.946525]  [<ffffffffc09737ae>] libcfs_call_trace+0x4e/0x60 [libcfs]
      [  930.946530] LustreError: 2586:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946531]  [<ffffffffc09737ae>] libcfs_call_trace+0x4e/0x60 [libcfs]
      [  930.946538] LustreError: 2530:0:(lod_qos.c:858:lod_comp_ost_in_use()) ASSERTION( inuse->op_count * sizeof(inuse->op_array[0]) < inuse->op_size ) failed:
      [  930.946540] LustreError: 2586:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946543]  [<ffffffffc185e858>] lod_env_info.part.10+0x0/0x36 [lod]
      [  930.946545]  [<ffffffffc097383c>] lbug_with_loc+0x4c/0xb0 [libcfs]
      [  930.946549] LustreError: 2530:0:(lod_qos.c:858:lod_comp_ost_in_use()) LBUG
      [  930.946551] Pid:
      

      System crash dumped, dump available on soak.

      Attachments

        Issue Links

          Activity

            People

              bobijam Zhenyu Xu
              cliffw Cliff White (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: