Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-11654

kernel crash on client eviction with CL_OPEN changelog enabled

    Details

    • Type: Bug
    • Status: Resolved
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: Lustre 2.12.0
    • Fix Version/s: Lustre 2.12.0
    • Labels:
      None
    • Severity:
      1
    • Rank (Obsolete):
      9223372036854775807

      Description

      Encounter this issue in my test:

      [5996825.601469] Hardware name: Quanta Cloud Technology Inc. QuantaGrid D51PH-1ULH/S2PH-MB, BIOS S2P_3B04 03/04/2016
      {{ [5996825.611713] task: ffff8fbf68309fa0 ti: ffff8fbf6cc9c000 task.ti: ffff8fbf6cc9c000}}
      {{ [5996825.619358] RIP: 0010:[<ffffffffc17c2b41>] [<ffffffffc17c2b41>] mdd_close+0x241/0xf40 [mdd]}}
      {{ [5996825.627991] RSP: 0018:ffff8fbf6cc9fcb0 EFLAGS: 00010246}}
      {{ [5996825.633468] RAX: 0000000000000000 RBX: ffff8fa7fda20ba0 RCX: 00000000000000c3}}
      {{ [5996825.640766] RDX: 0000000000000001 RSI: ffffffffc0b3fec0 RDI: ffff8fa7fda20ba0}}
      {{ [5996825.648064] RBP: ffff8fbf6cc9fd08 R08: 0000000000000001 R09: ffffffffc1026a71}}
      {{ [5996825.655362] R10: 0000000000000000 R11: ffffd41edd731800 R12: ffff8fbf6cc9fd90}}
      {{ [5996825.662659] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000001}}
      {{ [5996825.669958] FS: 0000000000000000(0000) GS:ffff8fafffbc0000(0000) knlGS:0000000000000000}}
      {{ [5996825.678218] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033}}
      {{ [5996825.684128] CR2: 0000000000000018 CR3: 0000000f9fe0e000 CR4: 00000000003607e0}}
      {{ [5996825.691427] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000}}
      {{ [5996825.698732] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400}}
      {{ [5996825.706030] Call Trace:}}
      {{ [5996825.708679] [<ffffffffc163165b>] mdt_mfd_close+0x3cb/0x810 [mdt]}}
      {{ [5996825.714951] [<ffffffffc15fe55e>] mdt_obd_disconnect+0x36e/0x670 [mdt]}}
      {{ [5996825.721689] [<ffffffffc0ff9914>] class_fail_export+0x224/0x520 [obdclass]}}
      {{ [5996825.728794] [<ffffffffc11e5a38>] expired_lock_main+0x5f8/0xc00 [ptlrpc]}}
      {{ [5996825.735665] [<ffffffff9dccf670>] ? wake_up_state+0x20/0x20}}
      {{ [5996825.741449] [<ffffffffc11e5440>] ? ldlm_server_glimpse_ast+0x3f0/0x3f0 [ptlrpc]}}
      {{ [5996825.749012] [<ffffffff9dcbb621>] kthread+0xd1/0xe0}}
      {{ [5996825.754065] [<ffffffff9dcbb550>] ? insert_kthread_work+0x40/0x40}}
      {{ [5996825.760333] [<ffffffff9e3205f7>] ret_from_fork_nospec_begin+0x21/0x21}}
      {{ [5996825.767030] [<ffffffff9dcbb550>] ? insert_kthread_work+0x40/0x40}}
      {{ [5996825.773285] Code: 74 50 4c 89 e7 44 89 45 c0 e8 bc f5 87 ff 48 85 c0 44 8b 45 c0 74 06 80 78 70 00 74 35 48 8b 45 b8 8b 4d c4 48 89 df 44 89 45 c0 <8b> 50 18 8b 70 14 e8 34 ae ff ff 48 85 c0 48 89 c2 44 8b 45 c0}}
      {{ [5996825.793410] RIP [<ffffffffc17c2b41>] mdd_close+0x241/0xf40 [mdd]}}
      {{ [5996825.799703] RSP <ffff8fbf6cc9fcb0>}}
      {{ [5996825.803360] CR2: 0000000000000018}}

      }}

      The root cause of this issue is that mdd_close() assumes that lu_ucred is always initialized.

       

      Patch is coming soon

        Attachments

          Activity

            People

            • Assignee:
              Jinshan Jinshan Xiong
              Reporter:
              Jinshan Jinshan Xiong
            • Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: