Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-17754

lustre crash on copper50 general protection fault, probably for non-canonical address 0x3760003237332428: 0000 [#1] SMP PTI

    XMLWordPrintable

Details

    • Bug
    • Resolution: Not a Bug
    • Minor
    • None
    • Lustre 2.15.4
    • None
    • 3
    • 9223372036854775807

    Description

      An OST server node (copper50) crashed and failed over.

      The stack trace is

      2024-04-17 23:18:22 [556577.979144] general protection fault, probably for non-canonical address 0x3760003237332428: 0000 [#1] SMP PTI
      2024-04-17 23:18:22 [556577.990410] CPU: 3 PID: 220137 Comm: ll_ost00_020 Kdump: loaded Tainted: P        W  OE KX --------- -  - 4.18.0-513.18.1.2toss.t4.x86_64 #1
      2024-04-17 23:18:22 [556578.004602] Hardware name: Intel Corporation S2600WTTR/S2600WTTR, BIOS SE5C610.86B.01.01.0024.021320181901 02/13/2018
      2024-04-17 23:18:22 [556578.016541] RIP: 0010:strcmp+0xc/0x30
      2024-04-17 23:18:22 [556578.020730] Code: 75 f7 31 d2 44 0f b6 04 16 44 88 04 11 48 83 c2 01 45 84 c0 75 ee c3 cc cc cc cc 0f 1f 00 31 c0 eb 08 48 83 c0 01 84 d2 74 13 <0f> b6 14 07 3a 14 06 74 ef 19 c0 83 c8 01 c3 cc cc cc cc 31 c0 c3
      2024-04-17 23:18:22 [556578.041782] RSP: 0018:ffffbc9d2debbb58 EFLAGS: 00010246
      2024-04-17 23:18:22 [556578.047712] RAX: 0000000000000000 RBX: 3760003237332400 RCX: 000000000000000f
      2024-04-17 23:18:22 [556578.055772] RDX: 0000000000000032 RSI: ffffffffc18f2c7d RDI: 3760003237332428
      2024-04-17 23:18:22 [556578.063829] RBP: ffffffffc18f2c7d R08: 00000000c063529e R09: ffffffffc18f2c8c
      2024-04-17 23:18:22 [556578.071888] R10: ffff970622b19400 R11: ffff97115b98a000 R12: 000000000000000a
      2024-04-17 23:18:22 [556578.079959] R13: ffffbc9d2debbc04 R14: ffff97022a6e6c80 R15: ffff9701d08a0e00
      2024-04-17 23:18:22 [556578.088019] FS:  0000000000000000(0000) GS:ffff970fffac0000(0000) knlGS:0000000000000000
      2024-04-17 23:18:22 [556578.097144] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
      2024-04-17 23:18:22 [556578.103651] CR2: 00007fffea61b1a0 CR3: 0000000d9ae10005 CR4: 00000000003706e0
      2024-04-17 23:18:22 [556578.111710] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      2024-04-17 23:18:22 [556578.119768] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
      2024-04-17 23:18:22 [556578.127826] Call Trace:
      2024-04-17 23:18:22 [556578.130651]  ? __die_body+0x1a/0x60
      2024-04-17 23:18:22 [556578.134643]  ? die_addr+0x38/0x51
      2024-04-17 23:18:22 [556578.138438]  ? do_general_protection+0x14f/0x2a0
      2024-04-17 23:18:22 [556578.143687]  ? general_protection+0x1e/0x30
      2024-04-17 23:18:22 [556578.148454]  ? strcmp+0xc/0x30
      2024-04-17 23:18:22 [556578.151962]  nvt_lookup_name_type.isra.53+0x72/0xb0 [znvpair]
      2024-04-17 23:18:22 [556578.158479]  nvlist_lookup_common+0x32/0x80 [znvpair]
      2024-04-17 23:18:22 [556578.164219]  __osd_sa_xattr_get.isra.10.part.11+0x37/0xc0 [osd_zfs]
      2024-04-17 23:18:22 [556578.171324]  osd_xattr_get_internal+0x5b/0x150 [osd_zfs]
      2024-04-17 23:18:22 [556578.177357]  osd_xattr_get+0x35b/0x5f0 [osd_zfs]
      2024-04-17 23:18:22 [556578.182613]  dt_version_get+0x75/0x240 [obdclass]
      2024-04-17 23:18:22 [556578.188016]  ofd_version_get_check+0x20/0x1f0 [ofd]
      2024-04-17 23:18:22 [556578.193571]  ofd_attr_set+0x77/0x1090 [ofd]
      2024-04-17 23:18:22 [556578.198340]  ofd_setattr_hdl+0x3db/0x6e0 [ofd]
      2024-04-17 23:18:22 [556578.203401]  tgt_request_handle+0xce0/0x1a40 [ptlrpc]
      2024-04-17 23:18:22 [556578.209235]  ? ptlrpc_nrs_req_get_nolock0+0xff/0x1f0 [ptlrpc]
      2024-04-17 23:18:22 [556578.215788]  ptlrpc_server_handle_request+0x323/0xbe0 [ptlrpc]
      2024-04-17 23:18:22 [556578.222458]  ptlrpc_main+0xc24/0x1580 [ptlrpc]
      2024-04-17 23:18:22 [556578.227566]  ? ptlrpc_wait_event+0x5d0/0x5d0 [ptlrpc]
      2024-04-17 23:18:22 [556578.233355]  kthread+0x14c/0x170
      2024-04-17 23:18:22 [556578.237053]  ? set_kthread_struct+0x50/0x50
      2024-04-17 23:18:22 [556578.241815]  ret_from_fork+0x35/0x40

      Attachments

        Activity

          People

            pjones Peter Jones
            defazio Gian-Carlo Defazio
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: