Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-16441

BUG: Bad page state in process socknal_* functions

    XMLWordPrintable

Details

    • Bug
    • Resolution: Unresolved
    • Critical
    • None
    • Lustre 2.16.0, Lustre 2.12.9, Lustre 2.15.1
    • Lustre 2.12.9 vanilla servers running in RHEL7.9 environment. Ethernet hardware is 200GiB cards.
    • 3
    • 9223372036854775807

    Description

      The following  crash hit our OSS servers for a production system.

      [3845523.531585] Call Trace:
      [3845523.535134]  [<ffffffffb7d865c9>] dump_stack+0x19/0x1b
      [3845523.541350]  [<ffffffffb7d81905>] bad_page.part.75+0xdc/0xf9
      [3845523.548082]  [<ffffffffb77c6fd2>] free_pages_prepare+0x1f2/0x210
      [3845523.555139]  [<ffffffffb77c7309>] __free_pages_ok+0x19/0xc0
      [3845523.561750]  [<ffffffffb77c73cb>] free_compound_page+0x1b/0x20
      [3845523.568605]  [<ffffffffb7d8236f>] __put_compound_page+0x25/0x28
      [3845523.575535]  [<ffffffffb7d824d8>] put_compound_page+0x166/0x174
      [3845523.582456]  [<ffffffffb77cce06>] put_page+0x56/0x60
      [3845523.588404]  [<ffffffffb7c4266f>] skb_release_data+0x8f/0x150
      [3845523.595114]  [<ffffffffb7c42754>] skb_release_all+0x24/0x30
      [3845523.601633]  [<ffffffffb7c42772>] __kfree_skb+0x12/0x20
      [3845523.607795]  [<ffffffffb7cbecad>] tcp_ack+0x60d/0x12f0
      [3845523.613856]  [<ffffffffb7cbff66>] tcp_rcv_established+0x1d6/0x7a0
      [3845523.620857]  [<ffffffffb7cc83e3>] ? tcp_v4_md5_lookup+0x13/0x20
      [3845523.627689]  [<ffffffffb7ccb04a>] tcp_v4_do_rcv+0x10a/0x350
      [3845523.634166]  [<ffffffffb7c3e986>] release_sock+0xa6/0x180
      [3845523.640454]  [<ffffffffb7cb609d>] tcp_sendpage+0xdd/0x5c0
      [3845523.646746]  [<ffffffffc0b47bab>] ksocknal_lib_send_kiov+0xdb/0x2e0 [ksocklnd]
      [3845523.654853]  [<ffffffffc0b48662>] ? ksocknal_lib_send_iov+0xd2/0x140 [ksocklnd]
      [3845523.663040]  [<ffffffffc0b4112e>] ksocknal_process_transmit+0x39e/0xc10 [ksocklnd]
      [3845523.671478]  [<ffffffffc0b45b80>] ksocknal_scheduler+0x320/0xd50 [ksocklnd]
      [3845523.679304]  [<ffffffffb76c7080>] ? wake_up_atomic_t+0x30/0x30
      [3845523.685989]  [<ffffffffc0b45860>] ? ksocknal_recv+0x2a0/0x2a0 [ksocklnd]
      [3845523.693527]  [<ffffffffb76c5f91>] kthread+0xd1/0xe0
      [3845523.699238]  [<ffffffffb76c5ec0>] ? insert_kthread_work+0x40/0x40
      [3845523.706156]  [<ffffffffb7d99ddd>] ret_from_fork_nospec_begin+0x7/0x21
      [3845523.713415]  [<ffffffffb76c5ec0>] ? insert_kthread_work+0x40/0x40
      [3845523.720321] BUG: Bad page state in process socknal_sd05_02  pfn:282c3e3
      [3845523.727747] page:ffffeb1aa0b0f8c0 count:0 mapcount:-1 mapping:          (null) index:0x0
      [3845523.736648] page flags: 0x6fffff00008000(tail)
      [3845523.741962] page dumped because: nonzero mapcount

      Attachments

        Activity

          People

            simmonsja James A Simmons
            simmonsja James A Simmons
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated: