Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1423

16K pagesize clients error during ls

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.3.0, Lustre 2.1.3
    • Lustre 2.1.1, Lustre 1.8.6
    • None
    • clients = ia64 (16Kpagesize) lustre-1.8.6.81-5.1nas
      server = x86(4kpagesize) lustre-2.1.1
    • 2
    • 4558

    Description

      Mounting a lustre2.1.1 filesystem on a 1.8.6 client doing a ls will fail with the following error on the client logs.

      LustreError: 2551436:0:(mdc_request.c:983:mdc_readpage()) Unexpected # bytes transferred: 4096 (16384 expected)
      LustreError: 2556239:0:(mdc_request.c:983:mdc_readpage()) Unexpected # bytes transferred: 4096 (16384 expected)
      LustreError: 2556239:0:(dir.c:949:ll_readdir_20()) error reading dir [0x12480001:0xc1bc1b0c:0x0] at 0: rc -71

      Attachments

        Issue Links

          Activity

            [LU-1423] 16K pagesize clients error during ls
            ys Yang Sheng added a comment -

            Patch landed. close bug.

            ys Yang Sheng added a comment - Patch landed. close bug.

            The patch was landed to b2_1 tree 3 days ago. So, please close this ticket.

            jaylan Jay Lan (Inactive) added a comment - The patch was landed to b2_1 tree 3 days ago. So, please close this ticket.
            ys Yang Sheng added a comment -

            Hi, Jay, Cloud we close this ticket? Please advice.

            ys Yang Sheng added a comment - Hi, Jay, Cloud we close this ticket? Please advice.

            The latest patch seemed working for us. Mahmoud tested it.

            jaylan Jay Lan (Inactive) added a comment - The latest patch seemed working for us. Mahmoud tested it.
            ys Yang Sheng added a comment -

            Patch update, Please retry, sorry for the inconvenient.

            ys Yang Sheng added a comment - Patch update, Please retry, sorry for the inconvenient.

            We installed the patch. Got a LBUG!

            Lustre: 3692:0:(sec.c:1474:sptlrpc_import_sec_adapt()) import MGS->NET_0x500000a9719cf_UUID netid 50000: select flavor null^M
            Lustre: 3692:0:(sec.c:1474:sptlrpc_import_sec_adapt()) Skipped 37 previous similar messages^M
            LustreError: 3761:0:(mdt_handler.c:1247:mdt_sendpage()) ASSERTION(desc->bd_nob == nob) failed^M
            LustreError: 3761:0:(mdt_handler.c:1247:mdt_sendpage()) LBUG^M
            Pid: 3761, comm: mdt_rdpg_01^M
            ^M
            Call Trace:^M
            [<ffffffffa05e3855>] libcfs_debug_dumpstack+0x55/0x80 [libcfs]^M
            [<ffffffffa05e3e95>] lbug_with_loc+0x75/0xe0 [libcfs]^M
            ^M
            Entering kdb (current=0xffff880c1f4fa0c0, pid 3761) on processor 0 Oops: (null)^M

            mhanafi Mahmoud Hanafi added a comment - We installed the patch. Got a LBUG! Lustre: 3692:0:(sec.c:1474:sptlrpc_import_sec_adapt()) import MGS->NET_0x500000a9719cf_UUID netid 50000: select flavor null^M Lustre: 3692:0:(sec.c:1474:sptlrpc_import_sec_adapt()) Skipped 37 previous similar messages^M LustreError: 3761:0:(mdt_handler.c:1247:mdt_sendpage()) ASSERTION(desc->bd_nob == nob) failed^M LustreError: 3761:0:(mdt_handler.c:1247:mdt_sendpage()) LBUG^M Pid: 3761, comm: mdt_rdpg_01^M ^M Call Trace:^M [<ffffffffa05e3855>] libcfs_debug_dumpstack+0x55/0x80 [libcfs] ^M [<ffffffffa05e3e95>] lbug_with_loc+0x75/0xe0 [libcfs] ^M ^M Entering kdb (current=0xffff880c1f4fa0c0, pid 3761) on processor 0 Oops: (null)^M
            ys Yang Sheng added a comment - Patch unload to: http://review.whamcloud.com/3014
            ys Yang Sheng added a comment -

            This issue related to change: http://review.whamcloud.com/#change,604

            For the fix, We need to check OBD_CONNECT_BRW_SIZE on server and return single page for old client. So looks like we need restore some codes that removed from above change.

            ys Yang Sheng added a comment - This issue related to change: http://review.whamcloud.com/#change,604 For the fix, We need to check OBD_CONNECT_BRW_SIZE on server and return single page for old client. So looks like we need restore some codes that removed from above change.

            We use IB. ipoib on ib0 and rdma on ib1.

            jaylan Jay Lan (Inactive) added a comment - We use IB. ipoib on ib0 and rdma on ib1.
            ys Yang Sheng added a comment -

            Hi, Jay, Could you tell me what net type is used between client&server.

            ys Yang Sheng added a comment - Hi, Jay, Could you tell me what net type is used between client&server.

            People

              ys Yang Sheng
              mhanafi Mahmoud Hanafi
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: