Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-5345

PPC mdc_read_page(): Page-wide hash collision: 0xfeffffffffffffff

    XMLWordPrintable

Details

    • Bug
    • Resolution: Cannot Reproduce
    • Major
    • None
    • Lustre 2.6.0
    • 3
    • 14906

    Description

      Reading a normal (non DNE) directory on PPC will display the correct entries but also print an error. The last call to readdir() which should return 0 is returning -EIO.

      ## Create and ls on x86_64 node:
      [root@shadow-36vm8 lustre]# mkdir d0
      [root@shadow-36vm8 lustre]# touch d0/f{0..7}
      [root@shadow-36vm8 lustre]# ls d0
      f0  f1  f2  f3  f4  f5  f6  f7
      
      ## ls on PPC node:
      shadow-31:lustre# ls d0
      ls: reading directory d0: Input/output error
      f0  f1  f2  f3  f4  f5  f6  f7
      

      Here are the traces from the two readdir() calls:

      00000080:00200000:5.0:1405368875.648730:1552:4391:0:(dir.c:318:ll_readdir()) VFS Op:inode=[0x280000409:0xa:0x0](c000000772e69cf8) pos/size0/4096 32bit_api 0
      00000002:00010000:5.0:1405368875.648735:3088:4391:0:(mdc_locks.c:1155:mdc_intent_lock()) (name: ,[0x280000409:0xa:0x0]) in obj [0x280000409:0xa:0x0], intent: readdir flags 00
      00010000:00010000:5.0:1405368875.648739:4464:4391:0:(ldlm_lock.c:758:ldlm_lock_addref_internal_nolock()) ### ldlm_lock_addref(PR) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 3 type: IBT flags: 0x0 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00010000:00010000:5.0:1405368875.648745:4320:4391:0:(ldlm_lock.c:1392:ldlm_lock_match()) ### matched (0 0) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x0 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00000100:00100000:5.0:1405368875.648770:4176:4391:0:(client.c:1480:ptlrpc_send_new_req()) Sending RPC pname:cluuid:pid:xid:nid:opc ls:4986df11-5f39-84aa-b8fd-928839f5b18a:4391:1473632680299424:10.1.5.199@tcp:37
      00000100:00100000:5.0:1405368875.648785:3952:4391:0:(client.c:2146:ptlrpc_set_wait()) set c000000798c90b80 going to sleep for 6 seconds
      00000100:00100000:5.0:1405368875.650204:4368:4391:0:(client.c:1863:ptlrpc_check_set()) Completed RPC pname:cluuid:pid:xid:nid:opc ls:4986df11-5f39-84aa-b8fd-928839f5b18a:4391:1473632680299424:10.1.5.199@tcp:37
      00010000:00010000:5.0:1405368875.650219:3680:4391:0:(ldlm_lock.c:811:ldlm_lock_decref_internal_nolock()) ### ldlm_lock_decref(PR) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 3/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x10000000000000 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00010000:00010000:5.0:1405368875.650226:3504:4391:0:(ldlm_lock.c:879:ldlm_lock_decref_internal()) ### add lock into lru list ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/0,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x10000000000000 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00000002:00010000:5.0:1405368875.650237:3088:4391:0:(mdc_locks.c:1155:mdc_intent_lock()) (name: ,[0x280000409:0xa:0x0]) in obj [0x280000409:0xa:0x0], intent: readdir flags 00
      00010000:00010000:5.0:1405368875.650242:4464:4391:0:(ldlm_lock.c:758:ldlm_lock_addref_internal_nolock()) ### ldlm_lock_addref(PR) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 3 type: IBT flags: 0x0 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00010000:00010000:5.0:1405368875.650248:4320:4391:0:(ldlm_lock.c:1392:ldlm_lock_match()) ### matched (0 0) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x0 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00000002:00200000:5.0:1405368875.650259:2720:4391:0:(mdc_request.c:1217:mdc_page_locate()) offset 100000000000000 [0x0 0xfeffffffffffffff], hash 0xfeffffffffffffff
      00000100:00100000:5.0:1405368875.650281:4176:4391:0:(client.c:1480:ptlrpc_send_new_req()) Sending RPC pname:cluuid:pid:xid:nid:opc ls:4986df11-5f39-84aa-b8fd-928839f5b18a:4391:1473632680299428:10.1.5.199@tcp:37
      00000100:00100000:5.0:1405368875.650296:3952:4391:0:(client.c:2146:ptlrpc_set_wait()) set c000000798c90b80 going to sleep for 6 seconds
      00000100:00100000:5.0:1405368875.651660:4368:4391:0:(client.c:1863:ptlrpc_check_set()) Completed RPC pname:cluuid:pid:xid:nid:opc ls:4986df11-5f39-84aa-b8fd-928839f5b18a:4391:1473632680299428:10.1.5.199@tcp:37
      00000002:00000400:5.0:1405368875.651674:2720:4391:0:(mdc_request.c:1585:mdc_read_page()) Page-wide hash collision: 0xfeffffffffffffff
      00010000:00010000:5.0:1405368875.651679:3680:4391:0:(ldlm_lock.c:811:ldlm_lock_decref_internal_nolock()) ### ldlm_lock_decref(PR) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 3/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x10000000000000 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00010000:00010000:5.0:1405368875.651686:3504:4391:0:(ldlm_lock.c:879:ldlm_lock_decref_internal()) ### add lock into lru list ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/0,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x10000000000000 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      
      00000080:00200000:5.0:1405368875.651701:1552:4391:0:(dir.c:318:ll_readdir()) VFS Op:inode=[0x280000409:0xa:0x0](c000000772e69cf8) pos/size18374686479671623679/4096 32bit_api 0
      00000002:00010000:5.0:1405368875.651707:3088:4391:0:(mdc_locks.c:1155:mdc_intent_lock()) (name: ,[0x280000409:0xa:0x0]) in obj [0x280000409:0xa:0x0], intent: readdir flags 00
      00010000:00010000:5.0:1405368875.651711:4464:4391:0:(ldlm_lock.c:758:ldlm_lock_addref_internal_nolock()) ### ldlm_lock_addref(PR) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 3 type: IBT flags: 0x0 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00010000:00010000:5.0:1405368875.651718:4320:4391:0:(ldlm_lock.c:1392:ldlm_lock_match()) ### matched (0 0) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x0 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00000002:00200000:5.0:1405368875.651728:2720:4391:0:(mdc_request.c:1217:mdc_page_locate()) offset 100000000000000 [0x0 0xfeffffffffffffff], hash 0xfeffffffffffffff
      00000100:00100000:5.0:1405368875.651750:4176:4391:0:(client.c:1480:ptlrpc_send_new_req()) Sending RPC pname:cluuid:pid:xid:nid:opc ls:4986df11-5f39-84aa-b8fd-928839f5b18a:4391:1473632680299432:10.1.5.199@tcp:37
      00000100:00100000:5.0:1405368875.651766:3952:4391:0:(client.c:2146:ptlrpc_set_wait()) set c000000798c90b80 going to sleep for 6 seconds
      00000100:00100000:5.0:1405368875.653407:4368:4391:0:(client.c:1863:ptlrpc_check_set()) Completed RPC pname:cluuid:pid:xid:nid:opc ls:4986df11-5f39-84aa-b8fd-928839f5b18a:4391:1473632680299432:10.1.5.199@tcp:37
      00000002:00000400:5.0:1405368875.653421:2720:4391:0:(mdc_request.c:1585:mdc_read_page()) Page-wide hash collision: 0xfeffffffffffffff
      00010000:00010000:5.0:1405368875.653426:3680:4391:0:(ldlm_lock.c:811:ldlm_lock_decref_internal_nolock()) ### ldlm_lock_decref(PR) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 3/1,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x10000000000000 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00010000:00010000:5.0:1405368875.653433:3504:4391:0:(ldlm_lock.c:879:ldlm_lock_decref_internal()) ### add lock into lru list ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a888a80/0xb20b39c134a927fd lrc: 2/0,0 mode: PR/PR res: [0x280000409:0xa:0x0].0 bits 0x13 rrc: 2 type: IBT flags: 0x10000000000000 nid: local remote: 0x6b92c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      00000080:00200000:5.0:1405368875.653666:1888:4391:0:(file.c:365:ll_file_release()) VFS Op:inode=[0x280000409:0xa:0x0](c000000772e69cf8)
      00010000:00010000:5.0:1405368875.653672:3168:4391:0:(ldlm_lock.c:1392:ldlm_lock_match()) ### matched (0 0) ns: lustre-MDT0000-mdc-c00000079aee3f00 lock: c00000079a882880/0xb20b39c134a92804 lrc: 2/0,0 mode: CR/CR res: [0x280000409:0xa:0x0].0 bits 0x5 rrc: 2 type: IBT flags: 0x0 nid: local remote: 0x7992c6b1df4440e9 expref: -99 pid: 4391 timeout: 0 lvb_type: 0
      

      Attachments

        Issue Links

          Activity

            People

              wc-triage WC Triage
              jhammond John Hammond
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: