Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1864

Test failure on test suite sanity-benchmark, subtest test_bonnie

    Details

    • Type: Bug
    • Status: Closed
    • Priority: Minor
    • Resolution: Fixed
    • Affects Version/s: Lustre 2.3.0
    • Fix Version/s: Lustre 2.3.0
    • Labels:
    • Severity:
      3
    • Rank (Obsolete):
      4290

      Description

      This issue was created by maloo for Minh Diep <mdiep@whamcloud.com>

      This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/7cd1625a-f9b3-11e1-b8d8-52540035b04c.

      The sub-test test_bonnie failed with the following error:

      test failed to respond and timed out

      OSS crash
      20:39:50:Lustre: DEBUG MARKER: == sanity-benchmark test bonnie: bonnie++ ============================================================ 20:39:40 (1346989180)
      20:39:50:Lustre: DEBUG MARKER: /usr/sbin/lctl mark min OST has 7013632kB available, using 4117256kB file size
      20:39:51:Lustre: DEBUG MARKER: min OST has 7013632kB available, using 4117256kB file size
      21:05:01:LustreError: 11-0: an error occurred while communicating with 10.10.4.222@tcp. The obd_ping operation failed with -107
      21:05:01:LustreError: 166-1: MGC10.10.4.222@tcp: Connection to MGS (at 10.10.4.222@tcp) was lost; in progress operations using this service will fail
      21:05:01:LustreError: Skipped 5 previous similar messages
      21:05:01:Lustre: Evicted from MGS (at MGC10.10.4.222@tcp_0) after server handle changed from 0x8f431cdc616c1654 to 0x8f431cdc61bdc473
      21:05:01:Lustre: Skipped 1 previous similar message
      21:05:01:Lustre: MGC10.10.4.222@tcp: Reactivating import
      21:05:01:Lustre: Skipped 3 previous similar messages
      21:05:01:Lustre: MGC10.10.4.222@tcp: Connection restored to MGS (at 10.10.4.222@tcp)
      21:05:01:Lustre: Skipped 1 previous similar message
      21:08:24:LustreError: 11-0: an error occurred while communicating with 10.10.4.222@tcp. The obd_ping operation failed with -107
      21:08:24:LustreError: 166-1: MGC10.10.4.222@tcp: Connection to MGS (at 10.10.4.222@tcp) was lost; in progress operations using this service will fail
      21:08:24:Lustre: Evicted from MGS (at MGC10.10.4.222@tcp_0) after server handle changed from 0x8f431cdc61bdc473 to 0x8f431cdc61bdc529
      21:08:24:Lustre: MGC10.10.4.222@tcp: Reactivating import
      21:08:24:Lustre: MGC10.10.4.222@tcp: Connection restored to MGS (at 10.10.4.222@tcp)
      21:10:46:BUG: unable to handle kernel NULL pointer dereference at 0000000000000036
      21:10:46:IP: [<ffffffffa0701f67>] lu_site_purge+0x47/0x400 [obdclass]
      21:10:46:PGD 0
      21:10:46:Oops: 0000 1 SMP
      21:10:46:last sysfs file: /sys/devices/system/cpu/possible
      21:10:46:CPU 0
      21:10:46:Modules linked in: osd_zfs(U) lustre(U) ofd(U) ost(U) cmm(U) mdt(U) mdd(U) mds(U) mgs(U) jbd2 obdecho(U) mgc(U) lquota(U) lov(U) osc(U) mdc(U) lmv(U) fid(U) fld(U) ptlrpc(U) obdclass(U) lvfs(U) ksocklnd(U) lnet(U) sha512_generic sha256_generic libcfs(U) nfsd lockd nfs_acl auth_rpcgss exportfs autofs4 sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core zfs(P)(U) zcommon(P)(U) znvpair(P)(U) zavl(P)(U) zunicode(P)(U) spl(U) zlib_deflate microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk pata_acpi ata_generic ata_piix virtio_pci virtio_ring virtio dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
      21:10:46:
      21:10:47:Pid: 915, comm: arc_adapt Tainted: P --------------- 2.6.32-279.5.1.el6_lustre.g293c36b.x86_64 #1 Red Hat KVM
      21:10:47:RIP: 0010:[<ffffffffa0701f67>] [<ffffffffa0701f67>] lu_site_purge+0x47/0x400 [obdclass]
      21:10:47:RSP: 0018:ffff880037cabd40 EFLAGS: 00010213
      21:10:47:RAX: 0000000000000000 RBX: 0000000000000400 RCX: 0000000000000002
      21:10:47:RDX: 0000000000000400 RSI: ffff88004e7532e0 RDI: ffff880037cabdf0
      21:10:47:RBP: ffff880037cabde0 R08: 0000000000000246 R09: 00000000000000ce
      21:10:47:R10: 0000000000000000 R11: 000000000000000a R12: ffff88004e753000
      21:10:47:R13: ffff88004e7532e0 R14: ffffffffa0e9a110 R15: ffffffffffffffff
      21:10:47:FS: 0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
      21:10:47:CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
      21:10:47:CR2: 0000000000000036 CR3: 0000000037f6a000 CR4: 00000000000006f0
      21:10:47:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
      21:10:47:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
      21:10:48:Process arc_adapt (pid: 915, threadinfo ffff880037caa000, task ffff880037cda080)
      21:10:48:Stack:
      21:10:48: ffff880037cabd90 ffff880037cabd90 ffff880037cabdf0 ffffffffffffffff
      21:10:48:<d> ffff880000000000 ffff880037cabdf0 ffff88004e753000 0000000000100000
      21:10:48:<d> ffffffffa0e9a110 ffffffffffffffff ffff880037cabd90 ffff880037cabd90
      21:10:48:Call Trace:
      21:10:48: [<ffffffffa0e9a110>] ? arc_prune_func+0x0/0xe0 [osd_zfs]
      21:10:48: [<ffffffffa0e9a110>] ? arc_prune_func+0x0/0xe0 [osd_zfs]
      21:10:49: [<ffffffffa0e9a15b>] arc_prune_func+0x4b/0xe0 [osd_zfs]
      21:10:49: [<ffffffffa023a290>] arc_adjust_meta+0x120/0x1e0 [zfs]
      21:10:49: [<ffffffffa023a350>] ? arc_adapt_thread+0x0/0xd0 [zfs]
      21:10:49: [<ffffffffa023a350>] ? arc_adapt_thread+0x0/0xd0 [zfs]
      21:10:49: [<ffffffffa023a3ba>] arc_adapt_thread+0x6a/0xd0 [zfs]
      21:10:49: [<ffffffffa01647f8>] thread_generic_wrapper+0x68/0x80 [spl]
      21:10:49: [<ffffffffa0164790>] ? thread_generic_wrapper+0x0/0x80 [spl]
      21:10:49: [<ffffffff81091d66>] kthread+0x96/0xa0
      21:10:49: [<ffffffff8100c14a>] child_rip+0xa/0x20
      21:10:50: [<ffffffff81091cd0>] ? kthread+0x0/0xa0
      21:10:50: [<ffffffff8100c140>] ? child_rip+0x0/0x20
      21:10:51:Code: bd 70 ff ff ff 89 d3 83 fb ff 49 89 f5 48 89 85 68 ff ff ff 48 89 45 b0 48 89 45 b8 8b 46 08 89 45 80 0f 84 fa 02 00 00 48 8b 06 <0f> b6 50 36 0f b6 48 32 29 d1 89 da d3 ea 89 d1 83 c1 01 89 8d
      21:10:51:RIP [<ffffffffa0701f67>] lu_site_purge+0x47/0x400 [obdclass]
      21:10:51: RSP <ffff880037cabd40>
      21:10:51:CR2: 0000000000000036
      21:10:51:Initializing cgroup subsys cpuset
      21:10:51:Initializing cgroup subsys cpu

      Info required for matching: sanity-benchmark bonnie

        Attachments

          Activity

            People

            • Assignee:
              liwei Li Wei (Inactive)
              Reporter:
              maloo Maloo
            • Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

              Dates

              • Created:
                Updated:
                Resolved: