[LU-7388] Interop 2.7.0<->master: sanity-hsm test_59: test failed to respond and timed out Created: 05/Nov/15  Updated: 05/Nov/15  Resolved: 05/Nov/15

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Server: 2.7.0, b2_7/29
Client: master, build# 3227, RHEL 6.7


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Saurabh Tandan <saurabh.tandan@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/03958094-7eb5-11e5-965a-5254006e85c2.

The sub-test test_59 failed with the following error:

test failed to respond and timed out

mds console:

00:32:37:Lustre: DEBUG MARKER: == sanity-hsm test 59: Release stripeless file with non-zero size == 00:32:32 (1446078752)
00:32:37:Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200000407:0xd:0x0'.*action='ARCHIVE'/ {print $13}' | cut -f2 -d=
00:32:37:Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n mdt.lustre-MDT0000.hsm.actions | awk '/'0x200000407:0xd:0x0'.*action='ARCHIVE'/ {print $13}' | cut -f2 -d=
00:32:37:divide error: 0000 [#1] SMP 
00:32:37:last sysfs file: /sys/devices/system/cpu/online
00:32:37:CPU 1 
00:32:37:Modules linked in: osp(U) mdd(U) lod(U) mdt(U) lfsck(U) mgs(U) mgc(U) osd_ldiskfs(U) lquota(U) lustre(U) lov(U) mdc(U) fid(U) lmv(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) sha512_generic libcfs(U) ldiskfs(U) jbd2 nfsd exportfs nfs lockd fscache auth_rpcgss nfs_acl sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_sa ib_mad ib_core ib_addr ipv6 microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: speedstep_lib]
00:32:37:
00:32:37:Pid: 10697, comm: mdt_rdpg00_007 Not tainted 2.6.32-504.8.1.el6_lustre.x86_64 #1 Red Hat KVM
00:32:37:RIP: 0010:[<ffffffffa0f1376f>]  [<ffffffffa0f1376f>] lod_declare_striped_object+0x4ef/0x9b0 [lod]
00:32:37:RSP: 0018:ffff88007a8e99f0  EFLAGS: 00010246
00:32:37:RAX: 0000000000000000 RBX: ffff88007ae8c128 RCX: 0000000000010000
00:32:37:RDX: 0000000000000000 RSI: 000000000000002a RDI: 0000000000000000
00:32:37:RBP: ffff88007a8e9a40 R08: ffff880079381400 R09: ffff880078d38800
00:32:37:R10: ffffffffa0f21d2a R11: 0000000000000000 R12: ffff88005e1b2980
00:32:37:R13: ffff88007aab9000 R14: ffff880079381400 R15: ffff88007aab90d0
00:32:37:FS:  0000000000000000(0000) GS:ffff880002300000(0000) knlGS:0000000000000000
00:32:37:CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
00:32:37:CR2: 00000000020a3990 CR3: 000000007a70c000 CR4: 00000000000006e0
00:32:37:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
00:32:37:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
00:32:37:Process mdt_rdpg00_007 (pid: 10697, threadinfo ffff88007a8e8000, task ffff88007a8e7500)
00:32:37:Stack:
00:32:37: ffff880079381400 ffff88007aab90d0 ffff88007aab90d0 ffff88007c02b800
00:32:37:<d> fffffffffffffffe ffff88007ae8c128 ffff88005e1b2980 ffff880079381400
00:32:37:<d> ffff88007aab90d0 ffff88007aab9000 ffff88007a8e9aa0 ffffffffa0f16399
00:32:37:Call Trace:
00:32:37: [<ffffffffa0f16399>] lod_declare_xattr_set+0x269/0x340 [lod]
00:32:37: [<ffffffffa0f81f0f>] mdd_declare_xattr_set+0x8f/0x260 [mdd]
00:32:37: [<ffffffffa0f80efc>] ? mdo_xattr_get+0xac/0x1c0 [mdd]
00:32:37: [<ffffffffa0f84f09>] mdd_swap_layouts+0x939/0x12b0 [mdd]
00:32:37: [<ffffffffa0f8e3ca>] ? mdd_trans_stop+0x1a/0xac [mdd]
00:32:37: [<ffffffffa0e46d60>] mdt_hsm_release+0xe30/0x13d0 [mdt]
00:32:37: [<ffffffffa07e6684>] ? sptlrpc_svc_alloc_rs+0x74/0x360 [ptlrpc]
00:32:37: [<ffffffffa07bcebc>] ? lustre_msg_add_version+0x6c/0xc0 [ptlrpc]
00:32:37: [<ffffffffa0e49724>] mdt_mfd_close+0x314/0xac0 [mdt]
00:32:37: [<ffffffffa0582905>] ? class_handle2object+0x95/0x190 [obdclass]
00:32:37: [<ffffffffa0581b6d>] ? class_handle_unhash_nolock+0x2d/0x150 [obdclass]
00:32:37: [<ffffffffa0e4b313>] mdt_close+0x6f3/0xaa0 [mdt]
00:32:37: [<ffffffffa081e56e>] tgt_request_handle+0x8be/0x1000 [ptlrpc]
00:32:37: [<ffffffffa07ce5a1>] ptlrpc_main+0xe41/0x1960 [ptlrpc]
00:32:37: [<ffffffffa07cd760>] ? ptlrpc_main+0x0/0x1960 [ptlrpc]
00:32:37: [<ffffffff8109e66e>] kthread+0x9e/0xc0
00:32:37: [<ffffffff8100c20a>] child_rip+0xa/0x20
00:32:37: [<ffffffff8109e5d0>] ? kthread+0x0/0xc0
00:32:37: [<ffffffff8100c200>] ? child_rip+0x0/0x20


 Comments   
Comment by Alex Zhuravlev [ 05/Nov/15 ]

this is a dup of LU-7376

Generated at Sat Feb 10 02:08:28 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.