[LU-2746] Failure on test suite conf-sanity test_45 Created: 04/Feb/13  Updated: 21/Nov/13  Resolved: 21/Nov/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: None

Type: Bug Priority: Major
Reporter: Maloo Assignee: Emoly Liu
Resolution: Fixed Votes: 0
Labels: None
Environment:

lustre-master build #1214 SLES11 SP2 client


Severity: 3
Rank (Obsolete): 6670

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/e8bec500-6d14-11e2-9f92-52540035b04c.

The sub-test test_45 failed with the following error:

test failed to respond and timed out

OST console shows:

00:17:11:Lustre: DEBUG MARKER: == conf-sanity test 45: long unlink handling in ptlrpcd ============================================== 00:16:56 (1359706616)
00:17:11:Lustre: DEBUG MARKER: mkdir -p /mnt/ost1
00:17:11:Lustre: DEBUG MARKER: test -b /dev/lvm-OSS/P1
00:17:11:Lustre: DEBUG MARKER: mkdir -p /mnt/ost1; mount -t lustre   		                   /dev/lvm-OSS/P1 /mnt/ost1
00:17:11:LDISKFS-fs (dm-0): mounted filesystem with ordered data mode. quota=on. Opts: 
00:17:11:Lustre: DEBUG MARKER: PATH=/usr/lib64/lustre/tests:/usr/lib/lustre/tests:/usr/lib64/lustre/tests:/opt/iozone/bin:/usr/lib64/lustre/tests//usr/lib64/lustre/tests:/usr/lib64/lustre/tests:/usr/lib64/lustre/tests/../utils:/opt/iozone/bin:/usr/lib64/lustre/tests/mpi:/usr/lib64/lust
00:17:11:Lustre: DEBUG MARKER: e2label /dev/lvm-OSS/P1 2>/dev/null
00:17:11:LustreError: 28915:0:(ldlm_resource.c:1159:ldlm_resource_get()) lvbo_init failed for resource 100: rc -2
00:17:11:LustreError: 28915:0:(ldlm_resource.c:1159:ldlm_resource_get()) Skipped 95 previous similar messages

client dmesg shows:

[18516.678051] Lustre: DEBUG MARKER: == conf-sanity test 45: long unlink handling in ptlrpcd ============================================== 00:16:56 (1359706616)
[18518.861254] Lustre: DEBUG MARKER: mkdir -p /mnt/lustre
[18518.868876] Lustre: DEBUG MARKER: mount -t lustre -o user_xattr,acl,flock client-32vm7@tcp:/lustre /mnt/lustre
[18518.877485] LustreError: 152-6: Ignoring deprecated mount option 'acl'.
[18518.890992] LustreError: 16712:0:(obd_config.c:1303:class_process_proc_param()) lustre-client-ffff880079d6c800: unknown param some_wrong_param=10
[18518.895450] LustreError: 11-0: lustre-MDT0000-mdc-ffff880079d6c800: Communicating with 10.10.4.202@tcp, operation mds_connect failed with -11.
[18523.895444] Lustre: Layout lock feature supported.
[18523.904992] Lustre: Mounted lustre-client


 Comments   
Comment by Emoly Liu [ 20/Mar/13 ]

I can't reproduce this failure on my local VM. By searching Maloo, I find this failure was hit for 6 times in recent two weeks and most of the failure happened on sles11 sp2.

I will keeping looking into this one.

Comment by Di Wang [ 28/Mar/13 ]

Hit this error again on SLES11 SP2 client:

https://maloo.whamcloud.com/test_sets/5f046832-9696-11e2-9ec7-52540035b04c

Comment by Sarah Liu [ 13/May/13 ]

another failure seen in sles11 sp2 client:
https://maloo.whamcloud.com/test_sets/c1da2976-ba7f-11e2-b1a3-52540035b04c

Comment by Emoly Liu [ 21/Nov/13 ]

We haven't seen this issue since June, so close it, and we can reopen it if we see it again.

Generated at Sat Feb 10 01:27:51 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.