[LU-1968] 1.8.8<->2.3 Test failure on test suite parallel-scale-nfsv4, subtest test_compilebench Created: 17/Sep/12  Updated: 17/Sep/12  Resolved: 17/Sep/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

server: 2.3-tag-2.2.96 RHEL6
client: 1.8.8 RHEL6


Issue Links:
Duplicate
duplicates LU-1881 sanity test 116 soft lockup Resolved
Severity: 3
Rank (Obsolete): 4063

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/903b9cf4-fec5-11e1-a707-52540035b04c.

The sub-test test_compilebench failed with the following error:

test failed to respond and timed out

Previous to this failure, performance-sanity, parallel-scale, large-scale and parallel-scale-nfsv3 are all failed due to similar error like

Connection to MGS (at 10.10.4.160@tcp) was lost

I cannot find other useful information in those logs

14:14:11:Lustre: DEBUG MARKER: == parallel-scale-nfsv4 test compilebench: compilebench == 14:14:03 (1347657243)
14:14:11:Lustre: ctl-lustre-MDT0000: super-sequence allocation rc = 0 [0x0000000200000400-0x0000000240000400):0:mdt
14:14:11:Lustre: DEBUG MARKER: /usr/sbin/lctl mark .\/compilebench -D \/mnt\/lustre\/d0.compilebench -i 4         -r 4 --makej
14:14:11:Lustre: DEBUG MARKER: ./compilebench -D /mnt/lustre/d0.compilebench -i 4 -r 4 --makej
14:21:25:------------[ cut here ]------------
14:21:25:WARNING: at lib/list_debug.c:30 __list_add+0x8f/0xa0() (Not tainted)
14:21:25:Hardware name: KVM
14:21:25:list_add corruption. prev->next should be next (ffffffff81ea7450), but was 5f9c1e10ffffffff. (prev=ffff88005f9c1df0).


 Comments   
Comment by Sarah Liu [ 17/Sep/12 ]

https://maloo.whamcloud.com/test_sets/88d67f94-fec3-11e1-a707-52540035b04c
https://maloo.whamcloud.com/test_sets/faa2bd9a-fec3-11e1-a707-52540035b04c
https://maloo.whamcloud.com/test_sets/56040b12-fec4-11e1-a707-52540035b04c
https://maloo.whamcloud.com/test_sets/3fed001c-fec5-11e1-a707-52540035b04c

Comment by Jian Yu [ 17/Sep/12 ]

Hi Sarah,
The Lustre b2_3 build in the above reports was http://build.whamcloud.com/job/lustre-b2_3/18, which was tag 2.2.95 and did not contain the fix for LU-1881.

The issue has been fixed in Lustre b2_3 build #19 (tag 2.2.96). So, I close this ticket as a duplicate of LU-1881.

Generated at Sat Feb 10 01:21:15 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.