[LU-6196] Interop 2.6.0<->2.7 sanity-scrub test_12: OSS reboot Created: 02/Feb/15  Updated: 17/Apr/17  Resolved: 17/Apr/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.7.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Incomplete Votes: 0
Labels: None
Environment:

server: 2.6.0
client: lustre-master build # 2835


Severity: 3
Rank (Obsolete): 17319

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/1b618dbc-a7c4-11e4-bc9c-5254006e85c2.

The sub-test test_12 failed with the following error:

test failed to respond and timed out

OST syslog shows that the node reboot, but I don't see any clear reason

Jan 28 06:13:26 onyx-65-ib kernel: LustreError: 42029:0:(ldlm_resource.c:1150:ldlm_resource_get()) Skipped 8 previous similar messages
Jan 28 06:17:31 onyx-65-ib kernel: imklog 5.8.10, log source = /proc/kmsg started.
Jan 28 06:17:31 onyx-65-ib rsyslogd: [origin software="rsyslogd" swVersion="5.8.10" x-pid="2958" x-info="http://www.rsyslog.com"] start
Jan 28 06:17:31 onyx-65-ib kernel: Initializing cgroup subsys cpuset


 Comments   
Comment by Oleg Drokin [ 02/Feb/15 ]

I see there's no console log, but I bet a crashdump was still generated?
Could this be found and dmesg there checked please?

Comment by Andreas Dilger [ 17/Apr/17 ]

Not enough info to debug.

Generated at Sat Feb 10 01:58:05 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.