Details
-
Bug
-
Resolution: Cannot Reproduce
-
Minor
-
None
-
Lustre 2.11.0
-
None
-
3
-
9223372036854775807
Description
recovery-mds-scale test_failover_ost - test_failover_ost returned 7
^^^^^^^^^^^^^ DO NOT REMOVE LINE ABOVE ^^^^^^^^^^^^^
This issue was created by maloo for sarah_lw <wei3.liu@intel.com>
This issue relates to the following test suite run:
https://testing.hpdd.intel.com/test_sets/b989acc6-ff51-11e7-a7cd-52540065bddc
test_failover_ost failed with the following error:
test_failover_ost returned 7
client dmesg
Server failover period: 1200 seconds
Exited after: 0 seconds
Number of failovers before exit:
mds1: 0 times
ost1: 0 times
ost2: 0 times
ost3: 0 times
ost4: 0 times
ost5: 0 times
ost6: 0 times
ost
[86211.448649] Lustre: DEBUG MARKER: Duration: 86400
[86211.613986] Lustre: DEBUG MARKER: test -f /tmp/client-load.pid &&
{ kill -s TERM $(cat /tmp/client-load.pid); rm -f /tmp/client-load.pid; }
[86401.075681] INFO: task sync:23671 blocked for more than 120 seconds.
[86401.076443] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[86401.077287] sync D ffff880036be0000 0 23671 23636 0x00000080
[86401.078094] Call Trace:
[86401.078486] [<ffffffff816a9700>] ? bit_wait+0x50/0x50
[86401.079156] [<ffffffff816ab6d9>] schedule+0x29/0x70
[86401.079713] [<ffffffff816a90e9>] schedule_timeout+0x239/0x2c0
[86401.080472] [<ffffffff810cf98c>] ? dequeue_entity+0x11c/0x5d0
[86401.081134] [<ffffffff81062efe>] ? kvm_clock_get_cycles+0x1e/0x20
[86401.081802] [<ffffffff816a9700>] ? bit_wait+0x50/0x50
[86401.082360] [<ffffffff816aac5d>] io_schedule_timeout+0xad/0x130
[86401.083121] [<ffffffff816aacf8>] io_schedule+0x18/0x20
[86401.083693] [<ffffffff816a9711>] bit_wait_io+0x11/0x50
[86401.084315] [<ffffffff816a9235>] __wait_on_bit+0x65/0x90
[86401.084968] [<ffffffff811839b1>] wait_on_page_bit+0x81/0xa0
[86401.085676] [<ffffffff810b3570>] ? wake_bit_function+0x40/0x40
[86401.086327] [<ffffffff81183ae1>] __filemap_fdatawait_range+0x111/0x190
[86401.087055] [<ffffffff811868d7>] filemap_fdatawait_keep_errors+0x27/0x30
[86401.087913] [<ffffffff8122fdcd>] sync_inodes_sb+0x16d/0x1f0
[86401.088524] [<ffffffff812353e0>] ? generic_write_sync+0x60/0x60
[86401.089179] [<ffffffff812353f9>] sync_inodes_one_sb+0x19/0x20
[86401.089911] [<ffffffff81206c61>] iterate_supers+0xc1/0x120
[86401.090500] [<ffffffff812356c4>] sys_sync+0x44/0xb0
[86401.091065] [<ffffffff816b89fd>] system_call_fastpath+0x16/0x1b
[86401.091795] [<ffffffff816b889d>] ? system_call_after_swapgs+0xca/0x214
[86521.091648] INFO: task sync:23671 blocked for more than 120 seconds.
[86521.092460] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[86521.093362] sync D ffff880036be0000 0 23671 23636 0x00000080
[86521.094227] Call Trace:
[86521.094524] [<ffffffff816a9700>] ? bit_wait+0x50/0x50
[86521.095146] [<ffffffff816ab6d9>] schedule+0x29/0x70
[86521.095867] [<ffffffff816a90e9>] schedule_timeout+0x239/0x2c0
[86521.096549] [<ffffffff810cf98c>] ? dequeue_entity+0x11c/0x5d0
[86521.097238] [<ffffffff81062efe>] ? kvm_clock_get_cycles+0x1e/0x20
[86521.098039] [<ffffffff816a9700>] ? bit_wait+0x50/0x50
[86521.098641] [<ffffffff816aac5d>] io_schedule_timeout+0xad/0x130
[86521.099402] [<ffffffff816aacf8>] io_schedule+0x18/0x20
[86521.100042] [<ffffffff816a9711>] bit_wait_io+0x11/0x50
[86521.100724] [<ffffffff816a9235>] __wait_on_bit+0x65/0x90
[86521.101351] [<ffffffff811839b1>] wait_on_page_bit+0x81/0xa0
[86521.102021] [<ffffffff810b3570>] ? wake_bit_function+0x40/0x40
[86521.102790] [<ffffffff81183ae1>] __filemap_fdatawait_range+0x111/0x190
[86521.103543] [<ffffffff811868d7>] filemap_fdatawait_keep_errors+0x27/0x30
[86521.104338] [<ffffffff8122fdcd>] sync_inodes_sb+0x16d/0x1f0
[86521.105080] [<ffffffff812353e0>] ? generic_write_sync+0x60/0x60
[86521.105778] [<ffffffff812353f9>] sync_inodes_one_sb+0x19/0x20
[86521.106516] [<ffffffff81206c61>] iterate_supers+0xc1/0x120
[86521.107177] [<ffffffff812356c4>] sys_sync+0x44/0xb0
[86521.107837] [<ffffffff816b89fd>] system_call_fastpath+0x16/0x1b
[86521.108535] [<ffffffff816b889d>] ? system_call_after_swapgs+0xca/0x214
[86641.108765] INFO: task sync:23671 blocked for more than 120 seconds.
[86641.109585] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message