Details
-
Bug
-
Resolution: Duplicate
-
Blocker
-
None
-
lola
build: master, 2.7.64-81-g6fc8da4, 6fc8da41f2ff5156639e89f379adcdbb73ac8567
-
3
-
9223372036854775807
Description
Error happened during lfsck run of soak FS using build '20160108'. (see https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20160108)
DNE is enabled.
- lfsck started on MDS hosting mdt-0:
[root@lola-8 ~]# date; lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A ; date Wed Jan 13 04:42:28 PST 2016 Started LFSCK on the device soaked-MDT0000: scrub layout namespace Wed Jan 13 04:42:28 PST 2016
No soak test was running
- lfsck_namespace don't complete phase scanning-phase2
- MDSes lola-9,11 showed an increasing number of blocked mdt_out* - threads
- Triggering stack trace lead kernel panic on lola-11 (2016-01-13-08:15:22)
- All MDSes show only minimal utilization of system resources
Attached files:
- console, messages files of lola-9,11; containing stack trace information
- vmcore-dmesg.txt of lola-11
- lfsck status information of all MDTs
Crash file location see next comment.
Attachments
Issue Links
- duplicates
-
LU-6684 lctl lfsck_stop hangs
-
- Resolved
-
Activity
Resolution | New: Duplicate [ 3 ] | |
Status | Original: In Progress [ 3 ] | New: Resolved [ 5 ] |
Priority | Original: Critical [ 2 ] | New: Blocker [ 1 ] |
Fix Version/s | New: Lustre 2.8.0 [ 11113 ] |
Attachment | New: lu-7662-lola-11-1452785464.17420-lustre-log [ 20135 ] |
Status | Original: Open [ 1 ] | New: In Progress [ 3 ] |
Assignee | Original: WC Triage [ wc-triage ] | New: nasf [ yong.fan ] |
Attachment | New: console-lola-9.log.bz2 [ 20106 ] | |
Attachment | New: console-lola-11.log.bz2 [ 20107 ] | |
Attachment | New: lfsck-info.txt.bz2 [ 20108 ] | |
Attachment | New: messages-lola-9.log.bz2 [ 20109 ] | |
Attachment | New: messages-lola-11.log.bz2 [ 20110 ] | |
Attachment | New: vmcore-dmesg.txt.bz2 [ 20111 ] |
Description |
Original:
Error happened during lfsck run of soak FS using build '20160108'. (see https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20160108) DNE is enabled. * {{lfsck}} started on MDS hosting mdt-0: {noformat} [root@lola-8 ~]# date; lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A ; date Wed Jan 13 04:42:28 PST 2016 Started LFSCK on the device soaked-MDT0000: scrub layout namespace Wed Jan 13 04:42:28 PST 2016 {noformat} *No* soak test was running * lfsck_namespace don't complete phase _scanning-phase2_ * MDSes {{lola-9,11}} showed an increasing number of blocked {{mdt_out*}} - threads * Triggering stack trace lead kernel panic on {{lola-11}} (2016-01-13-08:15:22) * All MDSes don't show only minimal utilization of system resources Attached files: * console, messages files of lola-9,11; containing stack trace information * vmcore-dmesg.txt of lola-11 * {{lfsck}} status information of all MDTs Crash file location see next comment. |
New:
Error happened during lfsck run of soak FS using build '20160108'. (see https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20160108) DNE is enabled. * {{lfsck}} started on MDS hosting mdt-0: {noformat} [root@lola-8 ~]# date; lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A ; date Wed Jan 13 04:42:28 PST 2016 Started LFSCK on the device soaked-MDT0000: scrub layout namespace Wed Jan 13 04:42:28 PST 2016 {noformat} *No* soak test was running * lfsck_namespace don't complete phase _scanning-phase2_ * MDSes {{lola-9,11}} showed an increasing number of blocked {{mdt_out*}} - threads * Triggering stack trace lead kernel panic on {{lola-11}} (2016-01-13-08:15:22) * All MDSes show only minimal utilization of system resources Attached files: * console, messages files of lola-9,11; containing stack trace information * vmcore-dmesg.txt of lola-11 * {{lfsck}} status information of all MDTs Crash file location see next comment. |
Description |
Original:
Error happened during lfsck run of soak FS using build '20160108'. (see https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20160108) DNE is enabled. * {{lfsck}} started on MDS hosting mdt-0: {noformat} [root@lola-8 ~]# date; lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A ; date Wed Jan 13 04:42:28 PST 2016 Started LFSCK on the device soaked-MDT0000: scrub layout namespace Wed Jan 13 04:42:28 PST 2016 {noformat} *No* soak test was running * lfsck_namespace don't complete phase _scanning-phase2_ * MDSes {{lola-9,11}} showed an increasing number of blocked {{mdt_out*}} - threads * Triggering stack trace lead kernel panic on {{lola-11}} (2016-01-13-08:15:22) * All MDSes don't sho Attached files: * console, messages files of lola-9,11; containing stack trace information * vmcore-dmesg.txt of lola-11 * {{lfsck}} status information of all MDTs Crash file location see next comment. |
New:
Error happened during lfsck run of soak FS using build '20160108'. (see https://wiki.hpdd.intel.com/display/Releases/Soak+Testing+on+Lola#SoakTestingonLola-20160108) DNE is enabled. * {{lfsck}} started on MDS hosting mdt-0: {noformat} [root@lola-8 ~]# date; lctl lfsck_start -M soaked-MDT0000 -s 1000 -t all -A ; date Wed Jan 13 04:42:28 PST 2016 Started LFSCK on the device soaked-MDT0000: scrub layout namespace Wed Jan 13 04:42:28 PST 2016 {noformat} *No* soak test was running * lfsck_namespace don't complete phase _scanning-phase2_ * MDSes {{lola-9,11}} showed an increasing number of blocked {{mdt_out*}} - threads * Triggering stack trace lead kernel panic on {{lola-11}} (2016-01-13-08:15:22) * All MDSes don't show only minimal utilization of system resources Attached files: * console, messages files of lola-9,11; containing stack trace information * vmcore-dmesg.txt of lola-11 * {{lfsck}} status information of all MDTs Crash file location see next comment. |