Details
-
Technical task
-
Resolution: Unresolved
-
Critical
-
None
-
Lustre 2.10.5
-
9223372036854775807
Description
After a filesystem corruption event and e2fsck 1.44.3.wc1 running to repair it, the server keeps crashing with the following error:
Lustre: nbp13-OST0008: trigger OI scrub by RPC for the [0x100080000:0x217edd:0x0] with flags 0x4a, rc = 0 ------------[ cut here ]------------ kernel BUG at /tmp/rpmbuild-lustre-jlan-ItUrr9b3/BUILD/lustre-2.10.5/ldiskfs/ldiskfs.h:1907! invalid opcode: 0000 [#1] SMP CPU: 5 PID: 11348 Comm: lfsck Tainted: G OE ------------ 3.10.0-693.21.1.el7.20180508.x86_64.lustre2105 #1 RIP: 0010:[<ffffffffa10fbd04>] [<ffffffffa10fbd04>] ldiskfs_rec_len_to_disk.part.9+0x4/0x10 [ldiskfs] Call Trace: htree_inlinedir_to_tree+0x445/0x450 [ldiskfs] ldiskfs_htree_fill_tree+0x137/0x2f0 [ldiskfs] ldiskfs_readdir+0x61c/0x850 [ldiskfs] osd_ldiskfs_it_fill+0xbe/0x260 [osd_ldiskfs] osd_it_ea_load+0x37/0x100 [osd_ldiskfs] lfsck_open_dir+0x11c/0x3a0 [lfsck] lfsck_master_oit_engine+0x9a2/0x1190 [lfsck] lfsck_master_engine+0x8f6/0x1360 [lfsck] kthread+0xd1/0xe0