[LU-1110] MDS Oops in osd_xattr_get() during file open by FID Created: 16/Feb/12 Updated: 28/Nov/13 Resolved: 29/Mar/12 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.0 |
| Fix Version/s: | Lustre 2.3.0 |
| Type: | Bug | Priority: | Major |
| Reporter: | Alexandre Louvet | Assignee: | Lai Siyao |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | mn1 | ||
| Severity: | 3 |
| Rank (Obsolete): | 4677 |
| Description |
|
MDS crashed solid/multiple-times before we were able to identify the concerned File/FID and fix the situation by unlink'ing it !!! The panic stack looks like following : Modules linked in: cmm(U) osd_ldiskfs(U) mdt(U) mdd(U) mds(U) fsfilt_ldiskfs(U) exportfs mgc(U) ldiskfs(U) lustre(U) lov(U) osc(U) mdc(U) lquota(U) fid(U) fld(U) ko2iblnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ipmi_devintf ipmi_si ipmi_msghandler nfs lockd fscache(T) nfs_acl auth_rpcgss sunrpc acpi_cpufreq freq_table rdma_ucm(U) ib_sdp(U) rdma_cm(U) iw_cm(U) ib_addr(U) ib_ipoib(U) ib_cm(U) ib_sa(U) ipv6 ib_uverbs(U) ib_umad(U) mlx4_ib(U) mlx4_core(U) ib_mthca(U) ib_mad(U) ib_core(U) dm_round_robin dm_multipath usbhid hid ghes i2c_i801 i2c_core iTCO_wdt iTCO_vendor_support ehci_hcd uhci_hcd ioatdma hed sg lpfc scsi_transport_fc scsi_tgt igb dca ext4 jbd2 sd_mod crc_t10dif ahci megaraid_sas dm_mod [last unloaded: microcode] The reason of the Oops is because osd_xattr_get() references obj->oo_inode which is NULL. This situation should be inherited from osd_fid_lookup() which returns 0 if osd_oi_lookup() returns ENOENT and does not initialize oo_inode ... This comes from extensive "by-FID" direct operations used by customer's tools which may trigger some FileSystem inconsistencies (causing a FID not to be correctly resolved ??...) not beeing handled in "by-FID" access-method code. And 1st question coming to my mind, is "by-FID" feature/access-method already available and safe for customer's usage ??? |
| Comments |
| Comment by Peter Jones [ 16/Feb/12 ] |
|
Lai Could you please look into this one? Thanks Peter |
| Comment by Lai Siyao [ 27/Feb/12 ] |
|
AFAIK, there is just an architecture design for open-by-fid feature, and security impact of this feature is not included in the design. As for the code, the support on MDT is incomplete, and no interface on client, that means no test is done for this. So basically this feature is not ready for customer usage. Oleg, if I miss anything, please correct me. |
| Comment by Lai Siyao [ 27/Feb/12 ] |
|
Ahh, I found the open-by-fid code is almost there, but lack of sanity test. I'll add some there to ensure normal operations won't crash. |
| Comment by Lai Siyao [ 29/Feb/12 ] |
|
Review is on http://review.whamcloud.com/#change,2224. |
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 29/Mar/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Peter Jones [ 29/Mar/12 ] |
|
Landed for 2.3 |
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/May/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Bob Glossman (Inactive) [ 16/May/12 ] |
|
http://review.whamcloud.com/#change,2807 |