Details
-
Bug
-
Resolution: Fixed
-
Minor
-
Lustre 2.6.0
-
None
-
sles11sp3 clients
-
3
-
13319
Description
every time I do a client unmount of a lustre filesystem I see a very verbose warning like
Mar 28 11:06:59 susesp3-3 kernel: [ 105.665817] Lustre: Unmounted lustre-client Mar 28 11:06:59 susesp3-3 kernel: [ 105.667023] ------------[ cut here ]------------ Mar 28 11:06:59 susesp3-3 kernel: [ 105.667031] WARNING: at fs/proc/generic.c:809 remove_proc_entry+0x22d/0x280() Mar 28 11:06:59 susesp3-3 kernel: [ 105.667032] Hardware name: VMware Virtual Platform Mar 28 11:06:59 susesp3-3 kernel: [ 105.667033] name 'nrs_tbf_quantum' Mar 28 11:06:59 susesp3-3 kernel: [ 105.667034] Modules linked in: osc(FN) mgc(FN) lustre(FN) lov(FN) mdc(FN) fid(FN) lmv(FN) fld(FN) ksocklnd(FN) ptlrpc(FN) obdclass(FN) lnet(FN) sha512_generic(FN) sha1_generic(FN) md5(FN) crc32c(FN) libcfs(FN) lp(FN) binfmt_misc(FN) snd_pcm_oss(FN) snd_mixer_oss(FN) snd_seq_midi(FN) snd_seq_midi_event(FN) snd_seq(FN) edd(FN) rdma_ucm(FN) rdma_cm(FN) iw_cm(FN) ib_addr(FN) ib_srp(FN) scsi_transport_srp(FN) scsi_tgt(FN) ib_ipoib(FN) ib_cm(FN) ib_uverbs(FN) ib_umad(FN) iw_cxgb3(FN) cxgb3(FN) mdio(FN) mlx4_en(FN) mlx4_ib(FN) ib_sa(FN) mlx4_core(FN) ib_mthca(FN) ib_mad(FN) ib_core(FN) mperf(FN) acpiphp(FN) microcode(FN) fuse(FN) loop(FN) dm_mod(FN) snd_ens1371(FN) gameport(FN) snd_rawmidi(FN) snd_seq_device(FN) ipv6(FN) snd_ac97_codec(FN) ipv6_lib(FN) btusb(FN) bluetooth(FN) ac97_bus(FN) snd_pcm(FN) snd_timer(FN) ppdev(FN) rfkill(FN) snd(FN) vmw_balloon(FN) usbhid(FN) parport_pc(FN) hid(FN) e1000(FN) floppy(FN) sr_mod(FN) soundcore(FN) rtc_cmos(FN) crc16(FN) parport(FN) i2c_piix4(FN) Mar 28 11:06:59 susesp3-3 kernel: sg(FN) shpchp(FN) pciehp(FN) pcspkr(FN) snd_page_alloc(FN) mptctl(FN) acpi_memhotplug(FN) intel_agp(FN) pci_hotplug(FN) cdrom(FN) i2c_core(FN) container(FN) button(FN) ac(FN) intel_gtt(FN) ext3(FN) jbd(FN) mbcache(FN) uhci_hcd(FN) ehci_hcd(FN) sd_mod(FN) crc_t10dif(FN) usbcore(FN) usb_common(FN) processor(FN) thermal_sys(FN) hwmon(FN) scsi_dh_hp_sw(FN) scsi_dh_rdac(FN) scsi_dh_alua(FN) scsi_dh_emc(FN) scsi_dh(FN) vmw_pvscsi(FN) vmxnet3(FN) ata_generic(FN) ata_piix(FN) ahci(FN) libahci(FN) libata(FN) mptspi(FN) mptscsih(FN) mptbase(FN) scsi_transport_spi(FN) scsi_mod(FN) Mar 28 11:06:59 susesp3-3 kernel: [ 105.667091] Supported: No, Unsupported modules are loaded Mar 28 11:06:59 susesp3-3 kernel: [ 105.667094] Pid: 4811, comm: obd_zombid Tainted: GF U N 3.0.101-0.18-default #1 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667095] Call Trace: Mar 28 11:06:59 susesp3-3 kernel: [ 105.667103] [<ffffffff81004935>] dump_trace+0x75/0x310 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667107] [<ffffffff8145fcd3>] dump_stack+0x69/0x6f Mar 28 11:06:59 susesp3-3 kernel: [ 105.667113] [<ffffffff8106063b>] warn_slowpath_common+0x7b/0xc0 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667115] [<ffffffff81060735>] warn_slowpath_fmt+0x45/0x50 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667118] [<ffffffff811bf14d>] remove_proc_entry+0x22d/0x280 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667159] [<ffffffffa0a8fdc7>] ptlrpc_service_nrs_cleanup+0x97/0xc0 [ptlrpc] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667224] [<ffffffffa0a57bcd>] ptlrpc_unregister_service+0xdd/0x1f0 [ptlrpc] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667264] [<ffffffffa0a22a09>] ldlm_cleanup+0x379/0x630 [ptlrpc] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667292] [<ffffffffa0a22de5>] ldlm_put_ref+0x125/0x1a0 [ptlrpc] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667320] [<ffffffffa0a123fa>] client_obd_cleanup+0xda/0x2e0 [ptlrpc] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667329] [<ffffffffa0df8938>] mgc_cleanup+0x38/0xe0 [mgc] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667357] [<ffffffffa07f4f2f>] class_decref+0x11f/0x550 [obdclass] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667387] [<ffffffffa07d67fe>] class_export_destroy+0xfe/0x480 [obdclass] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667408] [<ffffffffa07d6c4d>] obd_zombie_impexp_cull+0xcd/0x1e0 [obdclass] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667429] [<ffffffffa07d6db5>] obd_zombie_impexp_thread+0x55/0x1a0 [obdclass] Mar 28 11:06:59 susesp3-3 kernel: [ 105.667438] [<ffffffff810828a6>] kthread+0x96/0xa0 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667441] [<ffffffff8146bb64>] kernel_thread_helper+0x4/0x10 Mar 28 11:06:59 susesp3-3 kernel: [ 105.667444] ---[ end trace b99a6b85a455c4ed ]---
This is coming from the following lustre code in ptlrpc/nrs_tbf.c
void nrs_tbf_lprocfs_fini(struct ptlrpc_service *svc) { if (svc->srv_procroot == NULL) return; lprocfs_remove_proc_entry("nrs_tbf_quantum", svc->srv_procroot); }
I'm certain this error is only seen in SLES kernels because the SLES version of the kernel API remove_proc_entry() is much more verbose than the RHEL one.
In SLES11SP3 the error return in remove_proc_entry() is
if (!de) { WARN(1, "name '%s'\n", name); return; }
In RHEL6.5 it is
if (!de) return;
I'm pretty sure that bad call to lprocfs_remove_proc_entry() is always happening, but it's only noisy & logs warnings in SLES.