[LU-5961] Concurrent mount of ZFS targets fails when modules are not loaded Created: 28/Nov/14  Updated: 19/Feb/15  Resolved: 19/Feb/15

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.5.3
Fix Version/s: Lustre 2.7.0

Type: Bug Priority: Minor
Reporter: Oliver Mangold Assignee: Nathaniel Clark
Resolution: Fixed Votes: 0
Labels: None
Environment:

Centos6.5


Issue Links:
Duplicate
duplicates LU-1279 failure trying to mount two targets a... Resolved
Severity: 3
Rank (Obsolete): 16653

 Description   

When I mount multiple ZFS osts concurrently, e.g. by a script like the one below, I get errors like this:


Unable to open /dev/zfs: No such file or directory.
Verify the ZFS module stack is loaded by running '/sbin/modprobe zfs'.
Failed to initialize ZFS library
Unable to open /dev/zfs: No such file or directory.
Verify the ZFS module stack is loaded by running '/sbin/modprobe zfs'.
Failed to initialize ZFS library
Unable to open /dev/zfs: No such file or directory.
Verify the ZFS module stack is loaded by running '/sbin/modprobe zfs'.
Failed to initialize ZFS library

Mounting the targets serially works. Mounting in parallel also works, when the zfs and lustre modules are already loaded.


#!/bin/bash

mount -t lustre ost0/lnec /mnt/lustre/ost0 &
mount -t lustre ost1/lnec /mnt/lustre/ost1 &
mount -t lustre ost2/lnec /mnt/lustre/ost2 &
mount -t lustre ost3/lnec /mnt/lustre/ost3 &
mount -t lustre ost4/lnec /mnt/lustre/ost4 &
mount -t lustre ost5/lnec /mnt/lustre/ost5 &
wait

Syslog shows some problems about unknown symbols:
> dmesg
...
ZFS: Loaded module v0.6.3-1, ZFS pool version 5000, ZFS filesystem version 5
LNet: HW CPU cores: 24, npartitions: 4
alg: No test for crc32 (crc32-table)
alg: No test for adler32 (adler32-zlib)
alg: No test for crc32 (crc32-pclmul)
Lustre: Lustre: Build Version: 2.5.3-RC1--PRISTINE-2.6.32-431.23.3.el6_lustre.x86_64
fld: gave up waiting for init of module ptlrpc.
fld: Unknown symbol RQF_FLD_QUERY
fld: gave up waiting for init of module ptlrpc.
fld: Unknown symbol req_capsule_client_get



 Comments   
Comment by Peter Jones [ 28/Nov/14 ]

Nathaniel

Can you help with this issue?

Thanks

Peter

Comment by Brian Murrell (Inactive) [ 28/Nov/14 ]

I wonder if this is the same problem as LU-1279. If anyone can confirm that it is more or less the same problem, a "related" link ought to be created from this ticket to LU-1279.

Comment by Nathaniel Clark [ 19/Feb/15 ]

I cannot reproduce this on master.

Generated at Sat Feb 10 01:56:00 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.