[LU-3830] mount fails on targets configured with multiple mgs nids Created: 23/Aug/13 Updated: 23/Aug/13 Resolved: 23/Aug/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.4.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | Gregoire Pichon | Assignee: | WC Triage |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Epic: | mount | ||||||||
| Rank (Obsolete): | 9895 | ||||||||
| Description |
|
When a target is configured with multiple nids, the mount fails due to invalid mount options. # mkfs.lustre --mgs --fsname=localfs /dev/ram0 # mkfs.lustre --mdt --fsname=localfs --mgsnode=30.1.0.95@o2ib0 --index=0 /dev/ram1 # mkfs.lustre --ost --fsname=localfs --mgsnode=30.1.0.95@o2ib0 --index=0 /dev/ram2 # mkfs.lustre --ost --fsname=localfs --mgsnode=30.1.0.95@o2ib0,30.2.0.95@o2ib1 --index=1 /dev/ram3 # mount.lustre /dev/ram0 /mnt/localfs/mgt # mount.lustre /dev/ram1 /mnt/localfs/mdt # mount.lustre /dev/ram2 /mnt/localfs/ost0 # mount.lustre /dev/ram3 /mnt/localfs/ost1 mount.lustre: mount /dev/ram3 at /mnt/localfs/ost1 failed: Invalid argument This may have multiple causes. Are the mount options correct? Check the syslog for more info. # mount.lustre /dev/ram3 /mnt/localfs/ost1 -v arg[0] = /sbin/mount.lustre.orig arg[1] = -v arg[2] = /dev/ram3 arg[3] = /mnt/localfs/ost1 source = /dev/ram3 (/dev/ram3), target = /mnt/localfs/ost1 options = checking for existing Lustre data: found Reading CONFIGS/mountdata mounting device /dev/ram3 at /mnt/localfs/ost1, flags=0x1000000 options=osd=osd-ldiskfs,errors=remount-ro,mgsnode=30.1.0.95@o2ib,30.2.0.95@o2ib1,virgin,param=mgsnode=30.1.0.95@o2ib,30.2.0.95@o2ib1,svname=localfs-OST0001,device=/dev/ram3 mount.lustre: mount /dev/ram3 at /mnt/localfs/ost1 failed: Invalid argument retries left: 0 mount.lustre: mount /dev/ram3 at /mnt/localfs/ost1 failed: Invalid argument This may have multiple causes. Are the mount options correct? Check the syslog for more info. # tail -n 6 /var/log/syslog 1377250637 2013 Aug 23 11:37:17 mo95 kern err kernel LDISKFS-fs (ram3): Unrecognized mount option "30.2.0.95@o2ib1" or missing value 1377250637 2013 Aug 23 11:37:17 mo95 kern err kernel LustreError: 32322:0:(osd_handler.c:5349:osd_mount()) localfs-OST0001-osd: can't mount /dev/ram3: -22 1377250637 2013 Aug 23 11:37:17 mo95 kern err kernel LustreError: 32322:0:(obd_config.c:572:class_setup()) setup localfs-OST0001-osd failed (-22) 1377250637 2013 Aug 23 11:37:17 mo95 kern err kernel LustreError: 32322:0:(obd_mount.c:201:lustre_start_simple()) localfs-OST0001-osd setup error -22 1377250637 2013 Aug 23 11:37:17 mo95 kern err kernel LustreError: 32322:0:(obd_mount_server.c:1665:server_fill_super()) Unable to start osd on /dev/ram3: -22 1377250637 2013 Aug 23 11:37:17 mo95 kern err kernel LustreError: 32322:0:(obd_mount.c:1267:lustre_fill_super()) Unable to mount (-22) It seems that mountdata options are now passed to mount.lustre as additional options. Unfortunately comma is used as a separator of mount options and multiple mgs nids are also separated by a comma. This lead to have the second mgs nid "30.2.0.95@o2ib1" to be processed as a mount option, which fails |
| Comments |
| Comment by Niu Yawei (Inactive) [ 23/Aug/13 ] |
|
It should have been fixed in |
| Comment by Bruno Faccini (Inactive) [ 23/Aug/13 ] |
|
Yes, Niu you have been too fast and I definitely agree! Gregoire, can you give a try to changes/fixes for |
| Comment by Gregoire Pichon [ 23/Aug/13 ] |
|
Thanks for the notice. I missed it when looking for existing Jira tickets. |
| Comment by Bruno Faccini (Inactive) [ 23/Aug/13 ] |
|
Ticket is a duplicate of |