[LU-8694] ZFS format fails when /etc/hostid is set or unset. Created: 11/Oct/16  Updated: 13/Jul/17  Resolved: 25/Oct/16

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.9.0
Fix Version/s: Lustre 2.9.0

Type: Bug Priority: Critical
Reporter: Cliff White (Inactive) Assignee: Nathaniel Clark
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
is related to LU-8816 test-framework needs to reload SPL mo... Resolved
is related to LU-7314 In kiblnd_rejected(), NULL pointer 'c... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

Attempting to format ZFS OSTs we see the following error:
mkfs.lustre FATAL: spl_hostid not set. See mkfs.lustre(8)

We attempt to set hostid by using /sbin/genhostid, we confirm the ID exists, same failure. We attempt to set hostid by /usr/bin/hostid, confirm the ID exists, same failure
At this time we cannot format OSTS, system is dead.



 Comments   
Comment by Peter Jones [ 11/Oct/16 ]

Nathaniel

Can you please help improve the docs here?

Peter

Comment by Cliff White (Inactive) [ 11/Oct/16 ]

The fix is simple - reload the SPL module after creating /etc/hostid. Nathaniel will be updating the documents. We are able to format now.

Comment by Andreas Dilger [ 11/Oct/16 ]

This mostly means that we should change the mkfs.lustre message to indicate the spl.ko module needs to be reloaded (or better yet, try to unload it and reload it internally once), and improve the mkfs.lustre.8 man page to prefer running genhostid over spl.ko module parameters, and indicate that the spl.ko module needs to be reloaded after generating a new hostid.

Comment by Gerrit Updater [ 12/Oct/16 ]

Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: http://review.whamcloud.com/23118
Subject: LU-8694 docs: ZFS hostid mkfs.lustre(8) man page update
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: b3e16b15a405d3993bdd8d0938977f294a670bc6

Comment by Gerrit Updater [ 25/Oct/16 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/23118/
Subject: LU-8694 docs: ZFS hostid mkfs.lustre(8) man page update
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 5de18d44e1074d18f72946b2a19cd1b3a0b749c8

Comment by Peter Jones [ 25/Oct/16 ]

Landed for 2.9

Comment by Abe [ 07/Jul/17 ]

Hi All,
I'm hitting the same problem with the master release under SLES12 SP2:

mkfs.lustre --reformat --mdt --mgs --servicenode= 211@o2ib --backfstype=zfs --fsname=tempAA --index=0 mgs/mdt

Permanent disk data:
Target: tempAA:MDT0000
Index: 0
Lustre FS: tempAA
Mount type: zfs
Flags: 0x1065
(MDT MGS first_time update no_primnode )
Persistent mount opts:
Parameters: failover.node=10.10.10.213@o2ib:10.10.10.211@o2ib

mkfs.lustre FATAL: spl_hostid not set. See mkfs.lustre(8)
mkfs.lustre FATAL: mkfs failed 22
mkfs.lustre: exiting with 22 (Invalid argument)

can someone share the detail workaround for this ?

thanks,
Abe

Comment by Andreas Dilger [ 13/Jul/17 ]

Abe, I think there was another update in this area:

http://review.whamcloud.com/23804 "LU-8816 utils: Check /etc/hostid instead of failing for ZFS"

Generated at Sat Feb 10 02:19:45 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.