Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8694

ZFS format fails when /etc/hostid is set or unset.

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.9.0
    • Lustre 2.9.0
    • None
    • 3
    • 9223372036854775807

    Description

      Attempting to format ZFS OSTs we see the following error:
      mkfs.lustre FATAL: spl_hostid not set. See mkfs.lustre(8)

      We attempt to set hostid by using /sbin/genhostid, we confirm the ID exists, same failure. We attempt to set hostid by /usr/bin/hostid, confirm the ID exists, same failure
      At this time we cannot format OSTS, system is dead.

      Attachments

        Issue Links

          Activity

            [LU-8694] ZFS format fails when /etc/hostid is set or unset.

            Abe, I think there was another update in this area:

            http://review.whamcloud.com/23804 "LU-8816 utils: Check /etc/hostid instead of failing for ZFS"

            adilger Andreas Dilger added a comment - Abe, I think there was another update in this area: http://review.whamcloud.com/23804 " LU-8816 utils: Check /etc/hostid instead of failing for ZFS"

            Hi All,
            I'm hitting the same problem with the master release under SLES12 SP2:

            mkfs.lustre --reformat --mdt --mgs --servicenode= 211@o2ib --backfstype=zfs --fsname=tempAA --index=0 mgs/mdt

            Permanent disk data:
            Target: tempAA:MDT0000
            Index: 0
            Lustre FS: tempAA
            Mount type: zfs
            Flags: 0x1065
            (MDT MGS first_time update no_primnode )
            Persistent mount opts:
            Parameters: failover.node=10.10.10.213@o2ib:10.10.10.211@o2ib

            mkfs.lustre FATAL: spl_hostid not set. See mkfs.lustre(8)
            mkfs.lustre FATAL: mkfs failed 22
            mkfs.lustre: exiting with 22 (Invalid argument)

            can someone share the detail workaround for this ?

            thanks,
            Abe

            abea@supermicro.com Abe (Inactive) added a comment - Hi All, I'm hitting the same problem with the master release under SLES12 SP2: mkfs.lustre --reformat --mdt --mgs --servicenode= 211@o2ib --backfstype=zfs --fsname=tempAA --index=0 mgs/mdt Permanent disk data: Target: tempAA:MDT0000 Index: 0 Lustre FS: tempAA Mount type: zfs Flags: 0x1065 (MDT MGS first_time update no_primnode ) Persistent mount opts: Parameters: failover.node=10.10.10.213@o2ib:10.10.10.211@o2ib mkfs.lustre FATAL: spl_hostid not set. See mkfs.lustre(8) mkfs.lustre FATAL: mkfs failed 22 mkfs.lustre: exiting with 22 (Invalid argument) can someone share the detail workaround for this ? thanks, Abe
            pjones Peter Jones added a comment -

            Landed for 2.9

            pjones Peter Jones added a comment - Landed for 2.9

            Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/23118/
            Subject: LU-8694 docs: ZFS hostid mkfs.lustre(8) man page update
            Project: fs/lustre-release
            Branch: master
            Current Patch Set:
            Commit: 5de18d44e1074d18f72946b2a19cd1b3a0b749c8

            gerrit Gerrit Updater added a comment - Oleg Drokin (oleg.drokin@intel.com) merged in patch http://review.whamcloud.com/23118/ Subject: LU-8694 docs: ZFS hostid mkfs.lustre(8) man page update Project: fs/lustre-release Branch: master Current Patch Set: Commit: 5de18d44e1074d18f72946b2a19cd1b3a0b749c8

            Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: http://review.whamcloud.com/23118
            Subject: LU-8694 docs: ZFS hostid mkfs.lustre(8) man page update
            Project: fs/lustre-release
            Branch: master
            Current Patch Set: 1
            Commit: b3e16b15a405d3993bdd8d0938977f294a670bc6

            gerrit Gerrit Updater added a comment - Nathaniel Clark (nathaniel.l.clark@intel.com) uploaded a new patch: http://review.whamcloud.com/23118 Subject: LU-8694 docs: ZFS hostid mkfs.lustre(8) man page update Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: b3e16b15a405d3993bdd8d0938977f294a670bc6

            This mostly means that we should change the mkfs.lustre message to indicate the spl.ko module needs to be reloaded (or better yet, try to unload it and reload it internally once), and improve the mkfs.lustre.8 man page to prefer running genhostid over spl.ko module parameters, and indicate that the spl.ko module needs to be reloaded after generating a new hostid.

            adilger Andreas Dilger added a comment - This mostly means that we should change the mkfs.lustre message to indicate the spl.ko module needs to be reloaded (or better yet, try to unload it and reload it internally once), and improve the mkfs.lustre.8 man page to prefer running genhostid over spl.ko module parameters, and indicate that the spl.ko module needs to be reloaded after generating a new hostid.

            The fix is simple - reload the SPL module after creating /etc/hostid. Nathaniel will be updating the documents. We are able to format now.

            cliffw Cliff White (Inactive) added a comment - The fix is simple - reload the SPL module after creating /etc/hostid. Nathaniel will be updating the documents. We are able to format now.
            pjones Peter Jones added a comment -

            Nathaniel

            Can you please help improve the docs here?

            Peter

            pjones Peter Jones added a comment - Nathaniel Can you please help improve the docs here? Peter

            People

              utopiabound Nathaniel Clark
              cliffw Cliff White (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              5 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: