<!-- 
RSS generated by JIRA (9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c) at Sat Feb 10 03:22:49 UTC 2024

It is possible to restrict the fields that are returned in this document by specifying the 'field' parameter in your request.
For example, to request only the issue key and summary append 'field=key&field=summary' to the URL of your request.
-->
<rss version="0.92" >
<channel>
    <title>Whamcloud Community JIRA</title>
    <link>https://jira.whamcloud.com</link>
    <description>This file is an XML representation of an issue</description>
    <language>en-us</language>    <build-info>
        <version>9.4.14</version>
        <build-number>940014</build-number>
        <build-date>05-12-2023</build-date>
    </build-info>


<item>
            <title>[LU-15965] Backup-Restore when one of the hard drive on the MDT failed.</title>
                <link>https://jira.whamcloud.com/browse/LU-15965</link>
                <project id="10000" key="LU">Lustre</project>
                    <description>&lt;p&gt;Hello,&lt;/p&gt;

&lt;p&gt;One of the hard drives on MDT will fail soon.&#160; We need to replace it.&#160;&#160;&lt;/p&gt;

&lt;p&gt;The question is after replace the failed hard drive, should the Metadata will be rebuilt automatically or do we have to run something to recover the metadata?&#160; We have IML ver 4.0.3.&#160; Should we&#160; backup the DB before replacing the failed HD?&#160; Please advice.&lt;/p&gt;

&lt;p&gt;Thanks.&#160; &#160;&lt;/p&gt;</description>
                <environment>Dell EMC Redhat 7.4, IML version 4.0.3. Consist of  IML server, 1 head node, 72 compute nodes, 1 MDT,  MDSs, OSSs, and OSTs.</environment>
        <key id="70865">LU-15965</key>
            <summary>Backup-Restore when one of the hard drive on the MDT failed.</summary>
                <type id="9" iconUrl="https://jira.whamcloud.com/images/icons/issuetypes/undefined.png">Question/Request</type>
                                            <priority id="2" iconUrl="https://jira.whamcloud.com/images/icons/priorities/critical.svg">Critical</priority>
                        <status id="5" iconUrl="https://jira.whamcloud.com/images/icons/statuses/resolved.png" description="A resolution has been taken, and it is awaiting verification by reporter. From here issues are either reopened, or are closed.">Resolved</status>
                    <statusCategory id="3" key="done" colorName="success"/>
                                    <resolution id="10000">Done</resolution>
                                        <assignee username="adilger">Andreas Dilger</assignee>
                                    <reporter username="NgocLoan.T.Thai">Loan Thai</reporter>
                        <labels>
                    </labels>
                <created>Thu, 23 Jun 2022 18:11:57 +0000</created>
                <updated>Tue, 5 Jul 2022 17:04:03 +0000</updated>
                            <resolved>Tue, 5 Jul 2022 17:04:03 +0000</resolved>
                                                                        <due></due>
                            <votes>0</votes>
                                    <watches>2</watches>
                                                                            <comments>
                            <comment id="338553" author="adilger" created="Thu, 23 Jun 2022 18:59:35 +0000"  >&lt;p&gt;Yes, definitely backup the MDT.  You should do that now, &lt;b&gt;before&lt;/b&gt; the drive is failing, and probably a second backup right before replacing the drive.  Please see &lt;a href=&quot;https://wiki.lustre.org/Backing_Up_a_Lustre_File_System&quot; class=&quot;external-link&quot; target=&quot;_blank&quot; rel=&quot;nofollow noopener&quot;&gt;https://wiki.lustre.org/Backing_Up_a_Lustre_File_System&lt;/a&gt;  for details.&lt;/p&gt;

&lt;p&gt;Whether the MDT volume is automatically rebuilt when the drive is replaced depends on the underlying RAID hardware on your system.  &lt;b&gt;Lustre itself does not provide any redundancy&lt;/b&gt;, so there must be some kind of RAID at the storage level to protect from individual drive failures.  That is not something that we support, please contact your storage vendor.&lt;/p&gt;</comment>
                            <comment id="338687" author="JIRAUSER17201" created="Fri, 24 Jun 2022 15:39:37 +0000"  >&lt;p&gt;Hi Andreas,&lt;/p&gt;

&lt;p&gt;Thanks for working on my issue.&#160; To do the Device-Level backup, it requires to unmount the target?&#160; I am afraid&#160; the drive will fail after unmount and also we can not unmount the FS.&#160;&#160;Is there other method to backup MDT without unmount the luster FS?&lt;/p&gt;

&lt;p&gt;For the Device-level backup,&#160; i need to use dd command: &#160;&lt;tt&gt;dd if=/dev/{original} of=/dev/{newdev} bs=4M&lt;/tt&gt;.&#160; &#160;&lt;/p&gt;

&lt;p&gt;We have 2 MDS, should i run the dd command on both or just one MDS?&#160;&lt;/p&gt;

&lt;p&gt;Disks layout on the MDS is below.&#160; Should i backup all /dev/sdx or only /dev/sda ?&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;/dev/sda (299 GB)&#160; mount /boot , swap, /, etc.&lt;/li&gt;
	&lt;li&gt;/dev/sdb (11 GB)&#160; &#160;mount /mnt/MGS&lt;/li&gt;
	&lt;li&gt;/dev/sdc (6 TB)&lt;/li&gt;
	&lt;li&gt;/dev/sdd (11 GB) &#160; mount /mnt/MGS&lt;/li&gt;
	&lt;li&gt;/dev/sde (6 TB)&#160;&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;Thank you.&lt;/p&gt;</comment>
                            <comment id="338698" author="adilger" created="Fri, 24 Jun 2022 16:55:54 +0000"  >&lt;p&gt;The &quot;dd&quot; mechanism can be used with a mounted MDT.  Though it may not be a perfect backup (possibly some recently created/deleted files may be missed), it will be much better than having no backup at all.  The less activity on the filesystem, the better the backup will be.&lt;/p&gt;

&lt;p&gt;While you may have multiple MDS, I suspect you only have one MDT on a system this old.  I would recommend backing up at least the MGT and the MGS.  It isn&apos;t clear from your comment which device is the MDT, it should be clearly listed:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;mds# mount | grep lustre
/dev/nvme0n1p1 on /mnt/myth/mgs type lustre (ro,svname=MGS,nosvc,mgs,osd=osd-ldiskfs,user_xattr,errors=remount-ro,_netdev)
/dev/sda on /mnt/myth/ost0000 type lustre (ro,svname=myth-OST0000,mgsnode=192.168.20.1@tcp,osd=osd-ldiskfs,errors=remount-ro,extents,mballoc,_netdev)
/dev/sdb on /mnt/myth/ost0001 type lustre (ro,svname=myth-OST0001,mgsnode=192.168.20.1@tcp,osd=osd-ldiskfs,errors=remount-ro,extents,mballoc,_netdev)
/dev/sdc on /mnt/myth/ost0002 type lustre (ro,svname=myth-OST0002,mgsnode=192.168.20.1@tcp,osd=osd-ldiskfs,errors=remount-ro,extents,mballoc,_netdev)
/dev/sdd on /mnt/myth/ost0003 type lustre (ro,svname=myth-OST0003,mgsnode=192.168.20.1@tcp,osd=osd-ldiskfs,errors=remount-ro,_netdev)
/dev/sde on /mnt/myth/ost0004 type lustre (ro,svname=myth-OST0004,mgsnode=192.168.20.1@tcp,osd=osd-ldiskfs,errors=remount-ro,_netdev)
/dev/nvme0n1p2 on /mnt/myth/mdt0000 type lustre (ro,svname=myth-MDT0000,mgsnode=192.168.20.1@tcp,osd=osd-ldiskfs,errors=remount-ro,iopen_nopriv,user_xattr,_netdev)
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;In this example, the MDT is on &lt;tt&gt;/dev/nvme0n1p2&lt;/tt&gt;.&lt;/p&gt;

&lt;p&gt;That said, having a backup of the Linux OS disk is also very useful, because it would take a lot of time and effort to reinstall and rebuild this system from scratch (if even possible, given the age), and since the OS disk doesn&apos;t change very often then even a single OS backup is probably enough.  The lowest-price 4TB drive here is about $100, so backing up 300GB is about $7 worth of storage (probably less with a larger drive), and it will definitely take &lt;b&gt;much&lt;/b&gt; more time and effort to rebuild the OS drive than $7.&lt;/p&gt;</comment>
                            <comment id="338714" author="JIRAUSER17201" created="Fri, 24 Jun 2022 18:13:58 +0000"  >&lt;p&gt;Sorry for the confusion.&lt;/p&gt;

&lt;p&gt;Our system was built by a vendor so i dont quite understand the setup.&#160; &lt;br/&gt;
We only have 1 MDT (combine MDT/MGS). From the mount output, i got:&lt;/p&gt;
&lt;ul&gt;
	&lt;li&gt;MDS0-0: (please see the attached)&lt;br/&gt;
/dev/mapper/mpathb on /mnt/MGS type lustre (ro)&lt;/li&gt;
&lt;/ul&gt;



&lt;ul&gt;
	&lt;li&gt;MDS0-1: (please see the attached)&lt;br/&gt;
/dev/mapper/mpatha on /mnt/MGS type lustre (ro)&lt;/li&gt;
&lt;/ul&gt;


&lt;p&gt;Is the MDT on /dev/mapper/mpatha&#160; or /dev/mapper/mpathb ?&#160;&#160;&lt;br/&gt;
I will backup both mpatha , mpathb with this command on MDS0-0:&lt;/p&gt;

&lt;p&gt;MDS0-0# &lt;tt&gt;dd if=/dev/mapper/mpatha of=/mnt/my_NFS_mounted bs=4M&lt;/tt&gt;.&#160; &#160; &#160;#-- is it correct? --#&lt;/p&gt;

&lt;p&gt;You are right about the HD for backup.&#160; It is not money but permission.&#160; Our system is standalone/isolated in a secured area.&#160; It is very hard to get an approval for add in devices.&#160; But i will take your advice seriously and work on it.&lt;/p&gt;</comment>
                            <comment id="338718" author="adilger" created="Fri, 24 Jun 2022 18:26:18 +0000"  >&lt;p&gt;Sorry, it isn&apos;t clear at all that the &lt;tt&gt;/mnt/MGS&lt;/tt&gt; device is the right one.  Check the output from &quot;&lt;tt&gt;mount | grep lustre&lt;/tt&gt;&quot; on both MDS servers to see which device is mounting MDT0000, as was shown in my example output above.&lt;/p&gt;</comment>
                            <comment id="338719" author="adilger" created="Fri, 24 Jun 2022 18:28:26 +0000"  >&lt;p&gt;PS: I&apos;d assume you know which RAID device is holding the failed drive?  If yes, that is the device that should be backed up.&lt;/p&gt;</comment>
                            <comment id="339012" author="JIRAUSER17201" created="Tue, 28 Jun 2022 14:14:13 +0000"  >&lt;p&gt;GM Andrea, i am sorry for a late response as i was out on TDY yesterday.&lt;/p&gt;

&lt;p&gt;From the MDS0-0, the output of &quot;mount | grep lustre&quot; is:&lt;br/&gt;
&#160; &#160; /dev/mapper/mpathb&#160; &#160; /mnt/lustre-MDT0000&lt;/p&gt;

&lt;p&gt;From the MDS0-1, the output of &quot;mount | grep lustre&quot; is::&lt;br/&gt;
&#160; &#160;/dev/mapper/mpatha&#160; &#160;/mnt/MGS&lt;/p&gt;

&lt;p&gt;So i will login MDS0-0 and backup the MDT with this command: &#160;&lt;br/&gt;
&lt;tt&gt;&#160; dd if=/dev/mapper/mpathb of=/mnt/backupMDT0000_onNFSmounted bs=4M&lt;/tt&gt;. &#160;&lt;/p&gt;

&lt;p&gt;&#160;&lt;/p&gt;

&lt;p&gt;and&#160; also it is still good to backup the MGS right? &lt;tt&gt;dd if=/dev/mapper/mpatha of=/mnt/backupMGS_onNFSmounted bs=4M&lt;/tt&gt;. &#160;&lt;/p&gt;

&lt;p&gt;Thanks.&lt;/p&gt;</comment>
                            <comment id="339036" author="adilger" created="Tue, 28 Jun 2022 16:30:12 +0000"  >&lt;p&gt;Yes, making a backup of both is a good idea. &lt;/p&gt;</comment>
                            <comment id="339042" author="JIRAUSER17201" created="Tue, 28 Jun 2022 17:09:02 +0000"  >&lt;p&gt;Hi Andreas,&lt;/p&gt;

&lt;p&gt;I ran dd command to backup the MDT to my NFS but had to break it out because i dont have enough space on my nfs.&lt;br/&gt;
Total space of MDT is 4.4TB, used space is 418GB.&#160; My nfs only has about 1TB.&#160; The dd command seems to backup the entire 4.4TB of MDT.&lt;/p&gt;

&lt;p&gt;Is there another way just to back up the used space on MDT?&#160; Thanks.&lt;/p&gt;</comment>
                            <comment id="339063" author="adilger" created="Tue, 28 Jun 2022 19:49:06 +0000"  >&lt;p&gt;Using &apos;dd&apos; is by far the fastest and most reliable way to do a backup and restore, and would be my strong recommendation for you to use.  Yes, it does a full backup of the entire MDT device, but it also ensures that if it needs to  be restored it will be exactly the same as before, and it can nominally be done while the system is in use (though I would recommend to make a second backup when you are able to stop the MDT, or immediately before the drive is replaced). &lt;/p&gt;

&lt;p&gt;You could consider piping the &quot;dd&quot; output through bzip2 to try and compress it, but I don&apos;t know how much compression you will get as this depends heavily on how full the MDT is and the lifetime of the system.  this is not difficult to try, something like the following, but of course compressing and decompressing the backup will make that process take significantly longer:&lt;/p&gt;
&lt;div class=&quot;preformatted panel&quot; style=&quot;border-width: 1px;&quot;&gt;&lt;div class=&quot;preformattedContent panelContent&quot;&gt;
&lt;pre&gt;dd if=/dev/mapper/mpathb bs=4M | bzip2 -9 &amp;gt; /mnt/backupMDT0000_onNFSmounted.img.bz2
&lt;/pre&gt;
&lt;/div&gt;&lt;/div&gt;
&lt;p&gt;That said, it makes sense to start such a backup now while other options are considered.&lt;/p&gt;

&lt;p&gt;There are measures that could be taken to improve the compression of the dd image, but that would involve unmounting the MDT and writing a lot of zeroes to it, and this is not advisable if a drive is already near failure.&lt;/p&gt;

&lt;p&gt;The user manual also describes how to use &quot;tar&quot; to do a backup/restore, and this &lt;em&gt;may&lt;/em&gt; take less space (depends on how full the MDT is), but will take much longer and put a lot more load on the MDT drives.  This would also require reformatting the MDT before restoring the backup, and will not produce an &quot;exact&quot; backup and restore process.  It also isn&apos;t clear to me what state your configuration is, how the MDT was initially formatted, the software versions, and your experience level in order to use that approach, and given the secure nature of the system it would be extremely difficult to assist you.  This kind of system administration task is really outside the scope of the Lustre support contract.&lt;/p&gt;

&lt;p&gt;You really need to reconsider the relatively low cost of installing a suitable backup drive, not just for the MDT, but also for the OS images.  Given the lack of familiarity with the system, restoring a failed OS drive might prove to be very difficult and time consuming.  This is very worthwhile given the risk of potentially of losing all of the data in the filesystem if the MDT is lost.   If there is some administration overhead to install a new drive there, then consider making it a large one so that it will last a long time and can hold multiple MDT backups.  Most secure sites I&apos;ve worked with have less objection to bringing in new equipment, and more objection to removing equipment, but even a few hundred dollar 14TB drive in a USB enclosure would be sufficient for this use and could be left onsite afterward, since it will be doing linear writes during the backup and could sustain full bandwidth, so would finish a full MDT backup in about 48h (I&apos;m assuming based on the age of the system, USB2.0 ~= 25MB/s).&lt;/p&gt;</comment>
                            <comment id="339131" author="JIRAUSER17201" created="Wed, 29 Jun 2022 13:38:19 +0000"  >&lt;p&gt;I will do the dd backup and compress the data as you suggested.&#160; I have a couple big drives, 10 and 14tb.&#160; I will figure out and get approval to add them in.&#160; I will post the update.&#160; Thanks for your assistant.&lt;/p&gt;</comment>
                            <comment id="339589" author="JIRAUSER17201" created="Tue, 5 Jul 2022 16:14:13 +0000"  >&lt;p&gt;I followed your instruction to backup (with the compression) the MDT and MGS.&#160; Using Dell MDSM tool to manage the MDT and replacing the failed disk.&#160; Data is rebuilding now.&#160; &#160;Please close this ticket as resolved.&#160; Thanks so much for your help Andreas,&lt;/p&gt;</comment>
                    </comments>
                    <attachments>
                    </attachments>
                <subtasks>
                    </subtasks>
                <customfields>
                                                                                                                                                                <customfield id="customfield_10890" key="com.atlassian.jira.plugins.jira-development-integration-plugin:devsummary">
                        <customfieldname>Development</customfieldname>
                        <customfieldvalues>
                            
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                        <customfield id="customfield_10390" key="com.pyxis.greenhopper.jira:gh-lexo-rank">
                        <customfieldname>Rank</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>1|i02svb:</customfieldvalue>

                        </customfieldvalues>
                    </customfield>
                                                                <customfield id="customfield_10090" key="com.pyxis.greenhopper.jira:gh-global-rank">
                        <customfieldname>Rank (Obsolete)</customfieldname>
                        <customfieldvalues>
                            <customfieldvalue>9223372036854775807</customfieldvalue>
                        </customfieldvalues>
                    </customfield>
                                                                                                                                                                                                                                                                                                                                                                                                                </customfields>
    </item>
</channel>
</rss>