Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-966

post-fsck MDS LBUG during recovery due to missing FID

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Minor Minor
    • Lustre 2.2.0, Lustre 2.1.1
    • Lustre 2.0.0
    • None
    • 3
    • 4274

      We already got this LBUG twice :
      ================================
      ASSERTION(mdd_object_exists(obj)) failed: FID is [0x20001a604:0xc9:0x0]
      LustreError: 34372:0:(mdd_object.c:91:mdd_la_get()) LBUG
      ================================

      It always occured after a MDS crash, shine fsck, shine start and during the Clients recovery timeframe.
      Each time we achieved to restart the MDT/MDS using abort_recovery.

      If we assume that the concerned+missing FID has been destroyed during the fsck on the MDT after a MDS crash for any other problem, and if we consider that there are no other possible scenario than such "external" action to lead to this situation (my opinion, but what do you think ??), can we think about to replace this Assert/LBUG with only a Warning message (at least during Client-recovery phase ...) ???

            bobijam Zhenyu Xu
            louveta Alexandre Louvet (Inactive)
            Votes:
            0 Vote for this issue
            Watchers:
            6 Start watching this issue

              Created:
              Updated:
              Resolved: