Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-1570

llog_cat.c:428:llog_cat_process_flags() catlog 0x27500007 crosses index zero

Details

    • 3
    • 10435

    Description

      This message shows up periodically on the MDS of our test filesystem. What needs to be done about it?

      Attachments

        Issue Links

          Activity

            [LU-1570] llog_cat.c:428:llog_cat_process_flags() catlog 0x27500007 crosses index zero

            I believe this was fixed with LU-7934.

            adilger Andreas Dilger added a comment - I believe this was fixed with LU-7934 .

            Linking to LU-1586, since this is where the "no free catalog slots for log" message is being discussed.

            adilger Andreas Dilger added a comment - Linking to LU-1586 , since this is where the "no free catalog slots for log" message is being discussed.
            nedbass Ned Bass (Inactive) added a comment - - edited

            I wonder if this is related? On the same filesystem, we started getting 'Bad address' errors from the client:

            On the client:

            # hype318 /p/lcrater2/bass6 > dd if=/dev/zero of=foo count=1
            dd: opening `baz': Bad address
            

            On the MDS:

            Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_cat.c:81:llog_cat_new_log()) no free catalog slots for log...
            Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_cat.c:81:llog_cat_new_log()) Skipped 3 previous similar messages
            Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_obd.c:454:llog_obd_origin_add()) write one catalog record failed: -28
            Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_obd.c:454:llog_obd_origin_add()) Skipped 3 previous similar messages
            Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(mdd_object.c:1330:mdd_changelog_data_store()) changelog failed: rc=-28 op17 t[0x200de60af:0x17913:0x0]
            
            nedbass Ned Bass (Inactive) added a comment - - edited I wonder if this is related? On the same filesystem, we started getting 'Bad address' errors from the client: On the client: # hype318 /p/lcrater2/bass6 > dd if=/dev/zero of=foo count=1 dd: opening `baz': Bad address On the MDS: Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_cat.c:81:llog_cat_new_log()) no free catalog slots for log... Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_cat.c:81:llog_cat_new_log()) Skipped 3 previous similar messages Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_obd.c:454:llog_obd_origin_add()) write one catalog record failed: -28 Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(llog_obd.c:454:llog_obd_origin_add()) Skipped 3 previous similar messages Jun 28 12:08:45 zwicky-mds2 kernel: LustreError: 11841:0:(mdd_object.c:1330:mdd_changelog_data_store()) changelog failed: rc=-28 op17 t[0x200de60af:0x17913:0x0]

            There's other "bad" stuff going on in the logs due to recovery testing I was performing. The watchdog timers for mount.lustre and llog_process_th probably need a separate issue opened.

            nedbass Ned Bass (Inactive) added a comment - There's other "bad" stuff going on in the logs due to recovery testing I was performing. The watchdog timers for mount.lustre and llog_process_th probably need a separate issue opened.

            First look at the code suggests this is just a warning, not a serious error. Can we get a syslog from the MDS? We would like to see if this message is associated with any other behavior we can identify as bad.

            bogl Bob Glossman (Inactive) added a comment - First look at the code suggests this is just a warning, not a serious error. Can we get a syslog from the MDS? We would like to see if this message is associated with any other behavior we can identify as bad.

            People

              bogl Bob Glossman (Inactive)
              nedbass Ned Bass (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: