Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-2877

sanity test_34h failed Multiop blocked on ftruncate, pid=

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Critical
    • Lustre 2.4.0
    • Lustre 2.4.1
    • 3
    • 6951

    Description

      Happens pretty frequently in maloo since April 2012.
      Sample failures:
      https://maloo.whamcloud.com/test_sets/33b06c22-8065-11e2-9b82-52540035b04c

      https://maloo.whamcloud.com/sub_tests/9ccebafe-807b-11e2-b777-52540035b04c

      I think the underlying problem is that multiop as written can block on not just truncate lock, but also at group lock getting time it can block on cache flush.

              dd if=/dev/zero of=$DIR/$tfile bs=1M count=10 || error
              $MULTIOP $DIR/$tfile OG${gid}T${sz}g${gid}c &
              MULTIPID=$!
              sleep 2
      

      Co perhaps we need to add sync or other way of cache flush after dd before doing multiop.

      Attachments

        Activity

          People

            green Oleg Drokin
            green Oleg Drokin
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: