Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-8135

sanity test_101g fails with 'not all RPCs are 16 MiB BRW rpcs'

    XMLWordPrintable

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.9.0
    • Lustre 2.9.0
    • None
    • autotest review-zfs
    • 3
    • 9223372036854775807

    Description

      sanity test 101g fails with

      'not all RPCs are 16 MiB BRW rpcs' 
      

      In the test log on the client, we can see the ‘File too large’ error returned when dd tries to write to the file system

      == sanity test 101g: Big bulk(4/16 MiB) readahead ==================================================== 11:11:42 (1462903902)
      CMD: onyx-50vm8 /usr/sbin/lctl get_param obdfilter.*.brw_size |
      			while read s; do echo ost1 \$s; done
      CMD: onyx-50vm8 /usr/sbin/lctl get_param obdfilter.*.brw_size |
      			while read s; do echo ost2 \$s; done
      CMD: onyx-50vm8 /usr/sbin/lctl set_param -n obdfilter.lustre-OST*.brw_size=16M 		osd-*.lustre-OST*.brw_size=16M 2>&1
      remount client to enable large RPC size
      CMD: onyx-50vm1.onyx.hpdd.intel.com grep -c /mnt/lustre' ' /proc/mounts
      Stopping client onyx-50vm1.onyx.hpdd.intel.com /mnt/lustre (opts:)
      CMD: onyx-50vm1.onyx.hpdd.intel.com lsof -t /mnt/lustre
      CMD: onyx-50vm1.onyx.hpdd.intel.com umount  /mnt/lustre 2>&1
      Starting client: onyx-50vm1.onyx.hpdd.intel.com:  -o user_xattr,flock onyx-50vm7@tcp:/lustre /mnt/lustre
      CMD: onyx-50vm1.onyx.hpdd.intel.com mkdir -p /mnt/lustre
      CMD: onyx-50vm1.onyx.hpdd.intel.com mount -t lustre -o user_xattr,flock onyx-50vm7@tcp:/lustre /mnt/lustre
      dd: error writing '/mnt/lustre/f101g.sanity': File too large
      2+0 records in
      1+0 records out
      16777216 bytes (17 MB) copied, 0.978339 s, 17.1 MB/s
      0+0 records in
      0+0 records out
      0 bytes (0 B) copied, 0.00599718 s, 0.0 kB/s
      0 RPCs
       sanity test_101g: @@@@@@ FAIL: not all RPCs are 16 MiB BRW rpcs 
      

      We’ve had three occurrences of this failure since the test was introduced by http://review.whamcloud.com/19368 a little over a week ago and, so far, only seen on review-zfs-part-1:

      2016-05-09 - https://testing.hpdd.intel.com/test_sets/145be8f2-1661-11e6-b5f1-5254006e85c2
      2016-05-10 - https://testing.hpdd.intel.com/test_sets/f2b9b50c-1701-11e6-b5f1-5254006e85c2
      2016-05-11 - https://testing.hpdd.intel.com/test_sets/73c7a9a4-179f-11e6-b5f1-5254006e85c2

      Attachments

        Issue Links

          Activity

            People

              jay Jinshan Xiong (Inactive)
              jamesanunez James Nunez (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: