[LU-1696] Test failure on test suite sanity, subtest test_49 Created: 01/Aug/12  Updated: 07/Aug/12  Resolved: 07/Aug/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.3.0
Fix Version/s: Lustre 2.3.0

Type: Bug Priority: Blocker
Reporter: Maloo Assignee: Yang Sheng
Resolution: Fixed Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 4506

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/79266140-dba0-11e1-81e3-52540035b04c.

The sub-test test_49 failed with the following error:

test failed to respond and timed out



 Comments   
Comment by Peter Jones [ 06/Aug/12 ]

yangsheng will look into this one

Comment by Yang Sheng [ 06/Aug/12 ]

Looks like 'dd' has hang and cause below part infinite loop:

        # loop until dd process exits
        while ps ax -opid | grep -q $dd_pid; do
                $LCTL set_param $osc1_mppc=$((RANDOM % 256 + 1))
                sleep $((RANDOM % 5 + 1))
        done

But i cannot found 'dd' process in stacktrace file. So i suspect it just because the script use 'grep -q $dd_pid' to match the pid, It maybe match partial number.

Comment by Yang Sheng [ 06/Aug/12 ]

Patch upload to:http://review.whamcloud.com/3544

Comment by Peter Jones [ 07/Aug/12 ]

Landed for 2.3

Generated at Sat Feb 10 01:18:53 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.