[LU-3701] Failure on test suite posix subtest test_1: fcntl.18/fcntl.35 Unresolved Created: 05/Aug/13 Updated: 22/Nov/13 Resolved: 09/Sep/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.5.0 |
| Fix Version/s: | Lustre 2.5.0, Lustre 2.4.2 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Maloo | Assignee: | Bruno Faccini (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | mn1 | ||
| Environment: |
server and client: lustre-master build: 1952 |
||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9549 | ||||||||
| Description |
|
This issue was created by maloo for sarah <sarah@whamcloud.com> This issue relates to the following test suite run: http://maloo.whamcloud.com/test_sets/261745ba-fb5b-11e2-8c6e-52540035b04c. The sub-test test_1 failed with the following error:
test log SUCCESS SUMMARY:
News POSIX successes: 1
Test Name Baseline Lustre Report
read.15 Failed Succeeded
FAILURE SUMMARY:
POSIX failures: 2
Test Name Baseline Lustre Report
fcntl.18 Succeeded Unresolved
fcntl.35 Succeeded Unresolved
FAILURE DESCRIPTIONS:
####################################################
Test Name: fcntl.18 Unresolved
Test Description:
For the XNFS specification:
If the implementation supports file locking for files residing on
a remote file system: On a call to fcntl(fildes, F_SETLKW, arg)
when the lock specified by arg can not be set, waits until the
lock can be set.
For the XSH specification:
On a call to fcntl(fildes, F_SETLKW, arg) when the lock specified
by arg can not be set, waits until the lock can be set.
Posix Ref: Component FCNTL Assertion 6.5.2.2-23(A)
Test Information:
deletion reason: External error - waitsync failed
deletion reason: External error - waitsync failed
####################################################
Test Name: fcntl.35 Unresolved
Test Description:
For the XNFS specification:
If the implementation supports file locking for files residing on
a remote file system: EINTR in errno and -1 returned by fcntl() if
the operation is interrupted by a signal.
For the XSH specification:
EINTR in errno and -1 returned by fcntl() if the operation is
interrupted by a signal.
Posix Ref: Component FCNTL Assertion 6.5.2.4-40(A)
Test Information:
child process timed out
|
| Comments |
| Comment by Andreas Dilger [ 08/Aug/13 ] |
|
Is the client being mounted with "-o flock"? |
| Comment by Oleg Drokin [ 08/Aug/13 ] |
|
in client dmesg: Lustre: DEBUG MARKER: Run POSIX test against lustre filesystem LustreError: 11-0: lustre-MDT0000-mdc-ffff880331591c00: Communicating with 192.168.4.20@o2ib, operation ldlm_enqueue failed with -11. LustreError: 11-0: lustre-MDT0000-mdc-ffff880331591c00: Communicating with 192.168.4.20@o2ib, operation ldlm_enqueue failed with -11. LustreError: 11-0: lustre-MDT0000-mdc-ffff880331591c00: Communicating with 192.168.4.20@o2ib, operation ldlm_enqueue failed with -11. LustreError: 11-0: lustre-MDT0000-mdc-ffff880331591c00: Communicating with 192.168.4.20@o2ib, operation ldlm_enqueue failed with -11. LustreError: Skipped 8 previous similar messages LustreError: 11-0: lustre-MDT0000-mdc-ffff880331591c00: Communicating with 192.168.4.20@o2ib, operation ldlm_enqueue failed with -35. LustreError: Skipped 2 previous similar messages But nothing like that on MDS |
| Comment by Jian Yu [ 12/Aug/13 ] |
|
Hi Oleg, On Lustre b2_4 branch, this is a regression issue introduced by the patch in build http://build.whamcloud.com/job/lustre-b2_4/28/ : https://maloo.whamcloud.com/test_sets/23814618-02b1-11e3-a4b4-52540035b04c FYI, the posix test passed on Lustre b2_4 build #27. |
| Comment by Jian Yu [ 12/Aug/13 ] |
|
Hi Oleg,
On master branch, posix test passed on build #1560. However, build #1561 and #1562 were not tested. The test failed on build #1563. Here are the patches in those builds: Build #1561: Only " |
| Comment by Peter Jones [ 12/Aug/13 ] |
|
Oleg What do you suggest here? Peter |
| Comment by Peter Jones [ 20/Aug/13 ] |
|
Reverted |
| Comment by Sebastien Buisson (Inactive) [ 20/Aug/13 ] |
|
Hi, If the patch http://review.whamcloud.com/6415 from Thanks, |
| Comment by Peter Jones [ 20/Aug/13 ] |
|
Bruno will look into this |
| Comment by Bruno Faccini (Inactive) [ 21/Aug/13 ] |
|
According to the failing Posix tests description and BTW, where can I find the Posix test suite ? It does not appear to be part of lustre-tests. |
| Comment by Jian Yu [ 22/Aug/13 ] |
After installing the above package on test node, we can perform lustre/tests/posix.sh to install, build and run LSB-VSX POSIX test suite on $BASELINE_FS and Lustre, then compare the test results. |
| Comment by Bruno Faccini (Inactive) [ 26/Aug/13 ] |
|
Hello Jian, Enter the root password:^M
Password: ^M
losetup: /dev/loop0: device is busy^M
Aborting installation^M
mv: cannot stat `/usr/src/posix/ext4/tet/test_sets/results/0002e': No such file or directory
child process exited abnormally
while executing
"system "mv $results_dir/0002e $results_dir/lustre_baseline""
(file "build-posix.exp" line 161)^M
failed to build POSIX test suite.
posix test_1: @@@@@@ FAIL: Setup POSIX test suite failed
Trace dump:
= /usr/lib64/lustre/tests/test-framework.sh:4200:error_noexit()
= /usr/lib64/lustre/tests/test-framework.sh:4227:error()
= ./posix.sh:106:test_1()
= /usr/lib64/lustre/tests/test-framework.sh:4466:run_one()
= /usr/lib64/lustre/tests/test-framework.sh:4499:run_one_logged()
= /usr/lib64/lustre/tests/test-framework.sh:4369:run_test()
= ./posix.sh:118:main()
Dumping lctl log to /tmp/test_logs/1377521275/posix.test_1.*.1377521325.log
Dumping logs only on local client.
and this looks like some odd loop-device configuration issue.I am trying to debug+fix this, but any other help and hint are welcome. On the other hand, I think that original test/patch for |
| Comment by Jian Yu [ 27/Aug/13 ] |
Please add the following test parameter into the commit message to see whether posix test suite can pass or not: Test-Parameters: testlist=posix |
| Comment by Bruno Faccini (Inactive) [ 28/Aug/13 ] |
|
I did, but seems that only "posix" test ran, is it expected behavior ? I thought that Test-Parameters will run tests in addition to the default set, unless "fortestonly" is specified ... On the other hand "posix" test has been successful, so I need to check now that |
| Comment by Bruno Faccini (Inactive) [ 29/Aug/13 ] |
|
In fact auto-tests default set finally ran against build/patch. Also, I checked successfully that new patch/change http://review.whamcloud.com/7453 also preserves correct behavior against Will ask for reviews now and if ok, need to provide at least a b2_1 version and also push it for patch-less Client Kernel integration (in addition of patch for |
| Comment by Peter Jones [ 09/Sep/13 ] |
|
Landed to 2.5. |
| Comment by Bruno Faccini (Inactive) [ 09/Sep/13 ] |
|
b2_1 patch version is at http://review.whamcloud.com/7586. Patch-less Client Kernel integration will occur automatically now that master (http://review.whamcloud.com/7453) patch landed. |
| Comment by Jian Yu [ 22/Nov/13 ] |
|
Patch http://review.whamcloud.com/7453 was cherry-picked to Lustre b2_4 branch. |