[LU-12661] sanity test_817: FAIL: failed to execute 'true' command Created: 13/Aug/19 Updated: 23/Jan/23 |
|
| Status: | Open |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.13.0, Lustre 2.12.4, Lustre 2.12.5, Lustre 2.12.6, Lustre 2.12.7, Lustre 2.12.8 |
| Fix Version/s: | Upstream |
| Type: | Bug | Priority: | Minor |
| Reporter: | Maloo | Assignee: | Dongyang Li |
| Resolution: | Unresolved | Votes: | 0 |
| Labels: | always_except, rhel8, sles12, sles15 | ||
| Issue Links: |
|
||||||||||||||||||||
| Severity: | 3 | ||||||||||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||||||||||
| Description |
|
This issue was created by maloo for jianyu <yujian@whamcloud.com> This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/60697574-bbc3-11e9-a25b-52540065bddc test_817 failed with the following error: == sanity test 817: nfsd won't cache write lock for exec file ======================================== 13:19:00 (1565381940) /usr/lib64/lustre/tests/sanity.sh: line 21700: /mnt/lustre/nfsexp/true: Text file busy sanity test_817: @@@@@@ FAIL: failed to execute 'true' command <<Please provide additional information about the failure here>> VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV |
| Comments |
| Comment by James A Simmons [ 10/Sep/19 ] |
|
Is this a ARM only bug? |
| Comment by James Nunez (Inactive) [ 16/Sep/19 ] |
|
We see this with RHEL8 clients; https://testing.whamcloud.com/test_sets/fff5bbfc-d70c-11e9-9fc9-52540065bddc |
| Comment by Peter Jones [ 18/Sep/19 ] |
|
Dongyang Could you please investigate? Thanks Peter |
| Comment by Gerrit Updater [ 08/Nov/19 ] |
|
Li Dongyang (dongyangli@ddn.com) uploaded a new patch: https://review.whamcloud.com/36712 |
| Comment by Andreas Dilger [ 15/Nov/19 ] |
|
On new kernels nfsd is not releasing the file after write, it will fail with ETXTBSY regardless of whether the NFS export is backed by a Lustre mount or not. That is because newer kernels delay fput() in the write file descriptor, holding it open with write mode and preventing it from being opened in exec mode. |
| Comment by Andreas Dilger [ 15/Nov/19 ] |
|
Oleg, isn't there a patch to drop the write mode lock more quickly, or similar, that would fix this? |
| Comment by Gerrit Updater [ 16/Dec/19 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/36712/ |
| Comment by Andreas Dilger [ 09/Jan/20 ] |
|
The patch https://review.whamcloud.com/32265 " |
| Comment by Dongyang Li [ 17/Jan/20 ] |
|
I just tested 32265 on a centos8 box to be sure. the issue is still there. It can be reproduced on 4.14+ without lustre, so again it is a kernel problem. |
| Comment by Jian Yu [ 10/Feb/20 ] |
|
The same failure also occurred on SLES15 SP1 client (Kernel version 4.12.14-197.29-default): |
| Comment by James A Simmons [ 10/Feb/20 ] |
|
I see this with the linux lustre client as well. |
| Comment by Gerrit Updater [ 16/Apr/20 ] |
|
Jian Yu (yujian@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/38259 |
| Comment by Gerrit Updater [ 01/May/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/38259/ |
| Comment by Hongchao Zhang [ 07/Aug/20 ] |
|
it also fails on SLES 12.5 (kernel version is 4.12.14-122.20-default) |
| Comment by Andreas Dilger [ 09/Sep/20 ] |
|
Dongyang, I see comment-263035 and comment-276924 report failures for SLES 4.12.14 kernels, but the patch skips kernels 4.14 and later. Is that a typo in the patch? Should this test be skipped for kernels >= 4.12 instead? |
| Comment by Gerrit Updater [ 09/Sep/20 ] |
|
Andreas Dilger (adilger@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39838 |
| Comment by Dongyang Li [ 09/Sep/20 ] |
|
Andreas, the patch is only skipping for kernel 4.14 because we were only seeing failures on rhel8, I suspect we might need to skip for kernels >= 4.10, we will see. |
| Comment by Gerrit Updater [ 10/Sep/20 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/39838/ |
| Comment by Gerrit Updater [ 10/Sep/20 ] |
|
Andreas Dilger (adilger@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/39863 |
| Comment by Alena Nikitenko [ 30/Nov/21 ] |
|
+1 in 2.12.8 tests: https://testing.whamcloud.com/test_sets/f95832ef-44a3-4376-8e8e-7b2bb408d560 == sanity test 817: nfsd won't cache write lock for exec file ======================================== 04:32:28 (1637382748) /usr/lib64/lustre/tests/sanity.sh: line 22011: /mnt/lustre/nfsexp/true: Text file busy sanity test_817: @@@@@@ FAIL: failed to execute 'true' command |
| Comment by Gerrit Updater [ 03/Dec/21 ] |
|
"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/39863/ |
| Comment by Gerrit Updater [ 23/Jan/23 ] |
|
"jsimmons <jsimmons@infradead.org>" uploaded a new patch: https://review.whamcloud.com/c/fs/lustre-release/+/49732 |