[LU-10094] sanity test_17f: 'ls' fails with "ls: reading directory *: Input/output error" Created: 06/Oct/17 Updated: 04/Sep/19 Resolved: 15/Aug/19 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.11.0, Lustre 2.12.0, Lustre 2.13.0, Lustre 2.10.7, Lustre 2.12.1 |
| Fix Version/s: | Lustre 2.13.0, Lustre 2.12.3 |
| Type: | Bug | Priority: | Critical |
| Reporter: | James Casper | Assignee: | Lai Siyao |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | ppc | ||
| Environment: |
trevis, full, x86_64 servers, ppc clients |
||
| Issue Links: |
|
||||||||||||||||||||||||
| Severity: | 3 | ||||||||||||||||||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||||||||||||||||||
| Description |
|
https://testing.whamcloud.com/test_sessions/ba995751-659c-4e63-9b5b-fbf101137b78 From test_log: ls: reading directory /mnt/lustre/d17f.sanity: Input/output error sanity test_17f: @@@@@@ FAIL: test_17f failed with 2 Trace dump: = /usr/lib64/lustre/tests/test-framework.sh:5289:error() = /usr/lib64/lustre/tests/test-framework.sh:5565:run_one() = /usr/lib64/lustre/tests/test-framework.sh:5604:run_one_logged() = /usr/lib64/lustre/tests/test-framework.sh:5451:run_test() = /usr/lib64/lustre/tests/sanity.sh:459:main() |
| Comments |
| Comment by James Nunez (Inactive) [ 02/May/18 ] |
|
Several sanity tests have 'ls' fail with the "reading directory ... Input/output error" for PPC architectures including test_17f, 18, 22, 24v, 24A, 32b, 32d, 32f, 32h, 48a, 48b, 48c, 51a, 51b, 56ab, and 154a. For full test group results, the first time we see these tests fail for PPC is on 2017-09-17 20:43:36 UTC for master build # 3642, version 2.10.53.1. |
| Comment by Jian Yu [ 27/Jun/19 ] |
|
On ppc64 client: # ls /mnt/lustre/ ls: reading directory /mnt/lustre/: Input/output error Dmesg: [53353.003090] Lustre: Mounted lustre-client [53354.406948] Lustre: DEBUG MARKER: Using TIMEOUT=20 [53372.104367] Lustre: 30604:0:(mdc_request.c:1549:mdc_read_page()) Page-wide hash collision: 0xfeffffffffffffff [53378.035937] Lustre: lustre-OST0000-osc-c0000000788f6800: disconnect after 24s idle [54485.730632] Lustre: 30675:0:(mdc_request.c:1549:mdc_read_page()) Page-wide hash collision: 0xfeffffffffffffff [54485.730769] Lustre: 30675:0:(mdc_request.c:1549:mdc_read_page()) Skipped 1 previous similar message |
| Comment by Lai Siyao [ 15/Jul/19 ] |
|
Can you run 'getconf PAGE_SIZE' on ppc64 client? |
| Comment by Gerrit Updater [ 15/Jul/19 ] |
|
Lai Siyao (lai.siyao@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35517 |
| Comment by Jian Yu [ 15/Jul/19 ] |
# uname -m ppc64 # getconf PAGE_SIZE 65536 |
| Comment by Lai Siyao [ 16/Jul/19 ] |
|
Mmm, the above patch should be able to fix this issue. |
| Comment by Gerrit Updater [ 15/Aug/19 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35517/ |
| Comment by Peter Jones [ 15/Aug/19 ] |
|
Landed for 2.13 |
| Comment by Gerrit Updater [ 18/Aug/19 ] |
|
Jian Yu (yujian@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/35812 |
| Comment by Gerrit Updater [ 04/Sep/19 ] |
|
Oleg Drokin (green@whamcloud.com) merged in patch https://review.whamcloud.com/35812/ |