Details
-
Bug
-
Resolution: Duplicate
-
Blocker
-
None
-
Lustre 2.7.0, Lustre 2.5.4
-
None
-
Lustre Build: https://build.hpdd.intel.com/job/lustre-b2_5/100/
Distro/Arch: RHEL6.5/x86_64
-
3
-
16486
Description
replay-dual test 11 failed as follows:
rm: cannot remove `/mnt/lustre/f11.replay-dual-[1-5]': No such file or directory replay-dual test_11: @@@@@@ FAIL: test_11 failed with 1
Dmesg on client node:
Lustre: 15249:0:(client.c:2752:ptlrpc_replay_interpret()) @@@ Version mismatch during replay req@ffff88006a414400 x1484130260889212/t515396075526(515396075526) o36->lustre-MDT0000-mdc-ffff88006aa09400@10.1.4.66@tcp:12/10 lens 520/416 e 1 to 0 dl 1415377944 ref 2 fl Interpret:R/4/0 rc -75/-75 LustreError: 15249:0:(client.c:2740:ptlrpc_replay_interpret()) request replay timed out, restarting recovery LustreError: 167-0: lustre-MDT0000-mdc-ffff880037fb4400: This client was evicted by lustre-MDT0000; in progress operations using this service will fail. LustreError: 1678:0:(mdc_locks.c:918:mdc_enqueue()) ldlm_cli_enqueue: -5 Lustre: lustre-MDT0000-mdc-ffff880037fb4400: Connection restored to lustre-MDT0000 (at 10.1.4.66@tcp) LustreError: 1678:0:(dir.c:378:ll_get_dir_page()) lock enqueue: [0x200000007:0x1:0x0] at 0: rc -5 LustreError: 1678:0:(dir.c:584:ll_dir_read()) error reading dir [0x200000007:0x1:0x0] at 0: rc -5
Dmesg on MDS node:
Lustre: lustre-MDT0000: recovery is timed out, evict stale exports Lustre: lustre-MDT0000: disconnecting 1 stale clients Lustre: 18536:0:(ldlm_lib.c:2092:target_recovery_thread()) too long recovery - read logs LustreError: dumping log to /tmp/lustre-log.1415377850.18536
Maloo report: https://testing.hpdd.intel.com/test_sets/a6c1b3de-68c5-11e4-a63a-5254006e85c2
Attachments
Issue Links
- is related to
-
LU-5079 conf-sanity test_47 timeout
- Resolved