[LU-16403] sanity test_300o: mdt_restripe_migrate() migrate [0x28000234d:0x3:0x0]/d375 failed: rc = -2 Created: 14/Dec/22  Updated: 02/Jun/23

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.15.2
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: zfs

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/7d131ae1-4440-4707-a856-456470ace699

test_300o failed with the following error:

Timeout occurred after 402 minutes, last suite running was sanity

Not sure if it is a dup of LU-15602 or MDS console

[Sun Dec 11 23:30:00 2022] Lustre: DEBUG MARKER: /usr/sbin/lctl mark == sanity test 300o: unlink big sub stripe\(\> 65000 subdirs\) ========================================================== 23:30:01 \(1670801401\)
[Sun Dec 11 23:30:00 2022] Lustre: DEBUG MARKER: == sanity test 300o: unlink big sub stripe(> 65000 subdirs) ========================================================== 23:30:01 (1670801401)
[Sun Dec 11 23:30:01 2022] Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug
[Sun Dec 11 23:30:02 2022] Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=0
[Sun Dec 11 23:35:54 2022] Lustre: DEBUG MARKER: /usr/sbin/lctl set_param debug="super ioctl neterror warning dlmtrace error emerg ha rpctrace vfstrace config console lfsck"
[Sun Dec 11 23:35:55 2022] Lustre: DEBUG MARKER: /usr/sbin/lctl get_param -n debug
[Sun Dec 11 23:35:56 2022] Lustre: DEBUG MARKER: /usr/sbin/lctl set_param -n debug=0
[Sun Dec 11 23:36:00 2022] LustreError: 537439:0:(mdt_restripe.c:711:mdt_restripe_migrate()) lustre-MDT0002: migrate [0x28000234d:0x3:0x0]/d375 failed: rc = -2

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
sanity test_300o - Timeout occurred after 402 minutes, last suite running was sanity



 Comments   
Comment by Andreas Dilger [ 20/Dec/22 ]

This has been failing intermittently over the past 6 months, but only for ZFS:
https://testing.whamcloud.com/search?horizon=15552000&status%5B%5D=TIMEOUT&test_set_script_id=f9516376-32bc-11e0-aaee-52540025f9ae&sub_test_script_id=6b93bb68-5f4b-11e5-b7c8-5254006e85c2&source=sub_tests#redirect

Generated at Sat Feb 10 03:26:43 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.