[LU-14458] racer ZFS test_1: crash in dbuf_free_range() with directory migration Created: 19/Feb/21  Updated: 16/Jul/21

Status: Open
Project: Lustre
Component/s: None
Affects Version/s: None
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: WC Triage
Resolution: Unresolved Votes: 0
Labels: ZFS

Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Andreas Dilger <adilger@whamcloud.com>

This issue relates to the following test suite run: https://testing.whamcloud.com/test_sets/c4b236de-1eec-4ec4-8339-e44679537d2d

test_1 crashed after running 300s with the following error:

[  646.115634] BUG: unable to handle kernel paging request at ffffffffc09292d3
[  646.117145] IP: [<ffffffff9e584025>] mutex_lock+0x15/0x2f
[  646.118208] PGD 70214067 PUD 70216067 PMD 79e77067 PTE 7a469061
[  646.119405] Oops: 0003 [#1] SMP 
[  646.138547] CPU: 0 PID: 25472 Comm: mdt00_012 Kdump: loaded 3.10.0-1127.19.1.el7_lustre.x86_64 #1
[  646.141421] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2011
[  646.157093] Call Trace:
[  646.157807]  [<ffffffffc09298e5>] dbuf_free_range+0xa5/0x5b0 [zfs]
[  646.162689]  [<ffffffffc0929e0b>] dbuf_rm_spill+0x1b/0x20 [zfs]
[  646.164411]  [<ffffffffc09346dc>] dmu_rm_spill+0x4c/0xc0 [zfs]
[  646.167577]  [<ffffffffc099e2e2>] sa_build_layouts+0x572/0x950 [zfs]
[  646.169126]  [<ffffffffc099e997>] sa_modify_attrs+0x2d7/0x430 [zfs]
[  646.171051]  [<ffffffffc099edfe>] sa_attr_op+0x30e/0x460 [zfs]
[  646.172856]  [<ffffffffc09a03a9>] sa_bulk_update_impl+0x69/0x130 [zfs]
[  646.174946]  [<ffffffffc09a0525>] sa_update+0xb5/0x100 [zfs]
[  646.176168]  [<ffffffffc1393e52>] __osd_sa_xattr_update+0x122/0x230 [osd_zfs]
[  646.177490]  [<ffffffffc138740f>] osd_object_sa_dirty_rele+0xbf/0x110 [osd_zfs]
[  646.178844]  [<ffffffffc137e8ab>] osd_trans_stop+0x3db/0x5f0 [osd_zfs]
[  646.180397]  [<ffffffffc11a8466>] dt_trans_stop+0x16/0x30 [ptlrpc]
[  646.181571]  [<ffffffffc11aaccd>] top_trans_stop+0x38d/0xbf0 [ptlrpc]
[  646.182807]  [<ffffffffc15cfafc>] lod_trans_stop+0x25c/0x340 [lod]
[  646.185294]  [<ffffffffc168f61e>] mdd_trans_stop+0x2e/0x174 [mdd]
[  646.186424]  [<ffffffffc167385a>] mdd_migrate_object+0x8ba/0x1860 [mdd]
[  646.189168]  [<ffffffffc1674b86>] mdd_migrate+0x386/0x800 [mdd]
[  646.190347]  [<ffffffffc15275b3>] mdo_migrate+0x4f/0x51 [mdt]
[  646.191426]  [<ffffffffc14eab26>] mdt_reint_migrate+0xe96/0xfb0 [mdt]
[  646.192605]  [<ffffffffc14eacc3>] mdt_reint_rec+0x83/0x210 [mdt]
[  646.193729]  [<ffffffffc14c2a30>] mdt_reint_internal+0x720/0xaf0 [mdt]
[  646.195021]  [<ffffffffc14ce5c7>] mdt_reint+0x67/0x140 [mdt]
[  646.196417]  [<ffffffffc119a6fa>] tgt_request_handle+0x7ea/0x1750 [ptlrpc]
[  646.198963]  [<ffffffffc113a1a6>] ptlrpc_server_handle_request+0x256/0xb10 [ptlrpc]
[  646.200388]  [<ffffffffc113ecfc>] ptlrpc_main+0xb3c/0x14e0 [ptlrpc]
[  646.204074]  [<ffffffff9dec6691>] kthread+0xd1/0xe0

VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
racer test_1 - onyx-39vm5 crashed during racer test_1



 Comments   
Comment by Andreas Dilger [ 22/Feb/21 ]

+1 on master after 3 minutes with directory migration patch:
https://testing.whamcloud.com/test_sets/94f65ed4-e618-4703-bee7-485056a236e5

Comment by Alex Zhuravlev [ 16/Jul/21 ]

https://review.whamcloud.com/#/c/43233/ helps on my setup

Generated at Sat Feb 10 03:09:56 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.