[LU-526] sanityn test_40d failed for non PDO block issues Created: 21/Jul/11  Updated: 29/Apr/14  Resolved: 29/Apr/14

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: nasf (Inactive) Assignee: WC Triage
Resolution: Cannot Reproduce Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 7214

 Description   

https://maloo.whamcloud.com/test_sets/aa0c4c3c-b380-11e0-b33f-52540025f9af
== sanityn test 40d: pdirops: unlink and others ====================================================== 21:45:18 (1311223518)
fail_loc=0x80000145
No conflict
No conflict
No conflict
No conflict
Conflict
sanityn test_40d: @@@@@@ FAIL: getattr is blocked

In fact, it is the mkdir operation in sanityn test_40d is too slow, then caused the succedent getattr pdo conflict check failed. According to MDS side log, there is 3~4 sec idle time when MDS process such mkdir operation. We do not know what happened during such time.

=========================
00010000:00010000:0.0:1311223539.671871:0:3645:0:(ldlm_request.c:432:ldlm_cli_enqueue_local()) ### client-side local enqueue handler, new lock created ns: mdt-ffff81002e495000 lock: ffff81002712cd80/0x3e643e5ca108c3e6 lrc: 3/0,1 mode: PW/PW res: 2614401/391244433 bits 0x2 rrc: 1 type: IBT flags: 0x4004000 remote: 0x0 expref: -99 pid: 3645 timeout: 000000100:00100000:0.0:1311223540.829552:0:2796:0:(client.c:1392:ptlrpc_send_new_req()) Sending RPC pname:cluuid:pid:xid:nid:opc ptlrpcd:59c37739-b303-55e4-3143-54ec71d874a1:2796:1374939926001443:0@lo:400
00000100:00100000:0.0:1311223540.829602:0:2796:0:(events.c:286:request_in_callback()) peer: 12345-0@lo
00000100:00080000:0.0:1311223540.829627:0:3636:0:(service.c:772:ptlrpc_update_export_timer()) updating export 59c37739-b303-55e4-3143-54ec71d874a1 at 1311223540 exp ffff81004e707000
00000100:00100000:0.0:1311223540.829642:0:3636:0:(service.c:1705:ptlrpc_server_handle_request()) Handling RPC pname:cluuid+ref:pid:xid:nid:opc ll_mgs_02:59c37739-b303-55e4-3143-54ec71d874a1+7:2796:x1374939926001443:12345-0@lo:400
00000100:00100000:0.0:1311223540.829664:0:3636:0:(service.c:1752:ptlrpc_server_handle_request()) Handled RPC pname:cluuid+ref:pid:xid:nid:opc ll_mgs_02:59c37739-b303-55e4-3143-54ec71d874a1+6:2796:x1374939926001443:12345-0@lo:400 Request procesed in 30us (63us total) trans 0 rc 0/0
00000100:00100000:0.0:1311223540.829681:0:2796:0:(client.c:1726:ptlrpc_check_set()) Completed RPC pname:cluuid:pid:xid:nid:opc ptlrpcd:59c37739-b303-55e4-3143-54ec71d874a1:2796:1374939926001443:0@lo:400
00000100:00100000:0.0:1311223542.895675:0:2787:0:(events.c:286:request_in_callback()) peer: 12345-10.10.4.107@tcp
00000100:00080000:0.0:1311223542.895695:0:3636:0:(service.c:772:ptlrpc_update_export_timer()) updating export 2e47f11b-f9e6-151e-4510-1719d4dd1b5b at 1311223542 exp ffff81004e4a3400
00000100:00100000:0.0:1311223542.895712:0:3636:0:(service.c:1705:ptlrpc_server_handle_request()) Handling RPC pname:cluuid+ref:pid:xid:nid:opc ll_mgs_02:2e47f11b-f9e6-151e-4510-1719d4dd1b5b+13:4571:x1374939930211101:12345-10.10.4.107@tcp:400
00000100:00100000:0.0:1311223542.895737:0:3636:0:(service.c:1752:ptlrpc_server_handle_request()) Handled RPC pname:cluuid+ref:pid:xid:nid:opc ll_mgs_02:2e47f11b-f9e6-151e-4510-1719d4dd1b5b+12:4571:x1374939930211101:12345-10.10.4.107@tcp:400 Request procesed in 33us (64us total) trans 0 rc 0/0
00000004:00200000:0.0:1311223543.313443:0:3645:0:(mdt_handler.c:454:mdt_pack_attr2body()) [0x200001b71:0x273c:0x0]: returning size 4096
=========================



 Comments   
Comment by Sarah Liu [ 02/May/12 ]

Hit the same issue on 2.1.2 testing: https://maloo.whamcloud.com/test_sets/1f338df0-929b-11e1-9e8b-525400d2bfa6

Comment by Jodi Levi (Inactive) [ 29/Apr/14 ]

After discussion with Fan Yong, this has not been reproduced in over 2 years so we are closing ticket.

Generated at Sat Feb 10 01:07:54 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.