[LU-8200] sanityn test_33c: FAIL: Sync-Lock-Cancel not triggered Created: 24/May/16  Updated: 08/Oct/19  Resolved: 02/May/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0, Lustre 2.12.0, Lustre 2.10.4
Fix Version/s: Lustre 2.12.0, Lustre 2.10.6

Type: Bug Priority: Minor
Reporter: Jian Yu Assignee: Lai Siyao
Resolution: Fixed Votes: 0
Labels: DNE, zfs
Environment:

DNE


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

sanityn test 33c failed as follows:

== sanityn test 33c: Cancel cross-MDT lock should trigger Sync-Lock-Cancel == 13:45:22 (1461159922)
CMD: trevis-18vm4 lctl set_param -n mdt.*.sync_count=0
CMD: trevis-18vm4 lctl get_param -n mdt.*MDT0000.sync_count
 sanityn test_33c: @@@@@@ FAIL: Sync-Lock-Cancel not triggered 

https://testing.hpdd.intel.com/test_sets/68b87826-0729-11e6-9e5d-5254006e85c2
https://testing.hpdd.intel.com/test_sets/59057162-f7a8-11e5-a964-5254006e85c2



 Comments   
Comment by Minh Diep [ 26/Jan/18 ]

+1 on b2_10:
https://testing.hpdd.intel.com/test_sets/0252b3d4-0219-11e8-bd00-52540065bddc

Comment by Gerrit Updater [ 15/Mar/18 ]

Lai Siyao (lai.siyao@intel.com) uploaded a new patch: https://review.whamcloud.com/31655
Subject: LU-8200 test: improve sanityn.sh 33c
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 84d03889eb2a663d927936b443b84445cfdce35f

Comment by Lai Siyao [ 16/Mar/18 ]
00000004:00010000:1.0:1520923781.248558:0:27183:0:(mdt_handler.c:3061:mdt_save_lock()) ### save lock request ffff880057f29500 reply state ffff88004cc65400 transno 21474854652
 ns: mdt-lustre-MDT0001_UUID lock: ffff880056833440/0xe4e33dfd44c7d30a lrc: 3/0,1 mode: PW/PW res: [0x240002344:0x177d:0x0].0x0 bits 0x2/0x0 rrc: 2 type: IBT flags: 0x40210000000000 nid: local remote: 0x0 expref: -99 pid: 27183 timeout: 0 lvb_type: 0
00000004:00080000:1.0:1520923781.248569:0:27183:0:(osp_object.c:1304:osp_invalidate()) Invalidate osp_object [0x200002b14:0x53:0x0]
00010000:00010000:0.0:1520923781.358449:0:23490:0:(ldlm_request.c:1054:ldlm_cli_cancel_local()) ### client-side cancel ns: lustre-MDT0000-osp-MDT0001 lock: ffff88005906d8c0/0xe4e33dfd44c7d303 lrc: 3/0,0 mode: EX/EX res: [0x200002b14:0x53:0x0].0x0 bits 0x2/0x0 rrc: 2 type: IBT flags: 0x1008401000000 nid: local remote: 0xa40766a0045d15ef expref: -99 pid: 27183 timeout: 0 lvb_type: 0
00000004:00010000:0.0:1520923781.358460:0:23490:0:(mdt_handler.c:2692:mdt_remote_blocking_ast()) ### Revoke remote lock
 ns: lustre-MDT0000-osp-MDT0001 lock: ffff88005906d8c0/0xe4e33dfd44c7d303 lrc: 3/0,0 mode: EX/EX res: [0x200002b14:0x53:0x0].0x0 bits 0x2/0x0 rrc: 2 type: IBT flags: 0x1009401000000 nid: local remote: 0xa40766a0045d15ef expref: -99 pid: 27183 timeout: 0 lvb_type: 0

The log shows transaction was committed before unlock, so the remote lock is not saved, but put right away. That's why Sync-Lock-Cancel is not triggered.

Comment by Gerrit Updater [ 16/Mar/18 ]

Lai Siyao (lai.siyao@intel.com) uploaded a new patch: https://review.whamcloud.com/31673
Subject: LU-8200 test: improve sanityn.sh 33c
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: fa55e42097860d5f4e4f54fe3f9f112ce76234a2

Comment by Sarah Liu [ 22/Mar/18 ]

+1 on master https://testing.hpdd.intel.com/test_sets/16cdb996-2dbe-11e8-b3c6-52540065bddc

Comment by Gerrit Updater [ 02/May/18 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/31673/
Subject: LU-8200 test: improve sanityn.sh 33c
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 04c4b9c2a121e76881f8b387a4721069418c92f0

Comment by Peter Jones [ 02/May/18 ]

Landed for 2.12

Comment by Gerrit Updater [ 23/Aug/18 ]

Jian Yu (yujian@whamcloud.com) uploaded a new patch: https://review.whamcloud.com/33057
Subject: LU-8200 test: improve sanityn.sh 33c
Project: fs/lustre-release
Branch: b2_10
Current Patch Set: 1
Commit: 987f497ff0b593d550939c7f2d19dc0f0b3e750f

Comment by Jian Yu [ 23/Aug/18 ]

The same failure also occurred on Lustre b2_10 branch:
https://testing.whamcloud.com/test_sets/16e06700-a67a-11e8-8853-52540065bddc
https://testing.whamcloud.com/test_sets/2f409660-a5b9-11e8-8853-52540065bddc

Comment by Gerrit Updater [ 11/Sep/18 ]

John L. Hammond (jhammond@whamcloud.com) merged in patch https://review.whamcloud.com/33057/
Subject: LU-8200 test: improve sanityn.sh 33c
Project: fs/lustre-release
Branch: b2_10
Current Patch Set:
Commit: 1d570d64523cddb89683ea37acf2bab06ffc31be

Comment by Artem Blagodarenko (Inactive) [ 05/Dec/18 ]

Looks like the same failĀ https://testing.whamcloud.com/test_sets/6c09aaee-f804-11e8-bfe1-52540065bddc

Comment by Bruno Faccini (Inactive) [ 28/Mar/19 ]

Also looks like a +1 on recent master at https://testing.whamcloud.com/test_sets/f196a674-50ee-11e9-8e92-52540065bddc

Comment by Andreas Dilger [ 13/May/19 ]

+1 on master https://testing.whamcloud.com/test_sets/61ccb1fe-7390-11e9-a6f2-52540065bddc

Comment by Emoly Liu [ 08/Oct/19 ]

more on b2_12:
https://testing.whamcloud.com/test_sets/87abf4e2-e26d-11e9-a0ba-52540065bddc
https://testing.whamcloud.com/test_sets/a72e6de6-e267-11e9-b62b-52540065bddc

Generated at Sat Feb 10 02:15:30 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.