[LU-15020] OSP_DISCONNECT blocking MDT unmount - Whamcloud Community JIRA

Details

Type: Bug
Resolution: Fixed
Priority: Minor
Fix Version/s: Lustre 2.15.0
Affects Version/s: None
Labels:
None

Severity:
3
Rank (Obsolete):
9223372036854775807

Description

Send OSP_DISCONNECT only on health import. Otherwise, force local disconnect for unhealthy imports.

Attachments

Activity

[LU-15020] OSP_DISCONNECT blocking MDT unmount

Peter Jones made changes - 17/Jan/22 3:33 PM

Fix Version/s		New: Lustre 2.15.0 [ 14791 ]
Resolution		New: Fixed [ 1 ]
Status	Original: Open [ 1 ]	New: Resolved [ 5 ]

Peter Jones added a comment - 17/Jan/22 3:33 PM

Fix on master by https://review.whamcloud.com/#/c/44753/

Peter Jones added a comment - 17/Jan/22 3:33 PM Fix on master by https://review.whamcloud.com/#/c/44753/

Peter Jones made changes - 17/Sep/21 9:59 PM

Component/s	Original: Core Lustre [ 12687 ]
Key	Original: ~~EX-3687~~	New: ~~LU-15020~~
Workflow	Original: Software Simplified Workflow for Project EX [ 83010 ]	New: Sub-task Blocking [ 83398 ]
Project	Original: Exascaler [ 12911 ]	New: Lustre [ 10000 ]
Status	Original: To Do [ 10206 ]	New: Open [ 1 ]

Peter Jones made changes - 17/Sep/21 9:59 PM

Description

Original: This is based on observations during failover failback during testing and at customer sites. Often I would see MDT unmount hanging because a peer MDT was unavailable but the first was blocked waiting for a response to OSP_DISCONNECT.

John, 2:46 PM
I have asked about this but haven't gotten a clear answer. It seems bad to have MDT umount block on an RPC (from osp_disconnect()). What it the point of the MDT/OST_DISCONNECT RPC here and can it be eliminated entirely here?

Andreas, 2:53 PM
it definitely shouldn't block forever. I know clients can/should be able to unmount even if the server is down, but newer unmount binaries (for whatever reason) do a 'stat(mountpoint)' and hang there before the unmount even is called in the kernel

Andreas, 2:54 PM
there is https://review.whamcloud.com/43706 which may also fix this issue for regular unmounts. yes, it is horrific, but I didn't have a better suggestion. I guess we could replace the "/sbin/umount" binary to not do "stat"

John, 3:07 PM
it definitely shouldn't block forever. I know clients can/should be able to unmount even if the server is down, but newer unmount binaries (for whatever reason) do a 'stat(mountpoint)' and hang there before the unmount even is called in the kernel
But my question is "what is the point of osp_disconnect() sending a MDS/OST_DISCONNECT RPC?"

Andreas, 3:07 PM
so that the server can nicely clean up its resources instead of reporting the server being evicted?

Umount --force avoids waiting on the disconnect but causes client eviction issues described in EX-3429.

What resources are being released in server to server disconnect?
What happens if we don't send server to server disconnect?
What is the best way to avoid the hang described above?

New: Send OSP_DISCONNECT only on health import. Otherwise, force local disconnect for unhealthy imports.

Gerrit Updater added a comment - 17/Sep/21 2:10 PM

"Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/44753/
Subject: ~~EX-3687~~ osp: do force disconnect if import is not ready
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 8203c0f7a043aad9d087018119e278e4279ca8bc

Gerrit Updater added a comment - 17/Sep/21 2:10 PM "Oleg Drokin <green@whamcloud.com>" merged in patch https://review.whamcloud.com/44753/ Subject: EX-3687 osp: do force disconnect if import is not ready Project: fs/lustre-release Branch: master Current Patch Set: Commit: 8203c0f7a043aad9d087018119e278e4279ca8bc

John Hammond made changes - 14/Sep/21 12:07 PM

Link

New: This issue is related to EX-3792 [ EX-3792 ]

Gerrit Updater added a comment - 26/Aug/21 7:38 AM

"Mike Pershin <mpershin@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/44753
Subject: ~~EX-3687~~ osp: do force disconnect if import is not ready
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 1a5d067b340e0b62f5577a20779401427ca0adca

Gerrit Updater added a comment - 26/Aug/21 7:38 AM "Mike Pershin <mpershin@whamcloud.com>" uploaded a new patch: https://review.whamcloud.com/44753 Subject: EX-3687 osp: do force disconnect if import is not ready Project: fs/lustre-release Branch: master Current Patch Set: 1 Commit: 1a5d067b340e0b62f5577a20779401427ca0adca

Mikhail Pershin added a comment - 26/Aug/21 7:36 AM - edited

adilger, I am not sure about this hang while MDT unmounting is related to stat() call you've mentioned. That problem and related patch are for client unmount and client RPC, but here we have local mountpoint unmount on server, I doubt it causes inter MDT stat, though there can be some other RPC.

As for osp_disconnect() thing, simplest thing would be just to call ptlrpc_disconnect_import() with obd_force set if import is in recovery, so there will no waiting for import to recover and no disconnect RPC, if import is healthy then disconnect will be send, so other MDT could clean related resources.

Another my question is about whole situation as per description, it states that server hangs waiting for response to DISCONNECT RPC, at the same time this RPC is sent always with rq_no_resent flag, so it should fail after timeout but not hang forever. So was that hang observed by customers are just long in time or it never ends really?

Mikhail Pershin added a comment - 26/Aug/21 7:36 AM - edited adilger , I am not sure about this hang while MDT unmounting is related to stat() call you've mentioned. That problem and related patch are for client unmount and client RPC, but here we have local mountpoint unmount on server, I doubt it causes inter MDT stat, though there can be some other RPC. As for osp_disconnect() thing, simplest thing would be just to call ptlrpc_disconnect_import() with obd_force set if import is in recovery, so there will no waiting for import to recover and no disconnect RPC, if import is healthy then disconnect will be send, so other MDT could clean related resources. Another my question is about whole situation as per description, it states that server hangs waiting for response to DISCONNECT RPC, at the same time this RPC is sent always with rq_no_resent flag, so it should fail after timeout but not hang forever. So was that hang observed by customers are just long in time or it never ends really?

Minh Diep made changes - 24/Aug/21 5:10 PM

Component/s

New: Core Lustre [ 12687 ]

Mikhail Pershin added a comment - 23/Aug/21 6:18 PM - edited

While I am checking how to make server disconnect gracefully, possible way to go with --force umount is to set device read-only before that, in that case clients will be preserved on server I think.

Mikhail Pershin added a comment - 23/Aug/21 6:18 PM - edited While I am checking how to make server disconnect gracefully, possible way to go with --force umount is to set device read-only before that, in that case clients will be preserved on server I think.

People

Assignee:: Mikhail Pershin

Reporter:: John Hammond

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Dates

Created:: 20/Aug/21 3:55 PM

Updated:: 17/Jan/22 3:33 PM

Resolved:: 17/Jan/22 3:33 PM