[LU-2467] ABILITY TO DISABLE PINGING Created: 11/Dec/12  Updated: 13/Feb/19  Resolved: 13/Feb/19

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: Lustre 2.4.0, Lustre 1.8.9

Type: New Feature Priority: Minor
Reporter: Jodi Levi (Inactive) Assignee: Li Wei (Inactive)
Resolution: Fixed Votes: 0
Labels: Fujitsu

Issue Links:
Related
is related to LUDOC-111 Ability to Disable Pinging Documentation Closed
is related to LU-2898 More timely notification of clients i... Open
is related to LU-6391 Option for client not to stop pinging... Resolved
Sub-Tasks:
Key
Summary
Type
Status
Assignee
LU-2497 Create and attach test plan to ticket... Technical task Resolved Li Wei  
Rank (Obsolete): 5813

 Comments   
Comment by Li Wei (Inactive) [ 14/Jan/13 ]

http://review.whamcloud.com/5229
http://review.whamcloud.com/5232
http://review.whamcloud.com/5231
http://review.whamcloud.com/5009

Comment by Hiroya Nozaki [ 11/Feb/13 ]

Hi, all. I'm a developer in Fujitsu. I'd like to let you know one thing ralated to the issue that has bothered our development team for a long time. It is that ... clients don't have the way to confirm whether or not it's already been evicted besides actually sending an rpc request. So this behavior let it often happen that an already-evicted client returns -EIO to a user application when it's first used ... even when it's evicted long time ago. I know this ploblem is inevitable as far as we use ldlm. But disable pinging makes this problem more serious issue, especially in a large environment like K.

Now I've tried to implement one function which let the server-side, which has targets, notifies clients eviction-event via MGS. And I want to cooperate with you, I mean, I want to open my prototype patch for 1.8.8 and examine it together.

Thank you.

Comment by Li Wei (Inactive) [ 16/Feb/13 ]

Hi Nozaki-san,

The case you described is real. Ideally, an installation shall be tuned to a point where no evictions happen during normal operations (i.e., no clients miss recovery windows). But I guess that might be hard to achieve on realistic and large installations. Administratively triggering pings via procfs file "ping" before each job starts may help reduce the number of evictions seen by applications. Nevertheless, the case can not be completely eliminated this way.

If you have a patch already, please post a link here; some design thoughts would also do. Let's see if we can get some discussions going.

Comment by Hiroya Nozaki [ 17/Feb/13 ]

> Administratively triggering pings via procfs file "ping" before each job starts may help reduce the number of evictions seen by applications.
Yes, you're right. Actually We've workarounded this problem with some other new features of FEFS, running some scripts and "lfs df" on evicted clients which is triggerd by Fujitsu system management software. But I believe that it's more convenient if a ping can be driven and sent by Lustre itself.

> please post a link here
I'll appriciate it! now I have only a patch for FEFS. So I'll convert its version for Lustre-2.3.x, Please wait for a while.

Comment by Li Wei (Inactive) [ 17/Feb/13 ]

Generally, only master accepts new features. Also, I would post a minimum version that shows the design first, before spending too much time polishing the details.

Comment by Hiroya Nozaki [ 17/Feb/13 ]

Does you mean that you're OK if I upload a patch for FEFS ?

Comment by Li Wei (Inactive) [ 17/Feb/13 ]

Attaching an existing FEFS patch here would be more economical than porting it to b2_3, which no longer accepts any changes. If it depends too much on other FEFS-specific changes, master is where it should be ported to. (I was just trying to minimise your effort before we are sure whether it is the right way to go.)

Comment by Hiroya Nozaki [ 17/Feb/13 ]

I appriciate your consideration!
But I'm afraid to say that, as you said, my patch depends os some FEFS-specific changes, so I think I'd better collect some codes, change them and apply them for, at least, 1.8.8. I believe that you'll find it easer when I do it than when I don't.

I'll upload the patch within today in JST, please wait for a while.
and thank you for your consideration again !!

Comment by Hiroya Nozaki [ 18/Feb/13 ]

Hi, Li.
I uploaded the patch and here it is.

http://review.whamcloud.com/#change,5457

Comment by Oleg Drokin [ 04/Mar/13 ]

I opened a separate ticket LU-2898 to track the issue of client eviction notification, so please use it for further discussion on this topic and let's concentrate on the disabled pinging support in this ticket.

Generated at Sat Feb 10 01:25:27 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.