[LU-9665] LNET not shutting down properly Created: 14/Jun/17  Updated: 15/Jun/17  Resolved: 15/Jun/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Bhanu Assignee: WC Triage
Resolution: Not a Bug Votes: 0
Labels: None
Environment:

CentOS 7.2


Attachments: JPEG File reboot_shutdown.jpg    
Epic/Theme: Lustre-2.8.0
Severity: 3
Epic: client
Rank (Obsolete): 9223372036854775807

 Description   

When ever a reboot/shutdown command is issued from OS
It's failing because LNET didn't shutdown properly



 Comments   
Comment by Amir Shehata (Inactive) [ 14/Jun/17 ]

It appears that OFED is being shutdown before LNet. This would cause LNet to get stuck. So in your shutdown procedure LNet should be shutdown before OFED.

Comment by Bhanu [ 14/Jun/17 ]

Is there a recommended init script to shutdown lnet ??

Comment by Amir Shehata (Inactive) [ 15/Jun/17 ]

I believe the order when shutting down should be to
1. umount the FS
2. lustre_rmmod
3. reboot/shutdown

There should be an lnet script

/usr/lib/systemd/system/lnet.service 

That script should do the above steps.

It'll be a matter of making sure that this script is executed (IE lnet service stopped) on shutdown before the OFED modules are unloaded.

Comment by Bhanu [ 15/Jun/17 ]

Thanks,
we updated the script and now it's shutting down properly
You can close this ticket

Comment by Peter Jones [ 15/Jun/17 ]

Great - thanks

Comment by Amir Shehata (Inactive) [ 15/Jun/17 ]

Bhanu, Can you please share how you updated the script.

Generated at Sat Feb 10 02:28:09 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.