[LU-7035] conf-sanity test_32a failed with memory leak:(class_obd.c:633:cleanup_obdclass()) obd_memory max: *, leaked: * Created: 24/Aug/15  Updated: 12/Sep/16  Resolved: 12/Sep/16

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.8.0, Lustre 2.9.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: James Nunez (Inactive) Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

autotest review-dne-part-1


Issue Links:
Blocker
is blocked by LU-4828 conf-sanity test_32a: (class_obd.c:73... Closed
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

conf-sanity test 32a is failing with the error message

'test_32a failed with 2' 

There are several error messages in the test_log, but the first comes from a memory leak

shadow-25vm8: LustreError: 1435:0:(class_obd.c:638:cleanup_obdclass()) obd_memory max: 184078833, leaked: 61680
shadow-25vm8: 
shadow-25vm8: Memory leaks detected
 conf-sanity test_32a: @@@@@@ FAIL: Reloading modules 

After that, there are several other errors:

shadow-25vm8: open /proc/sys/lnet/dump_kernel failed: No such file or directory
shadow-25vm8: open(dump_kernel) failed: No such file or directory
CMD: shadow-25vm8 rm -rf /tmp/t32
CMD: shadow-25vm8 /usr/sbin/lctl list_nids
shadow-25vm8: opening /dev/lnet failed: No such device
shadow-25vm8: hint: the kernel modules may not be loaded
shadow-25vm8: IOC_LIBCFS_GET_NI error 19: No such device
...
shadow-25vm8: mount.lustre: mount /dev/loop1 at /tmp/t32/mnt/ost failed: Invalid argument
shadow-25vm8: This may have multiple causes.
shadow-25vm8: Are the mount options correct?
shadow-25vm8: Check the syslog for more info.
 conf-sanity test_32a: @@@@@@ FAIL: Mounting the OST 

Logs are at https://testing.hpdd.intel.com/test_sets/ef24fc34-4a66-11e5-bf45-5254006e85c2.

The same failure occurred in a full test session for 2.7.56 with logs at https://testing.hpdd.intel.com/test_sets/23cd8af0-2929-11e5-83b7-5254006e85c2.



 Comments   
Comment by James Nunez (Inactive) [ 17/Nov/15 ]

More failures on master
2015-11-16 07:12:25 - https://testing.hpdd.intel.com/test_sets/21478bdc-8c5c-11e5-8d76-5254006e85c2
2015-12-02 06:34:08 - review-dne-part-1 - https://testing.hpdd.intel.com/test_sets/08d07642-98ee-11e5-802b-5254006e85c2
2015-12-02 10:56:55 - review-dne-part-1 - https://testing.hpdd.intel.com/test_sets/b348d45a-9913-11e5-aeec-5254006e85c2
2016-02-02 22:39:26 - https://testing.hpdd.intel.com/test_sets/58c84f88-ca28-11e5-910f-5254006e85c2

Comment by James Nunez (Inactive) [ 29/Apr/16 ]

It looks like this issue is causing other tests to fail. In this case, lnet-selftest fails with this memory leak error message:
2016-04-28 - https://testing.hpdd.intel.com/test_sets/7653da8a-0d09-11e6-855a-5254006e85c2

Comment by Andreas Dilger [ 12/Sep/16 ]

Close as a duplicate of LU-4828.

Generated at Sat Feb 10 02:05:26 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.