[LU-638] conf-sanity test_55: @@@@@@ FAIL: client start failed Created: 25/Aug/11 Updated: 15/Dec/11 Resolved: 15/Dec/11 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Minor |
| Reporter: | James A Simmons | Assignee: | Minh Diep |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Rank (Obsolete): | 4241 |
| Description |
|
ON client we get Writing CONFIGS/mountdata Client dmesgLustre: DEBUG MARKER: == conf-sanity test 56: check big indexes ============================================================ 09:59:58 (1314280798) MDS dmesg Lustre: DEBUG MARKER: == conf-sanity test 56: check big indexes ============================================================ 09:59:58 (1314280798) |
| Comments |
| Comment by James A Simmons [ 25/Aug/11 ] |
|
Sorry meant to label this as config-sanity test_56 failure |
| Comment by Andreas Dilger [ 27/Aug/11 ] |
|
This looks like you are trying to run with 2 MDTs in CMD mode? There shouldn't be an MDT0001 otherwise. |
| Comment by James A Simmons [ 29/Aug/11 ] |
|
Doesn't that require a mkfs.lustre parameter iam_dir. This what I'm formating the MDT with " --mgsnode=10.37.248.61@o2ib1 --mdt --fsname=lustre --param sys.timeout=20 --device-size=200000 --mountfsoptions=errors=remount-ro,user_xattr,acl --param lov.stripesize=1048576 --param lov.stripecount=0 --param mdt.identity_upcall=/usr/sbin/l_getidentity --mkfsoptions=\"-E lazy_itable_init\" |
| Comment by James A Simmons [ 29/Aug/11 ] |
|
After some tracking I discovered the problem was the mount option acl. Once I removed it from both the client mount string and the mds mount string the test past. I also tried conf-sanity test 55 and the same result. I'm looking to see what other test the mount option acl breaks. |
| Comment by Peter Jones [ 13/Dec/11 ] |
|
Minh What would your expectations be re using the mount option acl? Peter |
| Comment by Minh Diep [ 13/Dec/11 ] |
|
Hi James, Could you try the same (with and without acl) with 1 MDT? |
| Comment by James A Simmons [ 14/Dec/11 ] |
|
Okay I ran a bunch of test with different options. First the acl option doesn't cause the failure any more. It fails in either condition. Only MDT is being formated with Format mds1: /dev/md5 with --mdt --fsname=lustre --device-size=200000 --param sys.timeout=20 --mountfsoptions=errors=remount-ro,user_xattr,acl --param lov.st.... Now the error I get is... Lustre: DEBUG MARKER: == conf-sanity test 55: check lov_objid size ========================================================= 09:06:09 (1323871569) |
| Comment by James A Simmons [ 14/Dec/11 ] |
|
Yipes. The MGS is stopped but never restarted... |
| Comment by James A Simmons [ 15/Dec/11 ] |
|
Tracked down the problem. Its due to having separate MGS and MDS. This problem was reported in |
| Comment by Peter Jones [ 15/Dec/11 ] |
|
Duplicate of |