[LU-9958] Create striped directory fail in 2.10(with LU-9500 patch) Created: 08/Sep/17 Updated: 14/Oct/17 Resolved: 05/Oct/17 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | sebg-crd-pm (Inactive) | Assignee: | Lai Siyao |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Lustre 2.10 (lustre-release-58fd06e) + |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
Hi, //two mdts must in different servers |
| Comments |
| Comment by Brad Hoagland (Inactive) [ 08/Sep/17 ] |
|
Hello, Please attach the entire log for us to review. Thanks, Brad |
| Comment by sebg-crd-pm (Inactive) [ 11/Sep/17 ] |
|
FYI lfs mkdir -c 2 /mnt/client/dir3 see attached file mdt1.log |
| Comment by sebg-crd-pm (Inactive) [ 13/Sep/17 ] |
|
Hi Brad, Do you have any update after reviewing logs? Thanks! |
| Comment by Peter Jones [ 14/Sep/17 ] |
|
Lai Can you please advise on this one? Thanks Peter |
| Comment by Lai Siyao [ 15/Sep/17 ] |
|
in mdt1.log: 00010000:00000001:6.0:1505117481.059504:0:5182:0:(ldlm_lib.c:3268:target_bulk_io()) Process leaving (rc=18446744073709551506 : -110 : ffffffffffffff92) 00000020:00000001:6.0:1505117481.059508:0:5182:0:(out_handler.c:982:out_handle()) Process leaving via out_free (rc=18446744073709547410 : -4206 : 0xffffffffffffef92) which caused mdt0: 00000004:00000001:9.0:1505117529.647439:0:3485:0:(osp_trans.c:1204:osp_send_update_req()) Process leaving (rc=18446744073709551506 : -110 : ffffffffffffff92) ... 00000020:00000001:1.0:1505117529.647589:0:3442:0:(update_trans.c:1091:top_trans_stop()) Process leaving (rc=18446744073709551611 : -5 : fffffffffffffffb) ... 00000004:00000001:1.0:1505117529.647779:0:3442:0:(mdt_reint.c:526:mdt_create()) Process leaving via put_child (rc=18446744073709551611 : -5 : 0xfffffffffffffffb) It looks to be network IO on mdt1 timed out, could you verify the network on mdt1 is working correctly? |
| Comment by sebg-crd-pm (Inactive) [ 19/Sep/17 ] |
|
I test this bug again in 2.10.1-RC1(no add any patch). It looks to be network IO on mdt1 timed out, could you verify the network on mdt1 is working correctly? [/var/log/message in mdt0 server] [/var/log/messages in mdt1 server] |
| Comment by sebg-crd-pm (Inactive) [ 19/Sep/17 ] |
|
It looks to be network IO on mdt1 timed out, could you verify the network on mdt1 is working correctly? |
| Comment by sebg-crd-pm (Inactive) [ 28/Sep/17 ] |
|
Hi Lai, Do you need more detail log? or you have already reproduce it in your site. Thanks. |
| Comment by Lai Siyao [ 28/Sep/17 ] |
|
can you test 'lfs mkdir -i 1 dir1' to create a remote directory? |
| Comment by sebg-crd-pm (Inactive) [ 02/Oct/17 ] |
|
create a remote directory =>fail [root@robin client]# lfs mkdir -i 0 dir0 |
| Comment by sebg-crd-pm (Inactive) [ 03/Oct/17 ] |
|
I have also test create striped directory successed when the two mdts in the same server.(transfer message by loopback device) Any update ? Thanks. |
| Comment by sebg-crd-pm (Inactive) [ 05/Oct/17 ] |
|
This bug can not be reproduced in release 2.10.1 |
| Comment by Peter Jones [ 05/Oct/17 ] |
|
Good news - thanks |