[LU-8567] mdc_reint.c:57:mdc_reint()) error in handling -17 encountered on power8 node Created: 29/Aug/16 Updated: 30/Aug/16 Resolved: 30/Aug/16 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.9.0 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Major |
| Reporter: | James A Simmons | Assignee: | Zhenyu Xu |
| Resolution: | Not a Bug | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Power8 running RHEL7.2 with lustre version 2.8.56 |
||
| Attachments: |
|
||||||||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 9223372036854775807 | ||||||||
| Description |
|
During my latest testing after I updated my test file system to the latest as well I found when running a IOR single shared file on one client the job alway fails. Nothing shows up in the kernel logs but I have gathered lustre debug logs to show what the problem is. I have attached those logs here. |
| Comments |
| Comment by Peter Jones [ 30/Aug/16 ] |
|
Bobijam Could you please assist with this issue? Thanks Peter |
| Comment by Zhenyu Xu [ 30/Aug/16 ] |
|
What's the failed job output? Can you strace it? The log shows that mkdir of "jsimmons" under 0x200000410:0x4:0x0] failed, since it exists. 00000080:00000001:1.0:1472488456.442910:1344:93034:0:(namei.c:1235:ll_mkdir()) Process entered 00000080:00200000:1.0:1472488456.442910:1344:93034:0:(namei.c:1238:ll_mkdir()) VFS Op:name=jsimmons, dir=[0x200000410:0x4:0x0](c0000007eb866c90) 00000080:00000001:1.0:1472488456.442912:1696:93034:0:(namei.c:982:ll_new_node()) Process entered ... 00000080:00000001:1.0:1472488456.444020:1792:93034:0:(namei.c:1008:ll_new_node()) Process leaving via err_exit (rc=18446744073709551599 : -17 : 0xffffffffffffffef) ... 00000080:00000001:1.0:1472488456.444031:1456:93034:0:(namei.c:1249:ll_mkdir()) Process leaving (rc=18446744073709551599 : -17 : ffffffffffffffef) |
| Comment by James A Simmons [ 30/Aug/16 ] |
|
Sorry this doesn't appear to be the source of the bug. The true source is |