[LU-3987] LBUG ASSERTION( (!(rc < 0) || (lustre_msg_get_transno(req->rq_repmsg) == 0)) ) failed Created: 20/Sep/13 Updated: 14/Nov/13 Resolved: 14/Nov/13 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.4 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Critical |
| Reporter: | Mahmoud Hanafi | Assignee: | Bruno Faccini (Inactive) |
| Resolution: | Duplicate | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 10636 | ||||||||
| Description |
|
This may be a dup of 3>LustreError: 0:0:(ldlm_lockd.c:358:waiting_locks_callback()) ### lock callback timer expired after 151s: evicting client at 10.151.4.243@o2ib ns: mdt-ffff880c2fb68000 lock: ffff880bf1ba3480/0x8107ff778cb60d48 lrc: 3/0,0 mode: CW/CW res: 9011569394/588 bits 0x5 rrc: 512 type: IBT flags: 0x4000030 remote: 0x1cf563a7b3f9d855 expref: 40 pid: 70400 timeout: 4583890257^M |
| Comments |
| Comment by Bruno Faccini (Inactive) [ 20/Sep/13 ] |
|
Hello Mahmoud, |
| Comment by Andreas Dilger [ 20/Sep/13 ] |
|
Looks like it is getting an error during replay, though that should never happen. If the error is -EREMOTE (which should only happen for DNE recovery) then this can be closed as a duplicate. |
| Comment by Bruno Faccini (Inactive) [ 15/Oct/13 ] |
|
Hello Mahmoud, Also, as Andreas pointed, a good way to also confirm it is related and likely to fix could be to identify if the error is -EREMOTE, so since you indicate problem is reproducible, is there any recent crash-dump or Lustre debug-log available ? |
| Comment by Kit Westneat (Inactive) [ 30/Oct/13 ] |
|
Hi Bruno, we recently hit this bug at IU in 2.1.6. I have a core dump I will have the customer upload to the FTP site. Did your patch for BTW the kernel-debuginfo we have is a different name, but it is the same kernel. |
| Comment by Mahmoud Hanafi [ 31/Oct/13 ] |
|
We applied patch#3 from |
| Comment by Bruno Faccini (Inactive) [ 31/Oct/13 ] |
|
Mahmoud, thanks for your feed-back! Kit, patch has not land until now, but since at least TGCC and now NASA sites successfully integrated it, I think it will land soon now. |
| Comment by Peter Jones [ 14/Nov/13 ] |
|
duplicate of |