[LU-489] Hyperion-mds1 - swraid crash in mkfs.lustre Created: 05/Jul/11 Updated: 01/Jul/15 Resolved: 01/Jul/15 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 1.8.6 |
| Fix Version/s: | None |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Cliff White (Inactive) | Assignee: | Yang Sheng |
| Resolution: | Cannot Reproduce | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Hyperion chaos distribute Linux version 2.6.18-238.12.1.el5_lustre.g266a955 (jenkins@rhel5-64-build.lab.whamcloud.com) (gcc version 4.1.2 20080704 (Red Hat 4.1.2-50)) #1 SMP Fri Jun 10 16:39:27 PDT 2011 |
||
| Severity: | 4 |
| Rank (Obsolete): | 10607 |
| Description |
|
Ran command: Result: 2011-07-05 16:25:34 hyperion-mds1 login: ----------- [cut here ] --------- [please bite here ] --------- Has occurred now 6 times, easy to reproduce. |
| Comments |
| Comment by Cliff White (Inactive) [ 05/Jul/11 ] |
|
Also worth noting - the MDS is the only node using the mptbase and mptsas drivers. - The OSSs are HW (DDN) and |
| Comment by Cliff White (Inactive) [ 05/Jul/11 ] |
|
I built a new image, based on chaos 4.4-2 - Installed the same RPMS, had the same crash. I repeated the test with the image from last week, |
| Comment by Cliff White (Inactive) [ 05/Jul/11 ] |
|
sorry, pasted wrong version - the non-crashing kernel is vmlinuz-2.6.18-238.12.1.el5_lustre.g529529a |
| Comment by Peter Jones [ 06/Jul/11 ] |
|
Yang Sheng Do you see anything in the raid patches in our patch series for the latest rhel kernel that might explain this? Thanks Peter |
| Comment by Johann Lombardi (Inactive) [ 07/Jul/11 ] |
|
All those kernels should be the same. The version string has changed just because i enabled/disabled slab debugging. |
| Comment by Yang Sheng [ 07/Jul/11 ] |
|
I cannot make sure our patches whether cause this kind of issue. But i think we can test without our raid patches to ensure they aren't crash the kernel. |
| Comment by Cliff White (Inactive) [ 07/Jul/11 ] |
|
I don't know what I would test with a stock kernel - the issue is a failure triggered buy running mkfs.lustre, and I cannot do this with a stock kernel. mkfs -t ext3 and mkfs -t ext4 have been tested on all these kernels, and do not fail. Please explain what tests you wish run with a stock kernel, and I'll see about finding the bits. |
| Comment by Johann Lombardi (Inactive) [ 07/Jul/11 ] |
|
Have you tried with a simple dd? In any casse, mkfs.lustre does not require to load the kernel module, so you should be able to run it on an unpatched kernel. |
| Comment by Yang Sheng [ 01/Jul/15 ] |
|
Can we close this one? Looks like it just hit on rhel5 kernel. |
| Comment by Cliff White (Inactive) [ 01/Jul/15 ] |
|
Might as well close, we haven't hit it again |