[LU-699] replay-dual test_1 fails to remount mdt Created: 21/Sep/11 Updated: 03/Jun/16 Resolved: 03/Jun/16 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.1.0, Lustre 1.8.7 |
| Fix Version/s: | Lustre 2.2.0 |
| Type: | Bug | Priority: | Minor |
| Reporter: | Minh Diep | Assignee: | Jinshan Xiong (Inactive) |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Environment: |
Lustre Clients: Lustre Servers: |
||
| Issue Links: |
|
||||||||
| Severity: | 3 | ||||||||
| Rank (Obsolete): | 4884 | ||||||||
| Description |
|
v2_1_0_RC2 testing Report: https://maloo.whamcloud.com/test_sets/396f9254-e440-11e0-9909-52540025f9af == replay-dual test 1: |X| simple create == 06:21:32 (1316524892) MDS console shows 06:21:42:Lustre: server umount lustre-MDT0000 complete |
| Comments |
| Comment by Andreas Dilger [ 21/Sep/11 ] |
|
> 06:21:51:LDISKFS-fs warning (device dm-0): ldiskfs_fill_super: extents feature not enabled on this filesystem, use tune2fs. FYI, this message is a complete red herring for the MDT (i.e. unrelated to this or any problem, in case there was any uncertainty), and we should remove it from our ldiskfs filesystems. I don't think that extents on the MDT can possibly help, and will likely hurt performance since directory blocks are allocated one-at-a-time and storing them as extents is less efficient. |
| Comment by Jian Yu [ 23/Sep/11 ] |
|
Lustre Clients: Lustre Servers: replay-dual test passed in manual run: https://maloo.whamcloud.com/test_sets/a60c9bd8-e5c5-11e0-9909-52540025f9af |
| Comment by nasf (Inactive) [ 18/Oct/11 ] |
|
Another failure on lustre-2.1: https://maloo.whamcloud.com/test_sets/3055c2c6-f6bd-11e0-a451-52540025f9af |
| Comment by Jinshan Xiong (Inactive) [ 19/Oct/11 ] |
|
I took a close look at this problem because it may block my IR landings. Obviously the problem is due to the corruption on lustre_disk_data. When we're modifying lustre_disk_data, we just wait for log to be committed but NOT data to actually write to disk. So in this test case, if the data is not written into disk when we mark the device as readonly, we will be in trouble because we'll lose updated lustre_disk_data. I'm going to fix this problem by using O_SYNC to update lustre_disk_data if this is hit in the test. |
| Comment by Jinshan Xiong (Inactive) [ 19/Oct/11 ] |
|
patch is at: http://review.whamcloud.com/1557 |
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 20/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Build Master (Inactive) [ 21/Oct/11 ] |
|
Integrated in Oleg Drokin : 886e67d4fe87d293952a11e7f41b98a8c3abeddd
|
| Comment by Jay Lan (Inactive) [ 30/Apr/12 ] |
|
I seemed to hit the data corruption problem in REPLAY_DUAL test 16 and 20. Why did this ticket not marked RESOLVED? |
| Comment by Jay Lan (Inactive) [ 01/May/12 ] |
|
I rebuilt the lustre server with this patch. It still failed, on a different way |