[LU-998] Test failure on test suite sanity, subtest test_103 Created: 16/Jan/12 Updated: 19/Mar/12 Resolved: 02/Feb/12 |
|
| Status: | Closed |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | Lustre 2.2.0 |
| Fix Version/s: | Lustre 2.2.0 |
| Type: | Bug | Priority: | Blocker |
| Reporter: | Maloo | Assignee: | Lai Siyao |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Severity: | 3 |
| Rank (Obsolete): | 4187 |
| Description |
|
This issue was created by maloo for sarah <sarah@whamcloud.com> This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/9a6e6584-3f4f-11e1-990e-5254004bbbd3. The sub-test test_103 failed with the following error:
Info required for matching: sanity 103 |
| Comments |
| Comment by Peter Jones [ 16/Jan/12 ] |
|
Lai Could you please look into this one? Thanks Peter |
| Comment by Lai Siyao [ 16/Jan/12 ] |
|
MDS hit an ASSERT: 05:20:56:LustreError: 5199:0:(osd_handler.c:2416:osd_xattr_set()) ASSERTION((oh)->ot_declare_xattr_set > 0) failed |
| Comment by Lai Siyao [ 17/Jan/12 ] |
|
It looks like mdd_attr_set() may call mdd_acl_chmod(), but this operation is not declared in advance. |
| Comment by Lai Siyao [ 18/Jan/12 ] |
|
review is on http://review.whamcloud.com/#change,1984 |
| Comment by Peter Jones [ 02/Feb/12 ] |
|
Lsnded for 2.2 |
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Lai Siyao [ 02/Feb/12 ] |
|
no more work is left. |
| Comment by Build Master (Inactive) [ 02/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 03/Feb/12 ] |
|
Integrated in Result = SUCCESS
|
| Comment by Build Master (Inactive) [ 17/Feb/12 ] |
|
Integrated in Result = FAILURE
|
| Comment by Build Master (Inactive) [ 17/Feb/12 ] |
|
Integrated in Result = FAILURE
|
| Comment by Build Master (Inactive) [ 17/Feb/12 ] |
|
Integrated in Result = ABORTED
|
| Comment by Sarah Liu [ 19/Mar/12 ] |
|
hit this issue again when doing interop test between 2.2-RC1 server and 2.1.1 RHEL6 client: |
| Comment by Lai Siyao [ 19/Mar/12 ] |
|
It looks to be a different crash: 20:01:48:BUG: unable to handle kernel paging request at 0000000400000002 20:01:48:IP: [<0000000400000002>] 0x400000002 20:01:48:PGD 72aa2067 PUD 0 20:01:48:Oops: 0010 [#1] SMP 20:01:48:last sysfs file: /sys/module/obdclass/initstate 20:01:48:CPU 0 20:01:48:Modules linked in: nfs fscache cmm(U) osd_ldiskfs(U) mdt(U) mdd(U) mds(U) fsfilt_ldiskfs(U) mgs(U) mgc(U) lustre(U) lquota(U) lov(U) osc(U) mdc(U) fid(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ldiskfs(U) jbd2 nfsd lockd nfs_acl auth_rpcgss exportfs autofs4 sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: llog_test] 20:01:48: 20:01:48:Pid: 4547, comm: jbd2/dm-0-8 Not tainted 2.6.32-220.4.2.el6_lustre.x86_64 #1 Red Hat KVM 20:01:48:RIP: 0010:[<0000000400000002>] [<0000000400000002>] 0x400000002 20:01:48:RSP: 0018:ffff88003f6bfca8 EFLAGS: 00010246 20:01:49:RAX: ffff880040263dc0 RBX: ffff8800553d03c0 RCX: 0000000000000000 20:01:49:RDX: ffff880040263dc0 RSI: ffff8800553d03c0 RDI: 0000000000000000 20:01:49:RBP: ffff88003f6bfce0 R08: 00000000ffffff0a R09: 0000000000000000 20:01:49:R10: 000000000000000f R11: 0000000000000000 R12: 0000000000000000 20:01:49:R13: ffff880037c26200 R14: 0006000100000002 R15: ffff8800553d0430 20:01:49:FS: 0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000 20:01:49:CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b 20:01:49:CR2: 0000000400000002 CR3: 0000000072ab9000 CR4: 00000000000006f0 20:01:49:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 20:01:49:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 20:01:49:Process jbd2/dm-0-8 (pid: 4547, threadinfo ffff88003f6be000, task ffff880037f1a040) 20:01:49:Stack: 20:01:49: ffffffffa0b8f1c9 0000000000000d88 ffff880068e414d8 ffff88003ef5368c 20:01:49:<0> ffff88003ef78c00 ffff88004db21c90 0000000000000000 ffff88003f6bfd20 20:01:49:<0> ffffffffa03ee36a ffff88003f6bfd20 ffff880068ec4b9c ffff88004db21bc0 20:01:49:Call Trace: 20:01:50: [<ffffffffa0b8f1c9>] ? osd_trans_commit_cb+0x79/0x1e0 [osd_ldiskfs] 20:01:50: [<ffffffffa03ee36a>] ldiskfs_journal_commit_callback+0x8a/0xc0 [ldiskfs] 20:01:50: [<ffffffffa039a89f>] jbd2_journal_commit_transaction+0x110f/0x1530 [jbd2] 20:01:50: [<ffffffff810096f0>] ? __switch_to+0xd0/0x320 20:01:50: [<ffffffff8107ca1b>] ? try_to_del_timer_sync+0x7b/0xe0 20:01:50: [<ffffffffa039faf8>] kjournald2+0xb8/0x220 [jbd2] 20:01:50: [<ffffffff81090a90>] ? autoremove_wake_function+0x0/0x40 20:01:50: [<ffffffffa039fa40>] ? kjournald2+0x0/0x220 [jbd2] 20:01:50: [<ffffffff81090726>] kthread+0x96/0xa0 20:01:50: [<ffffffff8100c14a>] child_rip+0xa/0x20 20:01:50: [<ffffffff81090690>] ? kthread+0x0/0xa0 20:01:50: [<ffffffff8100c140>] ? child_rip+0x0/0x20 20:01:50:Code: Bad RIP value. 20:01:50:RIP [<0000000400000002>] 0x400000002 20:01:50: RSP <ffff88003f6bfca8> 20:01:50:CR2: 0000000400000002 20:01:50:---[ end trace e31e50250e65373c ]--- I'll check whether it fail on my local setup. |
| Comment by Sarah Liu [ 19/Mar/12 ] |
|
I open a new ticket for tracking this issue: |