[LU-998] Test failure on test suite sanity, subtest test_103 Created: 16/Jan/12  Updated: 19/Mar/12  Resolved: 02/Feb/12

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.2.0
Fix Version/s: Lustre 2.2.0

Type: Bug Priority: Blocker
Reporter: Maloo Assignee: Lai Siyao
Resolution: Fixed Votes: 0
Labels: None

Severity: 3
Rank (Obsolete): 4187

 Description   

This issue was created by maloo for sarah <sarah@whamcloud.com>

This issue relates to the following test suite run: https://maloo.whamcloud.com/test_sets/9a6e6584-3f4f-11e1-990e-5254004bbbd3.

The sub-test test_103 failed with the following error:

test failed to respond and timed out

Info required for matching: sanity 103



 Comments   
Comment by Peter Jones [ 16/Jan/12 ]

Lai

Could you please look into this one?

Thanks

Peter

Comment by Lai Siyao [ 16/Jan/12 ]

MDS hit an ASSERT:

05:20:56:LustreError: 5199:0:(osd_handler.c:2416:osd_xattr_set()) ASSERTION((oh)->ot_declare_xattr_set > 0) failed
Comment by Lai Siyao [ 17/Jan/12 ]

It looks like mdd_attr_set() may call mdd_acl_chmod(), but this operation is not declared in advance.

Comment by Lai Siyao [ 18/Jan/12 ]

review is on http://review.whamcloud.com/#change,1984

Comment by Peter Jones [ 02/Feb/12 ]

Lsnded for 2.2

Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,server,el5,ofa #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,client,el6,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » i686,server,el5,ofa #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,client,el5,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,client,ubuntu1004,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,client,el5,ofa #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,server,el5,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » i686,server,el5,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » i686,client,el5,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » i686,client,el6,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » x86_64,client,sles11,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » i686,client,el5,ofa #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Lai Siyao [ 02/Feb/12 ]

no more work is left.

Comment by Build Master (Inactive) [ 02/Feb/12 ]

Integrated in lustre-master » i686,server,el6,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 03/Feb/12 ]

Integrated in lustre-master » x86_64,server,el6,inkernel #451
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = SUCCESS
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 17/Feb/12 ]

Integrated in lustre-master » x86_64,server,el6,ofa #480
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = FAILURE
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 17/Feb/12 ]

Integrated in lustre-master » x86_64,client,el6,ofa #480
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = FAILURE
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Build Master (Inactive) [ 17/Feb/12 ]

Integrated in lustre-master » i686,client,el6,ofa #480
LU-998 acl: declare acl operation for setattr (Revision ce8821a05da59e00de43e7c2f504bdb3b9dab82c)

Result = ABORTED
Oleg Drokin : ce8821a05da59e00de43e7c2f504bdb3b9dab82c
Files :

  • lustre/mdd/mdd_object.c
Comment by Sarah Liu [ 19/Mar/12 ]

hit this issue again when doing interop test between 2.2-RC1 server and 2.1.1 RHEL6 client:
https://maloo.whamcloud.com/test_sets/6b0e0a92-714a-11e1-a89e-5254004bbbd3

Comment by Lai Siyao [ 19/Mar/12 ]

It looks to be a different crash:

20:01:48:BUG: unable to handle kernel paging request at 0000000400000002
20:01:48:IP: [<0000000400000002>] 0x400000002
20:01:48:PGD 72aa2067 PUD 0 
20:01:48:Oops: 0010 [#1] SMP 
20:01:48:last sysfs file: /sys/module/obdclass/initstate
20:01:48:CPU 0 
20:01:48:Modules linked in: nfs fscache cmm(U) osd_ldiskfs(U) mdt(U) mdd(U) mds(U) fsfilt_ldiskfs(U) mgs(U) mgc(U) lustre(U) lquota(U) lov(U) osc(U) mdc(U) fid(U) fld(U) ksocklnd(U) ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) ldiskfs(U) jbd2 nfsd lockd nfs_acl auth_rpcgss exportfs autofs4 sunrpc ib_ipoib rdma_ucm ib_ucm ib_uverbs ib_umad rdma_cm ib_cm iw_cm ib_addr ipv6 ib_sa ib_mad ib_core microcode virtio_balloon 8139too 8139cp mii i2c_piix4 i2c_core ext3 jbd mbcache virtio_blk virtio_pci virtio_ring virtio pata_acpi ata_generic ata_piix dm_mirror dm_region_hash dm_log dm_mod [last unloaded: llog_test]
20:01:48:
20:01:48:Pid: 4547, comm: jbd2/dm-0-8 Not tainted 2.6.32-220.4.2.el6_lustre.x86_64 #1 Red Hat KVM
20:01:48:RIP: 0010:[<0000000400000002>]  [<0000000400000002>] 0x400000002
20:01:48:RSP: 0018:ffff88003f6bfca8  EFLAGS: 00010246
20:01:49:RAX: ffff880040263dc0 RBX: ffff8800553d03c0 RCX: 0000000000000000
20:01:49:RDX: ffff880040263dc0 RSI: ffff8800553d03c0 RDI: 0000000000000000
20:01:49:RBP: ffff88003f6bfce0 R08: 00000000ffffff0a R09: 0000000000000000
20:01:49:R10: 000000000000000f R11: 0000000000000000 R12: 0000000000000000
20:01:49:R13: ffff880037c26200 R14: 0006000100000002 R15: ffff8800553d0430
20:01:49:FS:  0000000000000000(0000) GS:ffff880002200000(0000) knlGS:0000000000000000
20:01:49:CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b
20:01:49:CR2: 0000000400000002 CR3: 0000000072ab9000 CR4: 00000000000006f0
20:01:49:DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
20:01:49:DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
20:01:49:Process jbd2/dm-0-8 (pid: 4547, threadinfo ffff88003f6be000, task ffff880037f1a040)
20:01:49:Stack:
20:01:49: ffffffffa0b8f1c9 0000000000000d88 ffff880068e414d8 ffff88003ef5368c
20:01:49:<0> ffff88003ef78c00 ffff88004db21c90 0000000000000000 ffff88003f6bfd20
20:01:49:<0> ffffffffa03ee36a ffff88003f6bfd20 ffff880068ec4b9c ffff88004db21bc0
20:01:49:Call Trace:
20:01:50: [<ffffffffa0b8f1c9>] ? osd_trans_commit_cb+0x79/0x1e0 [osd_ldiskfs]
20:01:50: [<ffffffffa03ee36a>] ldiskfs_journal_commit_callback+0x8a/0xc0 [ldiskfs]
20:01:50: [<ffffffffa039a89f>] jbd2_journal_commit_transaction+0x110f/0x1530 [jbd2]
20:01:50: [<ffffffff810096f0>] ? __switch_to+0xd0/0x320
20:01:50: [<ffffffff8107ca1b>] ? try_to_del_timer_sync+0x7b/0xe0
20:01:50: [<ffffffffa039faf8>] kjournald2+0xb8/0x220 [jbd2]
20:01:50: [<ffffffff81090a90>] ? autoremove_wake_function+0x0/0x40
20:01:50: [<ffffffffa039fa40>] ? kjournald2+0x0/0x220 [jbd2]
20:01:50: [<ffffffff81090726>] kthread+0x96/0xa0
20:01:50: [<ffffffff8100c14a>] child_rip+0xa/0x20
20:01:50: [<ffffffff81090690>] ? kthread+0x0/0xa0
20:01:50: [<ffffffff8100c140>] ? child_rip+0x0/0x20
20:01:50:Code:  Bad RIP value.
20:01:50:RIP  [<0000000400000002>] 0x400000002
20:01:50: RSP <ffff88003f6bfca8>
20:01:50:CR2: 0000000400000002
20:01:50:---[ end trace e31e50250e65373c ]---

I'll check whether it fail on my local setup.

Comment by Sarah Liu [ 19/Mar/12 ]

I open a new ticket for tracking this issue: LU-1235

Generated at Sat Feb 10 01:12:31 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.