[LU-8870] conf-sanity test_32b: lov.: error writing proc entry 'stripesize': rc = -22 Created: 29/Nov/16  Updated: 16/Jan/19  Resolved: 16/Jan/19

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.9.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Maloo Assignee: Yang Sheng
Resolution: Cannot Reproduce Votes: 0
Labels: None
Environment:

Full - EL7.3 Server/EL7.3 Client
b2_9, build# 21


Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

This issue was created by maloo for Saurabh Tandan <saurabh.tandan@intel.com>

This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/4cd2a468-b32e-11e6-85c4-5254006e85c2.

The sub-test test_32b failed with the following error:

test failed to respond and timed out

client console logs:

17:11:42:[24917.250751] Lustre: DEBUG MARKER: == conf-sanity test 32b: Upgrade with writeconf ====================================================== 16:54:55 (1480035295)
17:11:42:[25068.948171] LustreError: 119721:0:(obd_config.c:1393:class_process_proc_param()) lov.: error writing proc entry 'stripesize': rc = -22
17:11:42:[25163.969833] LustreError: 11-0: t32fs-MDT0000-mdc-ffff880c09ec2800: operation mds_connect to node 192.168.5.144@o2ib failed: rc = -11
17:11:42:[25313.971651] LustreError: 11-0: t32fs-MDT0000-mdc-ffff880c09ec2800: operation mds_connect to node 192.168.5.144@o2ib failed: rc = -11
17:11:42:[25463.973132] LustreError: 11-0: t32fs-MDT0000-mdc-ffff880c09ec2800: operation mds_connect to node 192.168.5.144@o2ib failed: rc = -11
17:11:42:[25613.974891] LustreError: 11-0: t32fs-MDT0000-mdc-ffff880c09ec2800: operation mds_connect to node 192.168.5.144@o2ib failed: rc = -11
17:11:42:[25763.976465] LustreError: 11-0: t32fs-MDT0000-mdc-ffff880c09ec2800: operation mds_connect to node 192.168.5.144@o2ib failed: rc = -11
17:55:56:********** Timeout by autotest system **********

Might be related to LU-8338



 Comments   
Comment by Bob Glossman (Inactive) [ 01/Dec/16 ]

another on master:
https://testing.hpdd.intel.com/test_sets/cb2a99e6-b776-11e6-be4d-5254006e85c2

Comment by Yang Sheng [ 21/Jun/17 ]

The message 'lov.: error writing proc entry 'stripesize': rc = -22' is not a root cause. Many of test results marked as this ticket was cause by different issue.

https://testing.hpdd.intel.com/test_sets/9310f828-5228-11e7-a749-5254006e85c2

Jun 15 22:33:56 trevis-49vm1 kernel: sha1sum         D ffff88000d59fa90     0 28001  27999 0x00000080
Jun 15 22:33:56 trevis-49vm1 kernel: ffff88000d59f930 0000000000000082 ffff880079773ec0 ffff88000d59ffd8
Jun 15 22:33:56 trevis-49vm1 kernel: ffff88000d59ffd8 ffff88000d59ffd8 ffff880079773ec0 ffff88007fc16c40
Jun 15 22:33:56 trevis-49vm1 kernel: 0000000000000000 7fffffffffffffff ffffffff8168a630 ffff88000d59fa90
Jun 15 22:33:56 trevis-49vm1 kernel: Call Trace:
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168a630>] ? bit_wait+0x50/0x50
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168c5d9>] schedule+0x29/0x70
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168a019>] schedule_timeout+0x239/0x2c0
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff81060c1f>] ? kvm_clock_get_cycles+0x1f/0x30
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff810eb0dc>] ? ktime_get_ts64+0x4c/0xf0
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168a630>] ? bit_wait+0x50/0x50
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168bb7e>] io_schedule_timeout+0xae/0x130
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168bc18>] io_schedule+0x18/0x20
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168a641>] bit_wait_io+0x11/0x50
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8168a35f>] __wait_on_bit_lock+0x5f/0xc0
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff81180684>] __lock_page_killable+0x74/0x90
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff810b1be0>] ? wake_bit_function+0x40/0x40
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff81182dd8>] generic_file_aio_read+0x748/0x790
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffffa0cafb47>] vvp_io_read_start+0x4b7/0x600 [lustre]
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffffa0819b65>] cl_io_start+0x65/0x130 [obdclass]
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffffa081bf2e>] cl_io_loop+0x12e/0xc90 [obdclass]
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffffa0c5eb38>] ll_file_io_generic+0x498/0xc40 [lustre]
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff8121a1ff>] ? touch_atime+0x12f/0x160
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffffa0c5fbaa>] ll_file_aio_read+0x34a/0x3e0 [lustre]
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffffa0c5fd0e>] ll_file_read+0xce/0x1e0 [lustre]
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff811fe69e>] vfs_read+0x9e/0x170
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff811ff26f>] SyS_read+0x7f/0xe0
Jun 15 22:33:56 trevis-49vm1 kernel: [<ffffffff816975c9>] system_call_fastpath+0x16/0x1b

https://testing.hpdd.intel.com/test_sets/088ad91a-519e-11e7-a743-5254006e85c2

00:07:47:[ 2654.638556] Lustre: DEBUG MARKER: == conf-sanity test 32b: Upgrade with writeconf ====================================================== 00:07:13 (1497485233)
00:07:47:[ 2676.277933] LustreError: 16554:0:(obd_config.c:1429:class_process_proc_param()) lov.: error writing proc entry 'stripesize': rc = -22
00:07:47:[ 2676.355021] Lustre: Mounted t32fs-client
00:07:47:[ 2677.713520] LustreError: 16606:0:(layout.c:1882:req_capsule_filled_sizes()) ASSERTION( loc != RCL_SERVER ) failed: 
00:07:47:[ 2677.716394] LustreError: 16606:0:(layout.c:1882:req_capsule_filled_sizes()) LBUG
00:07:47:[ 2677.719036] Pid: 16606, comm: ls
00:07:47:[ 2677.721293] 
00:07:47:[ 2677.721293] Call Trace:
00:07:47:[ 2677.725433]  [<ffffffffa06a77ee>] libcfs_call_trace+0x4e/0x60 [libcfs]
00:07:47:[ 2677.727842]  [<ffffffffa06a787c>] lbug_with_loc+0x4c/0xb0 [libcfs]
00:07:47:[ 2677.730243]  [<ffffffffa09edb0c>] req_capsule_filled_sizes+0xcc/0xd0 [ptlrpc]
00:07:47:[ 2677.732593]  [<ffffffffa09c5dde>] ptlrpc_request_set_replen+0x1e/0x50 [ptlrpc]
00:07:47:[ 2677.734966]  [<ffffffffa0ae42b0>] mdc_enqueue_base+0xd20/0x1870 [mdc]
00:07:47:[ 2677.737208]  [<ffffffffa0ae568b>] mdc_intent_lock+0x26b/0x520 [mdc]
00:07:47:[ 2677.739479]  [<ffffffffa0c6e9b0>] ? ll_md_blocking_ast+0x0/0x730 [lustre]
00:07:47:[ 2677.741691]  [<ffffffffa099ce00>] ? ldlm_completion_ast+0x0/0x910 [ptlrpc]
00:07:47:[ 2677.743929]  [<ffffffffa095bf0f>] lmv_intent_lock+0x5cf/0x1b50 [lmv]
00:07:47:[ 2677.746034]  [<ffffffffa0c6e9b0>] ? ll_md_blocking_ast+0x0/0x730 [lustre]
00:07:47:[ 2677.748213]  [<ffffffffa0c7c6fb>] ll_xattr_cache_refill+0x5fb/0x1860 [lustre]
00:07:47:[ 2677.750320]  [<ffffffffa0c7db2b>] ll_xattr_cache_get+0x9b/0x4b0 [lustre]
00:07:47:[ 2677.752446]  [<ffffffffa0c79fe6>] ll_getxattr_common+0x196/0xca0 [lustre]
00:07:47:[ 2677.754525]  [<ffffffffa07cd319>] ? lprocfs_counter_add+0xf9/0x160 [obdclass]
00:07:47:[ 2677.756674]  [<ffffffffa0c7ac23>] ll_getxattr+0x133/0x1b0 [lustre]
00:07:47:[ 2677.758682]  [<ffffffff81223e98>] vfs_getxattr+0x88/0xb0
00:07:47:[ 2677.760656]  [<ffffffff81223fdb>] getxattr+0xab/0x1d0
00:07:47:[ 2677.762551]  [<ffffffff8120f24d>] ? putname+0x3d/0x60
00:07:47:[ 2677.764487]  [<ffffffff812103f2>] ? user_path_at_empty+0x72/0xc0
00:07:47:[ 2677.766459]  [<ffffffffa06af324>] ? libcfs_log_return+0x24/0x30 [libcfs]
00:07:47:[ 2677.768505]  [<ffffffffa0c2b6f8>] ? ll_ddelete+0x218/0x290 [lustre]
00:07:47:[ 2677.770439]  [<ffffffff8121eede>] ? mntput_no_expire+0x3e/0x120
00:07:47:[ 2677.772347]  [<ffffffff81224d04>] SyS_getxattr+0x64/0xc0
00:07:47:[ 2677.774125]  [<ffffffff816975c9>] system_call_fastpath+0x16/0x1b
00:07:47:[ 2677.775997] 
00:07:47:[ 2677.777471] Kernel panic - not syncing: LBUG
00:07:47:[ 2677.778463] CPU: 0 PID: 16606 Comm: ls Tainted: G        W  OE  ------------   3.10.0-514.21.1.el7.x86_64 #1
00:07:47:[ 2677.778463] Hardware name: Red Hat KVM, BIOS 0.5.1 01/01/2007
00:07:47:[ 2677.778463]  ffffffffa06c5e8b 00000000f8feb9fe ffff88007a49f870 ffffffff81686f13
00:07:47:[ 2677.778463]  ffff88007a49f8f0 ffffffff8168031a ffffffff00000008 ffff88007a49f900
00:07:47:[ 2677.778463]  ffff88007a49f8a0 00000000f8feb9fe 00000000f8feb9fe ffff88007fc0f838
00:07:47:[ 2677.778463] Call Trace:
00:07:47:[ 2677.778463]  [<ffffffff81686f13>] dump_stack+0x19/0x1b
00:07:47:[ 2677.778463]  [<ffffffff8168031a>] panic+0xe3/0x1f2
00:07:47:[ 2677.778463]  [<ffffffffa06a7894>] lbug_with_loc+0x64/0xb0 [libcfs]
00:07:47:[ 2677.778463]  [<ffffffffa09edb0c>] req_capsule_filled_sizes+0xcc/0xd0 [ptlrpc]
00:07:47:[ 2677.778463]  [<ffffffffa09c5dde>] ptlrpc_request_set_replen+0x1e/0x50 [ptlrpc]
00:07:47:[ 2677.778463]  [<ffffffffa0ae42b0>] mdc_enqueue_base+0xd20/0x1870 [mdc]
00:07:47:[ 2677.778463]  [<ffffffffa0ae568b>] mdc_intent_lock+0x26b/0x520 [mdc]
00:07:47:[ 2677.778463]  [<ffffffffa0c6e9b0>] ? ll_invalidate_negative_children+0x1d0/0x1d0 [lustre]
00:07:47:[ 2677.778463]  [<ffffffffa099ce00>] ? ldlm_expired_completion_wait+0x240/0x240 [ptlrpc]
00:07:47:[ 2677.778463]  [<ffffffffa095bf0f>] lmv_intent_lock+0x5cf/0x1b50 [lmv]
00:07:47:[ 2677.778463]  [<ffffffffa0c6e9b0>] ? ll_invalidate_negative_children+0x1d0/0x1d0 [lustre]
00:07:47:[ 2677.778463]  [<ffffffffa0c7c6fb>] ll_xattr_cache_refill+0x5fb/0x1860 [lustre]
00:07:47:[ 2677.778463]  [<ffffffffa0c7db2b>] ll_xattr_cache_get+0x9b/0x4b0 [lustre]
00:07:47:[ 2677.778463]  [<ffffffffa0c79fe6>] ll_getxattr_common+0x196/0xca0 [lustre]
00:07:47:[ 2677.778463]  [<ffffffffa07cd319>] ? lprocfs_counter_add+0xf9/0x160 [obdclass]
00:07:47:[ 2677.778463]  [<ffffffffa0c7ac23>] ll_getxattr+0x133/0x1b0 [lustre]
00:07:47:[ 2677.778463]  [<ffffffff81223e98>] vfs_getxattr+0x88/0xb0
00:07:47:[ 2677.778463]  [<ffffffff81223fdb>] getxattr+0xab/0x1d0
00:07:47:[ 2677.778463]  [<ffffffff8120f24d>] ? putname+0x3d/0x60
00:07:47:[ 2677.778463]  [<ffffffff812103f2>] ? user_path_at_empty+0x72/0xc0
00:07:47:[ 2677.778463]  [<ffffffffa06af324>] ? libcfs_log_return+0x24/0x30 [libcfs]
00:07:47:[ 2677.778463]  [<ffffffffa0c2b6f8>] ? ll_ddelete+0x218/0x290 [lustre]
00:07:47:[ 2677.778463]  [<ffffffff8121eede>] ? mntput_no_expire+0x3e/0x120
00:07:47:[ 2677.778463]  [<ffffffff81224d04>] SyS_getxattr+0x64/0xc0
00:07:47:[ 2677.778463]  [<ffffffff816975c9>] system_call_fastpath+0x16/0x1b
Comment by Yang Sheng [ 16/Jan/19 ]

Please feel free to reopen it.

Generated at Sat Feb 10 02:21:16 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.