Details
-
Bug
-
Resolution: Won't Fix
-
Major
-
None
-
Lustre 2.5.4
-
Lustre Build: https://build.hpdd.intel.com/job/lustre-b2_5/104/
MDSCOUNT=2
-
3
-
16671
Description
While running racer test with MDSCOUNT=2, mv operation hung on client node as follows:
LustreError: 29294:0:(xattr.c:510:ll_getxattr()) server bug: replied size 56 > 32 for 11 (trusted.lov) INFO: task mv:26147 blocked for more than 120 seconds. Not tainted 2.6.32-431.29.2.el6.x86_64 #1 "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. mv D 0000000000000000 0 26147 18764 0x00000080 ffff88006b603cd8 0000000000000086 ffff88005fd60dc0 0000000000000000 0000000000016840 ffff88005fd60dc0 ffffffff8100b9ce ffff88006b603cd8 ffff880064a0baf8 ffff88006b603fd8 000000000000fbc8 ffff880064a0baf8 Call Trace: [<ffffffff8100b9ce>] ? common_interrupt+0xe/0x13 [<ffffffff8105546b>] ? mutex_spin_on_owner+0x9b/0xc0 [<ffffffff8152a5be>] __mutex_lock_slowpath+0x13e/0x180 [<ffffffff8152a45b>] mutex_lock+0x2b/0x50 [<ffffffffa1cdf99e>] ll_setattr_raw+0x2ee/0x1070 [lustre] [<ffffffff81078fd7>] ? current_fs_time+0x27/0x30 [<ffffffffa1ce0785>] ll_setattr+0x65/0xd0 [lustre] [<ffffffff811a7ca8>] notify_change+0x168/0x340 [<ffffffff8119b502>] ? user_path_at+0x62/0xa0 [<ffffffff811862be>] chown_common+0x6e/0x90 [<ffffffff8118658f>] sys_fchownat+0xbf/0xe0 [<ffffffff811865d0>] sys_lchown+0x20/0x30 [<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
Maloo reports:
https://testing.hpdd.intel.com/test_sets/28604232-79df-11e4-807e-5254006e85c2
https://testing.hpdd.intel.com/test_sets/b9d50256-7a21-11e4-bb7c-5254006e85c2
The failure was previously reported in LU-4105, which was fixed by the patch for LU-5144. However, the failure still occurred after the patch for LU-5144 was landed on Lustre b2_5 branch. So I create this new ticket to track the issue.
Attachments
Issue Links
- is related to
-
LU-6085 racer stuck on mutex_lock in ll_setattr_raw()
- Resolved