[LU-8840] sanity-lfsck test_2e: @@@@@@ FAIL: (5) Fail to repair crashed linkEA: 0 Created: 16/Nov/16  Updated: 08/May/17  Resolved: 26/Apr/17

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.10.0
Fix Version/s: Lustre 2.10.0

Type: Bug Priority: Major
Reporter: nasf (Inactive) Assignee: nasf (Inactive)
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Duplicate
Related
is related to LU-9045 conf-sanity test_32c: test failed to ... Resolved
is related to LU-9048 conf-sanity test_32c: test failed to ... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

== sanity-lfsck test 2e: namespace LFSCK can verify remote object linkEA ============================= 12:41:48 (1479271308)
fail_loc=0x1603
fail_loc=0
Started LFSCK on the device lustre-MDT0000: scrub namespace
sanity-lfsck test_2e: @@@@@@ FAIL: (5) Fail to repair crashed linkEA: 0
Trace dump:
= /root/Work/Lustre/IEEL3_STX/lustre-release-ee-stx/lustre/tests/test-framework.sh:4970:error()
= sanity-lfsck.sh:424:test_2e()
= /root/Work/Lustre/IEEL3_STX/lustre-release-ee-stx/lustre/tests/test-framework.sh:5230:run_one()
= /root/Work/Lustre/IEEL3_STX/lustre-release-ee-stx/lustre/tests/test-framework.sh:5268:run_one_logged()
= /root/Work/Lustre/IEEL3_STX/lustre-release-ee-stx/lustre/tests/test-framework.sh:5120:run_test()
= sanity-lfsck.sh:431:main()
Dumping lctl log to /tmp/test_logs/1479271199/sanity-lfsck.test_2e.*.1479271309.log
Dumping logs only on local client.
Resetting fail_loc on all nodes...done.
FAIL 2e (1s)



 Comments   
Comment by Gerrit Updater [ 16/Nov/16 ]

Fan Yong (fan.yong@intel.com) uploaded a new patch: http://review.whamcloud.com/23782
Subject: LU-8840 osp: osp_xattr_get should return the EA size
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 2323cc74fa3f2617ec3b99311056be9e3370cbf2

Comment by Gerrit Updater [ 24/Jan/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/23782/
Subject: LU-8840 osp: handle EA cache properly
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 555d02f47401340182b47b3245a657b52fc3e68a

Comment by Joseph Gmitter (Inactive) [ 27/Jan/17 ]

The https://review.whamcloud.com/23782/ patch has been identified as being the root cause of recent master failures in conf-sanity test_32c. See LU-9045 for detail.

A revert of the above patch is at https://review.whamcloud.com/#/c/25134/

Comment by Joseph Gmitter (Inactive) [ 27/Jan/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/25134/
Subject: LU-9045 osp: Revert "LU-8840 osp: handle EA cache properly"
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: db1ef0a322f41314abd37b5ec4ad153d63c9b405

Comment by Gerrit Updater [ 02/Feb/17 ]

Fan Yong (fan.yong@intel.com) uploaded a new patch: https://review.whamcloud.com/25207
Subject: LU-8840 osp: handle EA cache properly (2)
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: c2631e4dfcf052ec5c9fa717c45fca47fac2b9fa

Comment by nasf (Inactive) [ 03/Feb/17 ]

The reason for the original patch 23782 causing the issues LU-9045, LU-9048 is known now: the function osp_oac_xattr_assignment() did not properly check the 'osp_xattr_entry' length as to RAM overflow. The new patch 25207 fixed that.

Comment by Gerrit Updater [ 26/Apr/17 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/25207/
Subject: LU-8840 osp: handle EA cache properly (2)
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 783fccdde2e947628c7ee0eaa0c6bf86fd65e2e8

Comment by nasf (Inactive) [ 26/Apr/17 ]

The patch has been landed to master.

Generated at Sat Feb 10 02:21:00 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.