[LU-10155] File creation with 'lfs setstripe' is not replayable Created: 24/Oct/17  Updated: 31/Jan/18  Resolved: 31/Jan/18

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.11.0
Fix Version/s: Lustre 2.11.0

Type: Bug Priority: Blocker
Reporter: Mikhail Pershin Assignee: Lai Siyao
Resolution: Fixed Votes: 0
Labels: None

Issue Links:
Related
is related to LU-9514 sanity test 51f times out due to out ... Resolved
Severity: 3
Rank (Obsolete): 9223372036854775807

 Description   

File created with 'lfs setstripe' cannot recover from failover. Simple test like the one below fails:

test_00() {
replay_barrier $SINGLEMDS
$LFS setstripe -c 2 $DIR/$tfile
fail $SINGLEMDS
$CHECKSTAT -t file $DIR/$tfile ||
		error "$CHECKSTAT $DIR/$tfile check failed"
}

At first sign the file is not re-created due to problem with LOV EA supplied along with replay, it says LOV MAGIC is 0.



 Comments   
Comment by Andreas Dilger [ 24/Oct/17 ]

Mike, do you know when this problem was introduced? How is it possible that this wasn't detected by our testing?

Comment by Joseph Gmitter (Inactive) [ 24/Oct/17 ]

Hi Lai,

Can you please investigate?

Thanks.
Joe

Comment by Jinshan Xiong (Inactive) [ 24/Oct/17 ]

A recent change to this area of code is LU-9514, which tried to optimize memory usage by freeing the open reply buffer earlier.

Comment by Mikhail Pershin [ 24/Oct/17 ]

I have tried to revert that patch but it doesn't help

Comment by Lai Siyao [ 04/Jan/18 ]

I can't reproduce in latest master, maybe this was fixed in other tickets, but anyway I'll add this test into replay-single.sh.

Comment by Gerrit Updater [ 04/Jan/18 ]

Lai Siyao (lai.siyao@intel.com) uploaded a new patch: https://review.whamcloud.com/30704
Subject: LU-10155 test: add setstripe replay test
Project: fs/lustre-release
Branch: master
Current Patch Set: 1
Commit: 8341c5f2499aa22bd230c9c082cc7dc82b821bf7

Comment by Gerrit Updater [ 31/Jan/18 ]

Oleg Drokin (oleg.drokin@intel.com) merged in patch https://review.whamcloud.com/30704/
Subject: LU-10155 recovery: support setstripe replay
Project: fs/lustre-release
Branch: master
Current Patch Set:
Commit: 3a0a50f44f68066642a23365fa4a6f1924dd1108

Comment by Peter Jones [ 31/Jan/18 ]

Mark as resolved for now I suppose - we will reopen if the test starts failing now it is landed

Generated at Sat Feb 10 02:32:32 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.