[LU-12721] HPE MPT MPI and lustre2.12 clients causes write hangs Created: 31/Aug/19  Updated: 31/Aug/19  Resolved: 31/Aug/19

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.12.2
Fix Version/s: None

Type: Bug Priority: Critical
Reporter: Mahmoud Hanafi Assignee: Andreas Dilger
Resolution: Duplicate Votes: 0
Labels: None

Issue Links:
Related
is related to LU-12382 Deadloop on tiny write Resolved
Severity: 2
Rank (Obsolete): 9223372036854775807

 Description   

Using HPE MPT MPI with lustre2.12 clients write hang for 30 sec.

Here is strace that show the timing. We can only write(1,"",0) hang. Looks like a duplicate of  LU-12382.

 

1567214852.343460 write(1, " Compile options:\n", 18) = 18
1567214852.343873 write(1, "", 0)       = 0
1567214883.059446 write(1, "    MPIF77       = ifort\n", 25) = 25
1567214883.059938 write(1, "", 0)       = 0
1567214913.783442 write(1, "    FLINK        = $(MPIF77)\n", 29) = 29
1567214913.783822 write(1, "", 0)       = 0
1567214944.499451 write(1, "    FMPI_LIB     = -lmpi\n", 25) = 25
1567214944.499933 write(1, "", 0)       = 0
1567214975.219462 write(1, "    FMPI_INC     = (none)\n", 26) = 26
1567214975.219942 write(1, "", 0)       = 0
1567215005.943478 write(1, "    FFLAGS       = -O3 -ipo -axC"..., 66) = 66
1567215005.944041 write(1, "", 0)       = 0
1567215036.663466 write(1, "    FLINKFLAGS   = $(FFLAGS) -lc"..., 48) = 48
1567215036.664046 write(1, "", 0)       = 0
1567215067.383452 write(1, "    RAND         = randi8\n", 26) = 26
1567215067.383861 write(1, "", 0)       = 0


 Comments   
Comment by Andreas Dilger [ 31/Aug/19 ]

There is already a backported version of the LU-12382 patch https://review.whamcloud.com/35312/ that is landed to b2_12

Comment by Peter Jones [ 31/Aug/19 ]

Mahmoud

Sounds like this is a duplicate ticket that will be fixed in 2.12.3

Peter

Generated at Sat Feb 10 02:55:04 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.