Details
-
Bug
-
Resolution: Incomplete
-
Major
-
None
-
Lustre 2.4.3
-
None
-
3
-
17768
Description
We have a user when they try to read some restart files the will get this fortran error message. The error can move around to different files and is not always consistent.
I was able to capture lustre debugging during one of these failures.
this was the specific error and the FID of the file
Reading file unit: 1232 forrtl: severe (39): error during read, unit 1232, file /nobackupp9/pbalakum/TURBULENCE/3D_TURBULENCE/TURB1_10595_DNS_COMPACT_512_512_512/fort.1232 Image PC Routine Line Source read_file 000000000047C351 Unknown Unknown Unknown read_file 000000000047B325 Unknown Unknown Unknown read_file 000000000043687A Unknown Unknown Unknown read_file 0000000000408872 Unknown Unknown Unknown read_file 00000000004080A1 Unknown Unknown Unknown read_file 000000000041949F Unknown Unknown Unknown read_file 0000000000402F5D Unknown Unknown Unknown read_file 0000000000402BFC Unknown Unknown Unknown libc.so.6 00007FFFED0F5C36 Unknown Unknown Unknown read_file 0000000000402AF9 Unknown Unknown Unknown r401i2n10 /nobackupp9/pbalakum/TURBULENCE/3D_TURBULENCE/TURB1_10595_DNS_COMPACT_512_512_512 # lfs path2fid /nobackupp9/pbalakum/TURBULENCE/3D_TURBULENCE/TURB1_10595_DNS_COMPACT_512_512_512/fort.1232 [0x20009c845:0x14ddf:0x0]
I will upload the debug logs to ftp site and post the file