Details
-
Bug
-
Resolution: Cannot Reproduce
-
Minor
-
None
-
Lustre 2.1.1, Lustre 1.8.7
-
None
-
Lustre Configuration:
OSS1: version 2.2.0 (with 1000 OSTs)
OSS2: version 2.2.0 (with 1000 OSTs)
MGS/MDS: version 2.2.0 (with "large_xattr" enabled)
CLIENT1: version 2.2.0
CLIENT2: version 2.1.1 (Build: http://build.whamcloud.com/job/lustre-b2_1/44/)
CLIENT3: version 1.8.7-wc1 (Build: http://build.whamcloud.com/job/lustre-b1_8/171/)
Distro/Arch: RHEL6.2/x86_64
Lustre Configuration: OSS1: version 2.2.0 (with 1000 OSTs) OSS2: version 2.2.0 (with 1000 OSTs) MGS/MDS: version 2.2.0 (with "large_xattr" enabled) CLIENT1: version 2.2.0 CLIENT2: version 2.1.1 (Build: http://build.whamcloud.com/job/lustre-b2_1/44/) CLIENT3: version 1.8.7-wc1 (Build: http://build.whamcloud.com/job/lustre-b1_8/171/) Distro/Arch: RHEL6.2/x86_64
-
3
-
8295
Description
Test steps:
1) Mount the large stripe count (2000 OSTs in total) filesystem on Lustre 1.8.7-wc1, 2.1.1 and 2.2.0 clients.
2) Run "lfs setstripe" on CLIENT1 (version 2.2.0) to create a file with 2000 stripes.
3) Run "lfs getstripe" on CLIENT2 (version 2.1.1) and CLIENT3 (version 1.8.7-wc1) to get the striping information.
......
While running step 3), the striping information was gotten correctly. However, in the meantime, lfs hit the following issue on both 2.1.1 and 1.8.7-wc1 clients:
# lfs getstripe -c /mnt/lustre/testfile.client-1.2000 2000 *** glibc detected *** lfs: munmap_chunk(): invalid pointer: 0x0000000001b0bed0 *** ======= Backtrace: ========= /lib64/libc.so.6[0x398c0750c6] /lib64/libc.so.6(closedir+0xd)[0x398c0a690d] lfs[0x41c3a1] lfs[0x41c7f0] lfs[0x406370] lfs[0x4273a8] lfs[0x406937] /lib64/libc.so.6(__libc_start_main+0xfd)[0x398c01ecdd] lfs[0x4030b9] ======= Memory map: ======== 00400000-00464000 r-xp 00000000 08:01 928651 /usr/bin/lfs 00663000-00665000 rw-p 00063000 08:01 928651 /usr/bin/lfs 00665000-00680000 rw-p 00000000 00:00 0 00864000-00866000 rw-p 00064000 08:01 928651 /usr/bin/lfs 01b05000-01b26000 rw-p 00000000 00:00 0 [heap] 398b800000-398b820000 r-xp 00000000 08:01 541056 /lib64/ld-2.12.so 398ba1f000-398ba20000 r--p 0001f000 08:01 541056 /lib64/ld-2.12.so 398ba20000-398ba21000 rw-p 00020000 08:01 541056 /lib64/ld-2.12.so 398ba21000-398ba22000 rw-p 00000000 00:00 0 398bc00000-398bc02000 r-xp 00000000 08:01 541059 /lib64/libdl-2.12.so 398bc02000-398be02000 ---p 00002000 08:01 541059 /lib64/libdl-2.12.so 398be02000-398be03000 r--p 00002000 08:01 541059 /lib64/libdl-2.12.so 398be03000-398be04000 rw-p 00003000 08:01 541059 /lib64/libdl-2.12.so 398c000000-398c197000 r-xp 00000000 08:01 541057 /lib64/libc-2.12.so 398c197000-398c397000 ---p 00197000 08:01 541057 /lib64/libc-2.12.so 398c397000-398c39b000 r--p 00197000 08:01 541057 /lib64/libc-2.12.so 398c39b000-398c39c000 rw-p 0019b000 08:01 541057 /lib64/libc-2.12.so 398c39c000-398c3a1000 rw-p 00000000 00:00 0 398cc00000-398cc22000 r-xp 00000000 08:01 540770 /lib64/libncurses.so.5.7 398cc22000-398ce21000 ---p 00022000 08:01 540770 /lib64/libncurses.so.5.7 398ce21000-398ce22000 rw-p 00021000 08:01 540770 /lib64/libncurses.so.5.7 398d800000-398d81d000 r-xp 00000000 08:01 541091 /lib64/libtinfo.so.5.7 398d81d000-398da1d000 ---p 0001d000 08:01 541091 /lib64/libtinfo.so.5.7 398da1d000-398da21000 rw-p 0001d000 08:01 541091 /lib64/libtinfo.so.5.7 398dc00000-398dc3a000 r-xp 00000000 08:01 541092 /lib64/libreadline.so.6.0 398dc3a000-398de3a000 ---p 0003a000 08:01 541092 /lib64/libreadline.so.6.0 398de3a000-398de42000 rw-p 0003a000 08:01 541092 /lib64/libreadline.so.6.0 398de42000-398de43000 rw-p 00000000 00:00 0 7fe05e942000-7fe05e958000 r-xp 00000000 08:01 540760 /lib64/libgcc_s-4.4.6-20110824.so.1 7fe05e958000-7fe05eb57000 ---p 00016000 08:01 540760 /lib64/libgcc_s-4.4.6-20110824.so.1 7fe05eb57000-7fe05eb58000 rw-p 00015000 08:01 540760 /lib64/libgcc_s-4.4.6-20110824.so.1 7fe05eb60000-7fe05eb65000 rw-p 00000000 00:00 0 7fe05eb66000-7fe05eb68000 rw-p 00000000 00:00 0 7fe05eb68000-7fe05eb6d000 rw-s 00000000 00:04 720896 /SYSV00000000 (deleted) 7fe05eb6d000-7fe05eb6e000 rw-p 00000000 00:00 0 7fff8ca1d000-7fff8ca32000 rw-p 00000000 00:00 0 [stack] 7fff8cb12000-7fff8cb13000 r-xp 00000000 00:00 0 [vdso] ffffffffff600000-ffffffffff601000 r-xp 00000000 00:00 0 [vsyscall] Aborted (core dumped)
# gdb /usr/bin/lfs /tmp/lfs.2972.core
Core was generated by `lfs getstripe -c /mnt/lustre/testfile.client-1.2000'.
Program terminated with signal 6, Aborted.
#0 0x000000398c032885 in raise () from /lib64/libc.so.6
(gdb) bt
#0 0x000000398c032885 in raise () from /lib64/libc.so.6
#1 0x000000398c034065 in abort () from /lib64/libc.so.6
#2 0x000000398c06f7a7 in __libc_message () from /lib64/libc.so.6
#3 0x000000398c0750c6 in malloc_printerr () from /lib64/libc.so.6
#4 0x000000398c0a690d in closedir () from /lib64/libc.so.6
#5 0x000000000041c3a1 in llapi_semantic_traverse (path=0x1b05030 "/mnt/lustre/testfile.client-1.2000", parent=0x0, sem_init=0x41d2b0 <cb_getstripe>,
sem_fini=0x4177d0 <cb_common_fini>, data=0x7fff8ca2f600, de=0x0, size=4097) at liblustreapi.c:1132
#6 0x000000000041c7f0 in param_callback (path=0x7fff8ca3164e "/mnt/lustre/testfile.client-1.2000", sem_init=0x41d2b0 <cb_getstripe>, sem_fini=0x4177d0 <cb_common_fini>,
param=0x7fff8ca2f600) at liblustreapi.c:1159
#7 0x0000000000406370 in lfs_getstripe (argc=3, argv=0x7fff8ca2f840) at lfs.c:889
#8 0x00000000004273a8 in Parser_execarg (argc=3, argv=0x7fff8ca2f840, cmds=0x664400) at util/parser.c:104
#9 0x0000000000406937 in main (argc=4, argv=0x7fff8ca2f838) at lfs.c:2528
The core dump file lfs.2972.core on 2.1.1 client is attached.