Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-393

1.6<->1.8 interop: 1.8.5.56 client failed to connect to 1.6.7.2 server

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Major
    • None
    • Lustre 1.8.6
    • None
    • 3
    • 7877

    Description

      While mounting Lustre 1.8.5.56 client on node client-12-ib, it failed to connect to the Lustre 1.6.7.2 MDS server (fat-amd-1-ib) as follows:

      [root@client-12-ib ~]# mount -t lustre -o user_xattr,acl,flock fat-amd-1-ib@o2ib:/lustre /mnt/lustre
      mount.lustre: mount fat-amd-1-ib@o2ib:/lustre at /mnt/lustre failed: Cannot send after transport endpoint shutdown
      

      Dmesg on client-12-ib showed that:

      [root@client-12-ib ~]# dmesg
      Lustre: OBD class driver, http://www.lustre.org/
      Lustre:     Lustre Version: 1.8.5.56
      Lustre:     Build Version: 1.8.5.56-20110528075626-PRISTINE-2.6.32-71.18.2.el6.x86_64
      Lustre: Listener bound to ib0:192.168.4.12:987:mlx4_0
      Lustre: Register global MR array, MR size: 0xffffffffffffffff, array size: 1
      Lustre: Added LNI 192.168.4.12@o2ib [8/64/0/180]
      Lustre: Lustre Client File System; http://www.lustre.org/
      LustreError: 152-6: Ignoring deprecated mount option 'acl'.
      Lustre: 27162:0:(client.c:1487:ptlrpc_expire_one_request()) @@@ Request x1370764878020609 sent from MGC192.168.4.132@o2ib to NID 192.168.4.132@o2ib 5s ago has timed out (5s prior to deadline).
        req@ffff88031fb81400 x1370764878020609/t0 o250->MGS@MGC192.168.4.132@o2ib_0:26/25 lens 368/584 e 0 to 1 dl 1307263263 ref 2 fl Rpc:N/0/0 rc 0/0
      LustreError: 27150:0:(client.c:858:ptlrpc_import_delay_req()) @@@ IMP_INVALID  req@ffff88031fb81000 x1370764878020611/t0 o501->MGS@MGC192.168.4.132@o2ib_0:26/25 lens 264/432 e 0 to 1 dl 0 ref 1 fl Rpc:/0/0 rc 0/0
      LustreError: 15c-8: MGC192.168.4.132@o2ib: The configuration from log 'lustre-client' failed (-108). This may be the result of communication errors between this node and the MGS, a bad configuration, or other errors. See the syslog for more information.
      LustreError: 27150:0:(llite_lib.c:1090:ll_fill_super()) Unable to process log: -108
      Lustre: client ffff880329446800 umount complete
      LustreError: 27150:0:(obd_mount.c:2065:lustre_fill_super()) Unable to mount  (-108)
      
      [root@client-12-ib ~]# ping fat-amd-1-ib
      PING fat-amd-1-ib (192.168.4.132) 56(84) bytes of data.
      64 bytes from fat-amd-1-ib (192.168.4.132): icmp_seq=1 ttl=64 time=0.730 ms
      64 bytes from fat-amd-1-ib (192.168.4.132): icmp_seq=2 ttl=64 time=0.119 ms
      64 bytes from fat-amd-1-ib (192.168.4.132): icmp_seq=3 ttl=64 time=0.122 ms
      64 bytes from fat-amd-1-ib (192.168.4.132): icmp_seq=4 ttl=64 time=0.117 ms
      ^C
      --- fat-amd-1-ib ping statistics ---
      4 packets transmitted, 4 received, 0% packet loss, time 3917ms
      rtt min/avg/max/mdev = 0.117/0.272/0.730/0.264 ms
      
      [root@client-12-ib ~]# lctl ping fat-amd-1-ib@o2ib
      Can't parse process id "fat-amd-1-ib@o2ib"
      
      [root@fat-amd-1-ib ~]# lctl list_nids
      192.168.4.132@o2ib
      
      [root@client-12-ib ~]# lctl ping 192.168.4.132@o2ib
      failed to ping 192.168.4.132@o2ib: Input/output error
      
      [root@client-12-ib ~]# lctl list_nids
      192.168.4.12@o2ib
      
      [root@fat-amd-1-ib ~]# ping client-12-ib
      PING client-12-ib (192.168.4.12) 56(84) bytes of data.
      64 bytes from client-12-ib (192.168.4.12): icmp_seq=1 ttl=64 time=1.66 ms
      64 bytes from client-12-ib (192.168.4.12): icmp_seq=2 ttl=64 time=0.119 ms
      64 bytes from client-12-ib (192.168.4.12): icmp_seq=3 ttl=64 time=0.124 ms
      
      --- client-12-ib ping statistics ---
      3 packets transmitted, 3 received, 0% packet loss, time 2002ms
      rtt min/avg/max/mdev = 0.119/0.635/1.662/0.726 ms
       
      [root@fat-amd-1-ib ~]# lctl ping 192.168.4.12@o2ib
      failed to ping 192.168.4.12@o2ib: Protocol error
      
      [root@fat-amd-1-ib ~]# dmesg
      LustreError: 14950:0:(api-ni.c:1732:lnet_ping()) 12345-192.168.4.12@o2ib: Unexpected version 0x2
      
      

      Attachments

        Activity

          People

            wc-triage WC Triage
            yujian Jian Yu
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: