Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-10858

lustre-initialization-1 lustre-initialization fails for SLES12 SP2 and SP3

Details

    • Bug
    • Resolution: Fixed
    • Minor
    • Lustre 2.11.0
    • None
    • None
    • 3
    • 9223372036854775807

    Description

      This issue was created by maloo for James Nunez <james.a.nunez@intel.com>

      This issue relates to the following test suite run: https://testing.hpdd.intel.com/test_sets/199f17d0-3149-11e8-b74b-52540065bddc

      lustre-initialization failed with the following error:

      'lustre-initialization failed'
      

      <<Please provide additional information about the failure here>>
      Looking at the autotest log, we see that the Lustre tests are not installed

      2018-03-26T21:56:49 trevis-18vm1: /usr/lib64/lustre/tests/cfg/: No such file or directory
      2018-03-26T21:56:49 pdsh@trevis-18vm1: trevis-18vm1: ssh exited with exit code 1
      2018-03-26T21:56:49 trevis-18vm3: /usr/lib64/lustre/tests/cfg/: No such file or directory
      2018-03-26T21:56:49 pdsh@trevis-18vm1: trevis-18vm3: ssh exited with exit code 1
      2018-03-26T21:56:49 trevis-18vm4: /usr/lib64/lustre/tests/cfg/: No such file or directory
      2018-03-26T21:56:49 pdsh@trevis-18vm1: trevis-18vm4: ssh exited with exit code 1
      2018-03-26T21:56:49 trevis-18vm2: /usr/lib64/lustre/tests/cfg/: No such file or directory
      2018-03-26T21:56:49 pdsh@trevis-18vm1: trevis-18vm2: ssh exited with exit code 1
      

      Yet, looking at the node console logs, I don’t see any failure relating to loading RPMS. Looking at the console logs for all the nodes, they all end with

      Welcome to SUSE Linux Enterprise Server 12 SP3  (x86_64) - Kernel 4.4.114-94.11-default (ttyS0).
      
      
      trevis-18vm2 login: [   80.209414] random: nonblocking pool is initialized
      
      <ConMan> Console [trevis-18vm2] disconnected from <trevis-18:6001> at 03-26 22:56.
      

      This failure started with master build #3731.

      Another test session that failed in this way is at
      https://testing.hpdd.intel.com/test_sessions/fb84aaa9-888e-4d17-9a76-1cfd67d415aa

      VVVVVVV DO NOT REMOVE LINES BELOW, Added by Maloo for auto-association VVVVVVV
      lustre-initialization-1 lustre-initialization - 'lustre-initialization failed'

      Attachments

        Issue Links

          Activity

            [LU-10858] lustre-initialization-1 lustre-initialization fails for SLES12 SP2 and SP3
            simmonsja James A Simmons added a comment - - edited

            Its about pulling in the right rpms when you install a lustre binary rpm. Currently you can install rpm a prepackage lustre rpm on a system that could be lacking libyaml and/or libzlib. In that case you think it installed right but then when you go to run it you see a nice crash. Using yum you can grab libyaml and zlib when installing lustre binary rpm with this patch.

            simmonsja James A Simmons added a comment - - edited Its about pulling in the right rpms when you install a lustre binary rpm. Currently you can install rpm a prepackage lustre rpm on a system that could be lacking libyaml and/or libzlib. In that case you think it installed right but then when you go to run it you see a nice crash. Using yum you can grab libyaml and zlib when installing lustre binary rpm with this patch.

            -devel .rpms are fine. both libyaml-devel & zlib-devel exist in both RHEL and SLES.
            I question the strategy of enforcing this in the .spec file though.
            I thought there are already autoconf tests to check for and enforce the right build environment in these cases.

            bogl Bob Glossman (Inactive) added a comment - -devel .rpms are fine. both libyaml-devel & zlib-devel exist in both RHEL and SLES. I question the strategy of enforcing this in the .spec file though. I thought there are already autoconf tests to check for and enforce the right build environment in these cases.
            yujian Jian Yu added a comment -

            Its still zlib-devel tho?

            Yes. It's still zlib-devel.

            yujian Jian Yu added a comment - Its still zlib-devel tho? Yes. It's still zlib-devel.

            the zlib dependency isn't a problem. the SLES .rpm libz1 has a Provides for the name "zlib"

            bogl Bob Glossman (Inactive) added a comment - the zlib dependency isn't a problem. the SLES .rpm libz1 has a Provides for the name "zlib"
            simmonsja James A Simmons added a comment - - edited

            Oh crap. At least it is a easy fix. Its still zlib-devel tho?

            simmonsja James A Simmons added a comment - - edited Oh crap. At least it is a easy fix. Its still zlib-devel tho?
            yujian Jian Yu added a comment - - edited

            Yes, James.
            I just found 'zlib' package was named as 'libz1' on SLES 12.

            yujian Jian Yu added a comment - - edited Yes, James. I just found 'zlib' package was named as 'libz1' on SLES 12.

            yes, this is a problem with naming conventions.
            In RHEL the user level libyaml .rpm is named "libyaml" and Provides the symbol libyaml
            in SLES the user level libyaml .rpm is named "libyaml-0-2" and has no Provides of the name "libyaml". It only Provides the symbol "libyaml-0-2"

            would suggest removing the dependency this mod just added.

            bogl Bob Glossman (Inactive) added a comment - yes, this is a problem with naming conventions. In RHEL the user level libyaml .rpm is named "libyaml" and Provides the symbol libyaml in SLES the user level libyaml .rpm is named "libyaml-0-2" and has no Provides of the name "libyaml". It only Provides the symbol "libyaml-0-2" would suggest removing the dependency this mod just added.
            simmonsja James A Simmons added a comment - - edited

            Really? That is just adding in build requirements when building from the lustre source rpms. Are those rpms named differently on SLES12SP4?

            simmonsja James A Simmons added a comment - - edited Really? That is just adding in build requirements when building from the lustre source rpms. Are those rpms named differently on SLES12SP4?
            pjones Peter Jones added a comment -

            Bob

            Can you please investigate?

            Peter

            pjones Peter Jones added a comment - Bob Can you please investigate? Peter

            Test to see if reverting LU-10556 patch https://review.whamcloud.com/#/c/31710/ will fix this issue.

            Revert patch at https://review.whamcloud.com/#/c/31800/

            jamesanunez James Nunez (Inactive) added a comment - Test to see if reverting LU-10556 patch https://review.whamcloud.com/#/c/31710/  will fix this issue. Revert patch at  https://review.whamcloud.com/#/c/31800/

            People

              bogl Bob Glossman (Inactive)
              maloo Maloo
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: