Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-768

Hyperion - recovery-double-scale fails

    XMLWordPrintable

Details

    • Bug
    • Resolution: Won't Fix
    • Minor
    • None
    • Lustre 1.8.7
    • None
    • Hypeiron, RHEL5/x86_64
    • 3
    • 10377

    Description

      recovery-double-scale fails, detailed results in maloo
      Error reported:
      MDS

      Lustre: DEBUG MARKER: Failing type2=clients item2=hyperion321,hyperion421 ...
      Lustre: 2861:0:(quota_master.c:1718:mds_quota_recovery()) Only 4/8 OSTs are active, abort quota recovery
      Lustre: lustre-MDT0000: Recovery period over after 0:24, of 126 clients 126 recovered and 0 were evicted.
      Lustre: lustre-MDT0000: sending delayed replies to recovered clients
      Lustre: MDS lustre-MDT0000: lustre-OST0005_UUID now active, resetting orphans
      LustreError: 2910:0:(mds_open.c:1645:mds_close()) @@@ no handle for file close ino 122683904: cookie 0x3b989baac2130b2c req@ffff810f55c08c50 x1382661482849280/t0 o35->34523172-a256-a14a-a765-717695be2aa1@NET_0x50000c0a8723c_UUID:0/0 lens 408/864 e 0 to 0 dl 1318881574 ref 1 fl Interpret:/2/0 rc 0/0
      LustreError: 2910:0:(mds_open.c:1645:mds_close()) Skipped 9 previous similar messages
      Lustre: DEBUG MARKER: Mon Oct 17 12:59:31 2011
      Client

      Lustre: setting import lustre-OST0000_UUID INACTIVE by administrator request
      LustreError: 25042:0:(ldlm_resource.c:519:ldlm_namespace_cleanup()) Namespace lustre-OST0000-osc-ffff81022efde800 resource refcount nonzero (2) after lock cleanup; forcing cleanup.
      LustreError: 25042:0:(ldlm_resource.c:524:ldlm_namespace_cleanup()) Resource: ffff8101f237ae40 (162106/0/0/0) (rc: 2)
      Lustre: Mount still busy with 5 refs! You may try to umount it a bit later
      Lustre: setting import lustre-MDT0000_UUID INACTIVE by administrator request
      Lustre: Skipped 7 previous similar messages
      LustreError: 25042:0:(ldlm_resource.c:519:ldlm_namespace_cleanup()) Namespace lustre-OST0000-osc-ffff81022efde800 resource refcount nonzero (2) after lock cleanup; forcing cleanup.
      LustreError: 25042:0:(ldlm_resource.c:524:ldlm_namespace_cleanup()) Resource: ffff8101f237ae40 (162106/0/0/0) (rc: 1)
      LustreError: 25042:0:(ldlm_request.c:1039:ldlm_cli_cancel_req()) Got rc -108 from cancel RPC: canceling anyway
      LustreError: 25042:0:(ldlm_request.c:1597:ldlm_cli_cancel_list()) ldlm_cli_cancel_list: -108
      Lustre: client ffff81022efde800 umount complete
      Lustre: DEBUG MARKER: Mon Oct 17 12:59:31 2011

      Attachments

        Issue Links

          Activity

            People

              mdiep Minh Diep
              cliffw Cliff White (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: