Uploaded image for project: 'Lustre'
  1. Lustre
  2. LU-15193

qsd_op_begin: more than 8 qids enforced for a transaction?

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Minor Minor
    • Lustre 2.16.0, Lustre 2.15.4
    • Lustre 2.12.5
    • None
    • CentOS 7.9, Lustre 2.12.7 on clients
    • 3
    • 9223372036854775807

      This morning, we hit the following problem (new for us) on Fir (2.12.5 servers, 2.12.7 clients):

      # rmdir /scratch/users/ragoglia/csATAC/For_Rachel
      rmdir: failed to remove ‘/scratch/users/ragoglia/csATAC/For_Rachel’: Invalid argument
      

       
      More info on this directory:

      [root@fir-rbh01 ~]# ls -lisa /scratch/users/ragoglia/csATAC/For_Rachel
      total 8
      198162765779658672 4 drwxr-xr-x  2 atrev    wjg      4096 Oct 20 02:58 .
      198162765779658671 4 drwxrwxr-x+ 3 ragoglia hbfraser 4096 Sep  5  2019 ..
      
      [root@fir-rbh01 ~]# lfs getdirstripe /scratch/users/ragoglia/csATAC/For_Rachel
      lmv_stripe_count: 0 lmv_stripe_offset: 2 lmv_hash_type: none
      
      [root@fir-rbh01 ~]# lfs project -d /scratch/users/ragoglia/csATAC/For_Rachel
      259557 P /scratch/users/ragoglia/csATAC/For_Rachel
      
      [root@fir-rbh01 ~]# lfs getdirstripe /scratch/users/ragoglia/csATAC
      lmv_stripe_count: 0 lmv_stripe_offset: 2 lmv_hash_type: none
      
      [root@fir-rbh01 ~]# rmdir /scratch/users/ragoglia/csATAC/For_Rachel
      rmdir: failed to remove ‘/scratch/users/ragoglia/csATAC/For_Rachel’: Invalid argument
      

      MDS logs show:

      Nov 03 09:52:49 fir-md1-s3 kernel: LustreError: 103307:0:(qsd_handler.c:884:qsd_op_begin()) fir-MDT0002: more than 8 qids enforced for a transaction?
      Nov 03 10:10:09 fir-md1-s3 kernel: LustreError: 103781:0:(qsd_handler.c:884:qsd_op_begin()) fir-MDT0002: more than 8 qids enforced for a transaction?
      Nov 03 10:10:35 fir-md1-s3 kernel: LustreError: 103781:0:(qsd_handler.c:884:qsd_op_begin()) fir-MDT0002: more than 8 qids enforced for a transaction?
      Nov 03 10:10:52 fir-md1-s3 kernel: LustreError: 103697:0:(qsd_handler.c:884:qsd_op_begin()) fir-MDT0002: more than 8 qids enforced for a transaction?
      Nov 03 10:15:55 fir-md1-s3 kernel: LustreError: 103721:0:(qsd_handler.c:884:qsd_op_begin()) fir-MDT0002: more than 8 qids enforced for a transaction?
      

      Which looks like LU-12388
       
      My colleague found a workaround for rmdir: set project ID 0 before rmdir, and it worked:

      # lfs project -d  /scratch/users/ragoglia/csATAC/For_Rachel/
      259557 P /scratch/users/ragoglia/csATAC/For_Rachel/
      # lfs project -p 0  /scratch/users/ragoglia/csATAC/For_Rachel/
      # lfs project -d  /scratch/users/ragoglia/csATAC/For_Rachel/
          0 P /scratch/users/ragoglia/csATAC/For_Rachel/
      # rmdir /scratch/users/ragoglia/csATAC/For_Rachel/
      #
      

      I believe this is the only occurrence we've seen of this issue so far. Let me know if additional logs would be helpful the next time we hit this.
       

            flei Feng Lei
            sthiell Stephane Thiell
            Votes:
            0 Vote for this issue
            Watchers:
            8 Start watching this issue

              Created:
              Updated:
              Resolved: