[LU-1245] general protection fault in lprocfs_fops_read Created: 20/Mar/12  Updated: 02/May/12  Resolved: 30/Apr/12

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.1.0, Lustre 2.1.2
Fix Version/s: Lustre 2.3.0, Lustre 2.1.2

Type: Bug Priority: Minor
Reporter: Ned Bass Assignee: Zhenyu Xu
Resolution: Fixed Votes: 0
Labels: paj
Environment:

https://github.com/chaos/lustre/tree/2.1.0-llnl


Severity: 3
Rank (Obsolete): 4656

 Description   

Our slurm epilog runs a script that cats the files

/proc/fs/lustre/ldlm/namespaces/*/lock_count
/proc/fs/lustre/ldlm/namespaces/*/lock_unused
/proc/fs/lustre/ldlm/namespaces/*/resource_count

One such cat process on a Lustre 2.1 client hit a GPF with the backtrace below.

machine_kexec
crash_kexec
oops_end
die
do_general_protection
general_protection
[exception RIP:lprocfs_rd_ns_locks+88]
lprocfs_fops_read
proc_reg_read
vfs_read
sys_read
system_call_fastpath

We have a crash dump for analysis on our classified network, so please let us know if there's any data structures you want to see.



 Comments   
Comment by Peter Jones [ 21/Mar/12 ]

Bobi

Could you please help out with this one?

Thanks

Peter

Comment by Zhenyu Xu [ 26/Mar/12 ]

Ned Bass,

Can you list where lprocfs_rd_ns_locks+88 locates in the code? Thanks.

Comment by Ned Bass [ 26/Mar/12 ]

lustre-2.1.0/lustre/include/lprocfs_status.h:421

412 static inline __u64 lprocfs_stats_collector(struct lprocfs_stats *stats,
413                                             int idx,
414                                             enum lprocfs_fields_flags field)
415 {
416         __u64 ret = 0;
417         int i;
418 
419         LASSERT(stats != NULL);
420         for (i = 0; i < cfs_num_possible_cpus(); i++)
421                 ret += lprocfs_read_helper(&(stats->ls_percpu[i]->lp_cntr[idx]),
422                                            field);
423         return ret;
424 }
Comment by Zhenyu Xu [ 26/Mar/12 ]

patch tracking at http://review.whamcloud.com/2384

Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,client,sles11,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,server,el5,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,server,el5,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,server,el6,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,client,el5,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,client,el5,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,client,el6,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,client,el5,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,server,el5,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,client,el6,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,server,el6,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,client,el5,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » x86_64,server,el6,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,server,el5,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,server,el6,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,client,el6,ofa #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 06/Apr/12 ]

Integrated in lustre-master » i686,client,el6,inkernel #477
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Christopher Morrone [ 17/Apr/12 ]

This is required on 2.1 as well, which is where we reported the problem. We really need some visibility into the status of things like this. It seems like the ticket should not be marked resolved until it is resolved in all necessary branches.

Comment by Zhenyu Xu [ 17/Apr/12 ]

still need port for Lustre 2.1.2

Comment by Zhenyu Xu [ 17/Apr/12 ]

port for b2_1 at http://review.whamcloud.com/2566
port for b2_2 at http://review.whamcloud.com/2567

Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,client,sles11,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,client,el6,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,server,el5,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » i686,server,el5,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,server,el6,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,server,el5,ofa #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » i686,server,el5,ofa #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,client,el5,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » i686,server,el6,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » i686,client,el5,ofa #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » x86_64,client,el5,ofa #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » i686,client,el5,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 20/Apr/12 ]

Integrated in lustre-b2_1 » i686,client,el6,inkernel #47
LU-1245 lprocfs: use correct cpu number (Revision ef51cfe1a5a85d3f66c73282dfb073acc2c64450)

Result = SUCCESS
Oleg Drokin : ef51cfe1a5a85d3f66c73282dfb073acc2c64450
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » x86_64,client,el5,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » i686,client,el6,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » i686,server,el5,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » x86_64,server,el6,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » i686,client,el5,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » x86_64,server,el5,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Comment by Build Master (Inactive) [ 02/May/12 ]

Integrated in lustre-dev » x86_64,client,el6,inkernel #340
LU-1245 lprocfs: use correct cpu number (Revision debb9042dbe9f004d58e9dd89bcb259e55490b17)

Result = SUCCESS
Oleg Drokin : debb9042dbe9f004d58e9dd89bcb259e55490b17
Files :

  • lustre/include/lprocfs_status.h
Generated at Sat Feb 10 01:14:54 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.