[LU-5402] After rolling upgrade MDS and OSS, sanity test_220 FAIL: test_220 failed with 6 Created: 24/Jul/14  Updated: 12/Aug/22  Resolved: 12/Aug/22

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.6.0, Lustre 2.8.0
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Sarah Liu Assignee: Jian Yu
Resolution: Cannot Reproduce Votes: 1
Labels: None
Environment:

before upgrade: 2.5.2 ldiskfs
after upgrade:
server: b2_6-rc2
clients: 2.5.2


Attachments: File 5402.tgz    
Severity: 3
Rank (Obsolete): 15036

 Description   

After rolling upgrade OSS and MDS from 2.5.2 to b2_6-rc2, two clients are still 2.5.2, run sanity hit following error.

Same error if only upgrade OSS to b2_6-rc2, all the other nodes(MDS, clients) are 2.5.2

client console:

== sanity test 220: preallocated MDS objects still used if ENOSPC from OST == 19:16:32 (1406168192)
Lustre: DEBUG MARKER: == sanity test 220: preallocated MDS objects still used if ENOSPC from OST == 19:16:32 (1406168192)
onyx-26: error: get_param: /proc/{fs,sys}/{lnet,lustre}/osc/lustre-OST0000-osc-/prealloc_last_id: Found no match
onyx-26: error: get_param: /proc/{fs,sys}/{lnet,lustre}/osc/lustre-OST0000-osc-/prealloc_next_id: Found no match
UUID                      Inodes       IUsed       IFree IUse% Mounted on
lustre-MDT0000_UUID      1000184        2733      997451   0% /mnt/lustre[MDT:0]
lustre-OST0000_UUID       366720        1031      365689   0% /mnt/lustre[OST:0]

filesystem summary:      1000184        2733      997451   0% /mnt/lustre

fail_val=-1
fail_loc=0x229
onyx-26: Pool lustre.test_220 created

onyx-26: OST lustre-OST0000_UUID added to pool lustre.test_220
pool 'lustre.test_220' has no OSTs
error: setstripe: create stripe file '/mnt/lustre/d220.sanity' failed
preallocated objects on MDS is 0 ( - )
OST still has 0 kbytes free
create 0 files @next_id...
total: 0 creates in 0.00 seconds: -nan creates/second
onyx-26: error: get_param: /proc/{fs,sys}/{lnet,lustre}/osc/lustre-OST0000-osc-/prealloc_last_id: Found no match
onyx-26: error: get_param: /proc/{fs,sys}/{lnet,lustre}/osc/lustre-OST0000-osc-/prealloc_next_id: Found no match
after creation, last_id=, next_id=
UUID                      Inodes       IUsed       IFree IUse% Mounted on
lustre-MDT0000_UUID      1000184        2733      997451   0% /mnt/lustre[MDT:0]
lustre-OST0000_UUID       366720      366720           0 100% /mnt/lustre[OST:0]

filesystem summary:      1000184        2733      997451   0% /mnt/lustre

cleanup...
fail_val=0
fail_loc=0
onyx-26: OST lustre-OST0000_UUID removed from pool lustre.test_220
onyx-26: Pool lustre.test_220 destroyed
unlink 0 files @...
count must be at least one
 sanity test_220: @@@@@@ FAIL: test_220 failed with 6 
Lustre: DEBUG MARKER: sanity test_220: @@@@@@ FAIL: test_220 failed with 6
  Trace dump:
  = /usr/lib64/lustre/tests/test-framework.sh:4343:error_noexit()
  = /usr/lib64/lustre/tests/test-framework.sh:4374:error()
  = /usr/lib64/lustre/tests/test-framework.sh:4613:run_one()
  = /usr/lib64/lustre/tests/test-framework.sh:4648:run_one_logged()
  = /usr/lib64/lustre/tests/test-framework.sh:4516:run_test()
  = sanity.sh:11474:main()
Dumping lctl log to /home/w3liu/toro_home/test_logs/sanity.test_220.*.1406168221.log
FAIL 220 (32s)


 Comments   
Comment by Sarah Liu [ 24/Jul/14 ]

error logs

Comment by Sarah Liu [ 02/Sep/15 ]

Hit this issue when doing interop test between lustre-master server(DNE mode) and 2.7.0 client

server: lustre-master build # 3142

Comment by Sarah Liu [ 20/Nov/15 ]

hit this issue on SELinux enabled client after clean upgrade from 2.5.5 to master
https://testing.hpdd.intel.com/test_sets/6b88b5b0-8f0c-11e5-8dd0-5254006e85c2

Generated at Sat Feb 10 01:51:11 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.