[LU-6276] LustreError: error destroying object Created: 24/Feb/15  Updated: 25/Feb/15  Resolved: 25/Feb/15

Status: Closed
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.5.3
Fix Version/s: None

Type: Bug Priority: Minor
Reporter: Wolfgang Baudler Assignee: WC Triage
Resolution: Duplicate Votes: 0
Labels: None
Environment:

Red Hat Enterprise Linux Workstation release 6.6, kernel 2.6.32-431.23.3.el6_lustre.x86_64


Issue Links:
Duplicate
duplicates LU-5973 bad error message in ofd_destroy() Resolved
Epic/Theme: Lustre-2.5.3
Severity: 4
Rank (Obsolete): 17597

 Description   

I see the following LustreErrors showing up in irregular intervals:

Feb 24 15:03:47 oss-10 kernel: LustreError: 2417:0:(ofd_obd.c:1096:ofd_destroy()) vegas-OST0052: error destroying object [0x100000000:0x1d0e5:0x0]: 0
Feb 24 15:03:47 oss-10 kernel: LustreError: 2417:0:(ofd_obd.c:1096:ofd_destroy()) Skipped 300 previous similar messages
Feb 24 15:04:53 oss-10 kernel: Lustre: vegas-OST0052-os: trigger OI scrub by RPC for [0x100000000:0xc4a6:0x0], rc = 0 [1]
Feb 24 15:04:53 oss-10 kernel: Lustre: Skipped 219 previous similar messages
Feb 24 15:13:51 oss-10 kernel: LustreError: 2417:0:(ofd_obd.c:1096:ofd_destroy()) vegas-OST0052: error destroying object [0x100000000:0x20c7:0x0]: 0
Feb 24 15:13:51 oss-10 kernel: LustreError: 2417:0:(ofd_obd.c:1096:ofd_destroy()) Skipped 300 previous similar messages
Feb 24 15:14:57 oss-10 kernel: Lustre: vegas-OST0052-os: trigger OI scrub by RPC for [0x100000000:0x20c7:0x0], rc = 0 [1]
Feb 24 15:14:57 oss-10 kernel: Lustre: Skipped 215 previous similar messages

The object ID is not changing. It is always one of those two mentioned. This is only happening on one OST (out of 90). Filesystem seems to be fine otherwise.

output of lctl get_param -n osd-ldiskfs.vegas-OST0052.oi_scrub
name: OI_scrub
magic: 0x4c5fd252
oi_files: 64
status: completed
flags:
param:
time_since_last_completed: 4 seconds
time_since_latest_start: 4 seconds
time_since_last_checkpoint: 4 seconds
latest_start_position: 12
last_checkpoint_position: 40054145
first_failure_position: N/A
checked: 43262
updated: 0
failed: 0
prior_updated: 0
noscrub: 0
igif: 2
success_count: 10871
run_time: 0 seconds
average_speed: 43262 objects/sec
real-time_speed: N/A
current_position: N/A
lf_scanned: 0
lf_reparied: 0
lf_failed: 0

The OI_scrub seems to restart every 10 seconds or so?

What is the problem and is there a cure?



 Comments   
Comment by Oleg Drokin [ 25/Feb/15 ]

The error is a mistake that's covered in LU-5973

We do not know what hte error is though without that error printed

Generated at Sat Feb 10 01:58:49 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.