Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Major
Fix Version/s: None
Affects Version/s: Lustre 1.8.8
Labels:
None
Environment:
Sun Fire x4540 server, 48 internal 1TB disks, lustre patched kernel - kernel-2.6.18-308.4.1.el5, Lustre 1.8.8

Severity:
3
Rank (Obsolete):
10643

Since our recent upgrade to 1.8.8, we've been experiencing problems with the md subsystem. Our OSTs are constructed as 8+2 RAID6 metadevices using the mdadm utility.
Every Sunday morning, cron.weekly runs the raid.check scripts and starts re-syncing and if it hits a medium error, the md subsytem hangs, for example "cat /proc/mdstat" hangs. The load on the server immediately starts going up until the server becomes unusable and we have to reboot the OSS server
What could be causing this and should we be running raid.check on the ost metadevices?

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

oss06_messages
8.82 MB
12/Sep/12 11:25 AM

Assignee:: Oleg Drokin

Reporter:: Hellen (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 12/Sep/12 11:25 AM

Updated:: 08/Mar/14 12:57 AM

Resolved:: 08/Mar/14 12:57 AM

Details

Description

Attachments

Attachments

Activity

People

Dates