Loading...

XML

Word

Printable

Details

Type: Bug
Resolution: Cannot Reproduce
Priority: Critical
Fix Version/s: None
Affects Version/s: None
Labels:
- llnl

Severity:
3
Rank (Obsolete):
12530

Description

Some users have reported to us that the "rm" command is taking a long time. Some investigation revealed that at least the first "rm" in a directory takes just over 100 seconds, which of course sounds like OBD_TIMEOUT_DEFAULT.

This isn't necessarily the simplest reproducer, but the following reproducer is completely consistent:

set directory striping default count to 48
touch a file on client A
rm file on client B

The clients are running 2.4.0-19chaos, servers are at 2.4.0-21chaos. The servers are using zfs as the backend.

I have some lustre logs that I will share and talk about in additional posts to this ticket. But essentially it looks like the server always times out on a AST to client A (explaining the 100 second delay). It is not really clear yet to me why that happens, because client A appears to be completely responsive. My current suspicion is the the MDT is to blame.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending
- Thumbnails
- List
- Download All

172.16.66.4@tcp.log.bz2
40 kB
06/Feb/14 6:53 PM
172.16.66.5@tcp.log.bz2
53 kB
06/Feb/14 6:53 PM
172.20.20.201@o2ib500.log.bz2
8.52 MB
06/Feb/14 6:53 PM
client_log_20140206.txt
375 kB
07/Feb/14 2:05 AM
inflames.log
2.40 MB
02/Apr/14 6:58 PM

Issue Links

duplicates

LU-4963 client eviction during IOR test - lock callback timer expired

Closed

is related to

LU-5525 ASSERTION( new_lock->l_readers + new_lock->l_writers == 0 ) failed

Resolved

LU-5632 ldlm_lock_addref()) ASSERTION( lock != ((void *)0) )

Resolved

LU-5686 (mdt_handler.c:3203:mdt_intent_lock_replace()) ASSERTION( lustre_msg_get_flags(req->rq_reqmsg) & 0x0002 ) failed

Resolved

is related to

LU-2827 mdt_intent_fixup_resent() cannot find the proper lock in hash

Resolved

Activity

People

Assignee:: Bruno Faccini (Inactive)

Reporter:: Christopher Morrone (Inactive)

Votes:: 1 Vote for this issue

Watchers:: 29 Start watching this issue

Dates

Created:: 05/Feb/14 2:00 AM

Updated:: 13/Oct/21 3:05 AM

Resolved:: 12/Dec/17 8:30 AM