Details
-
Bug
-
Resolution: Duplicate
-
Major
-
None
-
Lustre 2.12.0
-
linux 3.10.0-957.1.3.1chaos.ch6.x86_64
lustre-2.12.0_1.chaos-1.ch6.x86_64
Clients OmniPath <-> routers <-> Servers mlx5
See https://github.com/LLNL/lustre/releases for contents of 2.12.0_1.chaos.
-
3
-
9223372036854775807
Description
mdtest intermittently fails and reports EINVAL error when trying to create or remove a file.
mdtest-1.8.3 was launched with 1024 total task(s) on 64 nodes Command line used: /g/g0/faaland1/projects/mdtest/mdtest/mdtest -d /p/lquake/faaland1/lustre-212-reconnects -n 1024 -F -u -v Path: /p/lquake/faaland1 FS: 1867.3 TiB Used FS: 34.2% Inodes: 765.8 Mi Used Inodes: 57.1% 1024 tasks, 1048576 files Operation Duration Rate --------- -------- ---- * iteration 1 02/20/2019 13:37:43 * Tree creation : 0.076 sec, 13.191 ops/sec 02/20/2019 13:39:00: Process 158(opal119): FAILED in create_remove_items_helper, unable to unlink file file.mdtest.158.223 (cwd=/p/lquake/faaland1/lustre-212-reconnects/#test-dir.0/mdtest_tree.158.0): Invalid argument -------------------------------------------------------------------------- MPI_ABORT was invoked on rank 158 in communicator MPI_COMM_WORLD with errorcode 1.
Seen with:
no DoM
no PFL
16 MDTs in the file system, but directory mdtest is using is not striped.
64 nodes x 16 ppn
Attachments
Issue Links
- duplicates
-
LU-11827 Race between llog_cat_declare_add_rec and llog_cat_current_log
- Resolved