[LU-3373] ldiskfs patches for FC19 Created: 21/May/13 Updated: 15/Nov/19 Resolved: 01/Jul/14 |
|
| Status: | Resolved |
| Project: | Lustre |
| Component/s: | None |
| Affects Version/s: | None |
| Fix Version/s: | Lustre 2.7.0 |
| Type: | Improvement | Priority: | Minor |
| Reporter: | Yang Sheng | Assignee: | Yang Sheng |
| Resolution: | Fixed | Votes: | 0 |
| Labels: | None | ||
| Issue Links: |
|
||||||||||||||||||||||||||||||||
| Rank (Obsolete): | 8342 | ||||||||||||||||||||||||||||||||
| Description |
|
This ticket intended to track ldiskfs patches work on FC18 3.7 kernel. |
| Comments |
| Comment by Dmitry Eremin (Inactive) [ 22/May/13 ] |
|
FC18 have 3.9.2 kernel already. Why 3.7 kernel is going to be supported? |
| Comment by James A Simmons [ 22/May/13 ] |
|
Can you post a link to download the kernel source. I will gladly update the kernel patch in |
| Comment by Dmitry Eremin (Inactive) [ 22/May/13 ] |
|
http://mirrors.med.harvard.edu/fedora/updates/18/SRPMS/kernel-3.9.2-200.fc18.src.rpm |
| Comment by Yang Sheng [ 22/May/13 ] |
|
I think work start from 3.7 can keep max compatible for further support. |
| Comment by Yang Sheng [ 07/Aug/13 ] |
|
First patchset: http://review.whamcloud.com/#/c/7263/ I'll continue update it after 3.9.5 kernel work done. |
| Comment by Bob Glossman (Inactive) [ 07/Aug/13 ] |
|
I think this a little behind the curve. current kernel in fc18 is 3.9.11. In fc19 it's 3.10.4 As far as I know everything needed for 3.9 has already landed. Not so true of 3.10. I don't see any mods to ldiskfs autoconf here to automatically choose the right (new) patch series. |
| Comment by Yang Sheng [ 07/Sep/13 ] |
| Comment by Yang Sheng [ 11/Sep/13 ] |
|
sanity test fail list: sanity: FAIL: test_17m e2fsck should not report error upon short/long symlink MDT: rc=4 FAIL 17m (43s) sanity: FAIL: test_24A Expected 5000 files, got 5005 (5005 unique) FAIL 24A (5s) sanity: FAIL: test_27k > 4194304 FAIL 27k (1s) sanity: FAIL: test_33a create FAIL 33a (1s) sanity: FAIL: test_36c FAIL 36c (1s) sanity: FAIL: test_36d FAIL 36d (1s) sanity: FAIL: test_102c setstripe failed FAIL 102c (2s) sanity: FAIL: test_103 permissions failed FAIL 103 (17s) sanity: FAIL: test_104b lfs check servers test failed FAIL 104b (1s) sanity: FAIL: test_129 exceeded dir size limit 4096 x 1 4096 : 12288 bytes FAIL 129 (9s) sanity: FAIL: test_133c The counter for destroy on ost was not incremented FAIL 133c (16s) sanity: FAIL: test_180c failed to load module obdecho FAIL 180c (2s) sanity: FAIL: test_183 test_183 failed with 1 FAIL 183 (3s) sanity: FAIL: test_184c concurrent write on /mnt/lustre/d0.sanity/d184/184c/file1 failed FAIL 184c (448s) sanity: FAIL: test_218 multiop failed while creating a file FAIL 218 (3s) sanity: FAIL: test_219 test_219 failed with 1 FAIL 219 (2s) FAIL 228a (8s) |
| Comment by James A Simmons [ 23/Oct/13 ] |
|
With the patches for lustre-2.5.50/lustre/osd-ldiskfs/osd_handler.c:66: |
| Comment by Yang Sheng [ 29/Oct/13 ] |
|
http://review.whamcloud.com/7948 |
| Comment by Yang Sheng [ 31/Oct/13 ] |
|
http://review.whamcloud.com/8110 |
| Comment by Yang Sheng [ 03/Nov/13 ] |
|
Latest sanity status: sanity: FAIL: test_17m e2fsck should not report error upon short/long symlink MDT: rc=4 |
| Comment by James A Simmons [ 11/Nov/13 ] |
|
I also had a patch for osd-ldiskfs iterate change but mine was unstable. Now I see it was due to me not updating the position field in struct dir_context. Your patch doesn't currently build for me on normal rhel6.4 but I working on fixing that. I should have something ready in the next few hours. Thanks Yang for finding that issue. |
| Comment by Bob Glossman (Inactive) [ 11/Nov/13 ] |
|
I think there is still one more call to ldiskfs_journal_start_sb() in osd_handler.c that hasn't been switched to osd_journal_start_sb() by the existing patches. Maybe could be taken care of by a refresh of #7794? Besides that all that's missing is the readdir -> iterate conversion that James has already mentioned. And of course finding all the bugs. Much of the previous hole has been filled in by recent server patch #8232 from James. We're getting very close to having a full ldiskfs server build. |
| Comment by Yang Sheng [ 12/Nov/13 ] |
| Comment by James A Simmons [ 12/Nov/13 ] |
|
It lives!!!! Behold ldiskfs on a 3.11.1 kernel. I had to fix up some of the ldiskfs patches but it works. jsimmons@spoon46:~$ uname -r;df |
| Comment by Bob Glossman (Inactive) [ 12/Nov/13 ] |
|
with the refresh of #7794 and the addition of #8231 I can now complete a full server build with ldiskfs on the 3.11.7 kernel that is current in fc19 too. major milestone!! I'm almost afraid to try some functional tests. |
| Comment by James A Simmons [ 12/Nov/13 ] |
|
Yes with the refresh now all I have to do is set the ldiskfs series manually. Also you need patch http://review.whamcloud.com/#/c/8237 which I just pushed. I also updated http://review.whamcloud.com/#/c/8116 locally to test if ext4_map_blocks() is available. |
| Comment by Bob Glossman (Inactive) [ 12/Nov/13 ] |
|
I was using Peng's original upstream patch. I see #8237 now. By the way I do have a prototype autoconf patch that selects the ldiskfs series. I haven't pushed it because I'm not sure it's production worthy. |
| Comment by James A Simmons [ 12/Nov/13 ] |
|
Send it my way. I will try it out. Tomorrow I will setup a real file system using a 3.11 kernel to see how it holds up. If it holds up I plan to mount it on our cray test bed and run a bunch of jobs to stress test it. |
| Comment by Bob Glossman (Inactive) [ 13/Nov/13 ] |
|
it lives, take II [root@fedora19 tests]# llmount.sh Stopping clients: fedora19 /mnt/lustre (opts:) Stopping clients: fedora19 /mnt/lustre2 (opts:) Loading modules from /usr/lib64/lustre/tests/.. detected 1 online CPUs by sysfs libcfs will create CPU partition based on online CPUs debug=vfstrace rpctrace dlmtrace neterror ha config ioctl super subsystem_debug=all -lnet -lnd -pinger gss/krb5 is not supported quota/lquota options: 'hash_lqs_cur_bits=3' Formatting mgs, mds, osts Format mds1: /tmp/lustre-mdt1 Format ost1: /tmp/lustre-ost1 Format ost2: /tmp/lustre-ost2 Checking servers environments Checking clients fedora19 environments Loading modules from /usr/lib64/lustre/tests/.. detected 1 online CPUs by sysfs libcfs will create CPU partition based on online CPUs debug=vfstrace rpctrace dlmtrace neterror ha config ioctl super subsystem_debug=all -lnet -lnd -pinger gss/krb5 is not supported Setup mgs, mdt, osts Starting mds1: -o loop /tmp/lustre-mdt1 /mnt/mds1 Started lustre-MDT0000 Starting ost1: -o loop /tmp/lustre-ost1 /mnt/ost1 Started lustre-OST0000 Starting ost2: -o loop /tmp/lustre-ost2 /mnt/ost2 Started lustre-OST0001 Starting client: fedora19: -o user_xattr,flock fedora19@tcp:/lustre /mnt/lustre Using TIMEOUT=20 seting jobstats to procname_uid Setting lustre.sys.jobid_var from disable to procname_uid Changed after 0s: from '' to 'disable' Waiting 90 secs for update Updated after 9s: wanted 'procname_uid' got 'procname_uid' disable quota as required [root@fedora19 tests]# lfs df UUID 1K-blocks Used Available Use% Mounted on lustre-MDT0000_UUID 133560 1576 118624 1% /mnt/lustre[MDT:0] lustre-OST0000_UUID 171080 9380 147700 6% /mnt/lustre[OST:0] lustre-OST0001_UUID 171080 9380 147700 6% /mnt/lustre[OST:1] filesystem summary: 342160 18760 295400 6% /mnt/lustre |
| Comment by Bob Glossman (Inactive) [ 13/Nov/13 ] |
|
One odd quirk. lustre-ldiskfs.spec is setup with a Requires: modutils. However there isn't any rpm in fedora that provides modutils. I could only install lustre with a --force option on the command line. Looks like in earlier distro modutils was one of the Provides: in the module-init-tools rpm. The fedora rpm that contains similar files as the old module-init-tools rpm is named kmod, but it has no Provides: modutils in it. |
| Comment by James A Simmons [ 19/Nov/13 ] |
|
Ran several jobs (various IOR, mdtest configurations and a science app) against the 3.11 base lustre file system using ldiskfs. It was successful so I would say we are in pretty good shape. |
| Comment by Yang Sheng [ 25/Nov/13 ] |
|
O_LOV_DELAY_CREATE conflict with __O_TMPFILE. Need change it. lod and osp procfs patch need fix some bug. I have commented in gerrit. |
| Comment by Bob Glossman (Inactive) [ 03/Dec/13 ] |
|
I think |
| Comment by James A Simmons [ 06/Dec/13 ] |
|
http://review.whamcloud.com/#/c/8116 looks ready for inspection and possible merger. |
| Comment by James A Simmons [ 13/Dec/13 ] |
|
I see patch 8116 has been abandon. I believe the point of the patch was to limit the port of patches for fc19 support. Do this mean the ldiskfs patches will be updated or will the rhel6.5 patch be enough? |
| Comment by James A Simmons [ 30/Dec/13 ] |
|
http://review.whamcloud.com/#/c/8231 is ready for inspection and possible merger. |
| Comment by Jodi Levi (Inactive) [ 01/Jul/14 ] |
|
Patches landed to Master. Final patch http://review.whamcloud.com/#/c/8116 is being tracked under |