[LU-2466] LNET NETWORKS HASHING Created: 11/Dec/12  Updated: 19/Apr/13  Resolved: 19/Apr/13

Status: Resolved
Project: Lustre
Component/s: None
Affects Version/s: Lustre 2.4.0
Fix Version/s: Lustre 2.4.0

Type: New Feature Priority: Minor
Reporter: Jodi Levi (Inactive) Assignee: Doug Oucharek (Inactive)
Resolution: Fixed Votes: 0
Labels: Fujitsu

Attachments: Microsoft Word NetHash_TestPlan.docx    
Issue Links:
Related
is related to LUDOC-125 Add Remote Network Hash Table Size Pa... Closed
is related to LUDOC-110 LNET Networks Hashing Documentation Closed
Sub-Tasks:
Key
Summary
Type
Status
Assignee
LU-2496 Create and attach test plan to ticket... Technical task Resolved Doug Oucharek  
Rank (Obsolete): 5812

 Comments   
Comment by Doug Oucharek (Inactive) [ 14/Jan/13 ]

Patch for this ticket is: http://review.whamcloud.com/#change,5023

Comment by Doug Oucharek (Inactive) [ 01/Feb/13 ]

Remote Networks Table Hashing test plan.

Comment by James Nunez (Inactive) [ 02/Apr/13 ]

As required by the test plan, the specified code changes were made to lnet/lnet/api-ni.c to print the number of entries in the hash table when LNET starts up, found by looking in dmesg, and to lnet/lnet/router_proc.c to show what hash table entries the given route is assigned when you cat /proc/sys/lnet/routes.

The following unit tests were run and verified locally and manually.

Unit Test 1: Hash Table Default Size – To make sure the default size of the hash table is 64 entries.

  1. lctl network configure
    LNET configured
  2. dmesg | grep -i "hash table"
    LNet: 12124:0:(api-ni.c:227:lnet_create_remote_nets_table()) Hash table entries: 64

Unit Test 2: Hash Table Size For Given Setting – To make sure the module parameter can change the hash table size

  1. lctl network configure
    LNET configured
  2. dmesg | grep -i "hash table"
    LNet: 12188:0:(api-ni.c:227:lnet_create_remote_nets_table()) Hash table entries: 1024

Unit Test 3: Hash Table Distribution – To make sure the hashing algorithm is fairly distributing the entries

  1. cat /etc/modprobe.d/lustre.conf
    options lnet ip2nets="tcp0 192.168.0.*; \
    o2ib0(ib0) 132.6.1.[1-59]" \
    routes="tcp0 132.6.1.[1-8]@o2ib0; \
    o2ib0 192.168.0.[6-7]@tcp0; tcp1 192.168.0.[1-2]@tcp0; \
    o2ib1 192.168.0.[16-17]@tcp0; tcp3 192.168.0.[21-22]@tcp0; \
    o2ib2 192.168.0.[26-27]@tcp0; tcp4 192.168.0.[31-32]@tcp0; \
    o2ib3 192.168.0.[36-37]@tcp0; tcp5 192.168.0.[41-42]@tcp0; \
    o2ib4 192.168.0.[46-47]@tcp0; tcp6 192.168.0.[51-52]@tcp0; \
    o2ib5 192.168.0.[56-57]@tcp0; tcp7 192.168.0.[61-62]@tcp0; \
    o2ib6 192.168.0.[66-67]@tcp0; tcp8 192.168.0.[71-72]@tcp0; \
    o2ib7 192.168.0.[76-77]@tcp0; tcp9 192.168.0.[81-82]@tcp0; \
    o2ib8 192.168.0.[86-87]@tcp0; tcp10 192.168.0.[91-92]@tcp0; \
    tcp2 192.168.0.[11-12]@tcp0;" \
    rnet_htable_size=1070
  1. cat /proc/sys/lnet/routes
    Routing disabled
    net hops state router
    tcp1 1 down 192.168.0.1@tcp (3)
    tcp1 1 down 192.168.0.2@tcp (3)
    tcp2 1 down 192.168.0.12@tcp (4)
    tcp2 1 down 192.168.0.11@tcp (4)
    o2ib 1 down 192.168.0.7@tcp (5)
    o2ib 1 down 192.168.0.6@tcp (5)
    tcp3 1 down 192.168.0.21@tcp (5)
    tcp3 1 down 192.168.0.22@tcp (5)
    o2ib1 1 down 192.168.0.16@tcp (6)
    o2ib1 1 down 192.168.0.17@tcp (6)
    tcp4 1 down 192.168.0.32@tcp (6)
    tcp4 1 down 192.168.0.31@tcp (6)
    o2ib2 1 down 192.168.0.26@tcp (7)
    o2ib2 1 down 192.168.0.27@tcp (7)
    tcp5 1 down 192.168.0.41@tcp (7)
    tcp5 1 down 192.168.0.42@tcp (7)
    o2ib3 1 down 192.168.0.37@tcp (8)
    o2ib3 1 down 192.168.0.36@tcp (8)
    tcp6 1 down 192.168.0.51@tcp (8)
    tcp6 1 down 192.168.0.52@tcp (8)
    o2ib4 1 down 192.168.0.46@tcp (9)
    o2ib4 1 down 192.168.0.47@tcp (9)
    tcp7 1 down 192.168.0.61@tcp (9)
    tcp7 1 down 192.168.0.62@tcp (9)
    o2ib5 1 down 192.168.0.57@tcp (10)
    o2ib5 1 down 192.168.0.56@tcp (10)
    tcp8 1 down 192.168.0.71@tcp (10)
    tcp8 1 down 192.168.0.72@tcp (10)
    o2ib6 1 down 192.168.0.66@tcp (11)
    o2ib6 1 down 192.168.0.67@tcp (11)
    tcp9 1 down 192.168.0.82@tcp (11)
    tcp9 1 down 192.168.0.81@tcp (11)
    o2ib7 1 down 192.168.0.76@tcp (12)
    o2ib7 1 down 192.168.0.77@tcp (12)
    tcp10 1 down 192.168.0.92@tcp (12)
    tcp10 1 down 192.168.0.91@tcp (12)
    o2ib8 1 down 192.168.0.87@tcp (13)
    o2ib8 1 down 192.168.0.86@tcp (13)
Comment by James Nunez (Inactive) [ 08/Apr/13 ]

Regression Test 1: Default
https://maloo.whamcloud.com/test_sessions/69ccd818-9fc3-11e2-9f27-52540035b04c
https://maloo.whamcloud.com/test_sessions/909b229c-9fc3-11e2-86dc-52540035b04c

Regression Test 2: rnet_htable_size=1070
https://maloo.whamcloud.com/test_sessions/c5912834-9fc3-11e2-86dc-52540035b04c

Comment by Jodi Levi (Inactive) [ 19/Apr/13 ]

Landed in 2.4

Generated at Sat Feb 10 01:25:26 UTC 2024 using Jira 9.4.14#940014-sha1:734e6822bbf0d45eff9af51f82432957f73aa32c.