Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 23 Jun 2009 09:42:44 -0400
From:      Andrew Gallatin <gallatin@cs.duke.edu>
To:        Andre Oppermann <andre@FreeBSD.org>
Cc:        svn-src-head@FreeBSD.org, svn-src-all@FreeBSD.org, src-committers@FreeBSD.org
Subject:   Re: svn commit: r194672 - in head/sys: kern netinet sys
Message-ID:  <4A40DBD4.3070904@cs.duke.edu>
In-Reply-To: <200906222308.n5MN856I055711@svn.freebsd.org>
References:  <200906222308.n5MN856I055711@svn.freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
Andre Oppermann wrote:

 >   Add soreceive_stream(), an optimized version of soreceive() for
 >   stream (TCP) sockets.
<....>
 >
 >   Testers, especially with 10GigE gear, are welcome.

Awesome!  On my very weak, ancient consumer grade athlon64 test
machine (AMD Athlon(tm) 64 X2 Dual Core Processor 3800+ (2050.16-MHz
K8-class CPU)) using mxge and LRO, I see a roughly 700Mb/s increase in
bandwidth from 7.7Gb/s to 8.4Gb/s.  For what its worth, this finally
gives FreeBSD performance parity with Linux on this hardware for
10GbE single-stream receive.

TCP SENDFILE TEST from 0.0.0.0 (0.0.0.0) port 0 AF_INET to venice-my 
(192.168.1.15) port 0 AF_INET
Recv   Send    Send                          Utilization       Service 
Demand
Socket Socket  Message  Elapsed              Send     Recv     Send    Recv
Size   Size    Size     Time     Throughput  local    remote   local 
remote
bytes  bytes   bytes    secs.    10^6bits/s  % S      % C      us/KB   us/KB

before:
  65536  65536  65536    60.01      7709.14   13.30    79.60    0.283 
1.692
after:
  65536  65536  65536    60.01      8403.86   14.66    81.63    0.286 
1.592


This is consistent across runs.  Lockstat output for 10 seconds in the
middle of a run is very interesting and shows a huge reduction in
lock contention.

Before:

Adaptive mutex spin: 369333 events in 10.017 seconds (36869 events/sec)

Count indv cuml rcnt     nsec Lock                   Caller 

-------------------------------------------------------------------------------
303685  82%  82% 0.00     1080 0xffffff000f2f98d0     recvit+0x21 

63847  17% 100% 0.00       25 0xffffff000f2f98d0     ip_input+0xad 

  1788   0% 100% 0.00      172 0xffffff0001c57c08 
intr_event_execute_handlers+0x100
     8   0% 100% 0.00      389 vm_page_queue_mtx      trap+0x4ce 

     1   0% 100% 0.00       30 0xffffff8000251598     ithread_loop+0x8e 

     1   0% 100% 0.00      720 0xffffff8000251598 
uhub_read_port_status+0x2d
     1   0% 100% 0.00     1639 0xffffff000f477190     vm_fault+0x112 

     1   0% 100% 0.00        1 0xffffff001fecce10     mxge_intr+0x425 

     1   0% 100% 0.00     1332 0xffffff0001845600 
clnt_reconnect_call+0x105
-------------------------------------------------------------------------------

Adaptive mutex block: 89 events in 10.017 seconds (9 events/sec)

Count indv cuml rcnt     nsec Lock                   Caller 

-------------------------------------------------------------------------------
    83  93%  93% 0.00    20908 0xffffff000f2f98d0     tcp_input+0xd96 

     3   3%  97% 0.00    45234 0xffffff8000259f08     fork_exit+0x118 

     3   3% 100% 0.00    44862 0xffffff8000251598     fork_exit+0x118 

-------------------------------------------------------------------------------


After:

Adaptive mutex spin: 105102 events in 10.020 seconds (10490 events/sec)

Count indv cuml rcnt     nsec Lock                   Caller 

-------------------------------------------------------------------------------
75886  72%  72% 0.00     2860 0xffffff0001fdde20     ip_input+0xad 

28418  27%  99% 0.00     1355 0xffffff0001fdde20     recvit+0x21 

   779   1% 100% 0.00      171 0xffffff0001642808 
intr_event_execute_handlers+0x100
     7   0% 100% 0.00      670 vm_page_queue_mtx      trap+0x4ce 

     5   0% 100% 0.00       46 0xffffff001fecce10     mxge_intr+0x425 

     1   0% 100% 0.00      105 vm_page_queue_mtx      trap_pfault+0x142 

     1   0% 100% 0.00      568 0xffffff8000251598     usb_process+0xd8 

     1   0% 100% 0.00      880 0xffffff8000251598     ithread_loop+0x8e 

     1   0% 100% 0.00      233 0xffffff001a224578     vm_fault+0x112 

     1   0% 100% 0.00       60 0xffffff001a1759b8     syscall+0x28f 

     1   0% 100% 0.00      809 0xffffff0001846000 
clnt_reconnect_call+0x105
     1   0% 100% 0.00     1139 0xffffff0001fdde20     kern_recvit+0x1d4 

-------------------------------------------------------------------------------

Adaptive mutex block: 88 events in 10.020 seconds (9 events/sec)

Count indv cuml rcnt     nsec Lock                   Caller 

-------------------------------------------------------------------------------
    80  91%  91% 0.00    25891 0xffffff0001fdde20     tcp_input+0xd96 

     3   3%  94% 0.00    45979 0xffffff8000259f08     fork_exit+0x118 

     3   3%  98% 0.00    45886 0xffffff8000251598     fork_exit+0x118 

     1   1%  99% 0.00    38254 0xffffff8000259f08 
intr_event_execute_handlers+0x100
     1   1% 100% 0.00    79858 0xffffff001a1760f8     kern_wait+0x7ee 

-------------------------------------------------------------------------------


Drew



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4A40DBD4.3070904>