Date: Wed, 19 Dec 2012 18:02:12 -0500 From: Karim Fodil-Lemelin <fodillemlinkarim@gmail.com> To: Vijay Singh <vijju.singh@gmail.com> Cc: freebsd-net@freebsd.org, Navdeep Parhar <nparhar@gmail.com> Subject: Re: use of V_tcbinfo lock for TCP syncache Message-ID: <50D24774.80800@gmail.com> In-Reply-To: <CALCNsJS8j0wvv4dc1Q6rLckmr0RE%2BgW0%2BmUDdmSvyfjXzKugsQ@mail.gmail.com> References: <CALCNsJRFyQ9ZmfJ3quX2-cUuFHjs2rXw63Tq5ZH-uP1%2BoQmjLw@mail.gmail.com> <50D218BA.7080301@FreeBSD.org> <50D21C3B.8020803@gmail.com> <CALCNsJS8j0wvv4dc1Q6rLckmr0RE%2BgW0%2BmUDdmSvyfjXzKugsQ@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
On 19/12/2012 4:01 PM, Vijay Singh wrote: >>> Holding the pcbinfo lock prevents races between syncache_socket() and >>> accept(). See rwatson's comment just above tcp_usr_accept. I noted >>> this too (the comment above tod->tod_offload_socket() in tcp_syncache.c) >>> back when I updated the TOE code in the kernel. >> er, I think I told you why tcp_usr_accept holds the pcbinfo lock, which >> wasn't your original question... :-) > But it helped. > > So I am thinking about trying a change where syncache_socket() would > call soalloc() first, get a socket, setup the inp, and then do a > (modified) sonewconn to place the socket in the listener's queue. > Robert's comment indicated that this would be a better way to > eliminate the race since we wouldn't need the pcblock when we make the > sonewconn call. > > Sure but syncache_expand() is entered with the tcbinfo already write locked which also protects the unlocking of the listening connection and the locking of the newly created socket. Around this part: /* * Socket is created in state SYN_RECEIVED. * Unlock the listen socket, lock the newly * created socket and update the tp variable. */ INP_WUNLOCK(inp); /* listen socket */ inp = sotoinpcb(so); INP_WLOCK(inp); /* new connection */ tp = intotcpcb(inp); Without the tcbinfo lock the new socket could be closed (getting a reset) which would put it in INP_TIMEWAIT or INP_DROPPED _after_ the check is made in tcp_usr_accept since there is a period of time where tcbinfo is not locked and the new socket inp is not locked either. I could be wrong but it seems that without the tcbinfo lock a lot could happen between the unlocking of the listen socket and the locking of the new one. Karim.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?50D24774.80800>