Skip site navigation (1)Skip section navigation (2)
Date:      Fri, 30 Apr 2004 09:50:16 -0400
From:      Brian Fundakowski Feldman <green@freebsd.org>
To:        Bruce Evans <bde@zeta.org.au>
Cc:        Kris Kennaway <kris@obsecurity.org>
Subject:   Re: code cleanup 
Message-ID:  <200404301350.i3UDoGnX076521@green.homeunix.org>
In-Reply-To: Message from Bruce Evans <bde@zeta.org.au>  <20040430231553.X15963@gamplex.bde.org> 

next in thread | previous in thread | raw e-mail | index | archive | help
Bruce Evans <bde@zeta.org.au> wrote:
> On Thu, 29 Apr 2004, Kris Kennaway wrote:
> 
> > On Thu, Apr 29, 2004 at 04:56:02PM -0400, John Baldwin wrote:
> > > On Thursday 29 April 2004 02:55 pm, Brian Fundakowski Feldman wrote:
> > > > For what it's worth, I don't think it is good to hide things as much as
> > > > FOREACH_PROC_IN_SYSTEM() -- this specific instance -- does, but grep is not
> > > > a good tool for a tree as large as FreeBSD's.  Try using cscope instead.
> > >
> > > I've used glimpse in the past but it is buggy.  Actually, grep -r on ssc/sys
> > > doesn't take that long, esp. if you do it multiple times as most of the tree
> > > is still in cache for subsequent grep's (at least on my laptop).  I also tend
> > > to have lots (around 7 or so) trees that have work going on in them at any
> > > one time.
> >
> > The problem with grep -r in src/sys is that it chokes on the symlinks
> > created by module builds and pollutes the output with hundreds of
> > lines of errors unless you remember to first remove the module build
> > files.
> 
> Use find(1) to not follow symlinks.  E.g.:
> 
> %%%
> Script started on Fri Apr 30 23:15:17 2004
> ttyp0:bde@besplex:/tmp> cd /sys
> ttyp0:bde@besplex:/sys> time find . -type f | time xargs grep fooo
>         0.42 real         0.01 user         0.06 sys
>         0.49 real         0.11 user         0.29 sys
> ttyp0:bde@besplex:/sys> exit
> 
> Script done on Fri Apr 30 23:15:33 2004
> %%%
> 
> This was fast because /sys was already in the disk cache.  It would have
> taken 15 seconds with a cold cache.

Yeah, but this fast is with nothing in the cache ;-)
{"/usr/src/sys"}$ time csgrep fooo
    0.15s real     0.00s user     0.03s system
It's hard to compare because you have to actually manually filter out those 
falses where "fooo" is not actually a C identifier.

> I also have:
> 
> lrwxrwxrwx  1 bde  wheel  28 Mar  6 06:57 /sys/i386/compile@ -> /usr/obj/usr/src/sys/compile
> 
> (see Makefile.i386 rev.1.28)
> 
> so I don't have any object files under /sys to slow down the search,
> except grep -r would follow this symlink too.
> 
> Perhaps it is a bug for grep -r to follow symlinks by default, especially
> since there is no way to change the default and whether symlinks are
> followed is not mentioned in the man page.  diff -r has the same problem.

I think it is very much a bug for grep to print out that it detected and 
stopped a directory loop -- that is very much the expected good-program 
behavior, is it not?  So much software gets this wrong.  Following directory 
symlinks isn't a terrible default behavior if it can detect loops, but 
having no control whatsoever like we do permanently is pretty gross.

Thankfully, cvs(1) is a repository format which could never support 
directory/symlink loops, so cvs diff -r at least doesn't get it wrong
(by luck).

-- 
Brian Fundakowski Feldman                           \'[ FreeBSD ]''''''''''\
  <> green@FreeBSD.org                               \  The Power to Serve! \
 Opinions expressed are my own.                       \,,,,,,,,,,,,,,,,,,,,,,\




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200404301350.i3UDoGnX076521>