Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 30 Dec 2008 23:07:05 +0200
From:      Giorgos Keramidas <keramida@ceid.upatras.gr>
To:        Gary Kline <kline@thought.org>
Cc:        Roland Smith <rsmith@xs4all.nl>, FreeBSD Mailing List <freebsd-questions@freebsd.org>
Subject:   Re: well, blew it... sed or perl q again.
Message-ID:  <87abad4bk6.fsf@kobe.laptop>
In-Reply-To: <20081230205131.GA34211@thought.org> (Gary Kline's message of "Tue, 30 Dec 2008 12:51:31 -0800")
References:  <20081230193111.GA32641@thought.org> <20081230201623.GB65659@slackbox.xs4all.nl> <20081230205131.GA34211@thought.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, 30 Dec 2008 12:51:31 -0800, Gary Kline <kline@thought.org> wrote:
> 	All right, then is this the right syntax.  In other words, do
> 	I need the double quotes to match the "http:" string?
>
>   perl -pi.bak -e 'print unless "/m/http:/" || eof; close ARGV if eof' *

Close, but not exactly right...

You have to keep in mind that the argument to -e is a Perl expression,
i.e. something you might type as part of a script that looks like this:

    #!/usr/bin/perl

    while (<STDIN>) {
        YOUR-EXPRESSION-HERE;
        print $_;
    }

One of the ways to print only the lines that do *not* match the
"http://" pattern is:

    print unless (m/http:\/\//);

Note how the '/' characters that are part of the m/.../ expression need
extra backslashes to quote them.  You can avoid this by using another
character for the m/.../ expression delimiter, like:

    print unless (m!http://!);

But you are not still done.  The while loop above already contains a
print statement _outside_ of your expression.  So if you add this to a
perl -p -e '...' invocation you are asking Perl to run this code:

    #!/usr/bin/perl

    while (<STDIN>) {
        print unless (m!http://!);
        print $_;
    }

Each line of input will be printed _anyway_, but you will be duplicating
all the non-http lines.  Use -n instead of -p to fix that:

    perl -n -e 'print unless (m!http://!)'

A tiny detail that may be useful is that "http://" is not required to be
lowercase in URIs.  It may be worth adding the 'i' modifier after the
second '!' of the URI matching expression:

    perl -n -e 'print unless (m!http://!i)'

Once you have that sort-of-working, it may be worth investigating more
elaborate URI matching regexps, because this will match far too much
(including, for instance, all the non-URI lines of this email that
contain the regexp example itself).




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?87abad4bk6.fsf>