Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 28 Mar 2010 21:27:56 -1000
From:      parv@pair.com
To:        George Sanders <gosand1982@yahoo.com>
Cc:        freebsd-questions@freebsd.org
Subject:   Re: procmail regex help ... sometimes works, sometimes doesn't...
Message-ID:  <20100329072756.GA1322@holstein.holy.cow>
In-Reply-To: <471394.79697.qm@web111611.mail.gq1.yahoo.com>
References:  <471394.79697.qm@web111611.mail.gq1.yahoo.com>

next in thread | previous in thread | raw e-mail | index | archive | help
in message <471394.79697.qm@web111611.mail.gq1.yahoo.com>,
wrote George Sanders thusly...
>
> I have added a very standard, very common regex line to my
> .procmailrc to filter character sets I can't read:
>
>
> UNREADABLE='[^?"]*big5|iso-2022-jp|ISO-2022-KR|euc-kr|gb2312|ks_c_5601-1987|ks_c_5601|3Deuc-kr|koi8'
> :0:
> * ^Content-Type:.*multipart
> * B ?? $ ^Content-Type:.*^?.*charset="?($UNREADABLE)
> unreadable_messages
>
>
> I know that this works because my "unreadable_messages" mail file
> is now full of messages with headers like:
>
>
> From: =?GB2312?B?xMLTq9Or?= <uigvrutit@heki.net>
> Subject: =?GB2312?B?MjAxMMTqyMvBptfK1LS4w9bYytPKssO0?=
> To: "me" <me@me.com>
> Content-Type: text/html;
>         charset="gb2312"
>
>
> However, a lot of mail gets through to my inbox that matches:
>
>
> From: "osdeiiftnvpp@gmail.com" <xjyfgzyjm@gmail.com>
> Reply-To: "osdeiiftnvpp@gmail.com" <xjyfgzyjm@gmail.com>
> Message-ID: <533pbxxy2oc>
> To: me <me@me.com>
> Subject: Fw: \xb8\xf2\xad\xe8\xa5X\xa8\xd3\xbd\xe6~\xb1o\xb4\xa9\xa9f\xaa\xb1\xb5L\xaeM\xa4\xba\xaeg\xb2n\xa7o
> X-Mailer: inhalation
> Organization: Microsoft Outlook Express 6.00.2462.0000
> Mime-Version: 1.0
> Content-Type: multipart/alternative;
>         boundary="1-104247307-2712732737=:8213"
> Status: RO
> X-Status:
> X-Keywords:
> X-UID: 63502
>
> --1-104247307-2712732737=:8213
> Content-Type: text/plain; charset="big5"
> Content-Transfer-Encoding: quoted-printable
>
>
> However, "big5" is very clearly listed in my regex above, and as
> far as I can tell, this mail should match perfectly...
>
> I cannot see why these "big5" emails are not matching my procmail
> regex ... is it obvious to anyone ?

Is "Content-Type:" completely missing from the body of your first
example?  Do you have your examples flipped?  I would have thought
that first example would have delivered in your inbox & second one
in your unreadable_messages one.

In any case, what does the procmail log say? See also
http://www.professional.org/procmail/sandbox.html#.

Do try your luck on procmail@lists.RWTH-Aachen.de list,
http://mailman.rwth-aachen.de/mailman/listinfo/procmail.


  - parv

-- 




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20100329072756.GA1322>