From owner-freebsd-questions@freebsd.org Wed Nov 8 17:48:05 2017 Return-Path: Delivered-To: freebsd-questions@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 876B2E58C6B for ; Wed, 8 Nov 2017 17:48:05 +0000 (UTC) (envelope-from byrnejb@harte-lyne.ca) Received: from inet08.hamilton.harte-lyne.ca (inet08.hamilton.harte-lyne.ca [216.185.71.28]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "inet08.hamilton.harte-lyne.ca", Issuer "CA_HLL_ISSUER_2016" (not verified)) by mx1.freebsd.org (Postfix) with ESMTPS id 571A52D2F for ; Wed, 8 Nov 2017 17:48:04 +0000 (UTC) (envelope-from byrnejb@harte-lyne.ca) Received: from localhost (localhost [127.0.0.1]) by inet08.hamilton.harte-lyne.ca (Postfix) with ESMTP id B240C622E1 for ; Wed, 8 Nov 2017 12:47:56 -0500 (EST) X-Virus-Scanned: amavisd-new at harte-lyne.ca Received: from inet08.hamilton.harte-lyne.ca ([127.0.0.1]) by localhost (inet08.hamilton.harte-lyne.ca [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id QClp9Zo2fn2u for ; Wed, 8 Nov 2017 12:47:55 -0500 (EST) Received: from webmail.harte-lyne.ca (inet04.hamilton.harte-lyne.ca [216.185.71.24]) (using TLSv1 with cipher ECDHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by inet08.hamilton.harte-lyne.ca (Postfix) with ESMTPSA id CE0D260A67 for ; Wed, 8 Nov 2017 12:47:54 -0500 (EST) Received: from 216.185.71.44 (SquirrelMail authenticated user byrnejb_hll) by webmail.harte-lyne.ca with HTTP; Wed, 8 Nov 2017 12:47:54 -0500 Message-ID: <41c47638eec0e1a562f4446c7fe5a2df.squirrel@webmail.harte-lyne.ca> Date: Wed, 8 Nov 2017 12:47:54 -0500 Subject: Regex character and collation calss documentation From: "James B. Byrne" To: freebsd-questions@freebsd.org Reply-To: byrnejb@harte-lyne.ca User-Agent: SquirrelMail/1.4.22-5.el6 MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 08 Nov 2017 17:48:05 -0000 I have been perusing the available documentation respecting regex on FreeBSD and cannot find a reference to [.NUL.]. Everything that I have found points to ctype.h. The only class names I can find therein are: int isalnum(int); [:alnum:] int isalpha(int); [:alpha:] int iscntrl(int); [:cntrl:] int isdigit(int); [:digit:] int isgraph(int); [:graph:] int islower(int); [:lower:] int isprint(int); [:print:] int ispunct(int); [:punct:] int isspace(int); [:space:] int isupper(int); [:upper:] int isxdigit(int); [:xdigit:] >From reading the reference at https://docs.freebsd.org/info/regex/regex.pdf and comparing it to the uncommented lines in ctype.h on my FreeBSD-11.1 desktop host one could reasonably deduce that the following should be available on FreeBSD in addition to the above: int isascii(int); [:ascii:] int isblank(int); [:blank:] int ishexnumber(int); [:hexnumber:] int isideogram(int); [:ideogram:] int isnumber(int); [:number:] int isphonogram(int); [:phonogram:] int isrune(int); [:rune:] int isspecial(int); [:special:] But of these only [[:blank:]] is recognized by grep; whatever else might employ the rest. [[:ascii:]] grep: Invalid character class name [[:hexnumber:]] grep: Invalid character class name [[:ideogram:]] grep: Invalid character class name [[:number:]] grep: Invalid character class name [[:phonogram:]] grep: Invalid character class name [[:rune:]] grep: Invalid character class name [[:special:]] grep: Invalid character class name However I see no reference to [.NUL.] anywhere. The sed man page has no reference to nul or NUL at all and tr only has this to say: The tr utility has historically not permitted the manipulation of NUL bytes in its input and, additionally, stripped NUL's from its input stream. This implementation has removed this behavior as a bug. Is there a master list of character/collation classes for FreeBSD regex? I have read the man pages for grep and re_format. In no case is the character or collation class NUL mentioned. Where is the usage of [.NUL.] documented? -- *** e-Mail is NOT a SECURE channel *** Do NOT transmit sensitive data via e-Mail Do NOT open attachments nor follow links sent by e-Mail James B. Byrne mailto:ByrneJB@Harte-Lyne.ca Harte & Lyne Limited http://www.harte-lyne.ca 9 Brockley Drive vox: +1 905 561 1241 Hamilton, Ontario fax: +1 905 561 0757 Canada L8E 3C3