From owner-svn-src-head@FreeBSD.ORG Thu May 21 17:56:00 2009 Return-Path: Delivered-To: svn-src-head@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D89711065670; Thu, 21 May 2009 17:56:00 +0000 (UTC) (envelope-from des@FreeBSD.org) Received: from svn.freebsd.org (svn.freebsd.org [IPv6:2001:4f8:fff6::2c]) by mx1.freebsd.org (Postfix) with ESMTP id C76D38FC15; Thu, 21 May 2009 17:56:00 +0000 (UTC) (envelope-from des@FreeBSD.org) Received: from svn.freebsd.org (localhost [127.0.0.1]) by svn.freebsd.org (8.14.3/8.14.3) with ESMTP id n4LHu05T074529; Thu, 21 May 2009 17:56:00 GMT (envelope-from des@svn.freebsd.org) Received: (from des@localhost) by svn.freebsd.org (8.14.3/8.14.3/Submit) id n4LHu0Gf074528; Thu, 21 May 2009 17:56:00 GMT (envelope-from des@svn.freebsd.org) Message-Id: <200905211756.n4LHu0Gf074528@svn.freebsd.org> From: Dag-Erling Smorgrav Date: Thu, 21 May 2009 17:56:00 +0000 (UTC) To: src-committers@freebsd.org, svn-src-all@freebsd.org, svn-src-head@freebsd.org X-SVN-Group: head MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Cc: Subject: svn commit: r192561 - head/contrib/groff/font/devutf8 X-BeenThere: svn-src-head@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SVN commit messages for the src tree for head/-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 21 May 2009 17:56:01 -0000 Author: des Date: Thu May 21 17:56:00 2009 New Revision: 192561 URL: http://svn.freebsd.org/changeset/base/192561 Log: When man pages are formatted in UTF-8, .Fl is encoded as U+2212 "MINUS SIGN" instead of U+002D "HYPHEN-MINUS". This is unfortunate for two reasons: 1) this is not the character which is actually used on the command line, and 2) it makes it impossible to search a man page for a specific command-line option. This patch fixes this, but there are other unresolved issues, such as confusion between -, \- and hy: while the latter is always (and only) used for hyphenation, both - and \- are used for negation and subtraction, and \- is used for command-line options and sometimes also for parenthesis. IMHO, the correct Unicode characters are: - hyphenation: either U+2010 or U+00AD, most likely the former (the latter is the so-called soft hyphen, used to indicate a point at which a text processor is allowed to hyphenate a word) - negation and subtraction: U+2212 - parenthesis: in English, U+2214, with spaces suppressed before and after; in some others (such as Norwegian), U+2213 with spaces retained. - command-line options: U+002D, because that is what is actually used on the command line. However, fixing this would require extensive modifications to (at least) the doc and man macro sets... MFC after: 1 week Modified: head/contrib/groff/font/devutf8/R.proto Modified: head/contrib/groff/font/devutf8/R.proto ============================================================================== --- head/contrib/groff/font/devutf8/R.proto Thu May 21 17:55:47 2009 (r192560) +++ head/contrib/groff/font/devutf8/R.proto Thu May 21 17:56:00 2009 (r192561) @@ -726,7 +726,7 @@ st 24 0 0x220B product 24 0 0x220F coproduct 24 0 0x2210 sum 24 0 0x2211 -\- 24 0 0x2212 +\- 24 0 0x002D mi " -+ 24 0 0x2213 ** 24 0 0x2217