Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 25 May 2005 14:30:55 +0400 (MSD)
From:      Dmitry Morozovsky <marck@rinet.ru>
To:        Murray Stokely <murray@freebsdmall.com>
Cc:        freebsd-doc@FreeBSD.org, doc-committers@FreeBSD.org, cvs-doc@FreeBSD.org
Subject:   Re: cvs commit: doc/ru_RU.KOI8-R/books/handbook/firewalls chapter.sgml
Message-ID:  <20050524115209.C58421@woozle.rinet.ru>
In-Reply-To: <20050523204713.E92518@woozle.rinet.ru>
References:  <200505231105.j4NB5Cs3043597@repoman.freebsd.org> <20050523123545.GC11650@freebsdmall.com> <20050523204713.E92518@woozle.rinet.ru>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, 23 May 2005, Dmitry Morozovsky wrote:

DM> MS> >   Unbreak the build second time: now with index.
DM> MS> >   
DM> MS> >   It seems <see> cannot be used inside <indexterm>,
DM> MS> >   at least collateindex.pl produces broken sgml with it.
DM> MS> 
DM> MS> This works in the English build.  It is used in the NIS/yellowpages
DM> MS> entry in the Network Servers chapter, right?  What is the broken SGML
DM> MS> that is produced in the ru_RU handbook?  Is it a KOI8-R encoding
DM> MS> issue?
DM> 
DM> Well, as I wrote in -doc@, the original 
DM> 
DM>   <indexterm>
DM>     <primary>some-russian-term</primary>
DM>     <see>russian reference</see>
DM>   </indexterm>
DM> 
DM> produces broken code in index.sgml:
DM> 
DM> <indexentry>
DM>   <primaryie>some-russian-term
DM>   </primaryie>
DM>     <seeie>russian reference</seeie>
DM> ,
DM>     <ulink url="network-bridging.html#AEN37716" role="AEN37718">Russian part title</ulink></indexentry>

Aha, it seems you're right, as changing terms to some english words leads to 
error to disappear.  The problem seems that terms can be only english letters, 
as there is at least three places with

if (($letter lt 'A') || ($letter gt 'Z')) {

The following patch to textproc/dsssl-docbook-modular should help to fix 
problem, but at least LC_COLLATE or preferrably LANG should be propagated to 
collateindex.pl appropriately.

--- bin/collateindex.pl.orig    Thu Nov  4 09:53:39 2004
+++ bin/collateindex.pl Tue May 24 12:18:39 2005
@@ -18,6 +18,7 @@
 
 =cut
 
+use locale;
 use File::Basename;
 use Getopt::Std;
 
@@ -362,9 +363,10 @@
            $letter = uc(substr($letter, 0, 1));
 
            # symbols are a special case
-           if (($letter lt 'A') || ($letter gt 'Z')) {
+           # if (($letter lt 'A') || ($letter gt 'Z')) {
+           if ($letter !~ /^[[:alpha:]]+$/) {
                if (($group eq '')
-                   || (($group ge 'A') && ($group le 'Z'))) {
+                   || ($group =~ /^[[:alpha:]]+$/)) {
                    print OUT "</indexdiv>\n" if !$first;
                    print OUT "<indexdiv><title>$symbolsname</title>\n\n";
                    $group = $letter;
@@ -647,7 +649,7 @@
        $letter = $idx->{'primary'} if !$letter;
        $letter = uc(substr($letter, 0, 1));
 
-       if (($letter lt 'A') || ($letter gt 'Z')) {
+       if ($letter !~ /^[[:alpha:]]+$/) {
            push (@sym, $idx);
        } else {
            push (@new, $idx);


Sincerely,
D.Marck                                     [DM5020, MCK-RIPE, DM3-RIPN]
------------------------------------------------------------------------
*** Dmitry Morozovsky --- D.Marck --- Wild Woozle --- marck@rinet.ru ***
------------------------------------------------------------------------



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20050524115209.C58421>