Date: Sat, 18 Feb 2006 15:50:04 -0500 From: Gerard Seibert <gerard@seibercom.net> To: freebsd-questions@freebsd.org Subject: Re: Removing BOM from UTF-8 Message-ID: <200602181550.13252.gerard@seibercom.net> In-Reply-To: <1140294574.8388.8.camel@localhost> References: <20060218111849.15E6.GERARD@seibercom.net> <200602181435.07066.gerard@seibercom.net> <1140294574.8388.8.camel@localhost>
next in thread | previous in thread | raw e-mail | index | archive | help
--nextPart7635913.jlODiEy3dC Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline enjamin A'Lee wrote: > From: > "Benjamin A'Lee" <lists@benalee.co.uk> > To: > freebsd-questions@freebsd.org > Date: > Today 03:29:34 pm > =C2=A0 > > On Sat, 2006-02-18 at 14:34 -0500, Gerard Seibert wrote: > > Maybe I am doing something wrong, but it does not appear to be > > working correctly. I named the file nobom.sh and put it in the same > > directory as the files I want to convert. I also set the program > > permission to 0755. > > > > typing the p[program name does nothing; I have to precede it with > > 'perl'. Even then, it does not appear to work correctly. In the > > following example, the file is parsed, but not converted. > > Sorry; try changing the first line to #!/usr/local/bin/perl > > > perl nobom.sh testfile > > > > Am I doing something incorrectly here? > > Try: > > cat testfile | nobom.sh > > Though the way you describe appears to work here: > > $ cat bom-testfile | hd > 00000000 =C2=A0ef bb bf 23 20 42 4f 4d =C2=A020 74 65 73 74 20 66 69 =C2= =A0|...# BOM > test fi| 00000010 =C2=A06c 65 0a =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2= =A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 =C2=A0 = =C2=A0 =C2=A0 =C2=A0 =C2=A0 > =C2=A0|le.| 00000013 > $ bomkill.pl bom-testfile | hd > 00000000 =C2=A023 20 42 4f 4d 20 74 65 =C2=A073 74 20 66 69 6c 65 0a =C2= =A0|# BOM > test file.| 00000010 > > > =C2=A0 =C2=A0 Ben Something appears to be wrong here. First, the file will not run unless=20 I precede it with 'perl'. I have another perl script in the same=20 directory that runs just fine without any special prefixes. Also, the=20 script does not seem to remove the BOM entity. This is the script as I have it entered: #!/usr/local/bin/perl use warnings; use diagnostics -verbose; @file=3D<>; $file[0] =3D~ s/^\xEF\xBB\xBF//; print(@file); I have the file permissions set to 0755. Is there anything else that=20 could be causing this to fail? This is the first line of the file I am attempting to fix (well one of=20 them). =C3=AF=C2=BB=C2=BFSubject: That is what appears when I use pico to view the file. =2D-=20 Gerard Seibert gerard@seibercom.net PGP: http://www.seibercom.net/sig/gerard.asc --nextPart7635913.jlODiEy3dC Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.2 (FreeBSD) iD8DBQBD94iFchM2dIO+3uMRAhyaAKCQbknYMo5eckbi4U/8kvTS/dAg1QCfe96i tLot4w2vyF53LChQhJVg/Yw= =Wz3m -----END PGP SIGNATURE----- --nextPart7635913.jlODiEy3dC--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?200602181550.13252.gerard>