[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: New charsets (7) manpage



On Sun, 5 Aug 2001, Karl Eichwalder wrote:
> > +8859-15 (Latin-9)
> > +This adds the Euro sign and French and Finnish letters that were missing in
> >  Latin-1.
>
> If you're telling the user what was added it's worth noting ½, ¼, and ¾
> have had to go away (not a big loss).

If you are going to start describing detailed differences between the ISO
8859 parts, then please use uniset to generate the differences completely:

$ uniset + 8859-1.TXT - 8859-15.TXT table
00A4 # CURRENCY SIGN
00A6 # BROKEN BAR
00A8 # DIAERESIS
00B4 # ACUTE ACCENT
00B8 # CEDILLA
00BC # VULGAR FRACTION ONE QUARTER
00BD # VULGAR FRACTION ONE HALF
00BE # VULGAR FRACTION THREE QUARTERS
$ uniset + 8859-15.TXT - 8859-1.TXT table
0152 # LATIN CAPITAL LIGATURE OE
0153 # LATIN SMALL LIGATURE OE
0160 # LATIN CAPITAL LETTER S WITH CARON
0161 # LATIN SMALL LETTER S WITH CARON
0178 # LATIN CAPITAL LETTER Y WITH DIAERESIS
017D # LATIN CAPITAL LETTER Z WITH CARON
017E # LATIN SMALL LETTER Z WITH CARON
20AC # EURO SIGN

Uniset is simple yet remarkably powerful Perl tool to do set operations on
Unicode subsets, available on

  http://www.cl.cam.ac.uk/~mgk25/download/uniset.tar.gz

Just updated to handle non-BMP characters smoothly as well.

Markus

-- 
Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>



Reply to: