Re: [gopher] CAPS capability: ServerDefaultCharset

To: gopher-project@lists.alioth.debian.org
Subject: Re: [gopher] CAPS capability: ServerDefaultCharset
From: Mateusz Viste <mateusz@viste.fr>
Date: Sat, 03 Jan 2015 18:23:35 +0100
Message-id: <[🔎] 54A82597.3030306@viste.fr>
Reply-to: Gopher Project Discussion <gopher-project@lists.alioth.debian.org>
In-reply-to: <[🔎] 20150103171829.GM2881@evenstar.elisa-laajakaista.fi>
References: <[🔎] 54A7C6BE.5010601@viste.fr> <[🔎] CALGqR9JKk8LVAji+xz1Lrmi=tsjyVyeYxZ1DjefeEQqFdj5G1g@mail.gmail.com> <[🔎] 54A7CEC6.2000201@viste.fr> <[🔎] CAE136rpyiRdm9WJArmAJhmGWORSgTS5NEa=WMaJ+Mi+9RkFmWg@mail.gmail.com> <[🔎] CALGqR9LhbiaFbtxrUijJ56danveB-=mEdf8fUV2UkE+auDuqjQ@mail.gmail.com> <[🔎] 54A7D4EB.4070509@viste.fr> <[🔎] 7962DBF5-7BDC-425E-B0D2-58DDF93D933E@holviala.com> <[🔎] A8291B7B-0831-4DFF-B9ED-9391740C1270@holviala.com> <[🔎] 20150103154953.GE2881@evenstar.elisa-laajakaista.fi> <[🔎] 98CCD9EF-8A29-497A-B142-81854A4DE8F4@holviala.com> <[🔎] 20150103171829.GM2881@evenstar.elisa-laajakaista.fi>

Hello,

For those interested in unicode and utf-8 handling, I developed not solong ago a converter that decodes UTF-8 content and encodes it intoseveral 8-bit codepages (can also work the other way). The source codeis pretty readable, and it's available here:


http://sourceforge.net/p/utf8tocp/code/HEAD/tree/utf8tocp.c

It comes with several lookup tables already. I developed it primarilyfor the FreeDOS localization project.


The utf8tocp project's main page is this:

http://sourceforge.net/projects/utf8tocp/

Mateusz




On 01/03/2015 06:18 PM, Nuno Silva wrote:

On 2015-01-03 17:55, Kim Holviala wrote:

On 03 Jan 2015, at 17:49, Nuno Silva <nunojsilva@ist.utl.pt> wrote:

You mean Gophernicus can even handle both ISO-8859-1 and UTF-8 if
they're mixed inside the *same* document? That's neat! (And it also
degrades in a nice way!)


Yep, it works even if they are used within a single line of text. I first tried to use the GNU iconv() but that function was just incredibly stupid so I wrote my own. While writing it I realized I can just autodetect all input on char-by-char basis, skip most of the “offical” conversion tables and just focus on US-ASCII/Latin-1/first plane of UTF-8. My strniconv() is purely a 80/20 implementation, and that’s good enough for me.


Out of curiosity, have you made a standalone (iconv-like) tool using the
code you wrote? Even if it is just 80/20, that is something I could use
in some situations.


_______________________________________________
Gopher-Project mailing list
Gopher-Project@lists.alioth.debian.org
http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/gopher-project

Reply to:

References:
- Re: [gopher] GopherMole - a gopher media crawler
  - From: Mateusz Viste <mateusz@viste.fr>
- Re: [gopher] GopherMole - a gopher media crawler
  - From: James Mills <prologic@shortcircuit.net.au>
- Re: [gopher] GopherMole - a gopher media crawler
  - From: Mateusz Viste <mateusz@viste.fr>
- Re: [gopher] GopherMole - a gopher media crawler
  - From: Matjaž Mešnjak <matjaz85@gmail.com>
- Re: [gopher] GopherMole - a gopher media crawler
  - From: James Mills <prologic@shortcircuit.net.au>
- Re: [gopher] CAPS capability: ServerDefaultCharset
  - From: Mateusz Viste <mateusz@viste.fr>
- Re: [gopher] CAPS capability: ServerDefaultCharset
  - From: Kim Holviala <kim@holviala.com>
- Re: [gopher] CAPS capability: ServerDefaultCharset
  - From: Kim Holviala <kim@holviala.com>
- Re: [gopher] CAPS capability: ServerDefaultCharset
  - From: Nuno Silva <nunojsilva@ist.utl.pt>
- Re: [gopher] CAPS capability: ServerDefaultCharset
  - From: Kim Holviala <kim@holviala.com>
- Re: [gopher] CAPS capability: ServerDefaultCharset
  - From: Nuno Silva <nunojsilva@ist.utl.pt>

Prev by Date: Re: [gopher] CAPS capability: ServerDefaultCharset
Next by Date: Re: [gopher] CAPS capability: ServerDefaultCharset
Previous by thread: Re: [gopher] CAPS capability: ServerDefaultCharset
Next by thread: Re: [gopher] CAPS capability: ServerDefaultCharset
Index(es):
- Date
- Thread