emacs and ASCII file to ISO-8859-* to UTF-8

To: debian-user@lists.debian.org
Subject: emacs and ASCII file to ISO-8859-* to UTF-8
From: hendrik@topoi.pooq.com
Date: Tue, 14 Nov 2006 09:45:53 -0500
Message-id: <[🔎] 20061114144553.GA24413@topoi.pooq.com>
In-reply-to: <[🔎] 20061114083125.GC538@fantomas.sk>
References: <[🔎] 457d677b0611120552i6cdaaaf8p9485cdee45aa29f1@mail.gmail.com> <[🔎] 20061113090644.GC18065@fantomas.sk> <[🔎] 20061113141435.GC19996@topoi.pooq.com> <[🔎] 20061114083125.GC538@fantomas.sk>

On Tue, Nov 14, 2006 at 09:31:25AM +0100, Matus UHLAR - fantomas wrote:
> > > On 12.11.06 14:52, Andrea Ganduglia wrote:
> > > > Hi. I have a lots ascii file with ecoding iso-8859-* and I must
> > > > convert those in UTF-8. How?
> 
> > On Mon, Nov 13, 2006 at 10:06:44AM +0100, Matus UHLAR - fantomas wrote:
> > > iconv -f <src-encoding> -t <dst-encoding> <inputfile> > outputfile.
> > > 
> > > There is also 'recode' package, however I found it a bit redundant, since
> > > iconv (part of libc6) has this functionality
> 
> On 13.11.06 09:14, hendrik@topoi.pooq.com wrote:
> > And after you've has converted such a file, how can you tell emacs that 
> > it is supposed to recognise the new encoding?
> 
> pardon?

This is an emacs-specific add-on question.  If it has seen a file in one 
encoding system, and I run a program to change it to another (in my 
case, getting my accented letters converted from the old 8-bit encoding 
into UTF-8) emacs insists on continuing to read it as if it were in the 
old encoding, so my accented characters, which have been expanded into 
two bytes each, show up in the editor as two gibberish characters each.
It seems that emacs keeps a database somewhere of file names and 
encodings.  In theory that would be useful, I guess, because there isn't 
another mechanism in the filesysten to mark files with their encodings, 
but if such a convention isn't a system-wide convention, tools don't 
know about it and it doesn't work.   I'm tryin to run a clean UTF-8 
system, and I want my non-UTF-8 abberations to be converted and treated 
as UTF-8 henceforth, instead of converting them and having them treated 
as non-UTF-8.

-- hendrik

Reply to:

Follow-Ups:
- Re: emacs and ASCII file to ISO-8859-* to UTF-8
  - From: Jhair Tocancipa Triana <jhair.tocancipa@gmail.com>

References:
- ASCII file to ISO-8859-* to UTF-8
  - From: "Andrea Ganduglia" <nonews.org@gmail.com>
- Re: ASCII file to ISO-8859-* to UTF-8
  - From: Matus UHLAR - fantomas <uhlar@fantomas.sk>
- Re: ASCII file to ISO-8859-* to UTF-8
  - From: hendrik@topoi.pooq.com
- Re: ASCII file to ISO-8859-* to UTF-8
  - From: Matus UHLAR - fantomas <uhlar@fantomas.sk>

Prev by Date: Re: [debian-users] Re: Package installation history.
Next by Date: Re: cron-apt with no mta
Previous by thread: Re: ASCII file to ISO-8859-* to UTF-8
Next by thread: Re: emacs and ASCII file to ISO-8859-* to UTF-8
Index(es):
- Date
- Thread