Re: check_typos.pl script

To: debian-i18n@lists.debian.org
Subject: Re: check_typos.pl script
From: Helmut Wollmersdorfer <helmut.wollmersdorfer@gmx.at>
Date: Sun, 09 Oct 2005 23:37:28 +0200
Message-id: <[🔎] dic2ip$6ht$1@sea.gmane.org>
In-reply-to: <[🔎] 20051008114759.GP17406@pluto>
References: <42D7AE5B000A5B1E@mail-7.mail.tiscali.sys> <200509242315.09106.aragorn@tiscali.nl> <20050924224034.GA8428@pluto> <200509250141.49377.aragorn@tiscali.nl> <20050925101519.GA31312@pluto> <20050925105124.GA27920@djedefre.onera> <20050925125705.GA32389@pluto> <pan.2005.09.25.16.19.09.31840@tiscali.it> <20050925181559.GA2736@pluto> <[🔎] di73l1$4sg$1@sea.gmane.org> <[🔎] 20051008114759.GP17406@pluto>

Jens Seidel wrote:

On Sat, Oct 08, 2005 at 02:25:05AM +0200, Helmut Wollmersdorfer wrote:

I know, but since I had already many wrong possitives by a distance of 1

I never tried larger distances.


I use distance=1 mostly.

Long words may indeed contain multiple
typos so it's maybe a good idea to use a maximal distance of
length(word)/10. Especially German and a few other languages would profit.

But the false positives will be very high. E.g. with distance=2 each2-letter word matches against _all_ other 2-letter words.

I don't know these algorithms,


Why invent the wheel?
If I want to solve a problem, then my first step is asking google or CPAN.

But I will definitively
test it (the code looks much cleaner, I'm a C/C++/Fortran77 coder not a
perl hacker :-))

I am coming from (Mainframe)Assembler and Prolog. Now Perl is myfavorite since two years. IMHO it's worth to put your nose deeper into Perl.

Where is it available?

On my workstation, my laptop, USB-stick - unfortunately in differentversions, ugly condition - I will send it per mail.

You refer to d-i, right? There are also many other English documents
which are not of a very high quality -:).

I know. But low quality docs only need only a spellchecker forsufficient 'bug hunting'. On high quality docs I watched myself loosingconcentration and being demotivated after reading some hours withoutfinding an error.

Also not every document use
'file system' instead of 'file-system' or 'filesystem' resp.


Just an example.

Helmut Wollmersdorfer

Reply to:

Follow-Ups:
- Re: check_typos.pl script
  - From: Eddy Petrişor <eddy.petrisor@gmail.com>

References:
- Re: check_typos.pl script
  - From: Helmut Wollmersdorfer <helmut.wollmersdorfer@gmx.at>
- Re: check_typos.pl script
  - From: Jens Seidel <jensseidel@users.sf.net>

Prev by Date: Asking for commit access in Alioth
Next by Date: Re: [D-I] [RANT] Some translations seem abandoned
Previous by thread: Re: check_typos.pl script
Next by thread: Re: check_typos.pl script
Index(es):
- Date
- Thread