[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Re: Removing duplication: Word lists of common words in languages



Ben Finney writes ("Re: Removing duplication: Word lists of common words in languages"):
> Where is a good authoritative source of such words, by frequency, for
> various natural languages, suitable for inclusion in Debian as a data
> package?

I had roughly this question in 2013, and found the answer.  Here is
probably the best starting point:

http://www.chiark.greenend.org.uk/ucgi/~ijackson/git?p=evade-mail-usrlocal.git;a=blob;f=lemma.al-permission.mbox

Ian.


Reply to: