Re: finding similar files
On Wed, 25 Feb 2009 18:58:48 +0000, Hendrik Boom wrote:
> There wouldn't happen to be any handy tools for searching a directory
> tree with a few hundred ASCII files and telling me which ones have
> similar content?
Yes, such tools exist. They are called AI tools, Artificial Intelligent.
Debian has many native AI packages but I doubt that you'll ever want to
touch them, because they all need extensive training.
> What I'm looking for is something that will give me a first cut on
> finding those pairs of files.
As said before, no easy solution for none-easy task, unless you name them
wisely, i.e., give similar files similar names. If so, check out
http://cpansearch.perl.org/src/SUNTONG/File-FindSimilars-2.06/README.html
It's super fast to give you the first cut.
--
Tong (remove underscore(s) to reply)
http://xpt.sourceforge.net/techdocs/
http://xpt.sourceforge.net/tools/
Reply to: