the 'original' string function?
Okay, this is gonna be a challenge to most guys who have been
processing text for a long time.
So, I would like a command, function or script, 'original',
that takes a string STR and a text file TXT and outputs
a score, from 0 to 100, how _original_ STR is, compared to
what is already in TXT.
So if I do
$ original "This isn't just another party" comments.txt
this will score 0 if that exact phrase to the letter already
exists in comments.txt.
But it will score 100 if not a single of those words exists in
the file! Because that would be 100% original.
Those endpoints are easy. But how to make it score - say - 62%
if some of the words are present, mostly spelled like that and
combined in ways that are not completely different?
Note: The above examples are examples, other definitions of
originality are okay. That is not the important part now - but
can be as interesting a part, later.
I'm not sure what function can solve this but I'm sure it will
be very original!
--
underground experts united
https://dataswamp.org/~incal
Reply to: