[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#636288: ITP: libstring-tokenizer-perl -- Simple string tokenizer



Package: wnpp
Severity: wishlist
Owner: Ben Webb <bjwebb67@googlemail.com>


* Package name    : libstring-tokenizer-perl
  Version         : 0.05
  Upstream Author : Stevan Little, <stevan@iinteractive.com>
* URL             : http://search.cpan.org/dist/String-Tokenizer/
* License         : Artistic or GPL-1+
  Programming Lang: Perl
  Description     : Simple string tokenizer

A simple string tokenizer which takes a string and splits it on whitespace.
It also optionally takes a string of characters to use as delimiters, and
returns them with the token set as well. This allows for splitting the string
in many different ways.

This is a very basic tokenizer, so more complex needs should be either
addressed with a custom written tokenizer or post-processing of the output
generated by this module. Basically, this will not fill everyones needs, but
it spans a gap between simple split / /, $string and the other options that
involve much larger and complex modules.

Also note that this is not a lexical analyser. Many people confuse
tokenization with lexical analysis. A tokenizer mearly splits its input into
specific chunks, a lexical analyzer classifies those chunks. Sometimes these
two steps are combined, but not here.



Reply to: