Bug#803342: Iiiittt's baaaack
Package ruby-bdb caused index++ to choke on 'README_en.html':
run-parts: executing /etc/cron.weekly/0anacron
run-parts: executing /etc/cron.weekly/dhelp
terminate called after throwing an instance of 'std::out_of_range'
what(): basic_string::erase: __pos (which is 4294967295) > this->size() (which is 0)
Dhelp::IndexerError: Broken pipe indexing /usr/share/doc/ruby-bdb/rdoc/README_en.html
, /usr/share/doc/ruby-bdb/rdoc/BDB.html
, /usr/share/doc/ruby-bdb/rdoc/Thread.html
, /usr/share/doc/ruby-bdb/rdoc/unknown.html
, /usr/share/doc/ruby-bdb/rdoc/table_of_contents.html
, /usr/share/doc/ruby-bdb/rdoc/index.html
, /usr/share/doc/ruby-bdb/rdoc/String.html
, /usr/share/doc/ruby-bdb/rdoc/RbConfig.html
, /usr/share/doc/ruby-bdb/rdoc/Object.html
, /usr/share/doc/ruby-bdb/rdoc/index.html
<* snip *>
I created another part ("14.part"), removed the README file from
it, and indexed it as follows:
cat 14.part | /usr/bin/index++ --config-file /usr/share/dhelp/config/swish++.conf --index-file ./14.index --follow-links -v4 -
BDB.html (269 words)
index.html (165 words)
Object.html (175 words)
RbConfig.html (25 words)
String.html (40 words)
table_of_contents.html (1004 words)
Thread.html (77 words)
unknown.html (27 words)
index++: ranking index...
"based" discarded (100%)
"berkeley" discarded (100%)
"classes" discarded (100%)
"darkfish" discarded (100%)
"generated" discarded (100%)
"granger" discarded (100%)
"home" discarded (100%)
"interface" discarded (100%)
"load" discarded (100%)
"methods" discarded (100%)
"michael" discarded (100%)
"pages" discarded (100%)
"rdoc" discarded (100%)
"rdoc_rel_prefix" discarded (100%)
"validate" discarded (100%)
index++: writing index...
index++: done:
00:00 (min:sec) elapsed time
8 files, 8 indexed
2342 words, 1782 indexed, 596 unique
cat 14.part
/usr/share/doc/ruby-bdb/rdoc/BDB.html
/usr/share/doc/ruby-bdb/rdoc/index.html
/usr/share/doc/ruby-bdb/rdoc/Object.html
/usr/share/doc/ruby-bdb/rdoc/RbConfig.html
/usr/share/doc/ruby-bdb/rdoc/String.html
/usr/share/doc/ruby-bdb/rdoc/table_of_contents.html
/usr/share/doc/ruby-bdb/rdoc/Thread.html
/usr/share/doc/ruby-bdb/rdoc/unknown.html
I looked at the README file and couldn't see any obvious reason
why the indexer wouldn't like it. This bug is very difficult to find
the root cause.
Reply to: