Back to the Summary tab for the Terrier Core project

.indexing

Summary

Description

Indexing parsers etc.

Issues: Due

  • Bug TR-11 Single pass indexing tries to merge too many run files at once
  • Bug TR-174 Indexing a directory breaks on special pdf- or excel files
  • Bug TR-182 MultiFileCollectionInputFormat has a negative effect on hadoop indexing

Issues: Updated recently

  • Bug TR-191 Last Tuesday 1:29 PM TwitterJSONCollection doesn't work with Hadoop plug-in
  • New Feature TR-171 20/Dec/11 Indexing support for TREC Tweets11 corpus
  • Bug TR-182 23/Nov/11 MultiFileCollectionInputFormat has a negative effect on hadoop indexing

Versions: Due