Terrier Core

BitInputFormat: some minor changes

Details

  • Type: Bug Bug
  • Status: Resolved Resolved
  • Priority: Trivial Trivial
  • Resolution: Fixed
  • Affects Version/s: 3.0
  • Fix Version/s: 3.0
  • Component/s: .structures
  • Description:
    Hide
    Two corner issues:
    1. End splits of less than one byte were not processed
    2. Empty entries were not processed correctly.

    This code is used by Inv2DirectMultiReduce. However, (1) is an unlikely case, as our inverted files are so massive, the chances of the final split being less than 1 byte as very small. (2) cannot happen for an inverted file - terms always have entries.
    Show
    Two corner issues: 1. End splits of less than one byte were not processed 2. Empty entries were not processed correctly. This code is used by Inv2DirectMultiReduce. However, (1) is an unlikely case, as our inverted files are so massive, the chances of the final split being less than 1 byte as very small. (2) cannot happen for an inverted file - terms always have entries.

Activity

Hide
Craig Macdonald added a comment - 23/Nov/09 12:36 PM

Formatting change

Show
Craig Macdonald added a comment - 23/Nov/09 12:36 PM Formatting change
Hide
Craig Macdonald added a comment - 23/Nov/09 12:37 PM

Committed to trunk

Show
Craig Macdonald added a comment - 23/Nov/09 12:37 PM Committed to trunk
Hide
Craig Macdonald added a comment - 23/Nov/09 12:39 PM

Test case was committed to HadoopShakespeareEndToEndTest for testing splitting of a direct index.

Show
Craig Macdonald added a comment - 23/Nov/09 12:39 PM Test case was committed to HadoopShakespeareEndToEndTest for testing splitting of a direct index.

People

Dates

  • Created:
    23/Nov/09 12:36 PM
    Updated:
    05/Mar/10 5:25 PM
    Resolved:
    23/Nov/09 12:37 PM