Document Retrieval Systems,

reportActive / Technical Report | Accession Number: AD0737042 | Need Help?

Abstract:

Some of the mathematical properties of a term matching information retrieval system are investigated. It is shown that the common retrieval method of using a query vector, a matching function, and a threshold is equivalent to retrieving documents by requiring that a specific mathematical combination of the over and under indexing errors between the query vector and document index vector is bounded. Furthermore, the over and under indexing error set provides a sample space for a probabilistic description of the retrieval process. Using this approach an explicit form of the expected recall ratio is derived. Author

Security Markings

DOCUMENT & CONTEXTUAL SUMMARY

Distribution:
Approved For Public Release

RECORD

Collection: TR
Identifying Numbers
Subject Terms