Multimillion Word Data Bases: A Preliminary Report. Volume 2.
DEFENSE DOCUMENTATION CENTER ALEXANDRIA VA
Pagination or Media Count:
Cumulative statistics are provided on word distribution and word type for a three million word data base. Consonant clusters, word length, and letter frequencies are given for the traditional natural language portion of the vocabulary. Volume one presents comparable statistics for three different one million word data bases, as well as the statistics for a two millon word corpus.