RESEARCH ON AUTOMATIC CLASSIFICATION, INDEXING AND EXTRACTING. FRQNCY: A GENERAL-PURPOSE FREQUENCY PROGRAM
Annual progress rept.
IBM FEDERAL SYSTEMS DIV GAITHERSBURG MD
Pagination or Media Count:
To support studies in automatic indexing, classification and extracting, a general purpose frequency program was developed to further theoretical and practical understanding of text word distributions. While the program is primarily designed for counting strings of character-oriented data, it can be used without change for counting any items which can be represented in an integral number of characters. Counts may be obtained simultaneously at several levels of detail, such as for sentences, paragraphs, chapters and entire documents. Both printed outputs and outputs for further computer processing may be obtained, and a variety of summary and detailed outputs are available. The program, titled FRQNCY, is written in the Fortran IV language and was compiled and run on the IBM System360 using Fortran IV G and the System360 Operating System. It uses at least one feature of the IBM System360 Fortran IV language LOGICAL1 and INTEGER2 variables which is not in USASI Fortran and hence may not compile or run under other Fortran systems. The program is extensively parametrized to allow its efficient use on computers with varying amounts of immediate-access storage and inputoutput equipment. The report is a complete writeup of the frequency program. It covers the purpose and usage of the program and also describes its organization and internal operation. Finally, guidelines for modifying the program or adapting it to different computers are also included.
- Information Science
- Computer Programming and Software