Accession Number : AD0456948


Title :   NATURAL LANGUAGE IN COMPUTER FORM


Corporate Author : RAND CORP SANTA MONICA CA


Personal Author(s) : Kay, Martin ; Ziehe, Theodore


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/456948.pdf


Report Date : Feb 1965


Pagination or Media Count : 89


Abstract : This Memorandum describes a scheme for recording text in computer- usable form in such a way that all meaningful typographical distinctions are represented in a standard way. Provision is made for texts in different languages and different alphabets and for subsidiary material such as parallel translations and comments of interest to users and librarians. The basic set of encoding conventions is indefinitely extensible to accommodate new kinds of material. Very large bodies of data require special facilities, and these have been provided by embedding the text encoding scheme in a general file maintenance system. Computer programs are described which simplify conversion of text from these various sources into the standard format. The final section discusses the problem of printing text which has been recorded in the standard format and describes a flexible program for doing this.


Descriptors :   *INFORMATION RETRIEVAL , *READING MACHINES , COMPUTERS , CODING , OPERATION , LANGUAGE , INPUT OUTPUT DEVICES , INFORMATION PROCESSING , MACHINE TRANSLATION


Subject Categories : Information Science
      Computer Hardware


Distribution Statement : APPROVED FOR PUBLIC RELEASE