Accession Number : ADA472458


Title :   Statistical Machine Translation of Japanese


Descriptive Note : Master's Thesis


Corporate Author : AIR FORCE INST OF TECH WRIGHT-PATTERSON AFB OH SCHOOL OF ENGINEERING AND MANAGEMENT


Personal Author(s) : Chapla, Erik A


Full Text : https://apps.dtic.mil/dtic/tr/fulltext/u2/a472458.pdf


Report Date : Mar 2007


Pagination or Media Count : 96


Abstract : The purpose of this research was to find ways to improve the performance of a statistical machine translation system that translates text from Japanese to English. Methods included altering the training and test data by adding a prior linguistic knowledge, altering sentence structures, and looking for better ways to statistically alter the way words align between the two languages. In addition, methods for properly segmenting words in Japanese text through statistical methods were examined. Finally, experiments were conducted on Japanese speech to produce the best text transcription of the speech. The best statistical machine translation methods implemented resulted in improvements that rivaled the best evaluations from the 2005 International Workshop on Spoken Language Translation from which training and test data was used. Recommendations, including how the methods presented may be altered for further improvements for future research, are also discussed.


Descriptors :   *JAPAN , *STATISTICAL PROCESSES , *LANGUAGE TRANSLATION , *MACHINE TRANSLATION , EXPERIMENTAL DATA , STRUCTURES , ENGLISH LANGUAGE , WORDS(LANGUAGE) , SPEECH , LANGUAGE , STATISTICS


Subject Categories : Linguistics


Distribution Statement : APPROVED FOR PUBLIC RELEASE