DID YOU KNOW? DTIC has over 3.5 million final reports on DoD funded research, development, test, and evaluation activities available to our registered users. Click
HERE to register or log in.
Accession Number:
ADA295618
Title:
Learning from Incomplete Data.
Descriptive Note:
Memorandum rept.,
Corporate Author:
MASSACHUSETTS INST OF TECH CAMBRIDGE ARTIFICIAL INTELLIGENCE LAB
Report Date:
1994-12-10
Pagination or Media Count:
12.0
Abstract:
Real-world learning tasks often involve high-dimensional data sets with complex patterns of missing features. In this paper we review the problem of learning from incomplete data from two statistical perspectives---the likelihood-based and the Bayesian. The goal is two-fold to place current neural network approaches to missing data within a statistical framework, and to describe a set of algorithms, derived from the likelihood-based framework, that handle clustering, classification, and function approximation from incomplete data in a principled and efficient manner. These algorithms are based on mixture modeling and make two distinct appeals to the Expectation-Maximization EM principle Dempster, Laird, and Rubin 1977---both for the estimation of mixture components and for coping with the missing data.
Distribution Statement:
APPROVED FOR PUBLIC RELEASE