Accession Number : AD1037488

Title :   Learning in the context of distribution drift

Descriptive Note : Technical Report,23 Apr 2015,22 Apr 2017

Corporate Author : MONASH UNIVERSITY CLAYTON Australia

Personal Author(s) : Webb,Geoff

Full Text :

Report Date : 09 May 2017

Pagination or Media Count : 17

Abstract : The increasing ubiquity of data and its ever-increasing use to deliver tangible value raises the need for ever more effective technologies for data analysis. Many online data sources are subject to distribution drift: the frequency of different factors and the relationships between them changeover time. This is problematic for machine learning because almost all algorithms assume that distributions are constant. This project investigates new technologies for learning in the context of distribution drift, guided by the insight that different subgroups will change in different ways, at different speeds and at different times. The results are leading towards robust and reliable data analytics, able to make more effective use of big data under real-world conditions of change. The key developments in this project have been the creation of:- a sound and applicable theoretical framework for analyzing concept drift,- efficient and effective techniques for analyzing, understanding and describing concept drift observed in real world data,- efficient and effective algorithms for learning from time varying data sequences,- efficient and effective algorithms for classifying high-dimensional data,- efficient and effective algorithms for handling ordinal data, and- efficient and effective algorithms for learning in the context of concept drift. These new algorithms and techniques greatly improve the community's capacity to learn under the demanding circumstances of concept drift.

Descriptors :   machine learning , probability distributions , bayesian networks , algorithms , classification , data mining , generative models , sequences (MATHEMATICS)

Subject Categories : Cybernetics

Distribution Statement : APPROVED FOR PUBLIC RELEASE