Active learning with evolving streaming data.

Zliobaite, I., Bifet, A., Pfahringer, B. and Holmes, G., 2011. Active learning with evolving streaming data. In: The European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD), 5-9 September 2011, Athens, Greece.

This is the latest version of this eprint.

Full text available as:

[img]PDF - Accepted Version
Restricted to Registered users only


Official URL:


In learning to classify streaming data, obtaining the true labels may require major effort and may incur excessive cost. Active learning focuses on learning an accurate model with as few labels as possible. Streaming data poses additional challenges for active learning, since the data distribution may change over time (concept drift) and classifiers need to adapt. Conventional active learning strategies concentrate on querying the most uncertain instances, which are typically concentrated around the decision boundary. If changes do not occur close to the boundary, they will be missed and classifiers will fail to adapt. In this paper we develop two active learning strategies for streaming data that explicitly handle concept drift. They are based on uncertainty, dynamic allocation of labeling efforts over time and randomization of the search space. We empirically demonstrate that these strategies react well to changes that can occur anywhere in the instance space and unexpectedly.

Item Type:Conference or Workshop Item (Paper)
Subjects:Generalities > Computer Science and Informatics > Artificial Intelligence
Group:Faculty of Science and Technology
ID Code:18345
Deposited By:Dr Indre Zliobaite LEFT
Deposited On:26 Jul 2011 15:46
Last Modified:10 Sep 2014 15:52

Available Versions of this Item


Downloads per month over past year

More statistics for this item...
Repository Staff Only -