eRules: A Modular Adaptive Classification Rule Learning Algorithm for Data Streams.

Stahl, F.; Medhat Gaber, M.; Salvador, Manuel Martin

eRules: A Modular Adaptive Classification Rule Learning Algorithm for Data Streams.

Tools

Stahl, F., Medhat Gaber, M. and Salvador, M. M., 2012. eRules: A Modular Adaptive Classification Rule Learning Algorithm for Data Streams. In: AI-2012, The Thirty-second SGAI International Conference on Innovative Techniques and Applications of Artificial Intelligence, 11 - 13 December 2013, Cambridge, England, 65 - 78.

Full text available as:

Preview

PDF
AI2011.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
436kB

Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk.

Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material.

Official URL: http://link.springer.com/chapter/10.1007/978-1-447...

DOI: 10.1007/978-1-4471-4739-8_5

Abstract

Advances in hardware and software in the past decade allow to capture, record and process fast data streams at a large scale. The research area of data stream mining has emerged as a consequence from these advances in order to cope with the real time analysis of potentially large and changing data streams. Examples of data streams include Google searches, credit card transactions, telemetric data and data of continuous chemical production processes. In some cases the data can be processed in batches by traditional data mining approaches. However, in some applications it is required to analyse the data in real time as soon as it is being captured. Such cases are for example if the data stream is infinite, fast changing, or simply too large in size to be stored. One of the most important data mining techniques on data streams is classification. This involves training the classifier on the data stream in real time and adapting it to concept drifts. Most data stream classifiers are based on decision trees. However, it is well known in the data mining community that there is no single optimal algorithm. An algorithm may work well on one or several datasets but badly on others. This paper introduces eRules, a new rule based adaptive classifier for data streams, based on an evolving set of Rules. eRules induces a set of rules that is constantly evaluated and adapted to changes in the data stream by adding new and removing old rules. It is different from the more popular decision tree based classifiers as it tends to leave data instances rather unclassified than forcing a classification that could be wrong. The ongoing development of eRules aims to improve its accuracy further through dynamic parameter setting which will also address the problem of changing feature domain values.

Item Type:	Conference or Workshop Item (Paper)
Group:	Faculty of Science & Technology (Until 31/07/2025)
ID Code:	23390
Deposited By:	Symplectic RT2
Deposited On:	12 Apr 2016 14:34
Last Modified:	14 Mar 2022 13:55

Downloads

Downloads per month over past year

More statistics for this item...

Repository Staff Only -