Apeh, E. and Gabrys, B., 2006. Clustering for Data Matching. In: Gabrys, B., Howlett, R.J. and Jain, L.C., eds. Knowledge-Based Intelligent Information and Engineering Systems: 10th International Conference, KES 2006, Bournemouth, UK, October 9-11 2006. Berlin: Springer, pp. 1216-1225.
Full text not available from this repository.
Official URL: http://www.springerlink.com/content/dwht56u3431u15...
DOI: 10.1007/11892960_146
Abstract
The problem of matching data has as one of its major bottlenecks the rapid deterioration in performance of time and accuracy, as the amount of data to be processed increases. One reason for this deterioration in performance is the cost incurred by data matching systems when comparing data records to determine their similarity (or dissimilarity). Approaches such as blocking and concatenation of data attributes have been used to minimize the comparison cost. In this paper, we analyse and present Keyword and Digram clustering as alternatives for enhancing the performance of data matching systems. We compare the performance of these clustering techniques in terms of potential savings in performing comparisons and their accuracy in correctly clustering similar data. Our results on a sampled London Stock Exchange listed companies database show that using the clustering techniques can lead to improved accuracy as well as time savings in data matching systems.
| Item Type: | Book Section |
|---|---|
| ISBN: | 3540465359 (pbk. : pt. 1); 3540465375 (pbk. : pt. 2); 3540465421 (pbk. : pt. 3) |
| Series Name: | Lecture Notes in Artificial Intelligence |
| Volume: | 1 |
| Number of Pages: | 1297 |
| ISSN: | 0302-9743 |
| Series Name: | Lecture Notes in Artificial Intelligence |
| Subjects: | Generalities > Computer Science and Informatics > Artificial Intelligence Generalities > Computer Science and Informatics |
| Group: | School of Design, Engineering & Computing > Smart Technology Research Centre |
| ID Code: | 8528 |
| Deposited By: | INVALID USER |
| Deposited On: | 19 Dec 2008 20:09 |
| Last Modified: | 07 Mar 2013 15:02 |
| Repository Staff Only - | |
| BU Staff Only - | |
| Help Guide - | Editing Your Items in BURO |

Tools
Tools