Skip navigation
Skip navigation

Quality and Complexity Measures for Data Linkage and Deduplication

Christen, Peter; Goiser, Karl

Description

Deduplicating one data set or linking several data sets are increasingly important tasks in the data preparation steps of many data mining projects. The aim of such linkages is to match all records relating to the same entity. Research interest in this area has increased in recent years, with techniques originating from statistics, machine learning, information retrieval, and database research being combined and applied to improve the linkage quality, as well as to increase performance and...[Show more]

CollectionsANU Research Publications
Date published: 2007
Type: Book chapter
URI: http://hdl.handle.net/1885/34693
DOI: 10.1007/978-3-540-44918-8_6

Download

File Description SizeFormat Image
01_Christen_Quality_and_Complexity_2007.pdf263.69 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  23 August 2018/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator