Skip navigation
Skip navigation

A Survey of Indexing Techniques for Scalable Record Linkage and Deduplication

Christen, Peter


Record linkage is the process of matching records from several databases that refer to the same entities. When applied on a single database, this process is known as deduplication. Increasingly, matched data are becoming important in many application areas, because they can contain information that is not available otherwise, or that is too costly to acquire. Removing duplicate records in a single database is a crucial step in the data cleaning process, because duplicates can severely influence...[Show more]

CollectionsANU Research Publications
Date published: 2012
Type: Journal article
Source: IEEE Transactions on Knowledge and Data Engineering
DOI: 10.1109/TKDE.2011.127


File Description SizeFormat Image
01_Christen_A_Survey_of_Indexing_2012.pdf1.8 MBAdobe PDF    Request a copy
02_Christen_A_Survey_of_Indexing_2012.pdf380.33 kBAdobe PDF    Request a copy

Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  20 July 2017/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator