Skip navigation
Skip navigation

An Efficient Two-Party Protocol for Approximate Matching in Private Record Linkage

Vatsalan, Dinusha; Christen, Peter; Verykios, Vassilios

Description

The task of linking multiple databases with the aim to identify records that refer to the same entity is occurring increasingly in many application areas. If unique identifiers for the entities are not available in all the databases to be linked, techniques that calculate approximate similarities between records must be used for the identification of matching pairs of records. Often, the records to be linked contain personal information such as names and addresses. In many applications, the...[Show more]

dc.contributor.authorVatsalan, Dinusha
dc.contributor.authorChristen, Peter
dc.contributor.authorVerykios, Vassilios
dc.date.accessioned2015-12-13T22:45:22Z
dc.date.available2015-12-13T22:45:22Z
dc.identifier.issn1445-1336
dc.identifier.urihttp://hdl.handle.net/1885/79729
dc.description.abstractThe task of linking multiple databases with the aim to identify records that refer to the same entity is occurring increasingly in many application areas. If unique identifiers for the entities are not available in all the databases to be linked, techniques that calculate approximate similarities between records must be used for the identification of matching pairs of records. Often, the records to be linked contain personal information such as names and addresses. In many applications, the exchange of attribute values that contain such personal details between organisations is not allowed due to privacy concerns. The linking of records between databases without revealing the actual attribute values in these records is the research problem known as 'privacy-preserving record linkage' (PPRL).While various approaches have been proposed to deal with privacy within the record linkage process, a viable solution that is well applicable to real-world conditions needs to address the major aspect of scalability of linking very large databases while preserving security and linkage quality. We propose a novel two-party protocol for PPRL that addresses scalability, security and quality/ accuracy. The protocol is based on (1) the use of reference values that are available to both database owners, and allows them to individually calculate the similarities between their attribute values and the reference values; and (2) the binning of these calculated similarity values to allow their secure exchange between the two database owners. Experiments on a real-world database with nearly two million records yield linkage results that have a linear scalability to large databases and high linkage accuracy, allowing for approximate matching in the privacy-preserving context. Since the protocol has a low computational burden and allows quality approximate matching while still preserving the privacy of the databases that are matched, the protocol can be useful for many real-world applications requiring PPRL.
dc.publisherAustralian Computer Society Inc.
dc.sourceConferences in Research and Practice in Information Technology - CRPIT
dc.subjectKeywords: Application area; Approximate matching; Attribute values; Binning; Computational burden; Large database; Personal information; Privacy concerns; Privacy preserving; Real-world application; Real-world database; Record linkage; Reference values; Research pr Approximate matching; Binning; Entity resolution; Privacy technologies; Scalability; Similarity measure; Two-party protocol
dc.titleAn Efficient Two-Party Protocol for Approximate Matching in Private Record Linkage
dc.typeJournal article
local.description.notesImported from ARIES
local.identifier.citationvolume121
dc.date.issued2010
local.identifier.absfor080201 - Analysis of Algorithms and Complexity
local.identifier.absfor080107 - Natural Language Processing
local.identifier.absfor080399 - Computer Software not elsewhere classified
local.identifier.ariespublicationf5625xPUB8112
local.type.statusPublished Version
local.contributor.affiliationVatsalan, Dinusha, College of Engineering and Computer Science, ANU
local.contributor.affiliationChristen, Peter, College of Engineering and Computer Science, ANU
local.contributor.affiliationVerykios, Vassilios, Helenic Open University
local.bibliographicCitation.startpage125
local.bibliographicCitation.lastpage136
dc.date.updated2016-02-24T09:39:43Z
local.identifier.scopusID2-s2.0-84870477881
CollectionsANU Research Publications

Download

There are no files associated with this item.


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  19 May 2020/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator