Skip navigation
Skip navigation

Clustering-Based Scalable Indexing for Multi-party Privacy

Ranbaduge, Thilina; Vatsalan, Dinusha; Christen, Peter

Description

The identification of common sets of records in multiple databases has become an increasingly important subject in many application areas, including banking, health, and national security. Often privacy concerns and regulations prevent the owners of the databases from sharing any sensitive details of their records with each other, and with any other party. The linkage of records in multiple databases while preserving privacy and confidentiality is an emerging research discipline known as...[Show more]

dc.contributor.authorRanbaduge, Thilina
dc.contributor.authorVatsalan, Dinusha
dc.contributor.authorChristen, Peter
dc.coverage.spatialHo Chi Minh City, Vietnam
dc.date.accessioned2016-06-14T23:21:12Z
dc.date.createdMay 19-22 2015
dc.identifier.isbn9783319180311
dc.identifier.urihttp://hdl.handle.net/1885/103766
dc.description.abstractThe identification of common sets of records in multiple databases has become an increasingly important subject in many application areas, including banking, health, and national security. Often privacy concerns and regulations prevent the owners of the databases from sharing any sensitive details of their records with each other, and with any other party. The linkage of records in multiple databases while preserving privacy and confidentiality is an emerging research discipline known as privacy-preserving record linkage (PPRL). We propose a novel two-step indexing (blocking) approach for PPRL between multiple (more than two) parties. First, we generate small mini-blocks using a multi-bit Bloom filter splitting method and second we merge these mini-blocks based on their similarity using a novel hierarchical canopy clustering technique. An empirical study conducted with large datasets of up-to one million records shows that our approach is scalable with the size of the datasets and the number of parties, while providing better privacy than previous multi-party indexing approaches.
dc.publisherSpringer International Publishing AG
dc.relation.ispartofseriesPacific-Asia Conference, on Knowledge Discovery and Data Mining, PAKDD 2015
dc.sourceEfficient Interactive Training Selection for Large-Scale Entity Resolution
dc.titleClustering-Based Scalable Indexing for Multi-party Privacy
dc.typeConference paper
local.description.notesImported from ARIES
local.description.refereedYes
dc.date.issued2015
local.identifier.absfor080109 - Pattern Recognition and Data Mining
local.identifier.ariespublicationu4334215xPUB1486
local.type.statusPublished Version
local.contributor.affiliationRanbaduge, Thilina, College of Engineering and Computer Science, ANU
local.contributor.affiliationVatsalan, Dinusha, College of Engineering and Computer Science, ANU
local.contributor.affiliationChristen, Peter, College of Engineering and Computer Science, ANU
local.description.embargo2037-12-31
local.bibliographicCitation.startpage549
local.bibliographicCitation.lastpage561
local.identifier.doi10.1007/978-3-319-18032-8_43
local.identifier.absseo970108 - Expanding Knowledge in the Information and Computing Sciences
dc.date.updated2016-06-14T09:02:46Z
local.identifier.scopusID2-s2.0-84945585620
CollectionsANU Research Publications

Download

File Description SizeFormat Image
01_Ranbaduge_Clustering-Based_Scalable_2015.pdf381.66 kBAdobe PDF    Request a copy


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator