Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

Multiple Instance Learning for Group Record Linkage

Loading...
Thumbnail Image

Date

Authors

Fu, Sally
Zhou, Jun
Christen, Peter
Boot, Hector

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Abstract

Record linkage is the process of identifying records that refer to the same entities from different data sources. While most research efforts are concerned with linking individual records, new approaches have recently been proposed to link groups of records across databases. Group record linkage aims to determine if two groups of records in two databases refer to the same entity or not. One application where group record linkage is of high importance is the linking of census data that contain household information across time. In this paper we propose a novel method to group record linkage based on multiple instance learning. Our method treats group links as bags and individual record links as instances. We extend multiple instance learning from bag to instance classification to reconstruct bags from candidate instances. The classified bag and instance samples lead to a significant reduction in multiple group links, thereby improving the overall quality of linked data. We evaluate our method with both synthetic data and real historical census data.

Description

Citation

Source

Book Title

Advances in Knowledge Discovery and Data Mining: 16th Pacific-Asia Conference, PKDD 2012: Kuala Lumpur, Malaysia, May 29 - June 1, 2012: Proceedings, Part I

Entity type

Access Statement

License Rights

Restricted until

2037-12-31
abcd