Multiple Instance Learning for Group Record Linkage
| dc.contributor.author | Fu, Sally | |
| dc.contributor.author | Zhou, Jun | |
| dc.contributor.author | Christen, Peter | |
| dc.contributor.author | Boot, Hector | |
| dc.date.accessioned | 2015-12-10T22:58:13Z | |
| dc.date.issued | 2012 | |
| dc.date.updated | 2016-02-24T11:59:35Z | |
| dc.description.abstract | Record linkage is the process of identifying records that refer to the same entities from different data sources. While most research efforts are concerned with linking individual records, new approaches have recently been proposed to link groups of records across databases. Group record linkage aims to determine if two groups of records in two databases refer to the same entity or not. One application where group record linkage is of high importance is the linking of census data that contain household information across time. In this paper we propose a novel method to group record linkage based on multiple instance learning. Our method treats group links as bags and individual record links as instances. We extend multiple instance learning from bag to instance classification to reconstruct bags from candidate instances. The classified bag and instance samples lead to a significant reduction in multiple group links, thereby improving the overall quality of linked data. We evaluate our method with both synthetic data and real historical census data. | |
| dc.identifier.isbn | 9783642302176 | |
| dc.identifier.uri | http://hdl.handle.net/1885/60764 | |
| dc.publisher | Springer | |
| dc.relation.ispartof | Advances in Knowledge Discovery and Data Mining: 16th Pacific-Asia Conference, PKDD 2012: Kuala Lumpur, Malaysia, May 29 - June 1, 2012: Proceedings, Part I | |
| dc.relation.isversionof | 1st Edition | |
| dc.subject | Keywords: Across time; Census data; Data source; Linked datum; Multiple instance learning; Multiple-group; Overall quality; Record linkage; Research efforts; Synthetic data; Data mining; Learning systems; Population statistics; Data handling entity resolution; historical census data; instance classification; Multiple instance learning; record linkage | |
| dc.title | Multiple Instance Learning for Group Record Linkage | |
| dc.type | Book chapter | |
| local.bibliographicCitation.lastpage | 182 | |
| local.bibliographicCitation.placeofpublication | Berlin Germany | |
| local.bibliographicCitation.startpage | 171 | |
| local.contributor.affiliation | Fu, Sally, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Zhou, Jun, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Christen, Peter, College of Engineering and Computer Science, ANU | |
| local.contributor.affiliation | Boot, Hector, College of Arts and Social Sciences, ANU | |
| local.contributor.authoruid | Fu, Sally, u4802791 | |
| local.contributor.authoruid | Zhou, Jun, u1818501 | |
| local.contributor.authoruid | Christen, Peter, u4021539 | |
| local.contributor.authoruid | Boot, Hector, u7000502 | |
| local.description.embargo | 2037-12-31 | |
| local.description.notes | Imported from ARIES | |
| local.description.refereed | Yes | |
| local.identifier.absfor | 080109 - Pattern Recognition and Data Mining | |
| local.identifier.absfor | 080306 - Open Software | |
| local.identifier.absfor | 160305 - Population Trends and Policies | |
| local.identifier.absseo | 970108 - Expanding Knowledge in the Information and Computing Sciences | |
| local.identifier.absseo | 970116 - Expanding Knowledge through Studies of Human Society | |
| local.identifier.absseo | 970121 - Expanding Knowledge in History and Archaeology | |
| local.identifier.ariespublication | u9406909xPUB561 | |
| local.identifier.doi | 10.1007/978-3-642-30217-6_15 | |
| local.identifier.scopusID | 2-s2.0-84861452098 | |
| local.type.status | Published Version |
Downloads
Original bundle
1 - 1 of 1
Loading...
- Name:
- 01_Fu_Multiple_Instance_Learning_for_2012.pdf
- Size:
- 241.32 KB
- Format:
- Adobe Portable Document Format