A Clustering-Based Framework to Control Block Sizes for Entity Resolution
-
Altmetric Citations
Fisher, Jeffrey; Christen, Peter; Wang, Qing (Ms); Rahm, Erhard
Description
Entity resolution (ER) is a common data cleaning task that involves determining which records from one or more data sets refer to the same real-world entities. Because a pairwise comparison of all records scales quadratically with the number of records in the data sets to be matched, it is common to use blocking or indexing techniques to reduce the number of comparisons required. These techniques split the data sets into blocks and only records within blocks are compared with each other. Most...[Show more]
Collections | ANU Research Publications |
---|---|
Date published: | 2015 |
Type: | Conference paper |
URI: | http://hdl.handle.net/1885/103790 |
Source: | A Clustering-Based Framework to Control Block Sizes for Entity Resolution |
DOI: | 10.1145/2783258.2783396 |
Download
File | Description | Size | Format | Image |
---|---|---|---|---|
01_Fisher_A_Clustering-Based_Framework_2015.pdf | 516.76 kB | Adobe PDF | Request a copy |
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.
Updated: 17 November 2022/ Responsible Officer: University Librarian/ Page Contact: Library Systems & Web Coordinator