Febrl - An open source data cleaning, deduplication and record linkage system with a graphical user interface
Matching records that refer to the same entity across data-bases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs to be matched in order to enrich data or improve its quality. Significant advances in record linkage techniques have been made in recent years. However, many new techniques are either implemented in research proof-of-concept systems only, or they are hidden within expensive 'black box' commercial software. This makes...[Show more]
|Collections||ANU Research Publications|
|Source:||Proceedings of 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining|
|01_Christen_Febrl_-_An_open_source_data_2008.pdf||561.63 kB||Adobe PDF||Request a copy|
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.