Wang, Qing (Ms); Schewe, Klaus-Dieter; Wang, Woods
Entity resolution (ER) - the process of identifying records that refer to the same real-world entity - pervasively exists in many application areas. Nevertheless, resolving entities is hardly ever completely accurate. In this paper, we investigate a provenance-aware framework for ER. We first propose an indexing structure that can be efficiently built for provenance storage in support of an ER process. Then a generic repairing strategy, called coordinate-split-merge (CSM), is developed to...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.