Wang, Qing (Ms); Cui, Mingyuan; Liang, Huizhi
In this paper, we propose a semantic-aware blocking framework for entity resolution (ER). The proposed framework is built using locality-sensitive hashing (LSH) techniques, which efficiently unifies both textual and semantic features into an ER blocking process. In order to understand how similarity metrics may affect the effectiveness of ER blocking, we study the robustness of similarity metrics and their properties in terms of LSH families. Then, we present how the semantic similarity of...[Show more]
Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.