Cultural advice

The Australian National University acknowledges, celebrates and pays our respects to the Ngunnawal and Ngambri people of the Canberra region and to all First Nations Australians on whose traditional lands we meet and work, and whose cultures are among the oldest continuing cultures in human history.

Aboriginal and Torres Strait Islander peoples are advised that ANU Library collections may include images, names, voices, and other representations of deceased persons.

Material in the collection may contain terms, language or views that reflect the period in which the item was created and may be considered inappropriate today.

A comparison of personal name matching: Techniques and practical issues

dc.contributor.authorChristen, Peter
dc.contributor.editorConference Program Committee
dc.coverage.spatialHong Kong
dc.date.accessioned2007-02-07T05:16:34Zen_US
dc.date.accessioned2011-01-05T08:38:21Z
dc.date.available2007-02-07T05:16:34Zen_US
dc.date.available2011-01-05T08:38:21Z
dc.date.created2006-09en_US
dc.date.issued2006-09en_US
dc.date.updated2015-12-08T09:07:36Z
dc.description.abstractFinding and matching personal names is at the core of an increasing number of applications: from text and Web mining, information retrieval and extraction, search engines, to deduplication and data linkage systems. Variations and errors in names make exact string matching problematic, and approximate matching techniques based on phonetic encoding or pattern matching have to be applied. When compared to general text, however, personal names have different characteristics that need to be considered. ¶ In this paper we discuss the characteristics of personal names and present potential sources of variations and errors. We overview a comprehensive number of commonly used, as well as some recently developed name matching techniques. Experimental comparisons on four large name data sets indicate that there is no clear best technique. We provide a series of recommendations that will help researchers and practitioners to select a name matching technique suitable for a given data set.
dc.identifier.citationhttp://cs.anu.edu.au/techreports/2006/TR-CS-06-02.html
dc.identifier.isbn1601320043
dc.identifier.urihttp://hdl.handle.net/1885/44521en_US
dc.identifier.urihttp://digitalcollections.anu.edu.au/handle/1885/44521
dc.language.isoenen_US
dc.publisherCanberra, ACT: Dept. of Computer Science / Computer Sciences Laboratory, The Australian National Universityen_AU
dc.relation.ispartofseriesJoint Computer Science Technical Report Series, no.06-02en_US
dc.sourceProceedings of the 2006 International Conference Conference on Data Mining
dc.source.urihttp://www.world-academy-of-science.org/worldcomp06/ws/publications/dmin06/index_html
dc.subjectString matching
dc.subjectphonetic encoding
dc.subjectpattern matching
dc.subjectdata linkage
dc.subjectpersonal name characteristics
dc.subjectTR-CS
dc.titleA comparison of personal name matching: Techniques and practical issues
dc.typeWorking/Technical Paperen_AU
dcterms.accessRightsOpen Accessen_AU
local.bibliographicCitation.lastpage294
local.bibliographicCitation.startpage290
local.citationTR-CS-06-02en_US
local.contributor.affiliationANUen_US
local.contributor.affiliationDepartment of Computer Science, FEITen_US
local.contributor.authoruidChristen, Peter, u4021539
local.description.refereednoen_US
local.identifier.absfor080109 - Pattern Recognition and Data Mining
local.identifier.ariespublicationu4251866xPUB103
local.identifier.scopusID2-s2.0-78449293191
local.rights.ispublishedyesen_US
local.type.statusPublished versionen_AU

Downloads

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR-CS-06-02.pdf
Size:
247.64 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.92 KB
Format:
Plain Text
Description:
abcd