Distance measures and smoothing methodology for imputing features of documents

Date

2005

Authors

Feuerverger, Andrey
Tilahun, Gelila
Hall, Peter
Gervers, Michael

Journal Title

Journal ISSN

Volume Title

Publisher

American Statistical Association

Abstract

We suggest a new class of metrics for measuring distances between documents, generalizing the well-known resemblance distance. We then show how to combine distance measures with statistical smoothing to develop techniques for imputing missing features of

Description

Keywords

Keywords: Bandwidth; Correspondence distance; Cross-validation; Dating; Kernel; Resemblance distance; Shingle

Citation

Source

Journal of Computational and Graphical Statistics

Type

Journal article

Book Title

Entity type

Access Statement

License Rights

Restricted until

2037-12-31