Skip navigation
Skip navigation

Bandwidth choice for nonparametric classification

Hall, Peter; Kang, Kee-Hoon

Description

It is shown that, for kernel-based classification with univariate distributions and two populations, optimal bandwidth choice has a dichotomous character. If the two densities cross at just one point, where their curvatures have the same signs, then minimum Bayes risk is achieved using bandwidths which are an order of magnitude larger than those which minimize pointwise estimation error. On the other hand, if the curvature signs are different, or if there are multiple crossing points,...[Show more]

dc.contributor.authorHall, Peter
dc.contributor.authorKang, Kee-Hoon
dc.date.accessioned2015-09-11T05:04:29Z
dc.date.available2015-09-11T05:04:29Z
dc.identifier.issn0090-5364
dc.identifier.urihttp://hdl.handle.net/1885/15344
dc.description.abstractIt is shown that, for kernel-based classification with univariate distributions and two populations, optimal bandwidth choice has a dichotomous character. If the two densities cross at just one point, where their curvatures have the same signs, then minimum Bayes risk is achieved using bandwidths which are an order of magnitude larger than those which minimize pointwise estimation error. On the other hand, if the curvature signs are different, or if there are multiple crossing points, then bandwidths of conventional size are generally appropriate. The range of different modes of behavior is narrower in multivariate settings. There, the optimal size of bandwidth is generally the same as that which is appropriate for pointwise density estimation. These properties motivate empirical rules for bandwidth choice.
dc.rights© Institute of Mathematical Statistics, 2005. Author can archive pdf http://www.sherpa.ac.uk/romeo/issn/0090-5364/ as at 11/9/15.
dc.sourceAnnals of Statistics 2005, Vol. 33, No. 1, 284-306
dc.subjectBayes risk
dc.subjectbootstrap
dc.subjectcross-validation
dc.subjectclassification error
dc.subjectdiscrimination
dc.subjecterror rate
dc.subjectkernel methods
dc.subjectnonparametric density estimation
dc.titleBandwidth choice for nonparametric classification
dc.typeJournal article
local.identifier.citationvolume33
dc.date.issued2005-04-25
local.type.statusPublished Version
local.contributor.affiliationHall, Peter, The Australian National University
local.contributor.affiliationKang, Kee-Hoon, The Australian National University
local.bibliographicCitation.issue1
local.bibliographicCitation.startpage284
local.bibliographicCitation.lastpage306
local.identifier.doi10.1214/009053604000000959
CollectionsANU Research Publications

Download

File Description SizeFormat Image
Hall and Kang Bandwidth choice 2005.pdf195.96 kBAdobe PDFThumbnail


Items in Open Research are protected by copyright, with all rights reserved, unless otherwise indicated.

Updated:  17 November 2022/ Responsible Officer:  University Librarian/ Page Contact:  Library Systems & Web Coordinator