Nonparametric density estimation for stratified samples [February 2001]
Date
2001
Authors
Breunig, Robert
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
In this paper, we consider the non-parametric, kernel estimate of the density, f(x), for data drawn from stratified samples. Much of the data used by economists is gathered in some type of complex survey (stratified, clustered, systematic, etc.), resulting in violations of the usual assumptions of independently and identically distributed data. Such effects induced by the survey structure are rarely considered in the literature on non-parametric density estimation, yet they may have serious consequences for our analysis, as shown in this paper. A weighted estimator is developed which provides asymptotically unbiased density estimation for stratified samples. A data-based method for choosing the optimal bandwidth is suggested, using information on within-stratum variances and means. The weighted estimator and proposed bandwidth are shown to give smaller mean squared error for stratified samples than an unweighted estimator and a commonly used method of choosing the bandwidth. Several illustrations from simulation are provided. We also show that the optimal sampling scheme in this case is always stratified sampling proportional to size, irrespective of the stratum-specific densities.
Description
Keywords
nonparametric density estimation, bandwidth selection, stratified sampling, optimal sampling
Citation
Collections
Source
Type
Working/Technical Paper
Book Title
Entity type
Access Statement
License Rights
DOI
Restricted until
Downloads
File
Description