Nonparametric density estimation for stratified samples [February 2001]

Date

2001

Authors

Breunig, Robert

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

In this paper, we consider the non-parametric, kernel estimate of the density, f(x), for data drawn from stratified samples. Much of the data used by economists is gathered in some type of complex survey (stratified, clustered, systematic, etc.), resulting in violations of the usual assumptions of independently and identically distributed data. Such effects induced by the survey structure are rarely considered in the literature on non-parametric density estimation, yet they may have serious consequences for our analysis, as shown in this paper. A weighted estimator is developed which provides asymptotically unbiased density estimation for stratified samples. A data-based method for choosing the optimal bandwidth is suggested, using information on within-stratum variances and means. The weighted estimator and proposed bandwidth are shown to give smaller mean squared error for stratified samples than an unweighted estimator and a commonly used method of choosing the bandwidth. Several illustrations from simulation are provided. We also show that the optimal sampling scheme in this case is always stratified sampling proportional to size, irrespective of the stratum-specific densities.

Description

Keywords

nonparametric density estimation, bandwidth selection, stratified sampling, optimal sampling

Citation

Source

Type

Working/Technical Paper

Book Title

Entity type

Access Statement

License Rights

DOI

Restricted until

Downloads