The Classical Occupancy Distribution: Computation and Approximation
Date
Authors
O'Neill, Ben
Journal Title
Journal ISSN
Volume Title
Publisher
American Statistical Association
Abstract
We examine the discrete distributional form that arises from the "classical occupancy problem," which looks at the behavior of the number of occupied bins when we allocate a given number of balls uniformly at random to a given number of bins. We review the mass function and moments of the classical occupancy distribution and derive exact and asymptotic results for the mean, variance, skewness and kurtosis. We develop an algorithm to compute a cubic array of log-probabilities from the classical occupancy distribution. This algorithm allows the computation of large blocks of values while avoiding underflow problems in computation. Using this algorithm, we compute the classical occupancy distribution for a large block of values of balls and bins, and we measure the accuracy of its asymptotic approximation using the normal distribution. We analyze the accuracy of the normal approximation with respect to the variance, skewness and kurtosis of the distribution. Based on this analysis, we give some practical guidance on the feasibility of computing large blocks of values from the occupancy distribution, and when approximation is required.
Description
Citation
Ben O’Neill (2020): The Classical Occupancy Distribution: Computation and Approximation, The American Statistician, DOI: 10.1080/00031305.2019.1699445
Collections
Source
The American Statistician
Type
Book Title
Entity type
Access Statement
License Rights
Restricted until
2037-12-31
Downloads
File
Description