Benford's Law and Imputation of Missing Data

John T. Quinn, Bryant University
Alan D. Olinsky, Bryant University
Phyllis Schumacher, Bryant University

ABSTRACT
Benford's Law and its generalizations are a set of rules which describe the distribution of digits in numbers for many types of data sets. It is often used to help detect data that are fraudulently created. We investigate whether adding a subset of randomized values to a data set will alter its distribution of digits as analyzed through various Benford tests. We consider two sets of data - Medicare charges and Medicare payments for thousands of hospitals - and analyze them using various Benford tests. In addition, we remove 20% of the observations for each set and replace them with uniform random values, and examine the differences.

(Return to Program Resources)

Updated 03/19/2014