Developing synthetic microdata through machine learning for firm-level business surveys
arxiv.org·15h
🧠Machine Learning
Preview
Report Post

View PDF HTML (experimental)

Abstract:Public-use microdata samples (PUMS) from the United States (US) Census Bureau on individuals have been available for decades. However, large increases in computing power and the greater availability of Big Data have dramatically increased the probability of re-identifying anonymized data, potentially violating the pledge of confidentiality given to survey respondents. Data science tools can be used to produce synthetic data that preserve critical moments of the empirical data but do not contain the records of any existing individual respondent or business. Developing public-use firm data from surveys presents unique challenges different from demographic data, because…

Similar Posts

Loading similar posts...