Do We Really Even Need Data? A Modern Look at Drawing Inference with Predicted Data
arxiv.org·1d
🧪Data science
Preview
Report Post

View PDF HTML (experimental)

Abstract:As artificial intelligence and machine learning tools become more accessible, and scientists face new obstacles to data collection (e.g., rising costs, declining survey response rates), researchers increasingly use predictions from pre-trained algorithms as substitutes for missing or unobserved data. Though appealing for financial and logistical reasons, using standard tools for inference can misrepresent the association between independent variables and the outcome of interest when the true, unobserved outcome is replaced by a predicted value. In this paper, we characterize the statistical challenges inherent to drawing inference with predicted data (IPD) and show …

Similar Posts

Loading similar posts...