References

Aruoba, S. Boragan, and Jesus Fernndez-Villaverde. 2018. “A Comparison of Programming Languages in Economics: An Update.” https://www.sas.upenn.edu/~jesusfv/Update_March_23_2018.pdf.
Ashraf, N., D. Karlan, and W. Yin. 2006. “Tying Odysseus to the Mast: Evidence From a Commitment Savings Product in the Philippines.” The Quarterly Journal of Economics 121 (2): 635–72. https://doi.org/10.1162/qjec.2006.121.2.635.
Barrientos, Andrés F., Aaron R. Williams, Joshua Snoke, and Claire McKay Bowen. 2021. “A Feasibility Study of Differentially Private Summary Statistics and Regression Analyses with Evaluations on Administrative and Survey Data.” https://doi.org/10.48550/ARXIV.2110.12055.
Bishop, Christopher M. 2006. Pattern Recognition and Machine Learning. Information Science and Statistics. New York: Springer.
Blumenstock, Joshua. n.d. “Calling for Better Measurement: Estimating an Individuals Wealth and Well-Being from Mobile Phone Transaction Records.” Center for Effective Global Action. https://escholarship.org/uc/item/8zs63942.
Blumenstock, Joshua, Gabriel Cadamuro, and Robert On. 2015. “Predicting Poverty and Wealth from Mobile Phone Metadata.” Science 350 (6264): 1073–76. https://doi.org/10.1126/science.aac4420.
Brown, Lawrence D., T. Tony Cai, and Anirban DasGupta. 2001. “Interval Estimation for a Binomial Proportion.” Statistical Science 16 (2). https://doi.org/10.1214/ss/1009213286.
Bruce, Anotnio, and Gregory Robinson. 2003. “The Planning Database: Its Development and Use as an Effective Tool in Census 2000.” https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=b2edb90180a9132f64f8287af2db92c031b5d40b.
Bruce, Antonio, Gregory Robinson, and Monique V. Sanders. 2001. “Hard-to-Count Scores and Broad Demographic Groups Associated with Patterns of Response Rates in Census 2000.” Proceedings of the Social Statistics Section, American Statistical Association.
Casella, George, and Roger L. Berger. 2002. Statistical Inference. 2nd ed. Australia ; Pacific Grove, CA: Thomson Learning.
Chernick, Michael R., and Robert A. LaBudde. 2011. An Introduction to Bootstrap Methods with Applications to r. Hoboken, N.J: Wiley.
Chetty, Raj, John N. Friedman, Søren Leth-Petersen, Torben Heien Nielsen, and Tore Olsen. 2014. “Active Vs. Passive Decisions and Crowd-Out in Retirement Savings Accounts: Evidence from Denmark*.” The Quarterly Journal of Economics 129 (3): 1141–1219. https://doi.org/10.1093/qje/qju013.
Erdman, Chandra, and Nancy Bates. 2014. “The u.s. Census Bureau Mail Return Rate Challenge: Crowdsourcing to Develop a Hard-to-Count Score.” https://www.census.gov/content/dam/Census/library/working-papers/2014/adrm/rrs2014-08.pdf.
———. 2017. “The Low Response Score (LRS).” Public Opinion Quarterly 81 (1): 144–56. https://doi.org/10.1093/poq/nfw040.
Eubank, Nick. 2016. “Embrace Your Fallibility: Thoughts on Code Integrity.” https://www.nickeubank.com/wp-content/uploads/2016/06/Eubank_EmbraceYourFallibility.pdf.
Fellegi, I. P. 1972. “On the Question of Statistical Confidentiality.” Journal of the American Statistical Association 67 (337): 7–18. https://www.jstor.org/stable/2284695?seq=1#metadata_info_tab_contents.
Ginsberg, Jeremy, Matthew H. Mohebbi, Rajan S. Patel, Lynnette Brammer, Mark S. Smolinski, and Larry Brilliant. 2009. “Detecting Influenza Epidemics Using Search Engine Query Data.” Nature 457 (7232): 1012–14. https://doi.org/10.1038/nature07634.
Hastie, Trevor, Robert Tibshirani, and J. H. Friedman. 2009. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. 2nd ed. Springer Series in Statistics. New York, NY: Springer.
Higgins, James J. 2004a. An Introduction to Modern Nonparametric Statistics. Pacific Grove, CA: Brooks/Cole.
———. 2004b. An Introduction to Modern Nonparametric Statistics. Pacific Grove, CA: Brooks/Cole.
James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani. 2017. An introduction to statistical learning: with applications in R. Corrected at 8th printing. Springer texts in statistics. New York Heidelberg Dordrecht London: Springer. https://doi.org/10.1007/978-1-4614-7138-7.
Knuth, Donald E. 1984. “Literate Programming.” Comput. J. 27 (2): 97–111. https://doi.org/10.1093/comjnl/27.2.97.
Kolenikov, Stas J. 2016. “Post-Stratification or a Non-Response Adjustment?” Survey Practice 9 (3): 1–12. https://doi.org/10.29115/SP-2016-0014.
Leisch, Friedrich. 2004. “FlexMix: A General Framework for Finite Mixture Models and Latent Class Regression in R.” Journal of Statistical Software 11 (8). https://doi.org/10.18637/jss.v011.i08.
Li, Jinjing, and Cathal O’Donoghue. 2014. “Evaluating Binary Alignment Methods in Microsimulation Models.” Journal of Artificial Societies and Social Simulation 17 (1): 15. https://doi.org/10.18564/jasss.2334.
McClelland, Robert, Surachai Khitatrakun, and Chenxi Lu. 2020. “Estimating Confidence Intervals in a Tax Microsimulation Model.” International Journal of Microsimulation 13 (2): 2–20. https://doi.org/10.34196/IJM.00216.
Murphy, Kevin P. 2022. Probabilistic Machine Learning: An Introduction. Adaptive Computation and Machine Learning Series. Cambridge, Massachusetts: The MIT Press.
Orcutt, Guy H. 1957. “A New Type of Socio-Economic System.” The Review of Economics and Statistics 39 (2): 116. https://doi.org/10.2307/1928528.
Peng, Roger. 2018. “Teaching r to New Users - from Tapply to the Tidyverse.” https://simplystatistics.org/posts/2018-07-12-use-r-keynote-2018/.
Potash, Eric, Joe Brew, Alexander Loewi, Subhabrata Majumdar, Andrew Reece, Joe Walsh, Eric Rozier, Emile Jorgenson, Raed Mansour, and Rayid Ghani. 2015. “KDD ’15: The 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.” In, 2039–47. Sydney NSW Australia: ACM. https://doi.org/10.1145/2783258.2788629.
Ravn, Signe, Ashley Barnwell, and Barbara Barbosa Neves. 2020. “What Is Publicly Available Data? Exploring Blurred PublicPrivate Boundaries and Ethical Practices Through a Case Study on Instagram.” Journal of Empirical Research on Human Research Ethics 15 (1-2): 40–45. https://doi.org/10.1177/1556264619850736.
Rizzo, Maria L. 2008. Statistical Computing with r. Chapman & Hall/CRC Computer Science and Data Analysis Series. Boca Raton: Chapman & Hall/CRC.
Rodrigues, Bruno. 2022. Modern R with the tidyverse. https://modern-rstats.eu.
Salganik, Matthew J. 2018. Bit by Bit: Social Research in the Digital Age. Princeton: Princeton University Press.
Scott, David W., and Stephan R. Sain. 2005. “Multidimensional Density Estimation.” In, 24:229–61. Elsevier. https://doi.org/10.1016/S0169-7161(04)24009-3.
Somepalli, Gowthami, Singla, Micah Goldblum, Jonas Geiping, and Tom Goldstein. 2023. “Diffusion Art or Digital Forgery? Investigating Data Replication in Diffusion Models.” Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 6048–58. https://openaccess.thecvf.com/content/CVPR2023/html/Somepalli_Diffusion_Art_or_Digital_Forgery_Investigating_Data_Replication_in_Diffusion_CVPR_2023_paper.html.
Wickham, Hadley. 2010. “A Layered Grammar of Graphics.” Journal of Computational and Graphical Statistics 19 (1): 3–28. https://doi.org/10.1198/jcgs.2009.07098.
———. 2014. “Tidy Data.” https://doi.org/10.18637/jss.v059.i10.
———. n.d. The tidyverse style guide. https://style.tidyverse.org/index.html.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023. R for Data Science: Import, Tidy, Transform, Visualie, and Model Data. 2nd edition. Sebastopol, CA: O’Reilly.
Wickham, Hadley, and Garrett Grolemund. 2017. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. 1st ed. Paperback; O’Reilly Media. http://r4ds.had.co.nz/.
Zheng, Vivian. 2020. “How Urban Piloted Data Science Techniques to Collect Land-Use Reform Data.” https://urban-institute.medium.com/how-urban-piloted-data-science-techniques-to-collect-land-use-reform-data-475409903b88.