References
Aruoba, S. Boragan, and Jesus Fernndez-Villaverde. 2018. “A
Comparison of Programming Languages in Economics: An Update.” https://www.sas.upenn.edu/~jesusfv/Update_March_23_2018.pdf.
Ashraf, N., D. Karlan, and W. Yin. 2006. “Tying Odysseus to the
Mast: Evidence From a Commitment Savings Product in the
Philippines.” The Quarterly Journal of Economics 121
(2): 635–72. https://doi.org/10.1162/qjec.2006.121.2.635.
Barrientos, Andrés F., Aaron R. Williams, Joshua Snoke, and Claire McKay
Bowen. 2021. “A Feasibility Study of Differentially Private
Summary Statistics and Regression Analyses with Evaluations on
Administrative and Survey Data.” https://doi.org/10.48550/ARXIV.2110.12055.
Bishop, Christopher M. 2006. Pattern Recognition and Machine
Learning. Information Science and Statistics. New York: Springer.
Blumenstock, Joshua. n.d. “Calling for Better Measurement:
Estimating an Individual’s Wealth and Well-Being from
Mobile Phone Transaction Records.” Center for Effective
Global Action. https://escholarship.org/uc/item/8zs63942.
Blumenstock, Joshua, Gabriel Cadamuro, and Robert On. 2015.
“Predicting Poverty and Wealth from Mobile Phone Metadata.”
Science 350 (6264): 1073–76. https://doi.org/10.1126/science.aac4420.
Brown, Lawrence D., T. Tony Cai, and Anirban DasGupta. 2001.
“Interval Estimation for a Binomial Proportion.”
Statistical Science 16 (2). https://doi.org/10.1214/ss/1009213286.
Bruce, Anotnio, and Gregory Robinson. 2003. “The Planning
Database: Its Development and Use as an Effective Tool in Census
2000.” https://citeseerx.ist.psu.edu/document?repid=rep1&type=pdf&doi=b2edb90180a9132f64f8287af2db92c031b5d40b.
Bruce, Antonio, Gregory Robinson, and Monique V. Sanders. 2001.
“Hard-to-Count Scores and Broad Demographic Groups Associated with
Patterns of Response Rates in Census 2000.” Proceedings of
the Social Statistics Section, American Statistical Association.
Casella, George, and Roger L. Berger. 2002. Statistical
Inference. 2nd ed. Australia ; Pacific Grove, CA: Thomson Learning.
Chernick, Michael R., and Robert A. LaBudde. 2011. An Introduction
to Bootstrap Methods with Applications to r. Hoboken, N.J: Wiley.
Chetty, Raj, John N. Friedman, Søren Leth-Petersen, Torben Heien
Nielsen, and Tore Olsen. 2014. “Active Vs. Passive Decisions and
Crowd-Out in Retirement Savings Accounts: Evidence from
Denmark*.” The Quarterly Journal of Economics 129 (3):
1141–1219. https://doi.org/10.1093/qje/qju013.
Erdman, Chandra, and Nancy Bates. 2014. “The u.s. Census Bureau
Mail Return Rate Challenge: Crowdsourcing to Develop a Hard-to-Count
Score.” https://www.census.gov/content/dam/Census/library/working-papers/2014/adrm/rrs2014-08.pdf.
———. 2017. “The Low Response Score (LRS).” Public
Opinion Quarterly 81 (1): 144–56. https://doi.org/10.1093/poq/nfw040.
Eubank, Nick. 2016. “Embrace Your Fallibility: Thoughts on Code
Integrity.” https://www.nickeubank.com/wp-content/uploads/2016/06/Eubank_EmbraceYourFallibility.pdf.
Fellegi, I. P. 1972. “On the Question of Statistical
Confidentiality.” Journal of the American Statistical
Association 67 (337): 7–18. https://www.jstor.org/stable/2284695?seq=1#metadata_info_tab_contents.
Ginsberg, Jeremy, Matthew H. Mohebbi, Rajan S. Patel, Lynnette Brammer,
Mark S. Smolinski, and Larry Brilliant. 2009. “Detecting Influenza
Epidemics Using Search Engine Query Data.” Nature 457
(7232): 1012–14. https://doi.org/10.1038/nature07634.
Hastie, Trevor, Robert Tibshirani, and J. H. Friedman. 2009. The
Elements of Statistical Learning: Data Mining, Inference, and
Prediction. 2nd ed. Springer Series in Statistics. New York, NY:
Springer.
Higgins, James J. 2004a. An Introduction to Modern Nonparametric
Statistics. Pacific Grove, CA: Brooks/Cole.
———. 2004b. An Introduction to Modern Nonparametric Statistics.
Pacific Grove, CA: Brooks/Cole.
James, Gareth, Daniela Witten, Trevor Hastie, and Robert Tibshirani.
2017. An introduction to statistical learning: with applications in
R. Corrected at 8th printing. Springer texts in statistics. New
York Heidelberg Dordrecht London: Springer. https://doi.org/10.1007/978-1-4614-7138-7.
Knuth, Donald E. 1984. “Literate Programming.” Comput.
J. 27 (2): 97–111. https://doi.org/10.1093/comjnl/27.2.97.
Kolenikov, Stas J. 2016. “Post-Stratification or a Non-Response
Adjustment?” Survey Practice 9 (3): 1–12. https://doi.org/10.29115/SP-2016-0014.
Leisch, Friedrich. 2004. “FlexMix: A General Framework for Finite
Mixture Models and Latent Class Regression in
R.” Journal of Statistical
Software 11 (8). https://doi.org/10.18637/jss.v011.i08.
Li, Jinjing, and Cathal O’Donoghue. 2014. “Evaluating Binary
Alignment Methods in Microsimulation Models.” Journal of
Artificial Societies and Social Simulation 17 (1): 15. https://doi.org/10.18564/jasss.2334.
McClelland, Robert, Surachai Khitatrakun, and Chenxi Lu. 2020.
“Estimating Confidence Intervals in a Tax Microsimulation
Model.” International Journal of Microsimulation 13 (2):
2–20. https://doi.org/10.34196/IJM.00216.
Murphy, Kevin P. 2022. Probabilistic Machine Learning: An
Introduction. Adaptive Computation and Machine Learning Series.
Cambridge, Massachusetts: The MIT Press.
Orcutt, Guy H. 1957. “A New Type of Socio-Economic System.”
The Review of Economics and Statistics 39 (2): 116. https://doi.org/10.2307/1928528.
Peng, Roger. 2018. “Teaching r to New Users - from Tapply to the
Tidyverse.” https://simplystatistics.org/posts/2018-07-12-use-r-keynote-2018/.
Potash, Eric, Joe Brew, Alexander Loewi, Subhabrata Majumdar, Andrew
Reece, Joe Walsh, Eric Rozier, Emile Jorgenson, Raed Mansour, and Rayid
Ghani. 2015. “KDD ’15: The 21th ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining.” In, 2039–47.
Sydney NSW Australia: ACM. https://doi.org/10.1145/2783258.2788629.
Ravn, Signe, Ashley Barnwell, and Barbara Barbosa Neves. 2020.
“What Is “Publicly Available Data”?
Exploring Blurred PublicPrivate Boundaries and Ethical
Practices Through a Case Study on Instagram.” Journal of
Empirical Research on Human Research Ethics 15 (1-2): 40–45. https://doi.org/10.1177/1556264619850736.
Rizzo, Maria L. 2008. Statistical Computing with r. Chapman
& Hall/CRC Computer Science and Data Analysis Series. Boca Raton:
Chapman & Hall/CRC.
Rodrigues, Bruno. 2022. Modern R with the tidyverse. https://modern-rstats.eu.
Salganik, Matthew J. 2018. Bit by Bit: Social Research in the
Digital Age. Princeton: Princeton University Press.
Scott, David W., and Stephan R. Sain. 2005. “Multidimensional
Density Estimation.” In, 24:229–61. Elsevier. https://doi.org/10.1016/S0169-7161(04)24009-3.
Somepalli, Gowthami, Singla, Micah Goldblum, Jonas Geiping, and Tom
Goldstein. 2023. “Diffusion Art or Digital Forgery? Investigating
Data Replication in Diffusion Models.” Proceedings of the
IEEE/CVF Conference on Computer Vision and Pattern Recognition
(CVPR), 6048–58. https://openaccess.thecvf.com/content/CVPR2023/html/Somepalli_Diffusion_Art_or_Digital_Forgery_Investigating_Data_Replication_in_Diffusion_CVPR_2023_paper.html.
Wickham, Hadley. 2010. “A Layered Grammar of Graphics.”
Journal of Computational and Graphical Statistics 19 (1): 3–28.
https://doi.org/10.1198/jcgs.2009.07098.
———. 2014. “Tidy Data.” https://doi.org/10.18637/jss.v059.i10.
———. n.d. The tidyverse style guide. https://style.tidyverse.org/index.html.
Wickham, Hadley, Mine Çetinkaya-Rundel, and Garrett Grolemund. 2023.
R for Data Science: Import, Tidy, Transform, Visualie, and Model
Data. 2nd edition. Sebastopol, CA: O’Reilly.
Wickham, Hadley, and Garrett Grolemund. 2017. R for Data Science:
Import, Tidy, Transform, Visualize, and Model Data. 1st ed.
Paperback; O’Reilly Media. http://r4ds.had.co.nz/.
Zheng, Vivian. 2020. “How Urban Piloted Data Science Techniques to
Collect Land-Use Reform Data.” https://urban-institute.medium.com/how-urban-piloted-data-science-techniques-to-collect-land-use-reform-data-475409903b88.