Cynthia Rudin Research Interests Education
Total Page:16
File Type:pdf, Size:1020Kb
Cynthia Rudin Professor of Computer Science, Electrical and Computer Engineering, and Statistical Science Duke University D342 LSRC Building, Box 90129, Durham, NC 27708 [email protected] https://users.cs.duke.edu/ cynthia/ Research Interests My work focuses on interpretable machine learning, which is important for responsible and trustworthy AI. This includes the design of algorithms for interpretable modeling, interpretable policy design, variable importance measures, causal infer- ence methods, ranking methods that assist with prioritization, uncertainty quantification, and methods that can incorporate domain-based constraints and other types of domain knowledge into machine learning. These techniques are applied to critical societal problems in criminology, healthcare, energy grid reliability, and other areas. The interpretable machine learning algorithms heavily rely on efficient discrete optimization techniques. Some accomplishments: My collaborators and I developed the first practical code for decision lists and decision trees that provably optimize • accuracy and sparsity. This work had an oral presentation at KDD 2017, a spotlight at NeurIPS 2019, and the latest work appeared in ICML 2020. Our work on optimal scoring systems (sparse linear models with integer coefficients) has been applied to several important • healthcare and criminal justice applications. For instance, my collaboration’s work on seizure prediction in ICU patients allowed for better allocation of ICU patient monitoring resources. This work won the 2019 INFORMS Innovative Applications in Analytics Award. It allowed doctors to monitor 2.8X more patients and saved over $6M at two major hospitals in FY2018. It helps to prevent severe brain damage in critically ill patients. I led a team in the first major effort to maintain an underground electrical distribution network using machine learning, in • joint work with Con Edison in NYC. This won the 2013 INFORMS Innovative Applications in Analytics Award. I solved a well-known (previously open) theoretical problem in machine learning as a PhD student, which is whether • AdaBoost maximizes the margin like SVM. Subsequent work solved a published COLT open problem, earning a prize. My collaborators and I developed code for detecting crime series in cities. This methodology (specifically, the Series • Finder algorithm) was adapted by the NYPD and their application (Patternizr) has been running live in NYC since 2016 for determining whether each new crime is related to past crimes. I have given invited and keynote talks at KDD (2014 and 2019), AISTATS, ECML-PKDD, ML in Healthcare, FAT-ML • (Fairness, Accountability, and Transparency), and at several other venues. I enjoy competing in data science competitions and coaching student teams. We have won awards in several competitions, • including the FICO Recognition Award for the first Explainable Machine Learning Competition in 2018, NTIRE Superresolution competition in 2018, PoeTix Literary Turing Competition in 2018, and we placed second in one of the ICML Exploration and Exploitation competitions. Education Ph.D.: Princeton University. Program in Applied and Computational Mathematics. Title: Boosting, Margins, and • Dynamics. Advisors: Ingrid Daubechies and Robert Schapire. BS/BA: University at Buffalo (SUNY), Honors Program. Outstanding Senior Award in the Arts and Sciences (one • awarded per year university-wide), separate outstanding senior awards from the Physics Department, Mathematics Department, and Music Department, BS Mathematical Physics, BA Music Theory, Minor in Computer Science, Summa Cum Laude, 1999 Cynthia Rudin 2 Employment History Duke University, Computer Science Department (50%), Electrical and Computer Engineering Department (50%), Sec- • ondary appointment in Statistical Science, Secondary appointment in Mathematics. Associate Professor 2016-2019, Professor 2019-present Massachusetts Institute of Technology, MIT Computer Science and Artificial Intelligence Laboratory and Sloan School • of Management, Associate Professor of Statistics 2013-2016, Assistant Professor of Statistics 2009-2013. Columbia University, Center for Computational Learning Systems, Associate Research Scientist, 2007-2009. • NSF Postdoctoral Research Fellow, New York University, 2004-2007 • Princeton University, Graduate Research Assistant, Program in Applied and Computational Mathematics, September • 1999 - June 2004, Postdoctoral Research Fellow, summer 2004 Peer-Reviewed Publications Papers associated with major awards (winner and finalist) 1. Gah-Yi Ban and Cynthia Rudin. MSOM Best OM paper in OR Award, INFORMS. (Awarded to the best paper in Oper- ations Research within the last 3 years, awarded by Manufacturing and Service Operations Management Society of INFORMS.) The Big Data Newsvendor: Practical Insights from Machine Learning, Operations Research, Vol. 67, No. 1, pages 90-108, 2019. 2. Aaron F. Struck, Berk Ustun, Andres Rodriguez Ruiz, Jong Woo Lee, Suzette LaRoche, Lawrence J. Hirsch, Emily J Gilmore, Jan Vlachy, Hiba Arif Haider, Cynthia Rudin, M Brandon Westover. 2019 INFORMS Innovative Applica- tions in Analytics Award (shared with paper below). Association of an Electroencephalography-Based Risk Score With Seizure Probability in Hospitalized Patients, JAMA Neurology, 74 (12), 1419-1424, 2017. 3. Berk Ustun and Cynthia Rudin. 2019 INFORMS Innovative Applications in Analytics Award, also 2017 INFORMS Com- puting Society Student Paper Prize. Learning Optimized Risk Scores. Journal of Machine Learning Research, 2019. Shorter version Learning Optimized Risk Scores from Large-Scale Datasets. Knowledge Discovery in Databases (KDD), 2017. 4. Chaofan Chen, Kangcheng Lin, Cynthia Rudin, Yaron Shaposhnik, Sijia Wang, Tong Wang. A Holistic Approach to Interpretability in Financial Lending: Models, Visualizations, and Summary-Explanations. Decision Support Systems, Accepted, 2021. Preliminary work: Chaofan Chen, Kangcheng Lin, Cynthia Rudin, Yaron Shaposhnik, Sijia Wang, Tong Wang. Winner of the FICO • Recognition Award for the Explainable Machine Learning Challenge, 2018. An Interpretable Model with Globally Consistent Explanations for Credit Risk. NIPS 2018 Workshop on Challenges and Opportunities for AI in Financial Services: the Impact of Fairness, Explainability, Accuracy, and Privacy, 2018. 5. Cynthia Rudin and Berk Ustun. Finalist for 2017 Daniel H. Wagner Prize for Excellence in Operations Research, Institute for Operations Research and Management Science (INFORMS). Optimized Scoring Systems: Towards Trust in Machine Learning for Healthcare and Criminal Justice. INFORMS Journal on Applied Analytics, Special Issue: 2017 Daniel H. Wagner Prize for Excellence in Operations Research Practice, 48(5), pages 399–486, September- October, 2018. 6. William Souillard-Mandar, Randall Davis, Cynthia Rudin, Rhoda Au, David J. Libon, Rodney Swenson, Catherine C. Price, Melissa Lamar, Dana L. Penney. 2016 INFORMS Innovative Applications in Analytics Award (shared with paper below). Learning Classification Models of Cognitive Conditions from Subtle Behaviors in the Digital Clock Drawing Test. Machine Learning, volume 102, number 3, 2016. 7. Berk Ustun and Cynthia Rudin. 2016 INFORMS Innovative Applications in Analytics Award and also Runner up, Invenia Labs SEE Award 2018 - Supporting Machine Learning Research with a Positive Impact on Social, Economic, or Environmental (SEE) Challenges. Supersparse Linear Integer Models for Optimized Medical Scoring Systems. Machine Learning, volume 102, number 3, 2016. Cynthia Rudin 3 8. Cynthia Rudin, S¸eyda Ertekin, Rebecca Passonneau, Axinia Radeva, Ashish Tomar, Boyi Xie, Stanley Lewis, Mark Riddle, Debbie Pangsrivinij, Tyler McCormick. 2013 INFORMS Innovative Applications in Analytics Award Analytics for Power Grid Distribution Reliability in New York City. INFORMS Journal on Applied Analytics, volume 44, issue 4, pages 364-383, 2014. 9. Indraneel Mukherjee, Cynthia Rudin, and Robert E. Schapire. This paper answered an open question published in COLT 2010. The Rate of Convergence of AdaBoost, Journal of Machine Learning Research, volume 14, pages 2315-2347, August 2013. Preliminary work: Indraneel Mukherjee, Cynthia Rudin, and Robert E. Schapire. The Rate of Convergence of AdaBoost, Proceedings • of the 24th Annual Conference on Learning Theory (COLT), 2011. Peer-Reviewed Publications (not including those above) 2021 10. Tong Wang and Cynthia Rudin. Subgroup Identification for Enhanced Treatment Effect with Decision Rules, IN- FORMS Journal on Computing, Accepted, 2021. 11. Yingfan Wang, Haiyang Huang, Cynthia Rudin, and Yaron Shaposhnik. Understanding How Dimension Reduction Tools Work: An Empirical Approach to Deciphering t-SNE, UMAP, TriMAP, and PaCMAP for Data Visual- ization, Journal of Machine Learning Research, accepted with minor revision, 2021. 12. Tianyu Wang, Marco Morucci, M. Usaid Awan, Yameng Liu, Sudeepa Roy, Cynthia Rudin, Alexander Volfovsky. FLAME: A Fast Large-scale Almost Matching Exactly Approach to Causal Inference. Journal of Machine Learning Re- search, 2021. 13. Stefano Traca`, Cynthia Rudin, Weiyu Yan. (Best paper award, INFORMS 2016 Data Mining & Decision Analytics Workshop, also Finalist for 2015 IBM Service Science Best Paper Award.) Regulating Greed over Time in Multi- Armed Bandits. Journal of Machine Learning Research, 2021. 14. Jianyou Wang, Xiaoxuan Zhang, Yuren Zhou, Christopher Suh, Cynthia Rudin. There once was a really bad poet, it was automated but you didn’t know it. Transactions of the Association for