Quantifying Uncertainty and Robustness at Scale Tamara Broderick ITT Career Development Assistant Professor [email protected]

Quantifying Uncertainty and Robustness at Scale Tamara Broderick ITT Career Development Assistant Professor Tbroderick@Mit.Edu

Bayesian Machine Learning Quantifying uncertainty and robustness at scale Tamara Broderick ITT Career Development Assistant Professor [email protected] Raj Agrawal Trevor Campbell Lorenzo Masoero Will Stephenson Microcredit Experiment 1 Microcredit Experiment • Simplified from Meager (2016) 1 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) [amcharts.com 2016] 1 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site [amcharts.com 2016] 1 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? [amcharts.com 2016] 1 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ [amcharts.com 2016] 1 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: • Interpretability [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: • Interpretability; incorporate expert info [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: • Interpretability; incorporate expert info • Quantify uncertainty (coherently) [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Share information across experiments [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Share information across experiments [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+) [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+); easy to use [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+); easy to use • Finite-data, finite-run-time theoretical guarantees [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+); easy to use • Finite-data, finite-run-time theoretical guarantees • Robustness quantification [amcharts.com 2016] 2 Microcredit Experiment • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+); easy to use • Finite-data, finite-run-time theoretical guarantees • Robustness quantification [amcharts.com 2016] 2 Bayesian machine learning • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+); easy to use • Finite-data, finite-run-time theoretical guarantees • Robustness quantification [amcharts.com 2016] 2 Bayesian machine learning • Assistive technology • Simplified from Meager (2016) • 7 sites with microcredit trials (in Mexico, Mongolia, Bosnia, India, Morocco, Philippines, Ethiopia) • ~900 to ~17K businesses at each site • Cybersecurity • Q: how much does microcredit increase business profit? τ • Desiderata: Bayesian methods • Interpretability; incorporate expert info • Quantify uncertainty (coherently) • Model complex phenomena • Get fast results (K, M, B+); easy to use • Finite-data, finite-run-time theoretical guarantees • Robustness quantification [amcharts.com 2016] 2 [image sources: Webb, Caverlee, Pu 2006 , www.itv.com/news/central/story/2014-08-05/locked-in-syndrome-woman-earns-degree-by-blinking] Data summarization 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” Football Curling 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” Football Curling 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” Football Curling 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” • Coresets: pre-process data to get a smaller, weighted data set Football Curling [Agarwal et al 2005; Feldman & Langberg 2011] 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” • Coresets: pre-process data to get a smaller, weighted data set with theoretical guarantees on quality Football Curling [Agarwal et al 2005; Feldman & Langberg 2011] 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” • Coresets: pre-process data to get a smaller, weighted data set with theoretical guarantees on quality Football Curling • Previous heuristics: data squashing, inducing points [Agarwal et al 2005; Feldman & Langberg 2011; DuMouchel et al 1999; Madigan et al 1999] 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” • Coresets: pre-process data to get a smaller, weighted data set with theoretical guarantees on quality Football Curling • Previous heuristics: data squashing, inducing points • Cf. subsampling [Agarwal et al 2005; Feldman & Langberg 2011; DuMouchel et al 1999; Madigan et al 1999] 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” • Coresets: pre-process data to get a smaller, weighted data set with theoretical guarantees on quality Football Curling • Previous heuristics: data squashing, inducing points • Cf. subsampling [Agarwal et al 2005; Feldman & Langberg 2011; DuMouchel et al 1999; Madigan et al 1999] 3 Data summarization • Observe: redundancies can exist even if data isn’t “tall” • Coresets: pre-process data to get a smaller, weighted data set with theoretical

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    50 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us