<<

Curriculum Vitae

Tandy Warnow Grainger Distinguished Chair in Engineering

1 Contact Information

Department of The University of Illinois at Urbana-Champaign Email: [email protected] Homepage: http://tandy.cs.illinois.edu

2 Research Interests

Phylogenetic tree inference in biology and historical linguistics, multiple , metage- nomic analysis, big data, statistical inference, probabilistic analysis of , machine learning, combinatorial and graph-theoretic algorithms, and experimental performance studies of algorithms.

3 Professional Appointments

• Co-chief scientist, C3.ai Digital Transformation Institute, 2020-present • Grainger Distinguished Chair in Engineering, 2020-present • Associate Head for Computer Science, 2019-present

• Special advisor to the Head of the Department of Computer Science, 2016-present • Associate Head for the Department of Computer Science, 2017-2018. • Founder Professor of Computer Science, the University of Illinois at Urbana-Champaign, 2014- 2019

• Member, Carl R. Woese Institute for Genomic Biology. Affiliate of the National Center for Supercomputing Applications (NCSA), Coordinated Sciences Laboratory, and the Unit for Criticism and Interpretive Theory. Affiliate faculty member in the Departments of Mathe- matics, Electrical and Computer Engineering, Bioengineering, Statistics, Entomology, Plant Biology, and Evolution, Ecology, and Behavior, 2014-present.

• National Science Foundation, Program Director for Big Data, July 2012-July 2013. • Member, Big Data Senior Steering Group of NITRD (The Networking and Information Tech- nology Research and Development Program), subcomittee of the National Technology Council (coordinating federal agencies), 2012-2013

• Departmental Scholar, Institute for Pure and Applied Mathematics, UCLA, Fall 2011 • Visiting Researcher, University of Maryland, Spring and Summer 2011.

1 • Visiting Researcher, Smithsonian Institute, Spring and Summer 2011. • Professeur Invit´e,Ecole Polytechnique F´ed´erale de Lausanne (EPFL), Summer 2010. • Visiting Researcher, Microsoft New England, Fall 2010. • Visiting Scholar, UC Berkeley, Summer 2009; co-taught (with Johanna Nichols and Donald Ringe) the course Computational Methods in Linguistic Reconstruction at the 2009 Linguistic Institute held at Berkeley. • Visiting Scholar, Program for Evolutionary Dynamics at , 2004-2005. • Radcliffe Institute for Advanced Studies, Emeline Bigelow Conland Fellow, 2003-2004. • Visiting Scholar, University of California at Berkeley, 2002-2004. • University of Texas at Austin, 9/1999 - 2014 – Co-Director, Center for and , 2001-2003 – Professor, Department of Computer Science, University of Texas at Austin (effective 9/1/2003). Assoc. Professor 9/1999-9/2003. – Member, Texas Institute for Computational and Applied Mathematics, and the Institute for Cellular and . – Member, Graduate Programs in Computer Sciences, Molecular Biology, Computational and Applied Mathematics, and the Program in Ecology, Evolution, and Behavior. • University of Pennsylvania, 9/1993-8/1999. – Associate Professor (tenured January 1998), Department of Computer and Information Sciences. – Member, Institute for Research in Cognitive Sciences. – Co-PI, Graduate and Postdoctoral Research Training Program (RTG) in Computational Biology. (PI: Warren Ewens). • University of Arizona, 9/1998-8/1999, Visiting Professor, Departments of Computer Science and Ecology and Evolutionary Biology. • , 1997-1998, Visiting Researcher, Department of Computer Science. • Princeton University, 1997-1998, Visiting Professor, Departments of Mathematics and Com- puter Sciences. • DIMACS, 1996, visitor.

4 Education

• Postdoctoral Fellowship (1991-1992), University of Southern California, with Michael Water- man and Simon Tavar´e. • Ph.D. Mathematics (1991), University of California, Berkeley. Dissertation: Combinatorial Algorithms for Constructing Phylogenetic Trees. Committee: (advisor), , , Dan Gusfield, and Richard Karp. • B.A. Mathematics (1984), magna cum lauda, University of California, Berkeley.

2 5 Honors

• Language (Journal of the Linguistic Society of America), included one of my papers (Nakhleh, Ringe, and Warnow, ‘Perfect Phylogenetic Networks: A new methodology for reconstructing the evolutionary history of natural languages, ” Language (Journal of the Linguistic Society of America), 81(2), pp. 382-420, June 2005) in their selection of top 20 papers from the last 30 years (see https://www.linguisticsociety.org/content/best-language-volume-iii)

• Teaching: I have been included in the List of Teachers Rated Excellent by their students four times (Spring 2017 for CS 466 and CS 581, Spring 2018 for CS 581, and Fall 2020 for CS 581). In the last of these, I was listed as “Outstanding”. • Elected member, Electorate Nominating Committee (ENC) of the Section on Information, Computing, & Communication, American Association for the Advancement of Science (AAAS), 2019-present • Fellow of the International Society for Computational Biology (ISCB), 2017. • Fellow of the Association for Computing Machinery (ACM), 2015. Citation: For contributions to mathematical theory, algorithms, and software for large-scale molecular and historical linguistics. • Founder Professor of Engineering, the University of Illinois at Urbana-Champaign, 2014- present • David Bruton, Jr. Centennial Professorship in Computer Science, 2010 - 2014

• John Simon Guggenheim Foundation Fellowship, 2011, New problems in evolutionary estima- tion. • Radcliffe Institute for Advanced Study, Emeline Bigelow Conland Fellow, 2003-2004. • David and Lucile Packard Foundation Fellowship, 1996-2001, Algorithms for reconstructing evolutionary trees in biology and linguistics.

• NSF National Young Investigator Award, 1994-99. Combinatorial Problems in Evolutionary Tree Construction.

6 Current and Former Graduate Students

Current:

• Gillian Chu (MS student, degree expected 2022) • Baqiao Liu (PhD student, degree expected 2026) • Minhyuk Park (PhD student, degree expected 2025) • Chengze Shen (PhD student, degree expected 2025)

• Yasamin Tabatabaee (PhD student, degree expected 2026) • Eleanor Wedell (MS student, degree expected 2022) • James Willson (PhD student, degree expected 2026)

3 Former: • Md. Shamsuzzoha Bayzid (PhD Fall 2016). Associate Professor, Department of Computer Science and Engineering (CSE), Bangladesh University of Engineering and Technology. • Sarah Christensen (PhD student in Computer Science, co-advised with Mohammed El-Kebir), PhD 2020. Now working in industry • Ganesh Ganapathy (PhD August 2006), Apple Computers • Ashu Gupta (MS May 2016, Computer Science at the University of Illinois at Urbana-Champaign), now at Apple

• Kevin Liu (PhD May 2011), Assistant Professor of Computer Science, Michigan State Univer- sity • Siavash Mirarab (PhD August 2015), Associate Professor of Electrical and Computer Engi- neering, UCSD • Erin Molloy (PhD August 2020), Assistant Professor of Computer Science, Univ Maryland starting 2021. Co-advised with Bill Gropp, PhD August 2020) • Luay Nakhleh (PhD May 2004), Dean of Engineering and Professor of Computer Science at Rice University. • Serita Nelesen (PhD December 2009), Software developer for Neural Planet

• Nam-phuong Nguyen (PhD August 2014), Principle Scientist, Bioinformatics, Boundless Bio • Michael Nute (PhD student 2019), postdoctoral researcher at Rice University • Usman Roshan (PhD May 2004), Associate Professor of Computer Science at NJIT.

• Vladimir Smirnov (PhD, August 2021), now postdoctoral researcher • Michelle (Shel) Swenson (PhD May 2009), Lecturer, Department of Mathematics, University of Tennessee at Knoxville • Pranjal Vachaspati (PhD December 2019), now Software Engineer at Google

• Li-San Wang (PhD May 2003), Professor of Pathology and Laboratory Medicine at the Uni- versity of Pennsylvania. • Shibu Yooseph (PhD received 2000), Professor of Computer Science at Central Florida Uni- versity • Xilin Yu (MS awarded 2019), now software engineer at Amazon

7 Current and Former Postdoctoral advisees

Current: • Paul Zaharias, Feb 2020-present Former:

• Kevin Atteson (Chief Quant at Summer Road)

4 • Fran¸cois Barban¸con(now self-employed) • Ruth Davidson (NSF Postdoctoral Fellow in Mathematics, now J.L. Doob Research Assistant Professor in the Department of Mathematics at UIUC) • Dannie Durand (Associate Professor of Biology and of Computer Science, Carnegie Mellon University) • Daniel Huson (C4 Professor of Bioinformatics, Tubingen University, Germany) • Nam-phuong Nguyen (now Principle Scientist, Bioinformatics, Boundless BIO) • Ken Rice (now retired)

• Katherine St. John (Professor of Computer Science, Hunter College, CUNY) • Elizabeth Sweedyk (Associate Professor of Computer Science, Harvey Mudd College) • Shel Swenson (Lecturer, University of Tennessee at Knoxville, Mathematics Department)

8 Current Grant Support

• National Science Foundation (NSF) 2006069, IIBR Informatics: Advancing Bioinformatics Methods using Ensembles of Profile Hidden Markov Models. PI Warnow (Co-PI: Jian Peng), August 15, 2020 to August 14, 2023. This project will extend the theory foundations of ensembles of profile HMMs, and use them for protein structure and function prediction.

9 Recent Grant Support

• National Science Foundation (NSF) DBI:1458652. ABI Innovation: New methods for multiple sequence alignment with improved accuracy and scalability. PI Warnow. August 15, 2015 to July 31, 2021 (no-cost extension). Award Amount: $861,625. (No Co-PIs). • National Science Foundation (NSF) grant III:AF:1513629. This is a collaborative grant with the University of Maryland, for new methods for metagenomic dataset analysis, building on our TIPP method for taxon identification of reads in a metagenomic sample.

• National Science Foundation (NSF) CCF 15-35977, AitF: Full: Collaborative Research: Graph- Theoretic Algorithms to Improve Phylogenomic Analyses. I am the overall PI, and this project is collaborative with Satish Rao (UC Berkeley PI) and Chandra Chekuri (UIUC). We are devel- oping new theoretical computer science and discrete algorithms for improving the estimation of large species and gene trees, and specifically enabling statistical methods to scale to ultra-large datasets. • NSF DBI-1062335/1461364. Collaborative Research: Novel Methodologies for Genome-scale Evolutionary Analysis of Multi-locus data. PI, 2010-2013. $349,999. Collaborative grant with Rice University and Stanford University. 2010-2013 (extended to 2016). • NSF DEB 0733029 (ATOL): Collaborative Research: Large-Scale Simultaneous Multiple Align- ment and Phylogeny Estimation. Overall PI: Warnow. Collaborative grant with the University of Georgia, The University of Nebraska, and the University of Kansas. Total amount: $1.7 Million for the entire project. 2007-2011 (extended to 2014).

5 • NSF ITR 0331453 (Large): Building the Tree of Life – A National Resource for Phyloinfor- matics and Computational Phylogenetics. Overall PI: Warnow, 2003-2008 (extended to 2010). Collaborative with four other institutions (UCSD, UNM, , and UC Berkeley). Total amount: $11.6 Million for the entire project. Final report in abridged form available at http://www.cs.utexas.edu/users/tandy/CIPRES-FINAL.pdf.

• NSF ITR BCS 0312830 (small): Collaborative Research, Algorithms for Inferring Reticulate Evolution in Historical Linguistics. Overall PI: Warnow, (with Donald Ringe, at the University of Pennsylvania), 2003-2006 (extended to 2009). Total amount: $344,517. • NSF ITR/AP 0121680 (medium): Collaborative Research, Reconstructing Complex Evolu- tionary Histories. Overall PI: Warnow, 2001-2006 (extended to 2009); collaborative with the University of New Mexico. Total amount: $1,707,458. • NSF IGERT DGE 0114387: IGERT: Computational Phylogenetics and Applications to Bi- ology. PI: David Hillis, Co-PIs Warnow, Jansen, and Gutell, 2001-2006 (extended to 2008). Total amount: $2,741,575.

• NSF DEB 0120709: Comparative Chloroplast : Integrating Computational Methods, Molecular Evolution, and Phylogeny. PI: Robert Jansen, Co-PIs Warnow and Raubeson, 2001- 2006. Total amount: $1,350,000. • NSF ITR/AP 0121682 (medium): Exploring the Tree of Life, PI: , 2001-2004 (extended to 2005). Total amount: $785,780.

• NSF ITR/AP DEB 011354: Collaborative Research, Computing Optimal Phylogenetic Trees Under Genome Rearrangement Metrics. PI: Robert Jansen, 2001-2004. Participant: Warnow. Total amount: $288,030. • NSF 9985991: CISE Research Instrumentation. PI: Doug Burger. Co-PIs: Tandy Warnow, Harrick Vin, Steve Keckler, and Inderjit Dhillon, 2000-2003. Total amount: $139,481.

• NSF 9512092 (Linguistics): Character-based Methods for Reconstructing the Evolutionary History of Natural Languages. PI: Tandy Warnow. Co-PI’s: Donald Ringe and Ann Taylor. 1995-1998. Total amount: $161,000. • NSF 9413215 (Research Training, Computational Biology): Statistical and Computational Methods for Data Management and Analysis in Molecular Genetics. PI: Warren Ewens. Co- PI: Tandy Warnow, David Roos, and David Searls. 1994-1999. Total amount: $1,592,579. • NSF 9457800 (Theory of Computing): NSF Young Investigator: Computational Problems in Evolutionary Tree Reconstruction, 1994-2000. Total amount: $275,000.

10 National and International Service

• Member, AAAS Election Nomination Committee (Section T), 2019-2020 • Faculty, Strategies and Techniques for Analyzing Microbial Population Structures (STAMPS) at Marine Biology Laboratory at Woods Hole, 2017-2019 • Member, Faculty of 1000, 2017-present • Genome 10K, Council Member, 2015-2019 • Paris Kanellakis Award Committee Member, 2015-2019 (chair 2017-2018) • WABI (Workshop on Algorithms for Bioinformatics) Steering Committee 2013-present

6 • Co-organizer, IPAM (Institute for Pure and Applied Mathematics) long program, “Science at Extreme Scales: Where Big Data Meets Large-Scale Computing” Fall 2018 • Co-organizer, IPAM (Institute for Pure and Applied Mathematics) workshop on Multiple Se- quence Alignment, 2015 • Co-organizer, Programme on Mathematical, Statistical and Computational Aspects of the new science of Metagenomics, Sir Isaac Newton Institute of Mathematical Sciences, Cambridge University, 2012-2014 • Program Director for Big Data, National Science Foundation, July 2012 - July 2013 • Chair, NIH BDMA (Biodata Management and Analysis) Study Section, 2010-2012 • PLoS Currents: Tree of Life, Board of Reviewers, 2010-present. • Faculty Recruiting Committee, ETH-Zurich, 2010-2011. • Advisory Board, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2006-2012. • Advisory Board, Springer, Computational Biology Book Series, 2005 - present. • Participant, Institutes of Medicine Workshop on Interdisciplinary Programs and Academic Health Centers on Tuesday, September 26th, 2006. • Member, Board of Directors, the International Society for Computational Biology (ISCB), 2001-2004. • Committee member, National Academy of Sciences Commmittee on Interdisciplinarity (2003- 2004). • Program committee co-chair, National Academy of Sciences (USA), Japanese-American Fron- tiers of Science (2003). • Committee member, National Academy of Sciences, Committee for Biological Cyberinfranstruc- ture (BIOACCI). (2003). • Committee member, National Research Council Committee on Interdisciplinarity. (2003). • Program committee, National Academy of Sciences (USA), Japanese-American Frontiers of Science (2002). • Panelist, Howard Hughes Medical Institute, Postdoctoral Fellowships in Biological Sciences, 2003. • Program committee co-chair: Great Lakes Bioinformatics (GLBIO) 2017; Combinatorics and Computing (COCOON) 2003; European Conference on Computational Biology (ECCB) 2008; Track for Evolution and Phylogeny, Intelligent Systems for Molecular Biology (ISMB) 2006, 2007, 2008, 2011, 2013, and 2014; ACM-SIAM Symposium on Discrete Algorithms (SODA) 1999; Workshop on Algorithms for Bioinformatics (WABI) 2009; Great Lakes Bioinformatics (GLBIO) 2016. • Conference co-chair, Intelligent Systems for Molecular Biology (ISMB) 2018. • Program committee member: ACM-BCB 2017; AAAI 2015; ALENEX 2005; COCOON 1998 and 2003; EACL 2012 Joint Workshop of LINGVIS (Visualization of Linguistic Patterns) and UNCLH (Uncovering Language History from Multilingual Resources); FOCS 1996; FUN 2001; GLBIO 2016 and 2017; ICALP 2005; ISMB 2006-2009, 2011-2014, 2016, and 2019; JOBIM 2000; RECOMB 1997, 2001, 2002, 2004, 2009-2011, 2014, 2016, 2018, 2019, and 2021. RECOMB-CG 2015-2019; SODA 2001; STOC 1998; WABI 2001, 2002, 2004-2006, 2009, 2010, 2015, 2018-2021. and WADS 2001. • NSF Panelist, 1995, 1996, 2000, 2001, 2002, 2003, 2007, 2016, and 2019. • Proposal reviewer for NSERC, 1999-2006. • Reviewer for faculty candidates at the Royal Institute of Technology in Stockholm, Sweden, 1999.

7 11 University Service

University service at UIUC • Faculty Senate, 2018-2020 • NCSA Allocations: PC member 2014, 2021

• Blue Waters Allocation Committee, 2015-2020

College of Engineering at UIUC

• Chair, Anti-Racism Task Force, 2020 • Chair, Grainger Engineering Breakthroughs Initiative (GEBI) Professorship committee, 2020- present • Member, Climate Diversity and Inclusiveness committee (COE), 2019

• Member, Search committee for the Dean of Engineering, 2018 • Member, Grainger Chairs and Professorship search committee (BigData), 2015-2016, 2018- present • Member, COE Promotion and Tenure Committee, 2017-2019

• Member, Awards committee for College of Engineering, 2016-2017

Department of Computer Science at UIUC

• Associate Head of the Department of Computer Science, 2017-2018, 2019-present • Special Advisor to the Head of the CS Department, 2016-present • Chair (elected), Promotion and Tenure Committee, CS Department, 2018-2019 • Chair (elected), Faculty Advisory Committee, Department of Computer Science, 2016-2017

• Chair, Tenure Track Faculty Mentoring Committee, 2018-2019 • Chair, CS Department P&T committee, 2018-present • Area Chair, Bioinformatics and Computational Biology group, CS department, 2015-2019

• Member, P&T committee, CS department, 2015-2017, 2018-2019, 2021-present • Member, FAA (Graduate Admissions Committee), Department of Computer Science, 2014- 2016 • Member, Internal Advisory Committee, CS department, 2015-2017.

• Member, Named Professorship committee, CS department, 2015-present • Chair, Big Research Initiatives Committee, 2017-2018, 2019- • Member, GEBI committee for Department of Computer Science, 2017-2019 • Member, Governance committee, CS department, 2015-2017

8 Department of Bioengineering at UIUC • Chair (elected), Internal Advisory Committee, Department of Bioengineering (2017) • Chair, Awards Committee, Bioengineering Department, 2016-2017 • Member, Internal Advisory Committee for Bioengineering, 2015-2017

• Member, PhD preliminary exam committee, Bioengineering, 2015-2016 • Member, P&T committee for Bioengineering, 2014-2017 • Member, Named Professorship committee for Bioengineering, 2014-2017

• PhD preliminary exam committee, Bioengineering, 2014-2015

12 Invited Lectures

2021 • February 1, IHP, Paris, France, conference on Phylogeny and Inference. • June 16, Keynote talk (10:30 AM) at MCEB

• June 22, SBE (Systematics, Biogeography, and Evolution). • July 8 SMBE (Society for Molecular Biology and Evolution) 2021. Invited talk in Symposium 32, ”Large-scale phylogeny using novel computational and statistical tools”. • Aug 30-Sept 3, 2021. IMSI (University of Chicago), Symposium on Eliciting Structure in Genomics Data. • September 28-30, 2021. Genomic population structures of microbial pathogens. Royal Society, London, UK.

2020 • February 24-26, 2020, Copenhagen, Denmark, Bird 10K Workshop • February 27, 2020. Natural History Museum, London, UK

• All invitations from March onward were cancelled due to COVID-19

2019

• February 13, 2019. University of Pennsylvania, Penn Bioinformatics Forum. March 22, 2019. University of Pennsylvania, Penn Linguistics Conference, Panel on “Interplay between Linguis- tics and Biology” • March 22, 2019. University of Pennsylvania, Penn Linguistics Conference, Panel on “Interplay between Linguistics and Biology”

• April 5, 2019. Harvey Mudd College, Computer Science Department seminar • April 12-14, 2019. “Trees in the Desert 2019”, a workshop on ultra-large phylogenetic trees, Tucson AZ

9 • April 30, 2019. European Bioinformatics Institute, Cambridge UK • May 13-15, NIMBIOS, Knoxville TN Scientific Collaboration Enabled by High Performance Computing • May 24, University of Chicago, Department of Computer Science

• May 27, Symposium at UC Berkeley on Phylogenomics • May 28-30, The 6th International Conference on Algorithms for Computational Biology (Al- CoB 2019), Berkeley • June 17-19, Systematics Association Biennial, Bristol UK

• August 28, Genome 10K Conference, New York • September 13, UCLA, Department of Statistics, Los Angeles CA • October 18, Statistics Graduate Student Seminar, UIUC

• November 7, Mathematical Biology Seminar, UIUC • November 15, UC Irvine, Department of Computer Science • November 22, Plant Biology, UIUC

2018 • January 5, 2018. Hunter College, Department of Mathematics • March 16, 2018. UCLA, Bioinformatics program

• April 4, 2018. UIUC, Pathobiology Seminar • April 19, 2018. Imperial College, London. Colloquium, Department of Mathematics. • April 21-24, 2018, Plenary talk, RECOMB (Paris, France) • May 8, 2018. CRA-W Distinguished Lecture, UCLA Computer Science.

• June 1-3, 2018. Society for Systematic Biologists, standalone meeting, Ohio State University. • June 4-8, 2018. SIAM Discrete Mathematics Annual Meeting, Foundations of Data Science. Denver, Colorado. • August 13, 2018. Statistics Department seminar, Oxford University.

• September 14, 2018. Advances in Phylogenomics, Genome 10K conference • October 29, 2018. Northwestern University, Department of Applied Mathematics • November 7, 2018, IPAM (at UCLA)

• November 16, 2018. Kew Royal Botanical Gardens, London UK

10 2017 • NIPS Workshop on Advances in Modeling and Learning Interactions from Complex Data, Long Beach CA, December 8, 2017. • Kew Royal Botanical Gardens, October 5, 2017.

• Keynote speaker, ACM-BCB and WABI, Cambridge, MA. August 20-23, 2017. • STAMPS, Marine Biology Laboratory at Woods Hole, August 6, 2017. • Keynote speaker. IPDPS (IEEE International Parallel and Distributed Processing) Sympo- sium, Orlando FL. May 30, 2017.

• NeLLi: From New Lineages of Life to New Functions at the DOE Joint Genome Institute (JGI). April 5, 2017. • Second Workshop on Statistical and Algorithmic Challenges in Microbiome Data Analysis at The Broad Institute of MIT and Harvard, in Cambridge, MA. February 16-17, 2017.

• Plant Biology, UIUC. January 27, 2017.

2016

• Simons Center, New York City. December 2016. • National Institutes of Health, NLM. December 2016. • NYU CS department, November 2016. • CLIMB conference, Lausanne, Switzerland, November 2016.

• Statistics Department Colloquium, University of Chicago. October 2016. • Georgia Tech, Math Department IMPACT Distinguished Lecture. October, 2016. • Georgia Tech, CSE Department. Distinguished Lecture. October, 2016.

• Princeton CS Department. October, 2016. • University of Pennsylvania, Math Department. October, 2016. • CMU Computational Biology. September, 2016. • Using Ensembles of HMMs for Grand Challenges in Bioinformatics, as part of the Schloss Dagstuhl seminar Next generation sequencing - Algorithms and Software for Biomedical Ap- plications August, 2016. • Oxford University Statistics Department. August 2016. • University of Trento, Italy. August, 2016.

• SIAM Conference Discrete Math, Georgia State University, Atlanta, Georgia, USA. June, 2016. • Blue Waters Symposium, Oregon. June, 2016. • Austin, Texas Phylogenomics Symposium, Advances in Multiple Sequence Alignment June, 2016.

11 • Austin, Texas. Evolution 2016 meeting. June, 2016. • UCSD Bioinformatics seminar, May 2016 • MIT Computer Science conference on historical linguistics, May 2016 • Molecules as documents of evolutionary history: 50 years after Roscoff (Brittany), France, May 2016 • Co-evolution in proteins and RNA, theory and experiments, Cargese, Corsica, April 2016 • PhyloPizza, Smithsonian Institute, March 2016

• CEHG Symposium at Stanford University, March 2016 • Toyota Technology Institute of Chicago, February 2016 • ITA (Information Theory and Applications), Plenary Talk, February 2016 • University of Washington, Combi seminar, February 2016

• UC Davis, Genomics Institute, January 2016. • UC San Francisco, January 2016 • UC Berkeley, Museum of Comparative Zoology, January 2016 • Pacific Symposium on Biocomputing, January 2016

2015 • HHMI (Howard Hughes Medical Institute), November 2015

• University of California at San Diego, Distinguished Lecture, November 2015 • American Mathematical Society, Chicago, October 2015 • Program for Evolutionary Dynamics, Harvard University, October 2015

• University of Maryland, September 2015 • Duke University, September 2015 • Annual meeting of the Society for Molecular Biology and Evolution, July 2015 • Great Lakes Bioinformatics Conference; Keynote speaker, May 2015

• Genome 10K, March 2015 • National Center for Supercomputing Applications, February 2015 • IPAM (Institute for Pure and Applied Mathematics at UCLA), January 2015

12 2014 • Department of Computer Science, Swarthmore College, December 2014 • Institute for Genomic Biology, University of Illinios at Urbana-Champaign, November 2014 • Institute for Biomedical Informatics, The University of Pennsylvania, October 2014 • Mathematics Colloquium, University of Illinois at Urbana-Champaign, October 2014 • Gotham Seminar on Genomics & Statistics, Columbia University, October 2014 • Department of Computer Science, Case Western University, September 2014 (Distinguished Lecture) • Pacific Northwestern National Laboratories, September 2014 • Simons Institute for the Theory of Computing (Berkeley), April 2014 • Copenhagen University, March 2014 • Newton Institute for Mathematical Sciences at Cambridge University, March 2014 • Janelia Farm (HHMI), February 2014

2013 • MAGE (Models and Algorithms for Genome Evolution), plenary speaker August 2013; invited by Nadia El-Mabrouk (Univ , Computer Science) • Evolution 2014 annual meeting, Symposium on Practical Phylogenomics, invited by David Posada (Biology, University of Vigo) • IPAM (Institute for Pure and Applied Mathematics) plenary speaker, June 2013; invited by (UCLA CS) • 10K Genome, Hollywood; invited by (UCSC, Computer Science) • Janelia Farm (Howard Hughes Medical Institute), March 2013; invited by Sean Eddy • Smithsonian Institution, Frontiers in Phylogenomics, invited by Charles Mitter (University of Maryland, Entomology)

2012 • Computational and Statistical Phylogenomics, Vigo, Spain, Sept 2012; invited by David Posada (University of Vigo Biology) • M.I.T. Bioinformatics, May 2012; invited by (MIT CS) • Smithsonian Institution, May 2012; invited by Mike Braun • J Craig Venter Institute, May 2012; invited by Karen Nelson • UCLA Computational Biosciences, April 2012; invited by Matteo Pelligrini (UCLA Biology) • University of Southern California, Symposium for Mike Waterman and Simon Tavar´e, March 2012; invited by Fengzhu Sun (USC Computational Biology) • University of Pennsylvania, Bio-Mathematics Seminar, March 2012; invited by Robin Pemantle (Penn Mathematics) and Junhyong Kim (Penn Biology)

13 2011 • Janelia Farm, Dec 2011; invited by Sean Eddy • Science Faculty Colloquium, Bristol University, distinguished lecture, Dec 2011 • Univ Wisconsin Biostatistics, Nov 2011; invited by Cecile An´e(Biostatistics)

• UCLA Institute for Pure and Applied Mathematics (IPAM), Nov 2011; invited by Eleazar Eskin (UCLA CS) • Heilbronn Annual Conference, University of Bristol, England, September 2011; plenary speaker, invited by Trevor Wooley (Bristol Mathematics)

• J Craig Venter Institute, Oct 2011; invited by Shibu Yooseph • Newton Institute, University of Cambridge, June 2011; invited by Vincent Moulton (University of East Anglia, UK) • UC Berkeley, Lens on the Sciences, May 2011; invited by Mike Jordan and Dick Karp (Berkeley CS) • Princeton University, Ecology and Evolutionary Biology, May 2011; invited by Laura Landwe- ber (Princeton EEB) • University of Maryland, April 2011

• Harvard University, invited by Stuart Shieber (Harvard CS) and Naomi Pierce (Harvard OEB) • Bardonnechia, Italy; invited by Riccardo Zecchina (Physics, University of Turin), February 2011

2010 • Brown University, December 2010; distinguished lecture, invited by Ben Raphael (Brown CS) • North Carolina State University, Department of Mathematics, December 2010; invited by Loek Helminck • Microsoft Research, Computational Aspects of Biological Information, December 2010; invited by Jennifer Chayes (MSRE) and Riccardo Zecchina (Turin Physics) • Washington Area Phylogenetics Consortium PHYLO-PIZZA, December 2010; invited by Mike Braun (Smithsonian Institute)

• Microsoft Research New England, November 2010; invited by Jennifer Chayes • Johns Hopkins University, Department of Computer Science, November 2010; invited by Rao Kosaraju (JHU CS) • Dartmouth College, Department of Computer Science, November 2010; invited by Peter Win- kler (Darthouth Math and CS) • iEvolBio conference, Evolution Meeting, June 2010. • RECOMB Bioinformatics Education Conference, May 2010; invited by Pavel Pevzner (UCSD CS)

14 • Illinois Language and Linguistics Society conference (ILLS 2), May 2010. • University of Illinois at Chicago, April 2010; invited by Jerry Bona (UIC Mathematics) • University of Chicago, April 2010; invited by Wen-Hsiung Li (Univ Chicago Biology) • Princeton University, Computer Science Colloquium, Feb 2010; invited by (Princeton CS) • NESCENT (National Evolutionary Synthesis Center), Feb 2010; invited by Allen Rodrigo (NESCENT) • CMU-UPitt, Computational Biology Seminar, Feb 2010;

• Yale University Computer Science Colloquium, Jan 2010; invited by Joan Feigenbaum (Yale CS)

2009 • Workshop on Molecular Evolution and Phylogenetics, SAMSI, April 2009. • Swadesh Centenary Conference, Leipzig, Germany, plenary speaker, January, 2009.

2008 • Harvard Symposium on The Science of The Human Past, Dec 5, 2008; invited by Stuart Shieber (Harvard, CS)

• European Conference on Complex Systems Science (ECCS08), Sept 11-17, 2008; plenary speaker, invited by Riccardo Zecchina (Turin, Physics) • School of Computer and Communication Sciences at EPFL, June 12, 2008; distinguished lec- ture, invited by (EPFL Computer Science)

• MIEP (Mathematics and Informatics in Evolution and Phylogeny), June 2008; invited by Olivier Gascuel (Montpellier Computer Science) • Department of Computer Sciences, Northwestern University, May 2, 2008 • School of Informatics, Indiana University, April 11, 2008

• Department of EECS, UC Berkeley, Distinguished Lecture, Feb. 6, 2008; invited by Dick Karp (Berkeley CS) • EECS Colloquium, UC Berkeley, Feb. 2008; invited by Satish Rao (Berkeley CS)

Distinguished lectures and keynote talks before 2008 • The Sir Isaac Newton Institute for Mathematical Sciences, September 4, 2007 • Department of Computer Sciences, University of Nebraska, Distinguished Lecture, Oct. 30, 2007.

• College of Computing, Georgia Inst. of Technology, Distinguished Lecture, Nov. 1, 2007; invited by David Bader (GaTech CS)

15 • Department of Computer Science, UC Davis, Distinguished Lecture, Nov. 15, 2007; invited by Dan Gusfield (UCD CS) • Plenary talk, Workshop on Graph Algorithms (WG’06), Bergen, Norway, June 2006. • Mathematics Department, Distinguished Lecture, The University of South Carolina, March 28, 2006; invited by Laszlo Sz´ek´ely (USC Math) • Symposium on Languages and Genes, The University of California at Santa Barbara, Sept. 2006. • Symposium on Stochastic Processes and Applications, Plenary talk, June 27, 2005).

• Grace Hopper 2004, invited speaker. • The McDonald Institute for Archaeological Research at Cambridge University, Summer 2004. • Virginia Tech (East Coast Indo-European Conference), May 26, 2004 • University of Illinois at Chicago, Department of Computer Sciences, distinguished lecture, 2003. • Annual meeting, Mathematics Association of America (2003), Invited Lecture. • SMBE (Society for Molecular Biology and Evolution) invited lecture, 2003. • Radcliffe Institute of Advanced Studies, Conference on Computational Biology: Function, Pathways, Phylogenies and Populations. May 19, 2003 • Invited talk, Mathfest 2003 (annual meeting of the Mathematics Association of America), Boulder, Colorado, July 31, 2003. • University of Toronto, Department of Computer Science. Distinguished Lecture. April 2002; invited by Toni Pitassi (Toronto CS) • Distinguished Lecture. Department of Computer Science, University of British Columbia, January 2001; invited by Anne Condon (UBC CS) • SCOPH (Showcase on Competing Technologies for Phylogenetics: A RECOMB Phylogenetics Conference, April 2001, Montreal ; invited by David Sankoff

• Plenary talk at the Annual Meeting of the Canadian Applied and Industrial Mathematics Society (CAIMS), June, 2001, Victoria, British Columbia Canada. • CRA Distinguished lecture, EECS Joint Colloquium, University of California at Berkeley, October, 2001, Berkeley, CA.

• Plenary talk, Deep Green. June 2, 2000, University of Maryland, College Park, MD.

13 Diversity Activities

My diversity statement is on my webpage at http://tandy.cs.illinois.edu/warnow-diversity.html.

• Chair, Anti-Racism Task Force, Grainger College of Engineering, 2020 • Member, Climate Diversity and Inclusiveness committee (College of Engineering), 2019

16 • Visit to Monticello Middle School to talk with 6th, 7th, and 8th grade students about computer science, 2019 • Visits to student groups focusing on women (e.g., CRA-W at UCLA) in computer science, bioinformatics, etc., 2017-present

• Mentoring (informal) of women assistant professors and postdocs in CS and mathematics about how to succeed in academia (e.g., at NIMBIOS, 2019). • Regular participation in programs within the CS department for undergraduate women stu- dents.

17 14 Publications

Books 1. Warnow, T. Computational Phylogenetics: An Introduction to Designing Methods for Phy- logeny Estimation. Cambridge University Press, 2017.

Edited volumes 1. Bioinformatics and Phylogenetics: Seminal Contributions of Bernard Moret, Edited by Tandy Warnow, Springer 2019, DOI: 10.1007/978-3-030-10837-3

Edited Conference Proceedings 1. Proceedings of the Tenth Annual ACM-SIAM Symposium on Discrete Algorithms, 17-19 Jan- uary 1999, Baltimore, Maryland, USA. Robert Endre Tarjan and Tandy J. Warnow, eds. ACM/SIAM 1999, ISBN 0-89871-434-6. 2. Computing and Combinatorics: Proceedings of the 9th Annual International Conference, CO- COON 2003, Big Sky, MT, USA, July 25-28, 2003. Binhui Zhu and Tandy Warnow, eds. Lecture Notes in Computer Science DOI: 10.1007/3-540-45071-8.

3. Algorithms in Bioinformatics, Proceedings of the 9th International Workshop, WABI 2009, Philadelphia, PA, USA, September 12-13, 2009. and Tandy Warnow, eds. Lecture Notes in Computer Science 5724, Springer 2009, ISBN 978-3-642-04240-9

Journal and Refereed Conference Publications 1. S. Kannan, E. Lawler, and T. Warnow, 1990. Determining the Evolutionary Tree. Proceedings of the First ACM-SIAM Symposium on Discrete Algorithms (SODA) 1990, pp. 475-484. 2. S. Kannan and T. Warnow, 1990. Inferring Evolutionary History from DNA Sequences. Pro- ceedings of the Foundations of Computer Science (FOCS) 1990, pp. 362-371. 3. Kannan, S. and T. Warnow, 1992. “Triangulating Three-Colored Graphs,”SIAM J. on Discrete Mathematics, Vol. 5 No. 2, pp. 249-258. (A preliminary version appeared in the Proceedings of the ACM/SIAM Symposium on Discrete Algorithms, 1991.) 4. Bodlaender, H., M. R. Fellows, and T. J. Warnow, 1992. “Two Strikes Against Perfect Phy- logeny”, Proc. 19th Int’l Colloq. on Automata, Languages, and Programming (ICALP92), Springer Verlag, pages 273–283, in LNCS 623. 5. Steel, M. and T. Warnow, 1993. “Kaikoura Tree Theorems: The Maximum Agreement Subtree Problem.” Information Processing Letters, 48, pp. 77-82.

6. M. Farach, S. Kannan, and T. Warnow, 1993. A robust model for finding optimal evolutionary trees. Proceedings of the Symposium on the Theory of Computing (STOC) 1993. 7. T. Warnow, 1993. Tree Compatibility and Inferring Evolutionary History. Proceedings, ACM- SIAM Symposium on Discrete Algorithms (SODA) 1993. 8. Warnow, T., 1993. “Constructing phylogenetic trees efficiently using compatibility criteria.” New Zealand Journal of Botany, Vol. 31, pp. 239-248.

18 9. Kannan, S. and T. Warnow, 1994. “Inferring Evolutionary History from DNA Sequences.” SIAM J. on Computing, Vol. 23, No. 4, pp. 713-737. (A preliminary version of this paper appeared at FOCS 1990.) 10. Warnow, T., 1994. “Tree Compatibility and Inferring Evolutionary History.” Journal of Algo- rithms, 16, pp. 388-407. (A preliminary version of this paper appeared at SODA 1993.) 11. McMorris, F. R., T. Warnow, and T. Wimer, 1994. “Triangulating Vertex Colored Graphs.” SIAM J. on Discrete Mathematics, Vol. 7, No. 2, pp. 296-306. (A preliminary version of this paper appeared at SODA 1993.) 12. Farach, M., S. Kannan, and T. Warnow, 1995. “A Robust Model for Finding Optimal Evo- lutionary Trees.” Algorithmica, special issue on Computational Biology, Vol. 13, No. 1, pp. 155-179. (A preliminary version of this paper appeared at STOC 1993.) 13. Kannan, S., and T. Warnow, 1995. “Tree Reconstruction from Partial Orders.” SIAM J. on Computing, Vol. 24 No. 3, pp. 511-520. (A preliminary version of this paper appeared at WADS.) 14. A. Taylor, D. Ringe, and T. Warnow. “Character-based reconstruction of a linguistic clado- gram.” Proceedings of the 12th International Conference on Historical Linguistics (Manchester, August 1995). 15. Kannan, S. and T. Warnow, 1995. “A fast for the computation and enumeration of perfect phylogenies” SIAM J. Computing, Vol. 26, No. 6, pp. 1749-1763. (A preliminary version appeared in SODA 1995). 16. Kannan, S., T. Warnow, and S. Yooseph, 1995. “Computing the local consensus of trees.” Proceedings SODA 1995, pp. 68-77. 17. Warnow, T.. D. Ringe, and A. Taylor, 1996. “Reconstructing the evolutionary history of natural languages.” Proceedings of ACM-SIAM Symposium on Discrete Algorithms (SODA), 1996, pp. 314-322. 18. M. Bonet, C.A. Phillips, T. Warnow, and S. Yooseph, 1999. Constructing evolutionary trees in the presence of polymorphic characters. Proceedings, ACM Symposium on the Theory of Computing (STOC), 1996. 19. Phillips, C. A., and T. Warnow, 1996. “The Asymmetric Median Tree: a new model for building consensus trees.” Discrete Applied Mathematics, Special Issue on Computational Molecular Biology, 71, pp. 311-335. 20. Goldberg, L. A., P.W. Goldberg, C.A. Phillips, E. Sweedyk, and T. Warnow, 1996. “Minimizing phylogenetic number to find good evolutionary trees.” Discrete Applied Mathematics, Volume 71, Numbers 1-3, pp. 111-136. (A preliminary version of this paper appeared in Combinatorial Pattern 1995) 21. Benham, C., S. Kannan, M. Paterson, and T. Warnow, 1996. “Hen’s Teeth and Whale’s Feet: Generalized Character Compatibility.” Journal of Computational Biology. Vol 2. No 4. pp 527-536. (A preliminary version of this appeared in Combinatorial Pattern Matching, 1995.) 22. M. Henzinger, V. King, and T. Warnow, 1996. Constructing a tree from homeomorphic sub- trees, with applications to computational molecular biology, SODA 1996, pp. 333-340. 23. Kannan, S., E. Lawler, and T. Warnow, 1996. “Determining the Evolutionary Tree.” Journal of Algorithms, 21(1): 26-50. (A preliminary version of this paper appeared at SODA 1990.)

19 24. Ringe, D., T. Warnow, A. Taylor, A. Michailov, and L. Levison, 1997. “Computational cladis- tics and the position of Tocharian.” In V. Mair (Ed.), The Bronze Age and Early Iron Age Peoples of Eastern Central Asia, a special volume of the Journal of Indoeuropean Studies. 25. Erd˝os,P. L., M. Steel, L. Sz´ek´ely, and T. Warnow, 1997. “Local quartet splits of a binary tree infer all quartet splits via one dyadic inference rule.” Computers and Artificial Intelligence, Number 2, Vol 16, pp. 217-227. 26. Warnow, T. 1997. “Mathematical approaches to comparative linguistics.” Proceedings of the National Academy of Sciences, Vol. 94, pp. 6585-6590, 1997. 27. Kannan, S. and T. Warnow, 1997. “A fast algorithm for the computation and enumeration of perfect phylogenies.” SIAM J. Computing, Vol. 26, No. 6, pp. 1749-1763. (A preliminary version appeared in SODA 1995). 28. Rice, K. and T. Warnow. 1997. “Parsimony is Hard to Beat!” Proceedings, Third Annual In- ternational Conference of Computing and Combinatorics (COCOON), Shanghai, China, 1997, pp. 124-133. T. Jiang and D.T. Lee, Eds. 29. Erd¨os, P. L., M. Steel, L. Szekeley, and T. Warnow, 1997. “Inferring big trees from short sequences.” Springer-Verlag Lecture Notes in Computer Science, 1256, P. Degano, R. Gorrieri, A. Marchetti-Spaccamela (Eds.), Proceedings of the 24th International Congress on Automata, Languages, and Programming (ICALP), Bologna, Italy, pp. 827-837. 30. Kannan, S., T. Warnow, and S. Yooseph, 1998. “Computing the local consensus of trees.” SIAM J. Computing, Vol. 27, No. 6, pp. 1695-1724. (This is an extended version of a paper that appeared in SODA 1995). 31. Huson, D., S. Nettles, L. Parida, T. Warnow, and S. Yooseph. 1998. “A Divide-and-Conquer Approach to Tree Reconstruction.” Algorithms and Experiments (ALEX) 1998. Trento, Italy. 32. Bonet, M., M. Steel, T. Warnow, and S. Yooseph. 1998. “Faster algorithms for solving parsimony and compatibility.” The Journal of Computational Biology, Vol. 5, No. 3, pp. 409-422. By invitation, for the special issue on selected papers from RECOMB 1998. 33. Huson, D., K. A. Smith and T. Warnow. 1999. “Correcting Large Distances for Phylogenetic Reconstruction.”Proceedings, 3rd Workshop on Algorithms Engineering (WAE), London, Eng- land, 1999, pp. 273-286. 34. Warnow, T.. 1999. “Some combinatorial problems in phylogenetics.” Invited paper, Pro- ceedings of the International Colloquium on Combinatorics and Graph Theory, Balatonlelle, Hungary, July 15-20, 1996, eds. A. Gy´arf´as, L. Lov´asz, L.A. Sz´ekely, Volume 7 of Bolyai Society Mathematical Studies, Budapest, pp. 363-413. 35. Erd˝os,P. L., M. Steel, L. Sz´ek´ely, and T. Warnow. 1999. “A few logs suffice to build almost all trees - I.” Random Structures and Algorithms, 14, 153-184. (Also appears as DIMACS Technical Report 97-71.) 36. Erd˝os,P. L., M. Steel, L. Sz´ek´ely, and T. Warnow. 1999. “A few logs suffice to build almost all trees - II.” Theoretical Computer Science, 221 (1-2) (1999) pp. 77-118, by invitation, in the issue of selected papers from ICALP 1997. (Also appears as DIMACS Technical Report 97-72.) 37. Bonet, M., C.A. Phillips, T. Warnow, and S. Yooseph, 1999. “Constructing evolutionary trees in the presence of polymorphic characters.” SIAM J. Computing, Vol. 29. No. 1, pp. 103-131. (A preliminary version appeared in the ACM Symposium on the Theory of Computing, 1996.)

20 38. Henzinger, M., V. King, and T. Warnow. 1999. “Constructing a tree from homeomorphic subtrees, with applications to computational molecular biology.” Algorithmica, 24(1): 1-13 (1999). (A preliminary version appeared in the Association for Computing Machinery and the Society of Industrial and Applied Mathematics, Proceedings, ACM/SIAM Symposium on Discrete Algorithms, 1996, pp. 333-340.) 39. Huson, D., S. Nettles, and T. Warnow. 1999. “Disk-Covering, a fast converging method for reconstruction.” Special issue of the Journal of Computational Biology for selected papers from RECOMB 1999, Vol. 6, No. 3, 1999, pp. 369-386. (This appeared in a preliminary form in the Proceedings of RECOMB 1999, as “Obtaining highly accurate topology estimates of evolutionary trees from very short sequences.” Lyon, France.) 40. Huson, D., S. Nettles, K. Rice, T. Warnow, and S. Yooseph. 1999. “The Hybrid tree re- construction method.” The Journal of Experimental Algorithmics, Volume 4, Article 5, 1999. Special issue for selected papers from The Workshop on Algorithms Engineering, Saarbrucken, Germany, 1998. http://www.jea.acm.org/1999/HusonHybrid/. 41. Huson, D., L. Vawter, and T. Warnow. 1999. “Solving large scale phylogenetic problems using DCM-2.”Proceedings of ISMB (Intelligent Systems for Molecular Biology), (ISMB), Heidelberg 1999, pp. 118-129. 42. Kim, J. and T. Warnow, 1999. “Tutorial on Phylogenetic Tree Estimation.” Intelligent Systems for Molecular Biology, Heidelberg 1999. See: http://ismb99.gmd.de/TUTORIALS/Kim/4KimTutorial.ps. 43. Cosner, M.E., R.K. Jansen, B.M.E. Moret, L.A. Raubeson, L.-S. Wang, T. Warnow, and S. Wyman, 2000. “A new fast heuristic for computing the breakpoint phylogeny and experimental phylogenetic analyses of real and synthetic data,” Proc. 8th Int’l Conf. on Intelligent Systems for Molecular Biology (ISMB 2000), San Diego (2000), pp. 104-115. 44. Cosner, M.E., R.K. Jansen, B.M.E. Moret, L.A. Raubeson, L.-S. Wang, T. Warnow, T., and S. Wyman, 2000. “An empirical comparison of phylogenetic methods on chloroplast gene order data in Campanulaceae,” in : Empirical and Analytical Approaches to Gene Order Dynamics, Map Alignment, and the Evolution of Gene Families, D. Sankoff and J. Nadeau, eds., Kluwer Academic Publishers, Dordrecht, 2000, pp. 99-121. 45. Bodlaender, H. L., M.R. Fellows, M. T. Hallett, H. T. Wareham, and T. Warnow, 2000. “The hardness of perfect phylogeny, feasible register assignment and other problems on thin colored graphs”, Theoretical Computer Science 244 (2000), pp. 167-188. 46. St. John, K., T. Warnow, B.M.E. Moret, and L. Vawter, 2001. “Performance study of phyloge- netic methods: (unweighted) quartet methods and neighbor-joining,” Proc. 12th Ann. Symp. Discrete Algs. (SODA 01), Washington DC, SIAM Press (2001), pp. 196-205. 47. Ganapathy, G. and T. Warnow, 2001. “Finding the Maximum Compatible Tree for a Bounded Number of Trees with Bounded Degree is Solvable in Polynomial Time.” In Proceedings of the First International Workshop on Algorithms and Bioinformatics (WABI), 2001, pp. 156-163, Springer Verlag, Olivier Gascuel and Bernard M.E. Moret, eds. 48. Warnow, T., B.M.E. Moret, and K. St. John, 2001. “Absolute convergence: true trees from short sequences,” Proc. 12th Ann. Symp. Discrete Algs. (SODA 01), Washington DC, SIAM Press (2001), pp. 186-195. 49. Wang, L.-S. and T. Warnow, 2001. “Estimating true evolutionary distances between genomes.” Proceedings, Symposium on the Theory of Computing (STOC), 2001, pp. 637-646.

21 50. Moret, B.M.E., S. Wyman, D.A. Bader, T. Warnow, and M. Yan, 2001. “A new implementation and detailed study of breakpoint analysis,” Proc. 6th Pacific Symp. on Biocomputing (PSB 2001), Hawaii, World Scientific Pub. (2001), pp. 583-594. 51. Moret, B.M.E., L.-S. Wang, T. Warnow, and S. Wyman, 2001. “New approaches for recon- structing phylogenies based on gene order,” 9th Int’l Conf. on Intelligent Systems for Molecular Biology (ISMB 2001), Copenhagen, in Bioinformatics 17, Suppl. 1, (2001), pp. S165-S173; chosen as one of the 5 best papers at the conference. 52. Nakhleh, L., K. St. John, U. Roshan, J. Sun, and T. Warnow, 2001. “Designing fast converging phylogenetic methods.” 9th Int’l Conf. on Intelligent Systems for Molecular Biology (ISMB 2001), Copenhagen, in Bioinformatics 17, Suppl. 1, (2001), pp. S190-S198.

53. Nakhleh, L., U. Roshan, K. St. John, J. Sun, and T. Warnow, 2001. “The performance of phylogenetic methods on trees of bounded diameter.” Proceedings, The First International Workshop on Algorithms in Bioinformatics (WABI), 2001, Lecture Notes in Computer Science (LNCS #2149) pp. 214-226, Springer Verlag, Olivier Gascuel and Bernard M.E. Moret, eds. 54. Moret, B.M.E., D.A. Bader, and T. Warnow, 2001. “High-performance algorithmic engineering for computational phylogenetics,” Proc. 2001 Int’l Conf. Computational Science (ICCS 2001), San Francisco, Lecture Notes in Computer Science 2074, pp. 1012-1021, Springer-Verlag. 55. Ringe, D., T. Warnow, and A. Taylor, 2002. “Indo-European and Computational Cladistics”, Transactions of the Philological Society, Volume 100: Issue 1, March 2002, pp. 59-129.

56. Nakhleh, L., B.M.E. Moret, U. Roshan, K. St. John, J. Sun, and T. Warnow, 2002. “The Accuracy of Fast Phylogenetic Methods for Large Datasets”, Proc. 7th Pacific Symp. on Biocomputing (PSB 2002), Hawaii, World Scientific Pub. (2002), pp. 211-222. 57. Ganapathy, G. and T. Warnow, 2002. Approximating the Complement of the Maximum Compatible Subset of Leaves of k Trees. In Proceedings of the Fifth International Workshop on Approximation Algorithms for Combinatorial Optimization, pp. 122-134, 2002. 58. Wang, L., R.K. Jansen, B.M.E. Moret, L.A. Raubeson, and T. Warnow, 2002. “Fast Phyloge- netic Methods For Genome Rearrangement Evolution: An Empirical Study,” Proc. 7th Pacific Symp. on Biocomputing (PSB 2002), Hawaii, World Scientific Pub. (2002), pp. 524-535. 59. Moret, B.M.E., D.A. Bader, and T. Warnow, 2002. “High-performance algorithm engineering for computational phylogenetics,” J. Supercomputing 22 (2002), pp. 99–111 (special issue on best papers from ICCS’01). 60. Moret, B.M.E., U. Roshan, and T. Warnow, 2002. “Sequence length requirements for phyloge- netic methods,”Proc. 2nd International Workshop on Algorithms in Bioinformatics (WABI’02), Rome (2002), Lecture Notes in Computer Science (LNCS #2452), pp. 343-356, Springer Ver- lag, Roderic Guig´oand Dan Gusfield, eds. 61. Nakhleh, L., U. Roshan, L. Vawter, and T. Warnow, 2002. “Estimating the deviation from a molecular clock”, Proc. 2nd International Workshop on Algorithms in Bioinformatics (WABI02), Rome, Italy (2002), Lecture Notes in Computer Science 2452, pp. 287-299, Springer-Verlag, R. Guido and D. Gusfield, eds.

62. Moret, B.M.E., J. Tang, L.-S. Wang, and T. Warnow, 2002. “Steps toward accurate recon- struction of phylogenies from gene-order data,” J. Comput. Syst. Sci. (invited, special issue on computational biology), 65, 3 (2002), pp. 508-525.

22 63. Stockham, C., L.-S. Wang, and T. Warnow, 2002. “Statistically-Based Postprocessing of Phy- logenetic Analysis Using Clustering,” Bioinformatics, Vol. 18, Suppl. 1, special issue for the Proceedings of 10th Int’l Conf. on Intelligent Systems for Molecular Biology (ISMB’02). Edmonton, Canada, pp. S285-S293. 64. Moret, B.M.E., L.-S. Wang, and T. Warnow, 2002. “Towards New Software for Computational Phylogenetics,” IEEE Computer 35, 7 (July 2002), special issue on Bioinformatics, pp. 55-64. 65. Moret, B.M.E., and T. Warnow, 2002. “Reconstructing optimal phylogenetic trees: A challenge in experimental algorithmics,” Lecture Notes in Computer Science 2547, Springer Verlag, 2002, pp. 163-180 (2002). 66. Nakhleh, L., J. Sun, T. Warnow, C.R. Linder, B.M.E. Moret, and A. Tholse, 2003. “Towards the development of computational tools for evaluating phylogenetic network reconstruction methods,” Proc. 8th Pacific Symp. on Biocomputing (PSB 2003), pp. 315-326. 67. Spencer, M., B. Bordalejo, L.-S. Wang, A.C. Barbrook, L.R. Mooney, P. Robinson, T. Warnow and C.J. Howe. 2003. “Gene Order Analysis Reveals the History of The Canterbury Tales Manuscripts.” Computers and the Humanities, 37 (1): pp. 97-109. Feb. 2003. Kluwer Aca- demic Pub. 68. Ganapathy, G., V. Ramachandran, and T. Warnow, 2003. “Better Hill-Climbing Seaches for Parsimony.” In Proceedings of the Third International Workshop on Algorithms in Bioinfor- matics (WABI), pp. 245–258, 2003. 69. St. John, K., T. Warnow, B.M.E. Moret, and L. Vawter, 2003. “Performance study of phy- logenetic methods: (unweighted) quartet methods and neighbor-joining”. J. Algorithms 48, 1 (2003), pp. 173-193 (special issue on the best papers from SODA 2001). 70. Roshan, U., B.M.E. Moret, T.L. Williams, and T. Warnow. 2004. “Performance of supertree methods on various dataset decompositions”. In O. R. P. Bininda-Emonds, editor, Phylogenetic Supertrees: Combining Information to Reveal the Tree of Life, Volume 3 of Computational Biology, pp. 301-328, Kluwer Academics, 2004 (Andreas Dress, series editor). 71. Ganapathy, G., V. Ramachandran, and T. Warnow. 2004. “On Contract-and-Refine-Transformations Between Phylogenetic Trees.” In Proceedings of the Fifteenth ACM-SIAM Symposium on Dis- crete Algorithms (SODA 2004), pp. 893-902. 72. Nakhleh, L., T. Warnow, C.R. Linder 2004. “Reconstructing reticulate evolution in species - theory and practice.” In Proceedings of the Eighth Annual International Conference on Re- search in Computational Molecular Biology (RECOMB 2004), pp. 337-346. 73. Roshan, U., B.M.E. Moret, T.L. Williams, and T. Warnow, 2004. “Rec-I-DCM3: A fast algo- rithmic technique for reconstructing large phylogenetic trees,” Proc. IEEE Computer Society Bioinformatics Conference (CSB) 2004, Stanford University, pp. 98-109 (2004). 74. Moret, B.M.E., L. Nakhleh, T. Warnow, C.R. Linder, A. Tholse, A. Padolina, J. Sun, and R. Timme, 2004. “Phylogenetic networks: modeling, reconstructibility, and accuracy,”IEEE/ACM Transactions on Computational Biology and Bioinformatics, Jan. 2004, 1:(1), pp. 13-23. 75. Evans, S.N. and T. Warnow, 2004, “Unidentifiable divergence times in rates-across-sites mod- els.” IEEE/ACM Transactions on Computational Biology and Bioinformatics, 1, July 2004, pp. 130-134. 76. Wang, L.-S. and T. Warnow. “Distance-based genome rearrangement phylogeny,” 2005. In Mathematics of Evolution and Phylogeny, O. Gascuel, ed., Oxford Univ. Press, pp. 353-380.

23 77. Moret, B.M.E., J. Tang, and T. Warnow, 2005. “Reconstructing phylogenies from gene-content and gene-order data.” In Mathematics of Evolution and Phylogeny, pp. 321-352, O. Gascuel, ed., Oxford Univ. Press (2005). 78. Moret, B.M.E. and T. Warnow, 2005. “Advances in phylogeny reconstruction from gene order and content data,” in Molecular Evolution: Producing the Biochemical Data, Part B, E.A. Zimmer and E.H. Roalson, eds., Vol. 395 of Methods in Enzymology, Elsevier (May 2005), pp. 673-700. 79. Nakhleh, L., D. Ringe, and T. Warnow, 2005. “Perfect Phylogenetic Networks: A new method- ology for reconstructing the evolutionary history of natural languages.” Language (Journal of the Linguistic Society of America), 81(2), pp. 382-420, June 2005.

80. Nakhleh, L., T. Warnow, C.R. Linder, and K. St. John. 2005. “Reconstructing reticulate evolution in species - theory and practice.” The Journal of Computational Biology (special issue for selected papers from RECOMB 2004), 12(6), July 2005, pp. 796-811. 81. Nakhleh, L., T. Warnow, D. Ringe, and S.N. Evans, 2005. “A comparison of phylogenetic reconstruction methods on an IE dataset.” The Transactions of the Philological Society, Vol. 103, Issue 2, pp. 171-192, August 2005. 82. Ganapathy, G., B. Goodson, R. Jansen, V. Ramachandran, and T. Warnow, 2005. “Pat- tern identification in biogeography: metrics and algorithms for comparing area cladograms.” Proceedings of the Fifth International Workshop on Algorithms for Bioinformatics (WABI), October 2005.

83. Wang, L.-S. and T. Warnow, 2006. “Reconstructing Chromosomal Evolution.” SIAM J. Com- puting, Vol. 36, Issue 2, 99-131. 84. Wang, L.-S., T. Warnow, B.M.E. Moret, R.K. Jansen, and L.A. Raubeson, 2006. “Distance- based Genome Rearrangement Phylogeny.” Journal of Molecular Evolution. 63(4):473-83.

85. Ganapathy, G., B. Goodson, R. Jansen, H. Le, V. Ramachandran, and T. Warnow, 2006. “Pattern identification in biogeography”, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 3(4):334-346. 86. Leebens-Mack, J.H., T. Vision, E. Brenner, J.E. Bowers, S. Cannon, M.J. Clement, C.W. Cunningham, C. dePamphilis, R. Desalle, J.J. Doyle, J.A. Eisen, X. Gu, J. Harshman, R.K. Jansen, E.A. Kellogg, E.V. Koonin, B.D. Mishler, H. Philippe, J.C. Pires, Y-L. Qiu, S.Y. Rhee, K. Sjolander, D.E. Soltis, P.S. Soltis, D.W. Stevenson, K. Wall, T. Warnow, C. Zmasek, 2006. “Taking the First Steps towards a Standard for Reporting on Phylogenies: Minimum Information about a Phylogenetic Analysis (MIAPA).” OMICS 10(2):231-237. 87. Nelesen, S., K. Liu, D. Zhao, R. Linder, and T. Warnow, 2008. “The effect of the guide tree on multiple sequence alignment and subsequent phylogenetic analyses”, Proceedings of the 2008 Pacific Symposium on Biocomputing. 88. Snir, S., T. Warnow and S. Rao, 2008. “Short Quartet Puzzling: A New Quartet-based Phylogeny Reconstruction Algorithm”. Journal of Computational Biology, Vol. 15, No. 1, pp. 91-103.

89. Nichols, J. and T. Warnow, 2008. “Tutorial on computational linguistic phylogeny.” Linguistics and Language Compass, Vol. 2, Issue 5, September 2008, pages 760-820.

24 90. Liu, K., S. Nelesen, S. Raghavan, C. R. Linder, and T. Warnow, 2009. “Barking up the wrong treelength: the impact of gap penalty on alignment and tree accuracy.” IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 6, no. 1, pp. 7-21, Jan.-Mar. 2009, doi:10.1109/TCBB.2008.63 91. Liu, K., S. Raghavan, S. Nelesen, C. R. Linder, T. Warnow, 2009. “Rapid and accurate large- scale coestimation of sequence alignments and phylogenetic trees.” Science, vol. 324, no. 5934, pp. 1561-1564, 19 June 2009, doi: 10.1126/science.1171243. 92. Swenson, M., F. Barban¸con, C.R. Linder, and T. Warnow, 2009. “A simulation study compar- ing supertree and combined analysis methods using SMIDGen.” Proceedings of the Workshop on Algorithms in Bioinformatics (WABI), 2009. 93. Bradner, J.E., N. West, M.L. Grachan, E. Greenberg, S.J. Haggarty, T. Warnow and R. Mazitschek, 2010. “Chemical phylogenetics of histone deacetylases”, Nature Chemical Biology, 6, pp. 238-243; published online 7 February 2010. 94. Swenson, M.S., F. Barban¸con, C.R. Linder, and T. Warnow, 2010. “A simulation study com- paring supertree and combined analysis methods using SMIDGen.”2010. Journal of Algorithms for Molecular Biology 5:8 (4 January 2010), special issue of selected papers from WABI 2009. (This journal version contains additional results compared to the WABI conference version.) 95. Swenson, M.S., R. Suri, C.R. Linder, and T. Warnow, 2010. “An experimental study of Quar- tets MaxCut and other supertree methods.” Proceedings of WABI (Workshop on Algorithms for Bioinformatics) 2010. 96. Linder, C.R., R. Suri, K. Liu, and T. Warnow, 2010. “Benchmark datasets and software for developing and testing methods for large-scale multiple sequence alignment and phyloge- netic inference”. PLoS Currents: Tree of Life, 2010 Nov 18;2:RRN1195. doi: 10.1371/cur- rents.RRN1195. 97. Liu, K., C.R. Linder, and T. Warnow, 2010. “Multiple sequence alignment: a major chal- lenge to large-scale phylogenetics,” 2010. PLoS Currents: Tree of Life, 2010 November 19; 2: RRN1198. doi: 10.1371/currents.RRN1198. 98. Swenson, M.S., R. Suri, C.R. Linder, and T. Warnow, 2011. “An experimental study of Quartets MaxCut and other supertree methods.” Journal of Algorithms for Molecular Biology, 6(7) (19 April 2011), special issue of selected papers from WABI 2010. (This journal version contains additional results compared to the WABI conference version.) 99. Wang, L.-S., J. Leebens-Mack, K. Wall, K. Beckmann, C. de Pamphilis, and T. Warnow., 2011. “The impact of multiple sequence alignment on phylogeny estimation”, IEEE Transactions on Computational Biology and Bioinformatics (TCBB), July 2011, pp. 1108-1119. 100. Yu, Y., T. Warnow, and L. Nakhleh, 2011. “Algorithms for MDC-based Multi-locus Phylogeny Inference”, Proceedings of RECOMB 2011. 101. Y. Yu, T. Warnow, and L. Nakhleh, 2011. “Algorithms for MDC-Based Multi-Locus Phylogeny Inference: Beyond Rooted Binary Gene Trees on Single Alleles,” J. Computational Biology November 2011, Vol. 18, No. 11, pp 1543-1559. 102. Yang, J. and T. Warnow, 2011. “Fast and accurate methods for phylogenomic analyses.” 2011. RECOMB-CG 2011 and BMC Bioinformatics 12(Suppl 9): S4 (5 October 2011). 103. Mirarab, S. and T. Warnow, 2011. “FastSP: Linear time calculation of alignment accuracy.” Bioinformatics 27(23):3250-3258.

25 104. Liu, K., C. Randal Linder, and T. Warnow, 2012. “RAxML and FastTree: Comparing Two Methods for Large-Scale Maximum Likelihood Phylogeny Estimation.” PLoS-ONE 6(11): e27731. doi:10.1371/journal.pone.0027731. 105. Liu, K., T.J. Warnow, M.T. Holder, S. Nelesen, J. Yu, A. Stamatakis, and C.R. Linder, 2012. “SAT´e-II: Very Fast and Accurate Simultaneous Estimation of Multiple Sequence Alignments and Phylogenetic Trees”. Systematic Biology 61(1):90-106. 106. Swenson, M.S., R. Suri, C.R. Linder, and T. Warnow, 2012. “SuperFine: fast and accurate supertree estimation.” Systematic Biology, 61(2):214-227. 107. Mirarab, S., N. Nguyen, and T. Warnow, 2012. “SEPP: SAT´e-Enabled Phylogenetic Place- ment.” Pacific Symposium on Biocomputing. 108. Neves, D.T., J.L. Sobral, K. Pingali, and T. Warnow, 2012. “Parallelizing Superfine.” 27th Symposium on Applied Computing (ACM-SAC), Bioinformatics, 2012, pages 1361–1367, doi = 10.1145/2231936.2231992. 109. Bayzid, Md. S., and T. Warnow, 2012. “Estimating optimal species trees from incomplete gene trees under deep coalescence.” Journal of Computational Biology, June 2012, Vol. 19, No. 6:591-605, special issue for Simon Tavar´eand Mike Waterman. 110. Nguyen, N., S. Mirarab, and T. Warnow, 2012. “MRL and SuperFine+MRL: new supertree methods.” 2012. J. Algorithms for Molecular Biology, 7:3, special issue for the Programme for Phylogenetics at the Sir Isaac Newton Institute for Mathematical Sciences, Cambridge University. 111. Liu, K. and T. Warnow, 2012. “Treelength Optimization for Phylogeny Estimation.” 2012. PLoS One, 7(3):e33104. 112. Warnow, T., 2012. “Standard maximum likelihood analyses of alignments with gaps can be statistically inconsistent.” PLoS Currents Tree of Life, doi:10.1371/currents.RRN1308. 113. Nelesen, S., K. Liu, L.-S. Wang, C.R. Linder, and T. Warnow, 2012. “DACTAL: divide-and- conquer trees (almost) without alignments.” ISMB 2012 and Bioinformatics, Vol. 28, pages i274-i282. 114. Barban¸con, F., S. Evans, L. Nakhleh, Dr. Ringe, and T. Warnow, 2013. “An experimental study comparing linguistic phylogenetic reconstruction methods”. Diachronica 2013, 30(2): 143-170. (An initial version of this paper was presented at the conference Languages and Genes, held at UC Santa Barbara, and organized by Bernard Comrie.) 115. Bayzid, M.S., S. Mirarab, and T. Warnow, 2013. “Inferring optimal species trees under dupli- cation and loss”. Proceedings, Pacific Symposium on Biocomputing, Volume 18, 250-261. 116. Bayzid, M. S. and T. Warnow, 2013. “Naive binning improves phylogenomic analyses”. Bioin- formatics. 29(18):2277-2284. 117. Mirarab, S., N. Nguyen, and T. Warnow, 2014. “PASTA: ultra-large multiple sequence align- ment.” Proceedings RECOMB 2014. An extended version of this paper appears in the Journal of Computational Biology. 118. Mirarab, S. R. Reaz, Md. S. Bayzid, T. Zimmermann, M.S. Swenson, and T. Warnow (2014). “ASTRAL: Genome-Scale Coalescent-Based Species Tree Estimation”. Proceedings of the Eu- ropean Conference on Computational Biology (ECCB) 2014 and Bioinformatics (2014) 30 (17): i541-i548, doi: 10.1093/bioinformatics/btu462.

26 119. Bayzid, Md. S., T. Hunt, and T. Warnow (2014). “Disk Covering Methods Improve Phy- logenomic Analyses”. Proceedings of RECOMB-CG (Comparative Genomics), 2014 and BMC Genomics 15 (Suppl. 6): S7. 120. Zimmermann, T., S. Mirarab and T. Warnow (2014). “BBCA: Improving the scalability of *BEAST using random binning”. Proceedings of RECOMB-CG (Comparative Genomics), 2014 and BMC Genomics 15 (Suppl. 6): S11. 121. Mirarab, S., Md. S. Bayzid, and T. Warnow (2014). “Evaluating summary methods for multi- locus species tree estimation in the presence of incomplete lineage sorting”. Systematic Biology 65 (3): 366-380, 2016. (First appeared online in 2014, doi = 10.1093/sysbio/syu063)

122. Wickett, N., S. Mirarab, N. Nguyen, T. Warnow, E. Carpenter, N. Matasci, S. Ayyampalayam, M. S. Barker, J. G. Burleigh, M. A. Gitzendanner, B. R. Ruhfel, E. Wafula, J. P. Der, S. W. Graham, S. Mathews, M. Melkonian, D. E. Soltis, P. S. Soltis, N. W. Miles, C. J. Rothfels, L. Pokorny, A. J. Shaw, L. DeGironimo, D. W. Stevenson, B. Surek, J. C. Villarreal, B. Roure, H. Philippe, C. W. dePamphilis, T. Chen, M. K. Deyholos, R. S. Baucom, T. M. Kutchan, M. M. Augustin, J. Wang, Y. Zhang, Z. Tian, Z. Yan, X. Wu, X. Sun, G. Ka-Shu Wong, and J. Leebens-Mack. “Phylotranscriptomic analysis of the origin and early diversification of land plants”, Proceedings of the National Academy of Sciences, 2014; volume 111 no. 45, E4859-E4868, published ahead of print October 29, 2014, doi:10.1073/pnas.1323926111. 123. Nguyen, N., S. Mirarab, B. Liu, M. Pop, and T. Warnow (2014). “TIPP:Taxonomic Identifi- cation and Phylogenetic Profiling.” Bioinformatics, 2014, 30 (24): 3548-3555.

124. Mirarab, S., N. Nguyen, and T. Warnow (2014). “PASTA: ultra-large multiple sequence align- ment for nucleotide and amino acid sequences”. Journal of Computational Biology, 22(5): 377-386, 2015 (first appeared online in 2014). This is an extended version of the RECOMB paper (#114). 125. Mirarab, S., Md. S. Bayzid, B. Boussau, and T. Warnow (2014). “Statistical binning enables an accurate coalescent-based estimation of the avian tree”. Science, 12 December 2014: 1250463. 126. Jarvis, E.D., S. Mirarab, A. J. Aberer, B. Li, P. Houde, C. Li, S. Y. W. Ho, B. C. Faircloth, B. Nabholz, J. T. Howard, A. Suh, C. C. Weber, R. R. da Fonseca, J. Li, F. Zhang, H. Li, L. Zhou, N. Narula, L. Liu, G. Ganapathy, B. Boussau, Md. S. Bayzid, V. Zavidovych, S. Subramanian, T. Gabaldon, S. Capella-Gutierrez, J. Huerta-Cepas, B. Rekepalli, K. Munch, M. Schierup, B. Lindow, W. C. Warren, D. Ray, R. E. Green, M. W. Bruford, X. Zhan, A. Dixon, S. Li, N. Li, Y. Huang, E. P. Derryberry, M. F. Bertelsen, F. H. Sheldon, R. T. Brumfield, C. V. Mello, P. V. Lovell, M. Wirthlin, M. P. C. Schneider, F. Prosdocimi, J. A. Samaniego, A. M. V. Velazquez, A. Alfaro-Nunez, P. F. Campos, B. Petersen, T. Sicheritz-Ponten, A. Pas, T. Bailey, P. Scofield, M. Bunce, D. M. Lambert, Q. Zhou, P. Perelman, A. C. Driskell, B. Shapiro, Z. Xiong, Y. Zeng, S. Liu, Z. Li, B. Liu, K. Wu, J. Xiao, X. Yinqi, Q. Zheng, Y. Zhang, H. Yang, J. Wang, L. Smeds, F. E. Rheindt, M. Braun, J. Fjeldsa, L. Orlando, F. K. Barker, K. A. Jansson, W. Johnson, K.-P. Koepfli, S. O’Brien, D. Haussler, O. A. Ryder, C. Rahbek, E. Willerslev, G. R. Graves, T. C. Glenn, J. McCormack, D. Burt, H. Ellegren, P. Alstrom, S. V. Edwards, A. Stamatakis, D. P. Mindell, J. Cracraft, E. L. Braun, T. Warnow, W. Jun, M. T. P. Gilbert, and G. Zhang (2014). “Whole-genome analyses resolve early branches in the tree of life of modern birds.” Science 12 December 2014: 1320-1331. (Co-corresponding author) 127. Matasci, N., L.-H. Hung, Z. Yan, E. Carpenter, N. J. Wickett, S. Mirarab, N. Nguyen, T. Warnow, S. Ayyampalayam, M. Barker, J. G. Burleigh, M. A. Gitzendanner, E. Wafula, J. P. Der, C. dePamphilis, B. Roure, H. Philippe, B. Ruhfel, N. Miles, S. Graham, S. Mathews, B.

27 Surek, M. Melkonian, D. Soltis, P. Soltis, C. Rothfels, L. Pokorny, J. Shaw, L. DeGironimo, D. Stevenson, J. Villarreal, T. Chen, T. Kutchan, M. Rolf, R. Baucom, M. Deyholos, R. Samudrala, Z. Tian, X. Wu, X. Sun, Y. Zhang, J. Wang, J. Leebens-Mack, and G. K-S. Wang (2014). Data access for the 1,000 Plants (1KP) project. GigaScience 2014 3:17, DOI: 10.1186/2047-217X-3-17.

128. Nguyen, N., S. Mirarab, K. Kumar, and T. Warnow (2015). “Ultra-large alignments using phylogeny-aware profiles”, Proceedings of Research in Computational Biology (RECOMB) 2015 and Genome Biology 16:124, 2015. Highly accessed 129. Bayzid, Md. S., S. Mirarab, B. Boussau, and T. Warnow (2015). “Weighted Statistical Bin- ning: enabling statistically consistent genome-scale phylogenetic analyses”, PLOS One 10(6): e0129183, 2015. 130. Mirarab, S. and T. Warnow (2015). “ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes”, ISMB 2015, and Bioinformatics, 31(12):i44- i52, 2015.

131. Roch, S., and T. Warnow (2015). “On the robustness to gene tree estimation error (or lack thereof) of coalescent-based species tree methods”, Points of View, Systematic Biology, 64(4):663-676, 2015. 132. Warnow, T (2015). “Concatenation analyses in the presence of incomplete lineage sorting”, PLOS Currents: Tree of Life 2015 May 22. Edition 1. doi: 10.1371/currents.tol.8d41ac0f13d1abedf4c4a59f5d17b1f7.

133. Davidson, R., P. Vachaspati, S. Mirarab, and T. Warnow (2015). “Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer”. RECOMB-Comparative Genomics, and BMC Genomics, 2015, 16 (Suppl 10): S1. 134. Chou, J., A. Gupta, S. Yaduvanshi, R. Davidson, M. Nute, S. Mirarab and T. Warnow (2015). “A comparative study of SVDquartets and other coalescent-based species tree estimation meth- ods”. RECOMB-Comparative Genomics, and BMC Genomics, 2015, 16 (Suppl 10): S2. 135. Vachaspati, P. and T. Warnow (2015). “ASTRID: Accurate Species TRees from Internode Distances”. RECOMB-Comparative Genomics, and BMC Genomics, 2015, 16 (Suppl. 10): S3.

136. Cracraft, J., P. Houde, S.Y.W. Ho, D.P. Mindell, J. FjeldsAˇe,B.˜ Lindow, S.V. Edwards, C. Rahbek, S. Mirarab, T. Warnow, M.T.P. Gilbert, G. Zhang, E.L. Braun, and E.D. Jarvis (2015). Response to Comment on “Whole-genome analyses resolve early branches in the tree of life of modern birds”. Science, 2015, volume 349, number 6255, doi = 10.1126/science.aab1578. 137. Mirarab, S., Md. S. Bayzid, B. Boussau, and T. Warnow (2015). Response to Comment on “Statistical binning enables an accurate coalescent-based estimation of the avian tree”. Science, 2015, volume 350, number 6257, p. 171, DOI: 10.1126/science.aaa7719. 138. Jarvis, E.D., S. Mirarab, A.A. Aberer, B. Li, P. Houde, C. Li, S. Ho, B. Faircloth, B. Nab- holz, J. T. Howard, A. Suh, C. C. Weber, R. R. da Fonseca, A. Alfaro-Nunez, N. Narula, L. Liu, D. Burt, H. Ellegren, S. Edwards, A. Stamatakis, D. Mindell, J. Cracraft, E. Braun, T. Warnow, W. Jun, M.T.P. Gilbert, G. Zhang, and The Avian Phylogenomics Consortium (2015). Phylogenomic analyses data of the avian phylogenomics project. GigaScience 2015, 4:4, DOI:10.1186/s13742-014-0038-1.

28 139. Tarver, J.E., M. d. Reis, S. Mirarab, R. J. Moran, S. Parker, J.E. O’Reilly, B.L. King, M.J. O’Connell, R.J. Asher, T. Warnow, K. J. Peterson, P.C.J. Donoghue, and D. Pisani (2016). The interrelationships of placental mammals and the limits of phylogenetic inference. Genome Biology and Evolution 2016, 8(2): 330-344. 140. Nguyen, N., T. Warnow, M. Pop, and B. White (2016). A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity. Nature Biofilms and Microbiomes 2, Article number 16004 (2016), doi:10.1038/npjbiofilms.2016.4. 141. Hansen, T., S. Mollerup, N. Nguyen, L. Vinner, N. White, M. Coghlan, D. Alquezar-Planas, T. Joshi, R. Jensen, H. Fridholm, K. Kjaransdottir, T. Mourier, T. Warnow, G. Belsham, T. Gilbert, L. Orlando, M. Bunce, E. Willerslev, L. Nielsen, and A. Hansen (2016). High diver- sity of picornaviruses in rats from different continents revealed by deep sequencing. (Nature Publishing Group) Emerging Microbes & Infections 5(8): e90. 142. Nguyen, N., M. Nute, S. Mirarab, and T. Warnow (2016). HIPPI: Highly Accurate Protein Family Classification with Ensembles of HMMs. BMC Bioinformatics 17(10): 89, special issue for RECOMB-CG. 143. Nute, M. and T. Warnow (2016). Scaling statistical multiple sequence alignment to large datasets. BMC Bioinformatics 17(1): 135, special issue for RECOMB-CG. 144. Uricchio, L., T. Warnow, and N. Rosenberg (2016). An analytical upper bound on the num- ber of loci required for all splits of a species tree to appear in a set of gene trees. BMC Bioinformatics 17(14): 241, special issue for RECOMB-CG. 145. Vachaspati, P. and T. Warnow (2016). FastRFS: Fast and Accurate Robinson-Foulds Su- pertrees using Constrained Exact Optimization Bioinformatics 2016; doi: 10.1093/bioinfor- matics/btw600. (Special issue for papers from RECOMB-CG) 146. Boyd, B.M., J.M. Allen, N. Nguyen, A.D. Sweet, T. Warnow, M.D. Shapiro, S.M. Villa, S.E. Bush, D.H. Clayton, and K.P. Johnson (2017). Phylogenomics using target-restricted assembly resolves intra-generic relationships of parasitic lice (Phthiraptera: Columbicola). Systematic Biology 2017, doi: 10.1093/sysbio/syx027. 147. Allen, J.M., B. Boyd, N. Nguyen, P. Vachaspati, T. Warnow, D.I. Huang, P.G. Grady, K.C. Bell, Q.C. Cronk, L. Mugisha, B.R. Pittendrigh, L.M. Soledad, D.L. Reed, and K.P. John- son. Phylogenomics from whole genome sequences using aTRAM. Systematic Biology 2017, doi:10.1093/sysbio/syw105. 148. Boyd, B.M., J.M. Allen, N. Nguyen, P. Vachaspati, Z.S. Quicksall, T. Warnow, L. Mugisha, K.P. Johnson, and D.L. Reed. Primates, Lice and Bacteria: Speciation and Genome Evolution in the Symbionts of Hominid Lice. Molecular Biology and Evolution 2017; 34 (7): 1743-1757. doi: 10.1093/molbev/msx117 149. Christensen, S., E. Molloy, P. Vachaspati, and T. Warnow. Optimal Completion of Incomplete Gene Trees in Polynomial Time. , 17th International Workshop on Algorithms for Bioinfor- matics (WABI) 2017. Lecture Notes in Computer Science (LNCS), Springer. 150. Bayzid, M.S. and T. Warnow. Gene Tree Parsimony for Incomplete Gene Trees. 17th Interna- tional Workshop on Algorithms for Bioinformatics (WABI) 2017. Lecture Notes in Computer Science (LNCS), Springer. 151. Vachaspati, P. and T. Warnow. Enhancing searches for optimal trees using SIESTA. Pro- ceedings RECOMB-CG (RECOMB International Workshop on Comparative Genomics) 2017, Lecture Notes in Computer Science book series (LNCS, volume 10562), pages 232-255.

29 152. Molloy, E. and T. Warnow. To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods. Systematic Biology, Volume 67, Issue 2, 1 March 2018, pages 285-303. https://doi.org/10.1093/sysbio/syx077. 153. Bayzid, Md S. and T. Warnow. Gene tree parsimony for incomplete gene trees: addressing true biological loss. Algorithms for Molecular Biology, 13:1., 2018. DOI: 10.1186/s13015-017- 0120-1. 154. Eiserhardt, W., A. Antonelli, D.J. Bennett, L.R. Botigue, J.G. Burleigh, S. Dodsworth, B.J. Enquist, F. Forest, J.T. Kim, A.M. Kozlov, I.J. Leitch, B.S. Maitner, S. Mirarab, W.H. Piel, O.A. Perez-Escobar, L. Pokorny, C. Rahbek, B. Sandel, S.A. Smith, A. Stamatakis, R.A. Vos, T. Warnow, W.J. Baker. A roadmap for global synthesis of the plant tree of life. American Journal of Botany, Vol. 105, Issue 3, 2018. doi:10.1002/ajb2.1041. 155. Christensen, S., E. Molloy, P. Vachaspati, and T. Warnow. OCTAL: Optimal Completion of Gene Trees in Polynomial Time. Algorithms for Molecular Biology, volume 13, number 6, 2018. https://doi.org/10.1186/s13015-018-0124-5. 156. Vachaspati, P. and T. Warnow. SIESTA: Enhancing searches for optimal supertrees and species trees. BMC Genomics, 19(Suppl 5):252, 2018. DOI: 10.1186/s12864-018-4621-1. 157. Nute, M., E. Molloy, J. Chou, and T. Warnow. The Performance of Coalescent-Based Species Tree Estimation Methods under Models of Missing Data. BMC Genomics, 19(Suppl 5):286, 2018. DOI: 10.1186/s12864-018-4619-8.

158. Vachaspati, P. and T. Warnow. SVDquest: Improving SVDquartets species tree estimation using exact optimization within a constrained search space. Molecular Phylogenetics and Evolution, Vol. 124, pp. 122-136, 2018. DOI:10.1016/j.ympev.2018.03.006. 159. Johnson, K.P., N. Nguyen, A.D. Sweet, B.M. Boyd, T. Warnow, and J.M. Allen (2018). Si- multaneous radiation of bird and mammal lice following the K-Pg boundary. Biology Letters, Vol. 14, page 20180141, DOI: 10.1098/rsbl.2018.0141 160. Zhang, Q., S. Rao, and T. Warnow. New absolute fast converging phylogeny estimation meth- ods with improved scalability and accuracy. Proceedings of WABI (Workshop on Algorithms for Bioinformatics) 2018. DOI: 10.4230/LIPIcs.WABI.2018 161. Pattabiraman, S. and T. Warnow. Are Profile Hidden Markov Models Identifiable? Proceed- ings of ACM-BCB 2018, pages 448-456, doi:10.1145/3233547.3233563. 162. Collins, K. and T. Warnow. PASTA for Proteins. Bioinformatics, Applications Note, 2018. https://doi.org/10.1093/bioinformatics/bty495 163. N. Shah, M. Nute, T. Warnow, and M. Pop. Misunderstood parameter of NCBI BLAST impacts the correctness of bioinformatics workflows. Bioinformatics, byt833, 2018. 164. Molloy, E. and T. Warnow. NJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees. Proceedings of RECOMB-CG (Satellite confer- ence for Comparative Genomics), Lecture Notes in Computer Science, vol 11183, pp. 260-276, 2018, Springer.

165. Roch, S., M. Nute, and T. Warnow. Long-branch attraction in species tree estimation: incon- sistency of partitioned likelihood and topology-based summary methods. Systematic Biology, Volume 68, Issue 2, March 2019, Pages 281-297, https://doi.org/10.1093/sysbio/syy061

30 166. Nute, M.G., E. Saleh, and T. Warnow. Evaluating Statistical Multiple Sequence Alignment in Comparison to Other Alignment Methods on Protein Data Sets. Systematic Biology Volume 68, Issue 3, May 2019, Pages 396-411, https://doi.org/10.1093/sysbio/syy068. 167. Zhang, Q., S. Rao, and T. Warnow. Constrained incremental tree building: new absolute fast converging phylogeny estimation methods with improved scalability and accuracy. Algorithms for Molecular Biology, 14(2), 2019. https://rdcu.be/blBXm 168. Le, T., A. Sy, E. Molloy, Q. Zhang, S. Rao and T. Warnow. Using INC within Divide- and-Conquer Phylogeny Estimation. Algorithms for Computational Biology, Chapter 12, DOI:10.1007/978-3-030-18174-1 12, 2019. 169. Molloy, E. and T. Warnow. TreeMerge: A new method for improving the scalability of species tree estimation methods. Bioinformatics (special issue for Intelligent Systems for Molecular Biology (ISMB) 2019), Volume 35, Issue 14, July 2019, Pages i417-i426, doi: 10.1093/bioinfor- matics/btz344 170. Molloy, E. and T. Warnow. Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge. Algorithms for Molecular Biology, 14:14, doi: 10.1186/s13015- 019-0151-x, 2019 (special issue of extended versions of selected papers from RECOMB-CG 2018) 171. Pattabiraman, S. and T. Warnow. Profile Hidden Markov Models are not identifiable. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2019. (Invited, extended versions of selected papers from ACM-BCB 2018) DOI: 10.1109/TCBB.2019.2933821. 172. Leebens-Mack, James H. and the One Thousand Plant Transcriptomes Initiative (2019). One thousand plant transcriptomes and the phylogenomics of green plants, Nature, 574, 679ˆaA¸S685,˘ https://doi.org/10.1038/s41586-019-1693-2. 173. Bradley, J., S. Devarakonda, A. Davey, D. Korobskiy, S. Liu, D. Lakhdar-Hamina, T. Warnow and G. Chacko (2020). Co-citations in context: disciplinary heterogeneity is relevant. Quan- titative Science Sciences (MIT Press) https://doi.org/10.1162/qss_a_00007 174. Christensen, S., E. Molloy, P. Vachaspati, and T. Warnow (2020). Non-parametric correction of estimated gene trees using TRACTION. Algorithms for Molecular Biology 15, 1 (2020) doi:10.1186/s13015-019-0161-8 (extended version of a paper from WABI 2019). 175. Smirnov, V. and T. Warnow (2020). Unblended Disjoint Tree Merging using GTM improves species tree estimation. BMC Genomics (special issue of selected papers from RECOMB-CG), Volume 21 (Suppl 2): 235, https://doi.org/10.1186/s12864-020-6605-1 176. Legried, B., E.K. Molloy, T. Warnow, and S. Roch (2019). Polynomial-time statistical estima- tion of species trees under gene duplication and Loss. International Conference on Research in Computational Molecular Biology (RECOMB 2020), pp. 120-135. The extended version of the paper appears in the Journal of Computational Biology, special issue for RECOMB 2020, 15 Dec 2020, https://doi.org/10.1089/cmb.2020.0424 177. E.K. Molloy and T. Warnow (2020). FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models. Accepted to ISMB 2020, Bioinformatics, Vol. 36, pages i57-i65, doi:10.1093/bioinformatics/btaa444 178. T. Le, A. Sy, E. K. Molloy, Q. Zhang, S. Rao, and T. Warnow (2020). Using Constrained- INC for Large-scale Gene Tree and Species Tree Estimation. IEEE/ACM Transactions on Computational Biology and Bioinformatics. doi: 10.1109/TCBB.2020.2990867

31 179. X. Yu, T. Le, S. Christensen, E. Molloy, and T. Warnow. Advancing Divide-and-Conquer Phylogeny Estimation using Robinson-Foulds Supertrees. Proceedings, WABI 2020. 180. S. Devarakonda, D. Korobskiy, T. Warnow, and G. Chacko (2020). Viewing computer science through citation analysis: Salton and Bergmark Redux. Scientometrics (2020). https://doi.org/10.1007/s11192-020-03624-0.

181. S. Devarakonda, J. Bradley, D. Korobskiy, T. Warnow, and G. Chacko (2020). Frequently Co-cited Publications: Features and Kinetics, Quantitative Science Studies (MIT Press), doi: 10.1162/qss_a_00075 182. S. Chandrasekharan, M. Zaka, S. Gallo, W. Zhao, D. Korobskiy, T. Warnow, & G. Chacko (2020). Finding Scientific Communities in Citation Graphs. Quantitative Science Studies (MIT Press). https://doi.org/10.1162/qss_a_00095 183. V. Smirnov and T. Warnow (2020). MAGUS: Multiple Sequence Alignment using Graph Clustering. In Press, Bioinformatics, btaa992, https://doi.org/10.1093/bioinformatics/ btaa992 184. V. Smirnov and T. Warnow (2021). ”Phylogeny Estimation Given Sequence Length Hetero- geneity.” Systematic Biology, Vol. 70, Issue 2, pages 268-282, https://doi.org/10.1093/ sysbio/syaa058 185. P. Zaharias, M. Grosshauser, and T. Warnow (2021). Re-evaluating deep neural networks for phylogeny estimation: The issue of taxon sampling. To appear, RECOMB 2021.

186. N. Shah, E.K. Molloy, M. Pop, and T. Warnow (2021). TIPP2: metagenomic taxonomic profiling using phylogenetic markers. Bioinformatics, btab023 https://doi.org/10.1093/ bioinformatics/btab023 187. P. Dibaiena, S. Tabe Bordhbar, and T. Warnow (2021). FASTRAL: Improving scalability of phylogenomic analysis. Bioinformatics, btab093. 188. P. Zaharias, V. Smirnov, and T. Warnow (2021). The Maximum Weight Trace Alignment Merging Problem. In: Martin-Vide C., Vega-Rodriguez M.A., Wheeler T. (eds) Algorithms for Computational Biology. AlCoB 2021. Lecture Notes in Computer Science, vol 12715. Springer, Cham. DOI: https://doi.org/10.1007/978-3-030-74432-8 12 189. E. Wedell, Y. Cai, and T. Warnow (2021). Scalable and Accurate Phylogenetic Placement using pplacer-XR. In: Martin-Vide C., Vega-Rodriguez M.A., Wheeler T. (eds) Algorithms for Computational Biology. AlCoB 2021. Lecture Notes in Computer Science, vol 12715. Springer, Cham. DOI: https://doi.org/10.1007/978-3-030-74432-8 7 190. J. Willson, M.S. Ruddur, and T. Warnow (2021). Comparing Methods for Species Tree Esti- mation With Gene Duplication and Loss. In: Martin-Vide C., Vega-Rodriguez M.A., Wheeler T. (eds) Algorithms for Computational Biology. AlCoB 2021. Lecture Notes in Computer Science, vol 12715. Springer, Cham. DOI: https://doi.org/10.1007/978-3-030-74432-8 8 191. A. Rhie, S.A. McCarthy, O. Fedrigo, et al. Towards complete and error-free genome assemblies of all vertebrate species. Nature 592, 737-746 (2021). 192. M. Park, P. Zaharias, and T. Warnow. Disjoint Tree Mergers for Large-Scale Maximum Likelihood Tree Estimation. Algorithms 2021, 14(5), 148. Special issue on Algorithms for Computational Biology.

32 193. E. Koning, M. Phillips, and T. Warnow. pplacerDC: a New Scalable Phylogenetic Placement Method. To appear, ACM-BCB 2021. 194. M. Gupta, P. Zaharias, and T. Warnow. Accurate Large-scale Phylogeny-Aware Alignment using BAli-Phy. Bioinformatics 2021

Publications as part of the ISCB Education Committee 1. Welch, L., C. Brooksbank, R. Schwartz, S.L. Morgan, B. Gaeta, A.M. Kilpatrick, D. Mietchen, B.L. Moore, N. Mulder, M. Pauley, W. Pearson, P. Radivojac, N. Rosenberg, A. Rosenwald, G. Rustici, and T. Warnow (2016). Applying, Evaluating and Refining Bioinformatics Core Competencies (An Update from the Curriculum Task Force of ISCB’s Education Committee). PLoS Comput Biol 12(5): e1004943, 2016.

Refereed Book Chapters 1. Warnow, T. (2005). “Large-scale phylogenetic reconstruction.” Book chapter, in S. Aluru (edi- tor), Handbook of Computational Biology, Chapman & Hall, CRC Computer and Information Science Series, December 2005.

2. Linder, C.R. and T. Warnow (2005). “Overview of phylogeny reconstruction.” Book chapter, in S. Aluru (editor), Handbook of Computational Biology, Chapman & Hall, CRC Computer and Information Science Series, December 2005. 3. Evans, S.N., D. Ringe, and T. Warnow (2006). ‘Inference of divergence times as a statistical inverse problem.” Book chapter in Phylogenetic Methods and the Prehistory of Languages, pp. 119-129. Edited by Peter Forster and Colin Renfrew. Edited for the Institute by Chris Scarre (Series Editor) and Dora A. Kemp (Production Editor). Publisher: McDonald Institute for Archaeological Research/University of Cambridge, 2006. 4. Warnow, T., S. N. Evans, D. Ringe, and L. Nakhleh (2006). “A Stochastic Model of Language Evolution that Incorporates Homoplasy and Borrowing.”Book chapter in Phylogenetic methods and the prehistory of languages, pp.75-87. Edited by Peter Forster and Colin Renfrew. Edited for the Institute by Chris Scarre (Series Editor) and Dora A. Kemp (Production Editor). Publisher: McDonald Institute for Archaeological Research/University of Cambridge. 5. Ringe, D. and T. Warnow (2008) “Linguistic History and Computational Cladistics”, book chapter (pages 257-271) in Origin and Evolution of Languages: Approaches, Models, Paradigms. Edited by Bernard Laks, pp. 75-87. Equinox Publishing. 6. Warnow, T. (2013). “Large-scale multiple sequence alignment and phylogeny estimation”, chapter 6 (pages 85-146) in Models and Algorithms for Genome Evolution, edited by Cedric Chauve, Nadia El-Mabrouk and Eric Tannier, Springer series on Computational Biology, Vol- ume 19.

7. Warnow, T. (2019). “Divide-and-Conquer Tree Estimation: Opportunities and Challenges.” In Bioinformatics and Phylogenetics, (T. Warnow, ed.), pp. 121-150, Springer series on Compu- tational Biology. 8. Warnow T. (2021) Revisiting Evaluation of Multiple Sequence Alignment Methods. In Multiple Sequence Alignment (K. Katoh, ed). Methods in Molecular Biology, vol 2231. Humana, New York, NY. https://doi.org/10.1007/978-1-0716-1036-7_17

33 9. Warnow T., Mirarab S. (2021) Multiple Sequence Alignment for Large Heterogeneous Datasets Using SAT´e, PASTA, and UPP. In Multiple Sequence Alignment (K. Katoh, ed). Meth- ods in Molecular Biology, vol 2231. Humana, New York, NY. https://doi.org/10.1007/ 978-1-0716-1036-7_7

34