Curriculum Vitae of Srinivas Aluru Executive Director, Institute for Data and Professor, School of Computational Science and Engineering Georgia Institute of Technology Contact: [email protected]

I. Education Ph. D. Computer Science Iowa State University 1994 M.S. Computer Science Iowa State University 1991 B. Tech Computer Science Indian Institute of Technology Madras, India 1989

II. Professional Appointments Academic Appointments Georgia Institute of Technology July 2013 – Present Professor, School of Computational Science and Engineering, College of Computing Adjunct Professor, School of Electrical and Computer Engineering Iowa State University Aug 1999 – July 2013 Faculty Member, Dept. of Electrical and Computer Engineering Courtesy Appointment, Dept. of Computer Science – Professor (2005-2013), Associate Professor (2001-2005), Assistant Professor (1999-2001) – Ross Martin Mehl and Marylyne Munas Mehl Professor (2009-2013) – Stanley Chair in Interdisciplinary Engineering, College of Engineering (2006-2009) – Member, L.H. Baker Center for Bioinformatics & Biological Statistics – Member, Center for Plant Genomics (from July 2014) New Mexico State University Aug 1996 – July 1999 Assistant Professor, Dept. of Computer Science Syracuse University Aug 1994 – July 1996 Visiting Assistant Professor, School of Computer and Information Science Administrative Appointments Georgia Institute of Technology Executive Director, Institute for Data Engineering and Science Oct 2019 – present Co-Executive Director, Institute for Data Engineering and Science July 2016 – Sep 2019 Interim Chair, School of Computational Science and Engineering June 2019 – Aug 2020 Iowa State University Chair, Bioinformatics and Computational Biology Program Aug 2005 – July 2007 Associate Chair, Dept. of Electrical and Computer Engineering Jan 2003 – June 2006 Associate Chair, Bioinformatics and Computational Biology Program Aug 2003 – July 2005 International Engagements Indian Institute of Technology (IIT) Bombay July 2009 – June 2014 Professor, Department of Computer Science and Engineering – Swarnajayanti Fellowship from Department of Science and Technology (2009-2014) – Bajaj Chair Professorship (2009-2012) Visiting Professor, Kanwal Rekhi School of Information Technology (Fall 2006) Indraprastha Institute of Information Technology (IIIT Delhi) 2013-2016 Consulting External Head, Center for Computational Biology Other Leadership Roles Lead PI, NSF South Big Data Regional Innovation Hub Sep 2015 – Present The NSF South Big Data Hub serves sixteen U.S. Southern states and Washington D.C. It nucleates multi-stakeholder partnerships in pursuit of addressing regional challenges through data-driven approaches. The Hub nurtures public-private partnerships, grows data science com- munities, facilitates data sharing, and engages in education and workforce development.

III. Honors and Awards

Professional Society Honors 2020 Fellow, Association for Computing Machinery (ACM) 2020 Fellow, Society for Industrial and Applied Mathematics (SIAM) 2010 Fellow, American Association for the Advancement of Science (AAAS) 2010 Fellow, Institute for Electrical and Electronics Engineers (IEEE) 2017 Distinguished Member, Association for Computing Machinery (ACM) 2019 IEEE Technical Committee on Parallel Processing (TCPP) Outstanding Service Award 2015 IEEE Computer Society Golden Core Award 2015 IEEE Computer Society Meritorious Service Award 2004-2006 IEEE Computer Society Distinguished Visitor 2002 IEEE Senior Member Research Awards 2017 John V. Atanasoff Discovery Award, College of Liberal Arts and , Iowa State University 2016 ACM SIGHPC Certificate of Recognition for Reproducible Methods in Scientific Re- search 2009 Swarnajayanti Fellowship, Department of Science and Technology, Government of India (awarded 2-4 per year in all branches of engineering and computer science combined) 2007 Computer World Honors Program Laureate and Finalist for 21st Century Achievement Award in Energy, Environment and Agriculture (for innovative use of information tech- nology for the benefit of society) 2002 IBM Faculty Award 1997 NSF CAREER Award Best Paper Awards 2015 Best student paper, ACM/IEEE Supercomputing Conference (SC) 2014 Best paper finalist, ACM/IEEE Supercomputing Conference (SC) 2007 Best paper finalist, ACM/IEEE Supercomputing Conference (SC) 2006 Best paper award, IEEE International Parallel and Distributed Processing Symposium (IPDPS) 2005 Best paper award, IEEE Computational Systems Bioinformatics (CSB) conference 2002 Best paper finalist, ACM/IEEE Supercomputing Conference (SC) University Awards 2019 Outstanding Achievement in Research Program Development Award, Georgia Institute of Technology 2016 Outstanding Senior Faculty Research Award, College of Computing, Georgia Institute of Technology 2015 Dean’s Award, College of Computing, Georgia Institute of Technology 2011 Iowa State University Award for Outstanding Achievement in Research 2006 Iowa State University Foundation Award for Mid-Career Achievement in Research 2002 Young Engineering Faculty Research Award, Iowa State University 1989 Premium for Academic Excellence Award, Iowa State University Teaching Awards 2005 Warren B. Boast Undergraduate Teaching Award, Iowa State University 1993 Teaching Excellence Award, Iowa State University.

Endowed Chairs and Professorships 2009-2013 Ross Martin Mehl and Marylyne Munas Mehl Professorship, Dept. of Electrical and Computer Engineering, Iowa State University 2009-2012 Bajaj Chair Professorship, Computer Science and Engineering, Indian Institute of Tech- nology Bombay 2006-2009 Stanley Chair in Interdisciplinary Engineering, Iowa State University

IV. Teaching

Courses Developed and Taught Georgia Institute of Technology 1. CSE 6220 High-Performance Computing 2. CX 4220 Introduction to High-Performance Computing

Iowa State University 3. CprE/ComS/BCB/Gen 548 Fundamental Algorithms in Computational Biology 4. CprE/ComS/BCB 549 Advanced Algorithms in Computational Biology 5. CprE 426/526 Introduction to Parallel Algorithms and Programming 6. CprE 626 Parallel Algorithms for Scientific Applications

New Mexico State University 7. CS 562 Advanced Topics in Computational Science and Engg. 8. CS 552 Introduction to Computational Science and Engg. 9. CS 490 Introduction to Parallel Computing

Syracuse University 10. CIS 600 Computational Geometry 11. CIS 700 Parallel and Distributed Algorithms

Courses Taught Iowa State University 1. CprE 310 Theoretical Foundations of Computer Engineering

New Mexico State University 2. CS 473 Architectural Concepts I 3. CS 573 Architectural Concepts II 4. CS 272 Introduction to Data Structures

Syracuse University 5. CIS 600 Principles of Database Systems 6. CIS 710 Parallel Computing 7. ECS 101 Introduction to Engineering and Computer Science 8. CIS 655 Computer Architecture 9. CIS 657 Operating Systems

Research Interests

1. Data Science – Scalable methods for large-scale data analysis, Bayesian networks, Data- intensive problems in science and engineering. 2. High Performance Computing — Parallel algorithms and applications, Applications on emerg- ing platforms (multicores, manycores, clouds). 3. Bioinformatics and Computational Biology — Computational genomics and metagenomics, Systems biology, and High performance/Parallel computational biology. 4. Combinatorial Scientific Computing — Spatial domain decomposition methods, Fast multi- pole method, Computational electromagnetics, and Materials informatics. V. Grants and Contracts

External Grants for Community Benefit 1. S. Aluru (PI), S. Kalidindi, D. Sherrill, D. Shoemaker and R. Vuduc, National Science Foundation, “MRI: Acquisition of an HPC System for Data-Driven Discovery in Computa- tional Astrophysics, Biology, , and Materials Science”, $3,699,317, Oct 2018 – Sep 2021.

2. S. Aluru (PI), S. Kalidindi, M. Marathe, R. Rawlings-Goss and P. Sullivan, National Science Foundation, “BD Hubs: Collaborative Proposal: SOUTH: The South Big Data Innovation Hub”, $2,031,560, June 2019 – May 2023.

3. D. Randall, S. Aluru (co-PI), N. Washburn and J. Wing, National Science Foundation, “TRIPODS+X: VIS: Creating an Annual Data Science Forum”, $200,000, Jan 2019 – Dec 2021.

4. X. Huo, S. Aluru (co-PI), D. Randall, P. Tetali and J. Wu, National Science Foundation, “TRIPODS: Transdisciplinary Research Institute for Advancing Data Science (TRIAD)”, $1,500,000, Sep 2017 – Aug 2020.

5. S. Aluru (PI), National Science Foundation, “Big Data Regional Innovation Hubs and Spokes Workshop”, $33,188, May 2017 – May 2018.

6. S. Aluru (Collaborative lead-PI) and S. Ahalt, Computing Research Association, “South Big Data Hub Industry-Academic Early Career Collaboration Program”, $300,000, Apr 2016 – Sep 2016.

7. S. Aluru (PI) and R. Rawlings-Goss, National Science Foundation, “BD Hubs: Collabora- tive Proposal: SOUTH: A Big Data Innovation Hub for the South Region”, $1,150,432, Oct 2015 – Sep 2019.

8. A. Somani, S. Aluru (co-PI), R. Fox, E. Takle and M. Gordon, National Science Foundation, “MRI: Acquisition of a HPC system for Data-Driven Discovery in Science and Engineering”, $1,841,346, Aug 2012 – July 2015.

9. S. Aluru (PI), Roy J. Carver Charitable Trust, “High Performance Computing Facility for Iowa State University”, $158,596, April 2009.

10. S. Aluru (PI), M. Aluru, J. McCalley, K. Rajan, and A. Somani, National Science Foun- dation, “CRI:IAD Acquisition of a Cluster and High Performance Storage for Data-intensive Applications in Materials Science, Power Systems and Systems Biology”, $719,000, April 2008 – March 2011.

11. A.K. Somani, S. Aluru (co-PI), R. Jernigan, R. Kanwar, J.D. McCalley and D.T. Rover, Department of Education, “GAANN: Training Students for Computing Oriented Interdisci- plinary Careers in Research and Education”, $506,688, Aug 2006 – Aug 2010. 12. R.K. Wilson, W.R. McCombie, R. Martienssen, D. Ware, L. Stein, P.S. Schnable, S. Aluru (co-PI), R. Wing and S. Clifton, National Science Foundation, “Sequencing the Maize Genome”, $29,400,000, Nov 2005 – May 2010.

13. S. Aluru (PI), R. Jernigan, P.S. Schnable and A.K. Somani, National Science Founda- tion, “Acquisition of a 512-node BlueGene/L Supercomputer for Large-Scale Applications in Genomics and Systems Biology”, $600,000, Aug 2005 – Jul 2008.

14. S. Aluru (PI), S. Balasubramaniam, V. Brendel, S. Kothari and D. Rover, National Science Foundation, “CISE Research Resources: Acquisition of a Cluster for Experimental Parallel Computing Research in Scientific Computing and Computational Biology”, $166,518, Sep 2001 – Sep 2004.

15. A. Karshmer (PI), S. Aluru (co-PI), M. Auguston, G. Gupta, S. Castillo, R. Hartley, R. Hills, B. Nassersharif, E. Pontelli, D. Ranjan, and J. Wiebe, National Science Foundation Minority Institution Infrastructure Award, “Dynamic and Irregular Parallelism: Its Manage- ment in Symbolic and Scientific Computing”, $1,498,343, September 1998 – August 2003.

16. S. Aluru (PI), S. Castillo and E. Johnson, National Science Foundation CISE Research Instrumentation Award, “Application Oriented Parallel Computing and Scientific Computing Research on a Commodity Parallel System”, $75,150, December 1997 – December 1999.

External Grants for Personal Research 17. S. Aluru (subcontract PI), D.A. Bader, E. Chow and R. Vuduc, National Science Foun- dation, “Operations and Maintenance for the Endless Frontier”, $250,000 (subcontract from University of Texas at Austin), Oct 2019 – Sep 2024.

18. L. Song and S. Aluru (co-PI), National Science Foundation, “III: EAGER: A Framework for Learning Graph Algorithms with Applicaitons to Social and Gene Networks”, $300,000, Aug 2018 – Aug 2020.

19. S. Aluru (PI), National Science Foundation, “AF: Small: Algorithmic Techniques for High- throughput Analysis of Long Reads”, $424,992, May 2018 – May 2021.

20. S. Aluru (PI), National Science Foundation, “Collaborative Research: Sequential and Par- allel Algorithms for Approximate Sequence Matching with Applications to Computational Biology”, $525,000, Aug 2017 – Aug 2020.

21. S. Aluru (PI), National Science Foundation, “Reproducibility and Comprehensive Assess- ment of Next Generation Sequencing Bioinformatics Software”, $499,984, July 2017 – June 2020.

22. S. Aluru (Task Lead), Pacific Northwest National Laboratory, “Subgraph Matching in Streaming Graphs using the Micron Automata Processor”, $350,000, July 2016 – May 2017.

23. S. Aluru (PI), Intel Corporation, “Intel Parallel Computing Center: Big Data in Biosciences and Public Health”, $400,000, July 2015 – June 2017. 24. S. Aluru (PI), National Science Foundation, “Collaborative Research: ABI Innovation: Towards High Performance Flexible Transcription factor-DNA Docking”, $156,722, Aug 2014 – July 2018.

25. S. Aluru (PI), National Science Foundation, “EAGER: Exploratory Research on the Micron Automata Processor”, $300,000, Aug 2014 – July 2019.

26. S. Aluru (PI), National Science Foundation, “Collaborative Research:XPS:CLCCA: Perfor- mance Portable Abstractions for Large-Scale Irregular Computations”, $299,999, Sep 2013 – Aug 2018.

27. Y. Yin, M. Aluru and S. Aluru (co-PI), National Science Foundation, “Signaling Network for Brassinosteroid-Regulated Gene Expression in Arabidopsis”, $863,615, Feb 2013 – Jan 2017.

28. S. Aluru (PI), P. Schnable, C. Sing and J. Zola, NSF/NIH Bigdata Initiative, “BIGDATA: Mid-Scale: DA: Collaborative Research: Genomes Galore - Core Techniques, Libraries, and Domain Specific Languages for High-Throughput DNA Sequencing”, $1,300,000, Jan 2013 – Dec 2018.

29. S. Aluru (PI), K. Dorman and P.S. Schnable, National Science Foundation, “AF:Medium: Parallel Algorithms and Software for High-throughput Sequence Assembly”, $1,000,000, May 2012 – Apr 2018.

30. K. Dorman, S. Aluru (co-PI), B. Blitvich, and A. Ramamoorthy, National Science Founda- tion, “ATD: Models for (Meta)Genome Identification from Next Generation Sequence Data with Errors”, $660,922, Sep 2011 – July 2015.

31. S. Aluru (PI), Micron Technology Foundation, “High-throughput Computational Biology”, $195,000, Sep 2011 – Aug 2013.

32. S. Balasubramaniam and S. Aluru (co-PI), National Science Foundation, “AF:Small: Par- allel Transient Solvers for Multiscale Electromagnetics”, $496,335, Aug 2010 – Aug 2013.

33. S. Aluru (PI), Department of Science and Technology Swarnajayanti Fellowship, “Large Scale Gene Expression Analysis on High Performance Parallel Computers for Functional Ge- nomics and Systems Biology”, $370,000, Nov 2009 – Nov 2014.

34. K. Rajan, S. Aluru (co-PI) and B. Ganapathysubramanian, National Science Foundation, “CDI: Dimensionality-Reduction and Reconstruction Tools for Atom Probe Tomography”, $952,884, Sep 2009 – Aug 2013.

35. S. Aluru (PI), B. Ganapathysubramanian and K. Rajan, National Science Foundation, “Parallel Methods for Large, Atomic-scale Quantitative Analysis of Materials”, $497,784, May 2009 – May 2012.

36. S. Aluru (PI), M. Aluru and D.S. Nettleton, National Science Foundation, “CPA-ACR: Parallel Algorithms and Software for Large Scale Microarray Data Analysis and Gene Network Inference”, $383,000, July 2008 – June 2012. 37. S. Aluru (PI), S. Balasubramaniam and S.J. Plimpton, National Science Foundation, “SGER: Exploring Timescale Parallelization for Long-timescale Molecular Dynamics”, $103,063, May 2008 – Oct 2009.

38. S. Aluru (PI), National Science Foundation, “Collaborative Research: PACE - Parallel Accelerated Cartesian Expansions with Application to Molecular Dynamics”, $102,600, Sep 2007 – Aug 2010.

39. P.S. Schnable, S. Aluru (co-PI), U.S.-Israel Binational Agricultural Research and Develop- ment Fund, “Building Consensus Genetic Maps for Maize and Wheat”, $283,000, July 2006 – June 2010.

40. S. Aluru (PI), National Science Foundation, “Techniques and Tools for Parallel Solution of Irregular Problems”, $262,000, Nov 2004 – Nov 2008.

41. S. Aluru (PI) and P.S. Schnable, National Science Foundation, “Efficient Representation and Manipulation of Large-Scale Biological Sequence Data”, $440,494, Sep 2004 – Aug 2008.

42. S. Aluru (subcontract PI), National Science Foundation, “CREST: Center for Research Excellence in Bioinformatics and Computational Biology’, $144,000 (subcontract from New Mexico State University), Sep 2004 – Aug 2009.

43. S. Aluru (PI), National Science Foundation, “Collaborative Research: Parallel Hybrid Dif- ferential and Integral Equation Based Solvers for Time Domain Electromagnetic Analysis with Application to Circuits”, $162,849, Aug 2003 – Aug 2007.

44. S. Aluru (PI), V. Brendel and S. Kothari, National Science Foundation, “Parallel Algo- rithms and Software for EST Clustering”, $160,000, Jul 2002 – Dec 2004.

45. S. Aluru (PI), IBM Corporation Faculty Award, $30,000, August 2002 – July 2003.

46. S. Aluru (PI), National Science Foundation, “CISE Postdoctoral Research Associates in Experimental Computer Science: Parallel Hierarchical Methods for Computational Electro- magnetics”, $66,000, Aug 2001 – Aug 2004.

47. S. Aluru (PI), S. Balasubramaniam and E. Michielssen (Univ. of Illinois at Urbana- Champaign), National Science Foundation, “Parallel Algorithms and Software for the Steepest Descent Fast Multipole Method”, $397,306, Sep 2000 – Aug 2004.

48. S. Aluru (PI), SUN Academic Equipment Grant, “A Parallel Software System for Gene Identification and Annotation from EST Data”, $86,124, Aug 2000.

49. S. Aluru (PI), Sandia National Laboratories, “Distributed Data Structures for Parallel Scientific Computing”, $35,000, October 1997 – September 1998.

50. S. Aluru (PI) and S. Ranka, Ballistic Missile Defense Organization, “Scalable Parallelization of Spatial Data Structures with Applications”, $216,604, September 1997 – August 1999. 51. S. Aluru (PI), Development of a program in Computational Science, ADMI Visiting Scien- tist Program, provides funding for an expert consultant to assist in developing a program in computational science, August 1997 – July 1998.

52. S. Aluru (PI), National Science Foundation CAREER Award, “Parallel Algorithms and Software for Irregular Scientific Applications”, $234,999, August 1997 – July 2002.

53. S. Castillo and S. Aluru (co-PI), Sandia National Laboratories, “Development of a Complex Version of Aztec”, $44,625, July 1997 – December 1997.

54. S. Aluru (PI), Sandia National Laboratories, “Efficient Parallel Algorithms for Hierarchical N-body Methods”, $35,000, October 1996 – September 1997.

Pending External Grants 55. S. Aluru (PI), National Science Foundation, “OAC Core: Scalable Parallel Algorithms and Software for Sparse Symmetric and Hierarchical Tensors”, $499,214, May 2021 – May 2024.

University Grants 56. S. Aluru (PI), J. Dickerson, K. Dorman, L. Dong, L. Jarboe, D. Nettleton, B. Nikolau, S. Pandy, A. Ramamoorthy, P.S. Schnable and J.V. Shanks, College of Engineering Dean’s Research Initiative, $500,000, “Dean’s Research Initiative in High Throughput Computational Biology”, Apr 2011 – Mar 2014.

57. S. Aluru (PI), Plant Sciences Institute Genomics Research Initiative, $56,982, “Software Tools for Analysis of High-throughput Data from Next Generation Sequencers”, Jul 2008 – Jun 2010.

58. S. Aluru (PI) and S.C. Kothari, Grow Iowa Values Fund Grant Program, $96,016, “Model- based Reasoning for Software: Advanced Algorithmic Techniques and Prototype Implemen- tation”, Jan 2006 – May 2007.

59. S. Aluru (PI), Seed funding from Information Infrastructure Institute (ICube), $16,843, Jan 2004 – Dec 2004.

60. S. Aluru (PI), Special Research Initiation Grant, “Design and Development of Rapid Access Databases for Biological Sequences”, $16,000, Jan 2002 – Dec 2002.

61. S. Aluru (PI), International Travel Grant, $1,038, Dec 2001.

62. S. Aluru (PI), Provost’s Office, Study in Second Discipline: Molecular Biology, $15,000, Fall 2001.

63. S. Aluru (PI) and V. Brendel, University Research Grant, $26,788, “Parallel Computational Methods for Gene Identification from EST Collections”, Jul 2000 – Dec 2001.

64. S. Aluru (PI), International Travel Grant, $1,125, Dec 1999. VI. Technical Publications

Books 1. S. Aluru (Editor), Handbook of Computational Molecular Biology, Chapman & Hall/CRC Computer and Information Science Series, 2005 (1104 pages).

• The Editor co-authored five chapters: – Pairwise Sequence Alignments (with B. Jackson) – Look-up Tables, Suffix Trees and Suffix Arrays (with P. Ko) – Suffix Trees Applications in Computational Biology (with P. Ko) – Expressed Sequence Tags: Clustering and Applications (with A. Kalyanaraman) – Algorithms for Large Scale Sequence Clustering and Assembly (with S.J. Emrich and A. Kalyanaraman)

2. C.S.R. Murthy, K.N.B. Murthy, and S. Aluru, New Parallel Algorithms for Direct Solution of Linear Equations, John Wiley & Sons Publishing Company, October 2000.

Book Chapters 1. A. Sarje, J. Zola and S. Aluru, “Pairwise computations on the Cell processor with applications in computational biology,” In Scientific Computing with Multicore and Accelerators, Edited by Jack Dongarra, David A. Bader, and Jakub Kurzak, Chapman & Hall/CRC Computer and Information Science Series, Chapter 14, pp. 297-327, 2010.

2. A. Sarje and S. Aluru, “Parallel Algorithms for Alignments on the Cell Broadband Engine,” In Emerging Parallel Architectures for Bioinformatics Applications, Edited by B. Schmidt, Chapman & Hall/CRC Computer and Information Science Series, Chapter 4, pp. 59-84, 2010.

3. X. Yang and S. Aluru, “Comparative Genomics: Algorithms and Applications,” In Algorithms in Computational Molecular Biology: Techniques, Approaches and Applications, Edited by M. Elloumi and A. Zomaya, Wiley, Chapter 32 (25 pages), 2010.

4. T.M. Murali and S. Aluru, “Computational Systems Biology,” In Algorithms and Theory of Computation Handbook. Volume II: Special Topics and Techniques, Edited by M.J. Atallah and M. Blanton, Chapman & Hall/CRC Computer and Information Science Series, Chapter 32 (26 pages), 2009.

5. S. Aluru, “Text Indexing,” In Encyclopedia of Algorithms, Edited by Ming-Yang Kao, Springer, 2008.

6. S. Seal and S. Aluru, “Spatial Domain Decomposition Methods for Parallel Scientific Com- puting”, In Handbook of Parallel Computing: Models, Algorithms and Applications, Edited by S. Rajasekaran and J. Reif, Chapman & Hall/CRC Computer and Information Science Series, Chapter 44 (24 pages), 2007. 7. S. Aluru, N. Amato, D. Bader, S. Bhandarkar, L. Kale, D. Marinescu, and N. Samatovan, “Parallel Computational Biology,” In Frontiers of Scientific Computing, Edited by Mike Her- oux, Padma Raghavan and Horst Simon, SIAM Publishing, pp. 357-378, 2006.

8. S. Aluru, “Bioinformatics,” In Handbook of Nature-Inspired and Innovative Computing: Inte- grating Classical Models with Emerging Technologies, Edited by Albert Y. Zomaya, Springer, Chapter 21, pp. 657-695, 2006.

9. S. Aluru, “Quad and Oct Trees,” In Handbook of Data Structures and Applications, Edited by Dinesh P. Mehta and Sartaj Sahni, Chapman & Hall/CRC Computer and Information Science Series, Chapter 19 (26 pages), 2004.

10. S. Aluru, “Suffix Trees and Suffix Arrays,” In Handbook of Data Structures and Applica- tions, Edited by Dinesh P. Mehta and Sartaj Sahni, Chapman & Hall/CRC Computer and Information Science Series, Chapter 29 (21 pages), 2004.

Journal Articles 11. H. Zhang, C. Jain and S. Aluru, “A comprehensive evaluation of long read error correction methods,” BMC Genomics, Vol. 21 (Suppl 6), Article No. 889, 15 pages, 2020.

12. S.P. Chockalingam, S.V. Thankachan and S. Aluru, “Sequential and parallel algorithms for all-pair k-mismatch maximal common substrings,” Journal of Parallel and Distributed Com- puting, Vol. 144, pp. 68-79, 2020.

13. C. Jain, H. Zhang, Y. Gao and S. Aluru, “On the complexity of sequence-to-graph alignment,” Journal of Computational Biology, Vol. 27, No. 4, pp. 640-654, 2020.

14. I. Roy, A. Srivastava, M. Grimm and S. Aluru, “Interval stabbing on the automata processor,” Journal of Parallel and Distributed Computing, Vol. 135, pp. 234-245, 2020.

15. I. Roy, A. Srivastava, M. Grimm, M. Nouria, M. Becchi and S. Aluru, “Evaluating high per- formance pattern matching on the Automata Processor,” IEEE Transactions on Computers, doi:10.1109/TC.2019.2901466, 12 pages, 2019.

16. R. Corona, S. Sudarshan, S. Aluru and J. Guo, “An SVM-based method for assessment of transcription factor-DNA complex models,” BMC Bioinformatics, Vol. 19(Suppl 20), Article No. 506, pp. 49-57, 2018.

17. C. Jain, L.M. Rodriguez, A.M. Phillippy, K.T. Konstantinidis and S. Aluru, “High-throughput ANI analysis of 90K prokaryotic genomes reveals clear special boundaries,” Nature Commu- nications, Vol. 9, Article No. 5514, 8 pages, 2018.

18. T. Pan, R. Nihalani and S. Aluru, “Fast de Bruijn graph compaction in distributed memory environments,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, doi: https://doi.org/10.1109/TCBB.2018.2858797, 13 pages, 2018. 19. C. Jain, A. Dilthey, S. Koren, S. Aluru and A. Phillippy, “A fast approximate algorithm for mapping long reads to large reference databases,” Journal of Computational Biology, Vol. 25, No. 7, pp. 766-779, 2018.

20. T. Pan, P. Flick, C. Jain, Y. Liu and S. Aluru, “Kmerind: A flexible parallel library for k-mer indexing of biological sequences on distributed memory systems,” IEEE Transactions on Computational Biology and Bioinformatics, doi:10.1109/TCBB.2017.2760829, 15 pages, 2017.

21. P. Flick, C. Jain, T. Pan and S. Aluru, “Reprint of: A parallel connectivity algorithms for de Bruijn graphs in metagenomic applications,” Parallel Computing, Special Issue on SC16 Student Cluster Competition Reproducibility Initiative, Vol. 70, pp. 54-65, 2017.

22. C. Jain, P. Flick, T. Pan, O. Green and S. Aluru, “An Adaptive Parallel Algorithm for Computing Connectivity,” IEEE Transactions on Parallel and Distributed Systems, Vol. 28, No. 9, pp. 2428-2439, 2017.

23. S. V. Thankachan, S. Chockalingam, Y. Liu, A. Krishnan and S. Aluru, “A greedy alignment- free distance estimator for phylogenetic inference,” BMC Bioinformatics, Vol. 18 (Suppl 8), No. 238, pp. 1-8, 2017.

24. I. Rytsareva, D.S. Campo, Y. Zheng, S. Sims, S.V. Thankachan, C. Tetik, C. Jain, S. Chock- alingam, A. Sue, S. Aluru and Y. Khudyakov, “Efficient detection of viral transmission with next-generation sequencing data,” BMC Genomics, Vol. 18 (Suppl 4), No. 372, pp. 11-17, 2017.

25. H. Ye, S. Liu, B. Tang, J. Chen, Z. Xie, T. Nolan, H. Jiang, H. Guo, H.Y. Lin, L. Li, Y. Wang, H. Tong, M. Zhang, Z. Li, C. Chu, M. Aluru, S. Aluru, P.S. Schnable and Y. Yin, “RD26 mediates crosstalk between drought and brassinosteroid signaling pathway,” Nature Communications, Vol. 8, Article No. 14573, 2017.

26. S. Chockalingam, M. Aluru and S. Aluru, “Microarray data processing techniques for genome- scale network inference from large public repositories,” Microarrays, special issue on Next Generation Microarray Bioinformatics, Vol. 5, No. 3, pp. 23-35, 2016.

27. Y. Liu, M. Loewer, S. Aluru and B. Schmidt, “SNVSniffer: an integrated caller for germline and somatic single-nucleotide and indel mutations,” BMC Systems Biology, Vol. 10(Suppl 2):47, pp. 215-229, 2016.

28. S.V. Thankachan, A. Apostolico and S. Aluru, “A provably efficient algorithm for k-mismatch average common substring problem,” Journal of Computational Biology, Vol. 23, No. 6, pp. 472-482, 2016.

29. S.V. Thankachan, S.P. Chockalingam, Y. Liu, A. Apostolico and S. Aluru, “ALFRED: A practical method for alignment-free distance computation,” Journal of Computational Biol- ogy, Vol. 23, No. 6, pp. 452-460, 2016. 30. I. Roy and S. Aluru, “Discovering motifs in biological sequences using the Micron Automata Processor,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 13, No. 1, pp. 99-111, 2016.

31. S. Misra, K. Pamnany and S. Aluru, “Parallel mutual information based construction of genome-scale networks on the Intel Xeon Phi coprocessor,” IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 12, No. 5, pp. 1008-1020, 2015.

32. S. Pal and S. Aluru, “In search of perfect reads,” BMC Bioinformatics, 16 (Suppl 17):S7, 8 pages, 2015.

33. S. Aluru and Y. Simmhan, “Editorial: Scalable systems for big data management and ana- lytics,” Journal of Parallel and Distributed Computing, Vol. 79-80, pp. 1-2, 2015.

34. D. Mester, Y. Ronin, P. Schnable, S. Aluru and A. Korol, “Fast and accurate construction of ultra-dense consensus genetic maps using evolution strategy optimization,” PLOS ONE, Vol. 10, No. 4, doi:10.1371/journal.pone.0122485, 16 pages, 2015.

35. S.K. Samudrala, J. Zola, S. Aluru and B. Ganapathysubramanian, “Parallel framework for dimensionality reduction of large-scale datasets,” Journal of Scientific Programming, Vol. 2015, Article ID 180214, doi:10.1155/2015/180214, 12 pages, 2015.

36. S. Aluru and D. Slonim, “Guest-editor’s introduction: Selected papers from ACM-BCB 2013,”, IEEE/ACM Transactions on Computational Biology and Bioinformatics, Vol. 12, No. 1, pp. 2-3, 2015.

37. S. Aluru and N. Jammula, “A review of hardware acceleration for computational genomics,” IEEE Design and Test of Computers, Vol. 31, No.1, pp. 19-30, 2014.

38. H. Guo, L. Li, M. Aluru, S. Aluru, and Y. Yin, “Mechanisms and networks for brassinosteroid regulated gene expression,” Current Opinion in Plant Biology, Vol. 16, No. 5, pp. 545-553, 2013.

39. O. Nikolova, J. Zola and S. Aluru, “Parallel globally optimal structure learning of Bayesian networks,” Journal of Parallel and Distributed Computing, Vol. 73, pp. 1039-1048, 2013.

40. A. Sarje and S. Aluru, “All-pairs computations on many-core graphics processors,” Parallel Computing, Vol. 39, pp. 79-93, 2013.

41. X. Yang, J. Zola and S. Aluru, “Large Scale Metagenomic Sequence Clustering on Map- Reduce Clusters,” Journal of Bioinformatics and Computational Biology, Vol. 11, No. 1, Article No. 1340001, 15 pages, 2013.

42. M. Aluru, J. Zola, D. Nettleton and S. Aluru, “Reverse engineering and analysis of large genome-scale gene networks,” Nucleic Acids Research, Vol. 41, No. 1, pp. e24 (13 pages), doi: 10.1093/nar/gks904, 2013.

43. Y. Ronin, D. Mester, D. Minkov, R. Belotserkovski, B.N. Jackson, P.S. Schnable, S. Aluru and A. Korol, “Two-phase analysis in consensus genetic mapping,” Genes, Genomes and Genetics, Vol. 2, No. 5, pp. 537-549, 2012. 44. X. Yang, S.P. Chockalingam and S. Aluru, “A survey of error correction methods for next generation sequencing,” Briefings in Bioinformatics, doi: 10.1093/bib/bbs015, 11 pages, 2012.

45. S. Aluru, “Bioinformatics for next generation sequencing,” CSI Journal of Computing, Vol. 1, No. 1, pp. 65-79, 2012.

46. M. Vikram, B. Shanker, S. Seal and S. Aluru, “A scalable parallel wideband MLFMA for efficient electromagnetic simulations on large scale clusters,” IEEE Transactions on Antennas and Propagation, Vol. 59, No. 7, pp. 2565-2577, 2011.

47. X. Yang, S. Aluru and K. Dorman, “Repeat-aware modeling and correction of short read errors,” BMC Bioinformatics, Vol. 12 (Suppl 1):S52 (10 pages), 2011.

48. X. Yu, L. Li, J. Zola, M. Aluru, H. Ye, A. Foudree, H. Guo, S. Anderson, S. Aluru, P. Liu, S. Rodermel and Y. Yin, “A brassinosteroid transcriptional network revealed by genome-wide identification of BES1 target genes in Arabidopsis thaliana,” The Plant Journal, Vol. 65, No. 4, pp. 634-646, 2011.

49. A. Sarje, J. Zola and S. Aluru, “Accelerating pairwise computations on Cell processors,” IEEE Transactions on Parallel and Distributed Systems, Vol. 22, No. 1, pp. 69-77, 2011.

50. X. Yang, K. Dorman and S. Aluru, “Reptile: Representative tiling for short read error cor- rection,” Bioinformatics, Vol. 26, No. 20, pp. 2526-2533, 2010.

51. J. Zola, M. Aluru, A. Sarje and S. Aluru, “Parallel information theory based construction of gene regulatory networks,” IEEE Transactions on Parallel and Distributed Systems, Vol. 21, No. 12, pp. 1721-1733, 2010.

52. A. Wehe, W-C. Chang, O. Eulenstein and S. Aluru, “A scalable parallelization of the gene duplication problem,” Journal of Parallel and Distributed Computing, Vol. 70, pp. 237-244, 2010.

53. P.S. Schnable, D. Ware, ... , S. Aluru, R.A. Martienssen, S.W. Clifton, W.R. McCombie, R.A. Wing and R.K. Wilson, “The B73 maize genome: complexity, diversity and dynamics,” Science (cover article), Vol. 326, Issue 5956, pp. 1112-1115, 2009.

54. F. Wei, J.C. Stein, ... , J. Bennetzen, R. Martienssen, W.R. McCombie, S. Aluru, S.W. Clifton, P.S. Schanble, D. Ware, R.K. Wilson and R.A. Wing, “Detailed analysis of a con- tiguous 22-Mb region of the maize genome,” PLOS Genetics, Vol. 5, No. 11, 16 pages, 2009.

55. A. Sarje and S. Aluru, “Parallel genomic alignments on the Cell broadband engine,” IEEE Transactions on Parallel and Distributed Systems, Vol. 20, No. 11, pp. 1600-1610, 2009.

56. B.G. Jackson, P.S. Schnable and S. Aluru, “Parallel short sequence assembly of transcrip- tomes,” BMC Bioinformatics, Vol. 10, Suppl. 1:S14, 2009.

57. D. A. Bader and S. Aluru, “Guest editorial: High-performance computational biology,” Par- allel Computing, Vol. 34, No. 11, pp. 613-615, 2008. 58. M. Ott, J. Zola, S. Aluru, A.D. Johnson, D. Janies and A. Stamatakis, “Large-scale phylo- genetic analysis on current HPC architectures,” Journal of Scientific Programming (special issue on Supercomputing 2007 best papers), Vol. 16, pp. 255-270, 2008.

59. B. Jackson, P.S. Schnable and S. Aluru, “Consensus genetic maps as median orders from in- consistent sources,” ACM/IEEE Transactions on Computational Biology and Bioinformatics, Vol. 5, No. 2, pp. 161-171, 2008.

60. A. Kalyanaraman, S.J. Emrich, P.S. Schnable and S. Aluru, “Assembling genomes on large- scale parallel computers,” Journal of Parallel and Distributed Computing (special issue on IPDPS 2006 best papers), Vol. 67, pp. 1240-1255, 2007.

61. S.J. Emrich, L. Li, T.-J. Wen, M.D. Yandeau-Nelson, Y. Fu, L. Guo, H.-H. Chou, S. Aluru, D.A. Ashlock and P.S. Schanble, “Nearly identical paralogs (NIPs): implications for maize (Zea mays L.) genome evolution,” Genetics, Vol. 175, pp. 429-439, 2007. (featured in Science, Vol. 315, No. 5810, pp. 302 in Editor’s Choice: Highlights of the recent literature)

62. S. Aluru, N. Amato and D.A. Bader, “Editorial: Special section on high performance com- putational biology,” IEEE Transactions on Parallel and Distributed Systems, Vol. 17, No. 8, pp. 737-740, 2006.

63. A. Kalyanaraman and S. Aluru, “Efficient algorithms and software for detection of full-length LTR retrotransposons,” Journal of Bioinformatics and Computational Biology, Vol. 4, No. 2, pp. 197-216, 2006.

64. R. Sangireddy, N. Futamura, S. Aluru and A.K. Somani, “Scalable, memory efficient, high- speed lookup and update algorithms for IP routing,” ACM/IEEE Transactions on Network- ing, Vol. 13, No. 4, pp. 802-812, 2005.

65. Y. Fu, S.J. Emrich, L. Guo, T.J. Wen, D. Ashlock, S. Aluru, and P.S. Schnable, “Quality assessment of maize assembled genomic islands (MAGIs) and large-scale experimental verifi- cation of predicted novel genes,” Proceedings of the National Academy of Sciences USA, Vol. 102, No. 34, pp. 12282-12287, 2005.

66. B. Hariharan and S. Aluru, “Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods,” Parallel Computing, Vol. 31, pp. 311-331, 2005.

67. P. Ko and S. Aluru, “Space efficient linear time construction of suffix arrays,” Journal of Discrete Algorithms, Vol. 3, No. 2-4, pp. 143-156, 2005.

68. S. Seal, S. Komarina and S. Aluru, “An optimal hierarchical clustering algorithm for gene expression data,” Information Processing Letters, Vol. 93, No. 3, pp. 143-147, 2005.

69. F. Sevilgen, S. Aluru and N. Futamura, “Parallel algorithms for tree accumulations,” Journal of Parallel and Distributed Computing, Vol. 65, No.1, pp. 85-93, 2005.

70. S. Rajko and S. Aluru, “Space and time optimal parallel sequence alignments,” IEEE Trans- actions on Parallel and Distributed Systems, Vol. 15, No. 12, pp. 1070-1081, 2004. 71. M. Mitreva, A.A. Elling, M. Dante, A.P. Kloek, A. Kalyanaraman, S. Aluru, S.W. Clifton, D.McK. Bird, T.J. Baum and J.P. McCarter, “A survey of SL1-spliced transcripts from the root-lesion nematode Pratylenchus penetrans,” Molecular Genetics and Genomics, Vol. 272, pp. 138-148, 2004.

72. D.A. Bader and S. Aluru, “Special issue: High performance computational biology,” Concur- rency and Computation: Practice and Experience, Vol. 16, No. 9, pp. 817-821, 2004.

73. S.J. Emrich, S. Aluru, Y. Fu, T.J. Wen, M. Narayanan, L. Guo, D.A. Ashlock, and P.S. Schnable, “A Strategy for Assembling the Maize (Zea mays L.) Genome,” Bioinformatics, Vol. 20, No. 2, pp. 140-147, 2004.

74. N. Futamura, S. Aluru and X. Huang, “Parallel syntenic alignments,” Parallel Processing Letters, Vol. 13, No. 4, pp. 689-703, 2003.

75. A. Kalyanaraman, S. Aluru, V. Brendel and S. Kothari, “Space and time efficient parallel algorithms and software for EST clustering,” IEEE Transactions on Parallel and Distributed Systems, Vol. 14, No. 12, pp. 1209-1221, 2003.

76. X. Huang, J. Wang, S. Aluru, S.P. Yang and L. Hillier, “PCAP: A whole-genome assembly program,” Genome Research, Vol. 13, pp. 2164-2170, 2003.

77. S. Aluru and D.A. Bader, “Guest editors’ introduction: Special issue on high performance computational biology,” Journal of Parallel and Distributed Computing, Vol. 63, No. 7-8, pp. 671-673, 2003.

78. A. Kalyanaraman, S. Aluru, S. Kothari, and V. Brendel, “Efficient clustering of large EST data sets on parallel computers,” Nucleic Acids Research, Vol. 31, No. 11, pp. 2963-2974, 2003.

79. S. Aluru, N. Futamura and K. Mehrotra, “Parallel biological sequence comparison using prefix computations,” Journal of Parallel and Distributed Computing, Vol. 63, No. 3, pp. 264-272, 2003.

80. N. Futamura, S. Aluru, D. Ranjan and B. Hariharan, “Efficient parallel algorithms for solvent accessible surface area of proteins,” IEEE Transactions on Parallel and Distributed Systems, Vol. 13, No. 6, pp. 544-555, 2002.

81. I. Al-furaih, S. Aluru, S. Goil and S. Ranka, “Parallel construction of multidimensional binary search trees,” IEEE Transactions on Parallel and Distributed Systems, Vol. 11, No. 2, pp. 136-148, 2000.

82. S. Aluru, G.M. Prabhu, J. Gustafson and F. Sevilgen, “Distribution-independent hierarchical algorithms for the N-body problem,” Journal of Supercomputing, Vol. 12, pp. 303-323, 1998.

83. S. Aluru, “Lagged Fibonacci random number generators for distributed memory parallel computers,”, Journal of Parallel and Distributed Computing, Vol. 45, No. 1, pp. 1-12, 1997. 84. I. Al-furaih, S. Aluru, S. Goil and S. Ranka, “Practical algorithms for selection on coarse- grained parallel computers,” IEEE Transactions on Parallel and Distributed Systems, Vol. 8, No. 8, pp. 313-324, 1997.

85. J. Gustafson and S. Aluru,“Massively parallel searching for better algorithms or how to do a cross product in five multiplications,” Journal of Scientific Programming, pp. 203-217, 1996.

86. S. Aluru, “Greengard’s N-body Algorithm is not Order N,” SIAM Journal on Scientific Computing, Vol. 17, No. 3, pp. 773-776, 1996.

87. S. Aluru, G.M. Prabhu and J. Gustafson, “A random number generator for parallel comput- ers,” Parallel Computing, Vol. 18, pp. 839-847, 1992.

Refereed Conference Publications 88. H. Shrivastava, X. Zhang, L. Song and S. Aluru, “An unrolled deep learning framework for single cell gene regulatory networks,” Proc. International Conference on Research in Computational Molecular Biology (RECOMB), 2021, in press.

89. A. Srivastava, S. Chockalingam and S. Aluru, “A parallel framework for constraint-based Bayesian network structure learning via Markov blanket discovery,” ACM/IEEE Supercom- puting Conference (SC), Article No. 7, pp. 1–15, 2020.

90. H. Shrivastava, X. Chen, B. Chen, G. Lan, S. Aluru, H. Liu and L. Song, “GLAD: Learn- ing sparse graph recovery,” Proc. 8th International conference on Learning Representations (ICLR), 2020.

91. P. Flick and S. Aluru, “Distributed enhanced suffix arrays: Efficient algorithms for construc- tion and querying,” ACM/IEEE Supercomputing Conference (SC), Article No. 71, pp. 1–17, doi.org/10.1145/3295500.3356211, 2019. (25% acceptance rate)

92. C. Jain, H. Zhang, A. Dilthey and S. Aluru, “Validating paired-end read alignments in se- quence graphs,” Proc. 19th International Workshop on Algorithms in Bioinformatics (WABI), pp. 17:1–17:13, 2019. (51% acceptance rate)

93. N. Jammula and S. Aluru, “ParRefCom: Parallel Reference-based Compression of Paired-end Genomics Read Datasets,” Proc. 10th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics, pp. 447–456, 2019.

94. J. Pannu, S. Hooshmand, S.P. Chockalingam, S.V. Thankachan and S. Aluru, “An alignment- free heuristic for fast sequence comparisons with applications to phylogeny reconstruction,” Proc. 15th International Symposium on Bioinformatics Research and Applications (ISBRA), 2019. (23% acceptance rate)

95. Vasimuddin Md, S. Misra, H. Li and S. Aluru, “Efficient architecture-aware acceleration of BWA-MEM for multicore systems,” Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 314–324, 2019. (28% acceptance rate) 96. C. Jain, S. Misra, H. Zhang, A. Dilthey and S. Aluru, “Accelerating sequence alignment to graphs,” Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 451–461, 2019. (28% acceptance rate)

97. C. Jain, H. Zhang, Y. Gao and S. Aluru, “On the complexity of sequence to graph align- ment,” Proc. International Conference on Research in Computational Molecular Biology (RECOMB), pp. 85–100, 2019. (21% acceptance rate)

98. H. Shrivastava, E. Bart, B. Price, H. Dai, B. Dai and S. Aluru, “Cooperative neural networks (CoNN): Exploiting prior independence structure for improved classification,” Advances in Neural Information Processing Systems (NeurIPS), pp. 4130–4140, 2018. (21% acceptance rate)

99. T.C. Pan, S. Misra and S. Aluru, “Optimizing high performance distributed memory parallel hash tables for DNA k-mer counting,” Proc. ACM/IEEE Supercomputing Conference (SC), 11 pages, 2018. (19% acceptance rate)

100. S. Misra, T. Pan, K. Mahadik, G. Powley, P. Vaidya, M. Vasimuddin and S. Aluru, “Per- formance extraction and suitability analysis of multi- and many-core architectures for next generation sequencing secondary analysis,” Proc. 27th International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 3:1–3:14, 2018. (29% acceptance rate)

101. C. Jain, S. Koren, A. Dilthey, A. Phillippy and S. Aluru, “A fast adaptive algorithm for com- puting whole-genome homology maps,” 17th European Conference on Computational Biology (ECCB), Bioinformatics, Vol. 34, i748-756, 2018. (17% acceptance rate)

102. Vasimuddin Md, S.P. Chockalingam and S. Aluru, “A parallel algorithm for Bayesian net- work inference using arithmetic circuits,” Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 34-43, 2018. (25% acceptance rate)

103. S.V. Thankachan, C. Aluru, S.P. Chockalingam and S. Aluru, “Algorithmic framework for ap- proximate matching under bounded edits with applications to sequence analysis,” Proc. Inter- national Conference on Research in Computational Molecular Biology (RECOMB), Springer Lecture Notes in Computer Science, pp. 211-224, 2018. (20% acceptance rate)

104. M. Vasimuddin and S. Aluru, “Parallel Exact Dynamic Bayesian Network Structure Learning with Application to Gene Networks,” Proc. 24th IEEE International Conference on High Performance Computing (HiPC), pp. 42-51, 2017. (23% acceptance rate; best paper finalist)

105. R. Corona, S. Sudarshan, S. Aluru and J. Guo, “Confidence assessment of protein-DNA complex models,” Proc. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 9-15, 2017. (19% acceptance rate)

106. R. Nihalani, S.P. Chockalingam, S. Zhu, V. Vazirani and S. Aluru, “Probabilistic estimation of overlap graphs for large sequence datasets,” Proc. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 247-252, 2017. (38% acceptance rate) 107. S.P. Chockalingam, M. Aluru, H. Guo, Y. Yin and S. Aluru, “Reverse engineering gene networks: A comparative study at genome-scale,” Proc. 8th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB), pp. 480- 490, 2017. (33% acceptance rate)

108. N. Jammula, S.P. Chockalingam and S. Aluru, “Distributed memory partitioning of high- throughput sequencing datasets for enabling parallel genomics analysis,” Proc. 8th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB), pp. 417-424, 2017. (54% acceptance rate)

109. P. Flick and S. Aluru, “Parallel construction of suffix trees and the all-nearest-smaller-values problem,” Proc. 31st IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 12-21, 2017. (23% acceptance rate)

110. C. Jain, A. Dilthey, S. Koren, S. Aluru and A. Phillippy, “A fast approximate algorithm for mapping long reads to large reference databases,” Proc. International Conference on Re- search in Computational Molecular Biology (RECOMB), Springer Lecture Notes in Computer Science, pp. 66-81, 2017. (21% acceptance rate)

111. I. Roy, A. Srivastava, M. Grimm and S. Aluru, “Parallel interval stabbing on the Automata processor,” Proc. 6th Workshop on Irregular Applications: Architectures and Algorithms (IA3), held in conjunction with Supercomputing Conference (SC), pp. 10-17, 2016. (20% acceptance rate)

112. S. Chockalingam, S.V. Thankachan and S. Aluru, “A parallel algorithm for k-mismatch all- pair maximal common substrings,” Proc. ACM/IEEE Supercomputing Conference (SC), 11 pages, 2016. (18% acceptance rate)

113. Y. Liu, T. Pan and S. Aluru, “Parallel pairwise correlation computation On Intel Xeon Phi clusters,” Proc. 28th International Symposium on Computer Architecture and High Perfor- mance Computing (SBAC-PAD), pp. 141-149, 2016. (35% acceptance rate)

114. R. Nihalani and S. Aluru, “Effective utilization of paired reads to improve length and accuracy of contigs in genome assembly,” Proc. 7th ACM International Conference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB), 9 pages, 2016.

115. T. Pan, P. Flick, C. Jain, Y. Liu and S. Aluru, “A flexible parallel library for k-mer indexing of biological sequences on distributed memory systems,” Proc. 7th ACM International Con- ference on Bioinformatics, Computational Biology, and Health Informatics (ACM-BCB), 12 pages, 2016.

116. I. Roy, A. Srivastava and S. Aluru, “Programming techniques for the Automata Processor,” Proc. 45th International Conference on Parallel Processing (ICPP), pp. 512-521, 2016. (29% acceptance rate)

117. S.V. Thankachan, S. Chockalingam and S. Aluru, “An efficient algorithm for finding all pairs k-mismatch maximal common substrings,” Proc. 12th International Symposium on Bioinfor- matics Research and Applications (ISBRA), in Lecture Notes in Bioinformatics (LNBI) Vol. 9683, pp. 3014, 2016. (26% acceptance rate) 118. I. Roy, N. Jammula and S. Aluru, “Algorithmic techniques for solving graph problems on the automata processor,” Proc. 30th International Parallel and Distributed Processing Symposium (IPDPS), pp. 283-292, 2016. (23% acceptance rate)

119. I. Roy, A. Srivastava, M. Nourian, M. Becchi and S. Aluru, “High performance pattern matching using the automata processor,” Proc. 30th International Parallel and Distributed Processing Symposium (IPDPS), pp. 1123-1132, 2016. (23% acceptance rate)

120. V. Sachdeva, S. Aluru, D.A. Bader, “A memory and time scalable parallelization of the Rep- tile error-correction code,” Proc. 15th IEEE International Workshop on High Performance Computational Biology (HiCOMB), pp. 453-462, 2016.

121. S.P. Chockalingam, M. Aluru and S. Aluru, “Information theory based genome-scale gene networks construction using MapReduce,” Proc. 22nd International Conference on High Per- formance Computing (HiPC), pp. 464-473, 2015. (24% acceptance rate)

122. N. Jammula, S. Chockalingam and S. Aluru, “A parallel algorithm for read error correction of big genomic datasets,” Proc. 22nd International Conference on High Performance Computing (HiPC), pp. 446-455, 2015. (24% acceptance rate)

123. Y. Liu, M. Loewer, S. Aluru and B. Schmidt, “SNVSniffer: An integrated caller for germline and somatic SNVs based on Bayesian models,” Proc. IEEE International Conference on Bioinformatics and Biomedicine, pp. 83-90, 2015. (19% acceptance rate)

124. S.V. Thankachan, S.P. Chockalingam, Y. Liu. A. Krishnan and S. Aluru, “A greedy alignment- free distance estimator for phylogenetic inference,” Proc. 5th IEEE International Conference on Computational Advances in Bio and Medical Sciences, 10.1109/ICCABS.2015.7344711, 2015. (50% acceptance rate)

125. I. Rytsareva, D.S. Campo, Y. Zheng, S. Sims, C. Tetik, C. Jain, S. Chockalingam, S. Thankachan, A. Sue, S. Aluru and Y. Khudyakov, “Efficient detection of viral transmission with threshold-based methods,” Proc. 5th IEEE International Conference on Computational Advances in Bio and Medical Sciences, DOI: 10.1109/ICCABS.2015.7344723, 2015. (50% acceptance rate)

126. P. Flick and S. Aluru, “Parallel distributed memory construction of suffix and longest common prefix arrays,” Proc. ACM/IEEE Supercomputing Conference (SC), 10 pages, 2015. (22% acceptance rate; best student paper)

127. P. Flick, C. Jain, T. Pan and S. Aluru, “A parallel connectivity algorithm for de Bruijn graphs in metagenomic applications,” Proc. ACM/IEEE Supercomputing Conference (SC), 11 pages, 2015. (22% acceptance rate)

128. K. Pamnany, S. Misra, M. Vasimuddin, X. Liu, E. Chow and S. Aluru, “Dtree: Dynamic task scheduling at petascale,” Proc. International Supercomputing Conference, Springer Lecture Notes in Computer Science, Vol. 9137, pp. 122-138, 2015. (40% acceptance rate) 129. S. Aluru, A. Apostolico and S.V. Thankachan, “ Efficient alignment free sequence comparison with bounded mismatches,” Proc. International Conference on Research in Computational Molecular Biology (RECOMB), Springer Lecture Notes in Computer Science, Vol. 9029, pp. 1-12, 2015. (21% acceptance rate)

130. S. Misra, M. Vasimuddin, K.Pamnany, S.P. Chockalingam, Y. Dong, M. Xie, M. Aluru and S. Aluru, “Parallel Bayesian network structure learning for genome-scale gene networks,” Proc. ACM/IEEE Supercomputing Conference (SC), 11 pages, 2014. (21% acceptance rate; best paper finalist)

131. S. Pal and S. Aluru, “In search of perfect reads,” Proc. 4th IEEE International Conference on Computational Advances in Bio and Medical Sciences, doi: 10.1109/ICCABS.2014.6863919, 2014.

132. S. Misra, K. Pamnany and S. Aluru, “Parallel mutual information based construction of whole-genome networks on the Intel Xeon Phi Coprocessor,” Proc. 28th International Parallel and Distributed Processing Symposium, pp. 241-250, 2014. (21% acceptance rate)

133. I. Roy and S. Aluru, “Finding motifs in biological sequences using the Micron automata processor,” Proc. 28th International Parallel and Distributed Processing Symposium (IPDPS), pp. 415-424, 2014. (21% acceptance rate)

134. R. Nihalani, J. Zola and S. Aluru, “Resolving read assignment ambiguities in metagenomic clustering,” Proc. 5th International Conference on Bioinformatics and Computational Biology (BICoB), 2013.

135. O. Nikolova and S. Aluru, “Parallel Bayesian network structure learning with application to gene networks,” Proc. ACM/IEEE Supercomputing Conference (SC), 13 pages, 2012. (21% acceptance rate)

136. A. Shah, S. Chockalingam and S. Aluru, “A parallel algorithms for spectrum-based short read error correction,” Proc. 26th International Parallel and Distributed Processing Symposium (IPDPS), pp. 60-70, 2012. (21% acceptance rate)

137. X. Yang, F. Sikora, G. Blin, S. Hamel, R. Rizzi and S. Aluru, “An algorithmic view on multi- related-segments: A unifying model for approximate common interval,” Proc. 9th annual conference on Theory and Applications of Models of Computation (TAMC), Springer-Verlag Lecture Notes in Bioinformatics, Vol. 7287, pp. 319-329, 2012.

138. O. Nikolova and S. Aluru, “Parallel discovery of direct causal relations and Markov blan- kets with applications to gene networks,” Proc. 40th International Conference on Parallel Processing (ICPP), pp. 512-521, 2011. (22% acceptance rate)

139. X. Yang, J. Zola and S. Aluru, “Parallel metagenomic sequence clustering via sketching and quasi-clique enumeration on map-reduce clouds,” Proc. 25th International Parallel and Distributed Processing Symposium (IPDPS), pp. 1223-1233, 2011. (20% acceptance rate) 140. A. Varma, A. Ranade and S. Aluru, “An improved maximum likelihood formulation for accurate genome assembly,” Proc. 1st IEEE International Conference on Computational Advances in Bio and medical Sciences (ICCABS), pp. 165-170, 2011. (43% acceptance rate)

141. X. Yang, S. Aluru and K. Dorman, “Repeat-aware modeling and correction of short read errors,” Proc. 9th Asia Pacific Bioinformatics Conference (APBC), 2011. (30% acceptance rate)

142. A. Sarje and S. Aluru, “A MapReduce style framework for computations on trees with appli- cations to k-nearest neighbors and FMM simulations,” Proc. 39th International Conference on Parallel Processing (ICPP), pp. 343-352, 2010.

143. B. Jackson, M. Regennitter, X. Yang, P.S. Schnable and S. Aluru, “Parallel de novo assembly of large genomes from high-throughput short reads,” Proc. 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2010, 15 pages. (24% acceptance rate)

144. X. Yang and S. Aluru, “An improved model for gene cluster inference,” Proc. ISCA Inter- national Conference on Bioinformatics and Computational Biology (BiCoB), pp. 190-195, 2010.

145. O. Nikolova, J. Zola and S. Aluru, “A parallel algorithm for exact Bayesian structure infer- ence,” Proc. 16th International Conference on High Performance Computing (HiPC), pp. 342-349, 2009. (19% acceptance rate)

146. J. Zola, A. Sarje and S. Aluru, “Constructing gene regulatory networks on clusters of Cell processors,” Proc. 38th International Conference on Parallel Processing (ICPP), pp. 108-115, 2009. (30% acceptance rate)

147. M. Vikram, B. Shanker and S. Aluru, “Provably scalable parallel FMM algorithm for mul- tiscale electromagnetic simulations,” Proc. Antennas and Propagation Society International Symposium (APSURSI), 4 pages, 2009.

148. M. Vikram, A. Baczewski, B. Shanker and S. Aluru, “Parallel accelerated Cartesian expan- sions for particle dynamics simulations,” Proc. IEEE International Parallel and Distributed Processing Symposium (IPDPS), 11 pages, 2009. (23% acceptance rate)

149. B.G. Jackson, P.S. Schanble and S. Aluru, “Assembly of large genomes from paired short reads,” Proc. 1st ISCA International Conference on Bioinformatics and Computational Biol- ogy (BiCOB), Springer-Verlag Lecture Notes in Bioinformatics, Vol. 5462, pp. 30-43, 2009. (invited paper)

150. B.G. Jackson, P.S. Schnable and S. Aluru, “Parallel short sequence assembly of transcrip- tomes,” Proc. 7th Asia Pacific Bioinformatics Conference (APBC), pp. 141-152, 2009. (37% acceptance rate)

151. J. Zola, M. Aluru and S. Aluru, “Parallel information theory based construction of gene regulatory networks,” Proc. 15th International Conference on High Performance Computing (HiPC), Springer-Verlag Lecture Notes in Computer Science, Vol. 5374, pp. 336-349, 2008. (15% acceptance rate) 152. A. Janowczyk, S. Chandran and S. Aluru, “Fast, processor-cardinality agnostic PRNG with a tracking application,” 6th Proc. Indian Conference on Computer Graphics, Vision and Image Processing, pp. 171-178, 2008.

153. S. Seal, M. Moody, A. Ceguerra, S. Ringer, K. Rajan and S. Aluru, “Tracking nanostructural evolution in alloys: Large-scale analysis of atom probe tomography data on Blue Gene/L,” Proc. 37th International Conference on Parallel Processing (ICPP), pp. 338-345, 2008. (31% acceptance rate)

154. B.G. Jackson and S. Aluru, “Parallel construction of bidirected string graphs for genome assembly,” Proc. 37th International Conference on Parallel Processing (ICPP), pp. 346-353, 2008. (31% acceptance rate)

155. A. Sarje and S. Aluru, “Parallel biological sequence alignments on the Cell broadband engine,” Proc. 22ndIEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 1-11, 2008. (26% acceptance rate)

156. M. Ott, J. Zola, S. Aluru and A. Stamatakis, “Large-scale maximum likelihood-based phy- logenetic analysis on the IBM Blue Gene/L,” Proc. ACM/IEEE Supercomputing Conference (SC), 2007. (20% acceptance rate; best paper finalist).

157. P. Ko and S. Aluru, “Optimal self-adjusting suffix tree layout for dynamic string data in secondary storage,” Proc. 14th Symposium on String Processing and Information Retrieval (SPIRE), Springer Verlag Lecture Notes in Computer Science, Vol. 4726, pp. 184-194, 2007. (35% acceptance rate)

158. J. Zola, X. Yang, A. Rospondek and S. Aluru, “Parallel-TCoffee: A parallel multiple sequence aligner,” Proc. ISCA Parallel and Distributed Computing Systems (PDCS), pp. 248-253, 2007.

159. S. Emrich, A. Kalyanaraman and S. Aluru, “Massively parallel expressed sequence tag clus- tering ,” Proc. ISCA Parallel and Distributed Computing Systems (PDCS), pp. 254-261, 2007.

160. S. Seal and S. Aluru, “Communication-aware parallel domain decomposition using space filling curves,” Proc. ISCA 19th International Conference on Parallel and Distributed Computing Systems (PDCS), pp. 159-164, 2006. (45% acceptance rate)

161. A. Kalyanaraman, S. Aluru and P.S. Schnable, “Turning repeats to advantage: scaffolding genomic contigs using LTR retrotransposons,” Proc. Life Sciences Society Computational Systems Bioinformatics (CSB) Conference, pp. 167-178, 2006. (19% acceptance rate)

162. S. Tirthapura, S. Seal and S. Aluru, “A formal analysis of space filling curves for parallel domain decomposition,” Proc. International Conference on Parallel Processing (ICPP), pp. 505-512, 2006. (32% acceptance rate)

163. P. Ko and S. Aluru, “Obtaining provably good performance from suffix trees in secondary storage,” Proc. Combinatorial Pattern Matching (CPM) Conference, Springer-Verlag Lecture Notes in Computer Science, Vol. 4009, pp. 72-83, 2006. (38% acceptance rate) 164. A. Kalyanaraman, S.J. Emrich, P.S. Schnable and S. Aluru, “Assembling Genomes on Large- Scale Parallel Computers”, Proc. 20thIEEE International Parallel and Distributed Processing Symposium (IPDPS), 2006. (23% acceptance rate; best paper award)

165. Z. Shen, S. Aluru and S. Tirthapura, “Indexing for subscription covering in publish-subscribe systems,” Proc. ISCA 18th International Conference on Parallel and Distributed Computing Systems (PDCS), pp. 328-333, 2005.

166. B. Jackson, S. Aluru and P.S. Schnable, “Consensus Genetic Maps: A Graph Theoretic Approach,” Proc. IEEE Computational Systems Bioinformatics (CSB) Conference, pp. 35- 43, 2005. (12% acceptance rate)

167. A. Kalyanaraman and S. Aluru, “Efficient Algorithms and Software for Detection of Full- length LTR Retrotransposons,” Proc. IEEE Computational Systems Bioinformatics (CSB) Conference, pp. 56-64, 2005. (12% acceptance rate; best paper award)

168. S. Aluru and F.E. Sevilgen, “Efficient Methods for Database Storage and Retrieval Using Space-filling Curves,” Proc. 19th International Symposium on Computer and Information Sciences, Spring Verlag Lecture Notes in Computer Science, Vol. 3280, pp. 503-512, 2004. (31% acceptance rate)

169. P. Ko, M. Narayanan, A. Kalyanaraman, and S. Aluru, “Space-conserving optimal DNA- protein alignment,” Proc. IEEE Computational Systems Bioinformatics Conference, pp. 80- 88, 2004. (21% acceptance rate)

170. M.S. Wu, R.A. Kendall, and S. Aluru, “Exploring collective communications on a cluster of SMPs,” Proc. IEEE International Conference on High Performance Computing – Asia, pp. 114-117, 2004. (62% acceptance rate)

171. M.S. Wu, R.A. Kendall, and S. Aluru, “A tunable collective communication framework on cluster of SMPs,” Proc. IASTED International Conference on Parallel and Distributed Com- puting and Networks, 2004. (58% acceptance rate)

172. S.J. Emrich, S. Aluru, Y. Fu, T.J. Wen, M. Narayanan, L. Guo, D.A. Ashlock, and P.S. Schn- able, “A strategy for assembling the maize genome,” Plant and Animal Genomes Conference Bioinformatics Workshop, 2004. (28% acceptance rate)

173. N. Futamura, R. Sangireddy, S. Aluru and A. Somani, “Scalable, memory efficient, high- speed lookup and update algorithms for IP routing,” Proc. 12th International Conference on Computer Communications and Networks, pp. 257-263, 2003. (31% acceptance rate)

174. S. Rajko and S. Aluru, “Space and time optimal parallel sequence alignment,” Proc. Inter- national Conference on Parallel Processing, pp. 39-47, 2003. (36% acceptance rate)

175. S. Aluru, “Computational biology on parallel computers,” European Controls Conference, 2003.

176. P. Ko and S. Aluru, “Space-efficient linear time construction of suffix arrays,” Proc. Combi- natorial Pattern Matching, pp. 200-210, 2003. (49% acceptance rate) 177. N. Futamura, S. Aluru and X. Huang, “Parallel Syntenic Alignments,” Proc. 9th IEEE International Conference on High Performance Computing (HiPC), pp. 420-430, 2002. (39% acceptance rate)

178. B. Hariharan, S. Aluru and B. Shanker, “A scalable parallel fast multipole method for analysis of scattering from perfect electrically conducting surfaces,” Proc. IEEE/ACM Supercomput- ing Conference, www.sc-2002.org, 2002. (29% acceptance rate)

179. M.S. Wu, S. Aluru and R.A. Kendall, “Mixed mode matrix multiplication,” Proc. IEEE International Conference on Cluster Computing, pp. 195-203, 2002.

180. A. Kalyanaraman, S. Aluru and S. Kothari, “Space and time efficient parallel algorithms and software for EST clustering,” Proc. 31st International Conference on Parallel Processing, pp. 331-339, 2002. (36% acceptance rate)

181. A. Kalyanaraman, S. Aluru and S. Kothari, “Parallel EST clustering,” First International Workshop on High Performance Computational Biology (HiCOMB), In Proc. IEEE Inter- national Parallel and Distributed Processing Symposium (IPDPS), pp. 185-192, 2002. (60% acceptance rate)

182. B. Hariharan and S. Aluru, “Efficient parallel algorithms and software for compressed octrees with application to hierarchical methods,” Proc. 8th IEEE International Conference on High Performance Computing (HiPC), Spring Verlag Lecture Notes in Computer Science, pp. 125- 136, 2001. (27% acceptance rate)

183. N. Futamura, S. Aluru and S. Kurtz, “Parallel suffix sorting,” Proc. 9th International Con- ference on Advanced Computing and Communications, pp. 76-81, 2001. (33% acceptance rate for regular papers)

184. N. Futamura, S. Aluru, D. Ranjan and K. Mehrotra, “Efficient algorithms for protein sol- vent accessible surface area,” Proc. IEEE International Conference on High Performance Computing – Asia, 2000. (30% acceptance rate for regular papers)

185. F. Sevilgen, S. Aluru and N. Futamura, “A provably optimal, distribution-independent, paral- lel fast multipole method,” Proc. 14thIEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 77-84, 2000. (35% acceptance rate)

186. S. Aluru, N. Futamura and D. Ranjan, “A parallel Monte Carlo algorithm for protein acces- sible surface area computation,” Proc. 6th IEEE International Conference on High Perfor- mance Computing (HiPC), Springer Verlag Lecture Notes in Computer Science, vol. 1745, pp. 339-348, 1999. (18% acceptance rate for regular papers)

187. S. Aluru and F. Sevilgen, “Dynamic compressed hyperoctrees with application to the N- body problem,” Proc. 19th International Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS), Springer Verlag Lecture Notes in Computer Science, vol. 1738, pp. 21-33, 1999. (36% acceptance rate) 188. F. Sevilgen and S. Aluru, “A unifying data structure for hierarchical methods,” Proc. IEEE/ACM Supercomputing Conference (SC), http://www.supercomp.org/sc99, 1999. (29% acceptance rate)

189. F. Sevilgen, S. Aluru and N. Futamura, “Distributed memory tree accumulations,” Proc. Parallel and Distributed Computing Systems, pp. 389-395, 1999. (46% acceptance rate)

190. K.N.B. Murthy and S. Aluru, “Bidirectional division-free Gaussian elimination algorithm for solving linear equations,” Proc. Parallel and Distributed Computing Systems, pp. 520-527, 1999. (46% acceptance rate)

191. S. Aluru and S. Maneewongvatana, “A practical parallel algorithm for line segment visibility,” Proc. Parallel and Distributed Processing Technology and Applications, vol. III, pp. 1676- 1682, 1999. (28% acceptance rate)

192. S. Aluru, N. Futamura and K. Mehrotra, “Biological sequence comparison using prefix compu- tations,” Proc. 13th IEEE International Parallel Processing Symposium (IPPS), pp. 653-659, 1999. (45% acceptance rate)

193. S. Aluru and F. Sevilgen,“Parallel domain decomposition and load balancing using space- filling curves,” Proc. 4th IEEE International Conference on High Performance Computing (HiPC), pp. 230-235, 1997. (57% acceptance rate)

194. S. Goil, S. Aluru and S. Ranka, “Concatenated parallelism: a technique for efficient parallel divide and conquer,” 8th IEEE Symposium on Parallel and Distributed Processing (SPDP), pp. 488-495, 1996. (39% acceptance rate)

195. S. Aluru and S. Maneewongvatana, “Practical parallel algorithms for constructing line ar- rangements,” Proc. International Conference on Parallel and Distributed Processing Tech- niques and Applications, pp. 125-135, 1996.

196. S. Aluru, “Parallel additive lagged Fibonacci random number generators,” Proc. 10th ACM International Conference on Supercomputing (ICS), pp. 102-108, 1996. (43% acceptance rate)

197. I. Al-furaih, S. Aluru, S. Goil and S. Ranka, “Parallel construction of multidimensional binary search trees,”Proc. 10th ACM International Conference on Supercomputing (ICS), pp. 205- 212, 1996. (43% acceptance rate)

198. I. Al-furaih, S. Aluru, S. Goil and S. Ranka, “Practical algorithms for selection on coarse- grained parallel computers,” Proc. 10th IEEE International Parallel Processing Symposium (IPPS), pp. 309-313, 1996. (36% acceptance rate)

199. I. Al-furaih, S. Aluru, S. Goil and S. Ranka, “Parallel construction of k-d tree and related problems,” Proc. 2nd Workshop on Solving Irregular Problems on Distributed Memory Ma- chines, 1996. 200. S. Aluru, “Properties of binomial coefficients and implications to parallelizing lagged Fi- bonacci random number generators,” Proc. International Conference on Parallel Processing, pp. 25-28, 1995. (28% acceptance rate)

201. S. Aluru, G.M. Prabhu and J. Gustafson, “Truly distribution-independent algorithms for the N-body problem,” Proc. IEEE/ACM Supercomputing Conference, pp. 420-428, 1994.

202. S. Aluru and J. Gustafson, “Subtle issues of SIMD tree search,” Proc. Parallel Computing, in Parallel Computing: Trends and Applications, Elsevier Publishers, pp. 49-56, 1993.

203. S. Aluru and J. Gustafson, “A massively parallel optimizer for expression evaluation,” Proc. 7th ACM International Conference on Supercomputing, pp. 97-106, 1993.

204. S. Aluru and G.M. Prabhu, “Random number generation for parallel computers,” Proc. NATO Advanced Study Institute on Parallel Computing on Distributed Memory Multipro- cessors, pp. 315-320, 1991.

Software 1. FastANI (with C. Jain, L.M. Rodriguez-R, A.M. Phillippy, and K.T. Konstantinidis), Fast alignment-free software for computing Average Nucleotide Identity

2. PaSGAL (with C. Jain and S. Misra), Parallel Sequence to Graph Aligner

3. ParBLiSS (with P. Flick, C. Jain, and T. Pan), Parallel Bioinformatics Library for Short Sequences.

4. MashMap (with C. Jain, S. Koren, A. Dilthey and A. Phillippy), Fast approximate aligner for long DNA sequences.

5. PARABLE (with O. Nikolova), Parallel Bayesian Learning software for large-scale Bayesian networks.

6. TINGe (with J. Zola and M. Aluru), Tool for Inferring Network of Genes. Parallel software for inferring gene networks from multiple microarray experiments. Scales to whole genome networks and thousands of experiments.

7. GeNA (with J. Zola and M. Aluru), Gene Network Analyzer. Software for extracting context- specific biological subnetworks from whole genome networks, using seeds genes from partially characterized pathways.

8. ParaBayL (with O. Nikolova and J. Zola), Parallel software for exact Baeysian Network Structure Learning.

9. Parallel Reptile (with A. Shan and S. Chockalingam), Parallel software for error correction of next-gen DNA sequences, particularly for Illumina.

10. Reptile (with X. Yang and K. Dorman), Software for error correction of next-gen DNA se- quences, particularly for Illumina. 11. Redeem (with X. Yang and K. Dorman), Software for error correction of reads from repeat- rich genomes, targeted for Illumina.

12. PaCE, Parallel Clustering of Expressed sequenced tags (with A. Kalyanaraman). Bioinfor- matics software that clusters large collections of DNA sequences based on sequence similarity. Runs on multiprocessor systems and can handle large scale problems in clustering and assem- bly of DNA sequences.

13. LTR Par (with A. Kalyanaraman). Software for identifying long terminal repeat retrotrans- posons in genomic sequences. This software can be used to explore large-scale genomes for retrotransposons and has been successfully used for the Chimpanzee genome.

14. Software for generating consensus genetic maps from individual genetic maps of the same species. Produces output to enable visualization using graph drawing software.

15. Parallel-TCoffee (with J. Zola). Parallelization of the popular TCoffee multiple sequence alignment software. It can easily align thousands of sequences.

16. PanDA, Protein and DNA alignment software (with P. Ko, M. Narayanan, and A. Kalya- naraman). The software models the DNA-Protein alignment problem as a combinatorial optimization problem and computes an optimal solution. It can be used for gene annotation and finding the structure of genes.

VII. Invited and Keynote Presentations

1. Genome graphs: Models, Algorithms, and Applications, keynote at the 16th International Symposium on Bioinformatics Research and Applications (ISBRA), Moscow, Russia, Decem- ber 1, 2020. 2. Long read mapping at scale: Algorithms and applications, Minisymposium on Applied and Computational Discrete Algorithms, Second Joint SIAM/CAIMS Annual Meeting (AN20), July 6, 2020. 3. Panelist, Is AI transforming HPC or HPC transforming AI?, SIAM Conference on Parallel Processing for Scientific Computing, Seattle, Washington, February 13, 2020. 4. Genomes galore: Big data challenges in computational genomics and systems biology, Com- puter Science and Mathematics Division (CSMD) Recognition Lecture Series, Oak Ridge National Laboratory, September 26, 2019. 5. Genomes galore: Big data challenges in computational genomics, University of Brasilia, May 27, 2019. 6. Long read mapping at scale: Algorithms and applications, keynote at the International Work- shop on String Algorithms in Bioinformatics, University of Central Florida, Orando, October 26, 2018. 7. Long read mapping at scale: Algorithms and applications, keynote at the 8th IEEE Interna- tional Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Las Vegas, October 19, 2018. 8. Parallel machine learning approaches for reverse engineering genome-scale networks, invited talk at Symposium on Machine Learning in Science and Engineering, Carnegie Mellon Uni- versity, Pittsburgh, June 7, 2018. 9. High performance computing for biology and , invited talk at the CCC Workshop on Digital Computing Beyond Moore’s Law, San Francisco, May 3, 2018. 10. Automata processor and its applications in bioinformatics, keynote at the 1st International Workshop on Accelerator Architecture in Computational Biology and Bioinformatics (AACBB), held in conjunction with International Symposium on High Performance Computer Architec- ture (HPCA), Vienna, Austria, February 24, 2018. 11. Genomes galore: Big data challenges in the life sciences, Invited talk at the University of Illinois at Urbana-Champaign, Urbana, Illinois, February 12, 2018. 12. South Big Data Regional Innovation Hub, Keynote at Research Computing Day: Tackling Big Data Challenges, Old Dominion University, Norfolk, Virginia, September 27, 2017. 13. Genomes galore: Big data challenges in the life sciences, Keynote at the 23rd IEEE Interna- tional Conference on High Performance Computing (HiPC), Hyderabad, India, December 20, 2016. 14. Parallel Machine Learning Approaches for Reverse Engineering Genome-scale Networks, In- vited talk at SIAM Workshop on Combinatorial Scientific Computing (CSC), Albuquerque, October 10, 2016. 15. Automata processing: A new paradigm for computing?, Keynote at the Seventh International Workshop on Accelerating Analytics and Data Management Systems Using Modern Processor and Storage Architectures (ADSM), New Delhi, India, September 9, 2016. 16. Genomes galore: Parallel methods and software for tackling NGS big data challenges, Keynote at the Ninth International Workshop on Parallel Programming Models and Systems Software for High-End Computing (P2S2), Philadelphia, August 16, 2016. 17. NSF Big Data Regional Hubs and collaboration opportunities in data science, Invited talk at the Informatics Institute, University of Florida, Gainesville, April 15, 2016. 18. Approximate sequence matching: Algorithms and applications, Invited talk at the Depart- ment of Computer and Information Science and Engineering, University of Florida, Gainesville, April 15, 2016. 19. Automata processing: A new paradigm for computing?, Keynote at Supercomputing Frontiers Conference, Singapore, March 17, 2016. 20. Big data in life sciences and public health, Keynote talk at Workshop on Foundations of Big Data Computing, Bengaluru, India, December 16, 2015. 21. Genomes galore: Parallel methods and software for tackling NGS big data challenges, Keynote at the IEEE Workshop on High Performance Computing for Big Data Computational Biology (HPC-BCB), Washington D.C., November 12, 2015. 22. Parallel machine learning approaches for reverse engineering genome-scale networks, Keynote at the IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Washing- ton D.C., November 11, 2015. 23. Genomes Galore: Big Data Challenges in the Life Sciences, Invited talk at the School of Computer Science, Georgia Institute of Technology, October 9, 2015. 24. Parallel methods for Bayesian network structure learning, Emory University, August 28, 2015. 25. Genomes galore: Big data challenges in the life sciences, Distinguished lecture, University of Delaware, Newark, Delaware, April 30, 2015. 26. Parallel machine learning approaches for reverse engineering genome-scale networks, Distin- guished Lecture, Georgia State University, Atlanta, Georgia, April 23, 2015. 27. Parallel machine learning approaches for reverse engineering genome-scale networks, invited talk at the annual event of Algorithms and Randomness Center, Georgia Institute of Tech- nology, April 17, 2015. 28. Parallel machine learning approaches for reverse engineering genome-scale networks, Univer- sity of Rochester, Rochester, New York, September 15, 2014. 29. Parallel methods for Bayesian network structure learning, Emory University, Atlanta, Geor- gia, August 28, 2014. 30. Big data challenges in the biosciences, International Workshop on Big Data Analytics for Predictive Organization and Big Transformations, Stanford, California, May 27, 2014. 31. Next generation sequencing bioinformatics and high performance computing, Centers for Disease Control, Atlanta, Georgia, April 24, 2014. 32. Parallel methods for Bayesian network structure learning, Minisymposium on Scalable Algo- rithms for Computational Statistical Inference, SIAM Conference on Parallel Processing for Scientific Computing (PP), Portland, Oregon, February 19, 2014. 33. Large-scale metagenomic sequence clustering via maximal quasi-clique enumeration, Min- isymposium on Graph Analysis for Scientific Discovery, SIAM Conference on Parallel Pro- cessing for Scientific Computing (PP), Portland, Oregon, February 20, 2014. 34. Reverse engineering and analysis of genome-scale gene networks, Plenary talk at the 9th In- ternational Conference on Genome Biology and Bioinformatics, Atlanta, Georgia, November 7, 2013. 35. Large-scale metagenomic clustering via quasi-clique enumeration and read assignment am- biguity resolution, Invited talk at the First International Workshop on Big Data in Life Sciences, New Orleans, Louisiana, June 13, 2013. 36. Parallel methods for Bayesian network structure learning, Keynote at the Workshop on Paral- lel and Distributed Computing for Machine Learning and Inference Problems (ParLearning), Boston, Massachusetts, May 24, 2013. 37. Genomes galore: Bigdata challenges in the biosciences, Keynote at the 41st International Conference on Parallel Processing, Pittsburgh, Pennsylvania, September 13, 2012. 38. Bigdata Analytics for Next Generation Sequencing, Life Technologies Corporation, Foster City, California, August 20, 2012. 39. BLiSS: A Library Proposal for NGS Analytics, DOE ICiS Summer Workshop on Developing a Next Generation Systems Analysis Library, Park City, Utah, July 28 – Aug 4, 2012. 40. The new era in genomics: Opportunities and challenges for high performance computing, Computer and Information Science and Engineering Department, University of Florida, April 13, 2012. 41. Improving NGS data quality through error correction algorithms, Genome Sequencing and Annotation Symposium, University of Florida, April 12, 2012. 42. The new era in genomics: Opportunities and challenges for high performance computing, Computational Science and Engineering Distinguished Lecture, Georgia Institute of Technol- ogy, Atlanta, Georgia, March 9, 2012. 43. Reverse engineering whole-genome networks, 2nd IEEE International Conference on Compu- tational Advances in Bio and Medical Sciences (ICCABS), Las Vegas, February 24, 2012. 44. Taxonomic clustering of large-scale metagenomic data, 2nd Workshop on Computational Ad- vances for Next Generation Sequencing (CANGS), Las Vegas, February 24, 2012. 45. Assembling complex genomes: Experiences from maize B73 sequencing, Monsanto Corpora- tion, Bengaluru, India, December 20, 2011. 46. Parallel graph algorithmic challenges in high-throughput sequence assembly, Workshop on parallel algorithms and software for analysis of massive graphs (ParGraph), held in conjunc- tion with 18th International Conference on High Performance Computing, Bengaluru, India, December 17, 2011. 47. Challenges in experimental design and algorithm development for high-throughput data anal- ysis, 1st Global Cancer Genomics Consortium Symposium, Tata Memorial Cancer Center, Mumbai, India, November 11, 2011. 48. Reverse engineering whole-genome networks using high performance parallel computers, 2nd Conference on Systems Biology and New Sequencing Techniques, International Center for Theoretical , Trieste, Italy, November 3, 2011. 49. The new era in genomics: Opportunities and challenges for next generation systems, IBM Centennial Colloquium and IBM Collaborative Academia Research Exchange (I-CARE), New Delhi, India, October 13, 2011. 50. Error correction algorithms for next-generation sequencing, NSF Workshop on Next-generation Sequencing Technology and Algorithms for Primary Data Analysis, Institute for Physical and Applied Mathematics, University of California Los Angeles, Los Angeles, California, October 4, 2011. 51. Bioinformatics for next-gen sequencing: Error correction and genome assembly, Institute for Genomics and Integrative Biology, New Delhi, India, September 6, 2011. 52. Reverse engineering whole-genome networks, 26th International Supercomputing Conference (ISC), Hamburg, Germany, June 23, 2011. 53. Flexible read decomposition for improved short read error correction, DTRA/NSF workshop on Algorithms for Threat Detection, Boston, Massachusetts, June 7, 2011. 54. Next-gen sequencing: Data intensive computing in the biosciences, Dept. of Computer Sci- ence, Rice University, May 11, 2011. 55. Next-generation sequencing, Minisymposium on Combinatorial Methods in Applications of CSE, SIAM Conference on Computational Science and Engineering (CSE), Reno, Nevada, February 28, 2011. 56. Next-gen sequencing: Data intensive computing in the biosciences, International Center for Theoretical Science, Bangaluru, India, February 9, 2011. 57. Flexible read decomposition for short read error correction, 1st IEEE International Confer- ence on Computational Advances in Bio and Medical Sciences (ICCABS), Orlando, Florida, February 4, 2011. 58. Why should computer science students work in life sciences?, Student Research Symposium, 17th International Conference on High Performance Computing (HiPC), Goa, India, Decem- ber 19, 2010. 59. The new era in genomics: Opportunities and challenges for high performance computing, IBM T.J. Watson Research Center, Yorktown Heights, New York, May 19, 2010. 60. The new era in genomics: Opportunities and challenges for high performance computing, keynote talk at the 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Atlanta, Georgia, April 21, 2010. 61. De novo Assembly of Large Genomes from Paired Short Reads, Dept. of Computer Science, Brigham Young University, Provo, Utah, March 4, 2010. 62. The new era in computational genomics: Opportunities and challenges for high performance computing, distinguished lecture, Dept. of Computer Science, Texas A&M University, College Station, Texas, February 16, 2010. 63. Parallel assembly of large genomes from paired short reads, invited talk at High Performance Computing workshop, Dept. of Energy (DoE) Joint Genome Institute (JGI), January 25, 2010. 64. Parallel assembly of large genomes from paired short reads, invited talk at IEEE Interna- tional Conference on Bioinformatics and Biomedicine (BIBM), Washington D.C., November 2, 2009. 65. Parallel assembly of large genomes from paired short reads, School of Technology and Com- puter Science, Tata Institute of Fundamental Research, October 26, 2009. 66. Parallel assembly of genomes from paired short reads, keynote talk at the 8th International Conference on Parallel Processing and Applied Mathematics, Wroclaw, Poland, September 16, 2009. 67. Assembling large genomes from paired short reads, invited talk at the 1st International Con- ference on Bioinformatics and Computational Biology, New Orleans, Louisiana, April 9, 2009. 68. Systems biology on a Cell cluster: Constructing gene regulatory networks, invited talk at the Workshop on Supercomputing, Multicore Architectures and Biomedical Informatics at 21st ACM/IEEE Supercomputing Conference (SC), November 17, 2008. 69. Consensus genetic maps: A graph theoretic approach, Chalmers University of Technology, G¨oteborg, Sweden, October 16, 2008. 70. Systems biology on a cell cluster: Constructing gene regulatory networks, Chalmers University of Technology, G¨oteborg, Sweden, October 13, 2008. 71. The combinatorics of sequencing the corn genome, keynote talk at the 1st International Conference on Contemporary Computing (ICCC), New Delhi, India, August 9, 2008. 72. The combinatorics of sequencing the corn genome, keynote talk at the 13th Annual Inter- national Computing and Combinatorics Conference (COCOON), Alberta, Canada, July 16, 2007. 73. Parallel methods in computational genomics, First Bertinoro Systems Biology Workshop, Bertinoro, Italy, May 2007. 74. Assembling the maize genome on large-scale parallel computers, Diamond Jubilee Conference of the Indian Society of Agricultural Statistics, New Delhi, India, Dec 28, 2006. 75. Parallel methods in computational biology, Supercomputing Education and Research Center, Indian Institute of Science (IISc), Bangalore, India, Dec 15, 2006. 76. Bioinformatics: Past accomplishments and future challenges, Kanwal Rekhi School of Infor- mation Technology, Indian Institute of Technology (IIT) Mumbai, September 28, 2006. 77. Parallel methods for maize genome assembly, College of Computing, Georgia Institute of Technology, Atlanta, Georgia, March 29, 2006. 78. Parallel methods for maize genome assembly, Workshop on Algorithms, Indian Statistical Institute, Kolkata, India, December 16, 2005. 79. New techniques for parallel solution of irregular problems, Workshop on Algorithms, Indian Statistical Institute, Kolkata, India, December 16, 2005. 80. Maize genome assembly on the IBM BlueGene/L Supercomputer, IBM T.J. Watson Research Center, New York, June 22, 2005. 81. Parallel methods for maize genome assembly, University of Memphis, Memphis, Tennessee, April 26, 2005. 82. Parallel methods for maize genome assembly, Advanced Techology Center, Tata Consultancy Services, Hyderabad, India, January 6, 2005. 83. Parallel methods for maize genome assembly, Kanwal Rekhi School of Information Technology, IIT Mumbai, December 27, 2004. 84. Maize genome assembly on the IBM BlueGene/L, IBM Global Services, Bangalore, India, December 23, 2004. 85. Parallel Methods in Computational Genomics, Coordinated Laboratory for Computational Genomics, University of Iowa, Iowa City, November 29, 2004. 86. High-performance computing for Maize genome assembly, SIAM minisymposium on Parallel Computational Biology, San Francisco, February, 2004. 87. Is the problem irregular? Or, do we need better algorithms?, SIAM minisymposium on Combinatorial Algorithms and Parallel Computing, San Francisco, February, 2004. 88. Parallel EST clustering, Department of Computer Science and Engineering, Indian Institute of Technology, Kanpur, India, January 2, 2003. 89. Parallel EST clustering, IBM Global Services, Bangalore, India, December 19, 2002. 90. Parallel EST clustering, Supercomputing Education and Research Center, Indian Institutes of Science, Bangalore, India, December 16, 2002. 91. Linear time suffix sorting, Department of Electrical Engineering and Computer Science, Syra- cuse University, Syracuse, New York, November 4, 2002. 92. Parallel EST clustering, National Center for Biotechnology Information, National Institutes of Health, Bethesda, Maryland, June 13, 2002. 93. Parallel EST clustering, Second Joint ISU/U of I symposium on Bioinformatics and Compu- tational Biology, Ames, Iowa, April 6, 2002. 94. New techniques for parallel solution of irregular problems, Fourth International Conference on Information Technology, Gopalapur-on-Sea, Orissa, India, December 22, 2001. 95. Parallel algorithms and software systems for computational biology, IBM T.J. Watson Center, June 5, 2001. 96. Opportunities and challenges in computational biology, 6hr tutorial presentation, Syracuse University, Syracuse, New York, March 23-24, 2001. 97. A distribution-independent framework for scientific applications, Department of Computer Science, University of Iowa, Iowa City, Iowa, March 6, 2001. 98. A distribution-independent framework for scientific applications, Department of Electrical and Computer Engineering, Northwestern University, Evanston, Illinois, April 24, 2000. 99. Parallel processing for computational biology, Iowa Computational Biology Laboratory, Ames, Iowa, November 1, 1999. 100. Cluster computing, Department of Physics, New Mexico State University, Las Cruces, New Mexico, May 7, 1999. 101. A distribution independent framework for scientific applications, Department of Computer Science and Engineering, Indian Institute of Technology, Chennai, India, December 15, 1998. 102. Dynamic compressed hyperoctrees with applications, Department of Computer Science, Iowa State University, Ames, Iowa, September 10, 1998. 103. Dynamic compressed hyperoctrees with applications, Department of Computer Science and Engineering, University of Florida, Gainsville, Florida, June 8, 1998. 104. Panelist, “Parallel systems across industry, academia, and the national laboratories”, Rio Grande ACM meeting, El Paso, Texas, March 27, 1998. 105. Distribution-independent organization of spatial data with applications to parallel and scien- tific computing, Sandia National Laboratories, Albuquerque, New Mexico, May 21, 1997. 106. Space-filling curves for arbitrary and dynamic points with applications, Los Alamos National Laboratories, Los Alamos, New Mexico, April 21, 1997. 107. Distribution-independent organization of spatial data with applications to parallel and sci- entific computing, Los Alamos National Laboratories, Los Alamos, New Mexico, April 21, 1997. 108. Algorithm design for parallel computers: some case studies and future directions, Summer Institute on Parallel Discrete Algorithms (PARADISE), Halifax, Canada, July 1996. 109. Distribution-Independent Hierarchical N-body Methods, Michigan Technological University, Houghton, Michigan, April 1996. 110. Distribution-Independent Algorithms for the N-body Problem, Boise State University, Boise, Idaho, April 1996.

VIII. Extension and Outreach Activities

1. Distinguished Visitor, Chalmers University of Technology, G¨oteborg, Sweden, Oct 13-17, 2008.

2. Instructor, NSF-NIH Bioinformatics and Computational Biology Summer Institute, Iowa State University, 2003 – 2005.

3. Hosted sabbatical visit of Prof. Desh Ranjan, New Mexico State University, Aug 2000 – May 2001.

4. Visit to Dept. of Computer and Information Science and Engg., University of Florida, June 2-9, 1998.

IX. Graduate Students and Research Staff

Current Group 1. Tony Pan (Senior Research Scientist)

2. Sriram Chockalingam (Research Scientist)

3. Shruti Shivakumar (Ph.D.; expected Summer 2022)

4. Ankit Srivastava (Ph.D.; expected Summer 2022)

5. Harsh Shrivastava (Ph.D.; expected Spring 2021)

6. Neda Tavakoli (Ph.D.; expected Summer 2022)

7. Haowen Zhang (Ph.D.; expected Summer 2022)

Research Scientists/Postdocs Supervised 1. Indranil Roy, November 2016 - June 2017.

2. Yongchao Liu, Feb 2015 - June 2017.

3. Sharma Thankachan, June 2014 - Dec 2016.

4. Soumitra Pal, Jan 2013 - Sep 2014. 5. Olga Nikolova, Aug 2012 - Dec 2012.

6. Jaroslaw Zola, June 2006 - July 2010.

7. Takako Soma, January 2003 - July 2004.

8. K.N.B. Murthy, July 1998 - January 1999.

Ph.D. Degrees Supervised (Initial employment following graduation in parenthesis)

1. Patrick Flick (2019S), “Parallel and scalable combinatorial string algorithms on distributed memory systems” (ACM SIGHPC Dissertation Award; Google, Inc.)

2. Chirag Jain (2019S), “Long read mapping at scale: Algorithms and applications” (College of Computing Dissertation Award; post-doc at National Institutes of Health)

3. Rahul Nihalani (2019S), “Techniques to improve genome assembly quality”

4. Nagakishore Jammula (2018F), “Parallel algorithms for enabling fast and scalable analysis of high-throughput sequencing datasets”

5. Tony Pan (2018S), “Distributed memory building blocks for massive biological sequence anal- ysis” (Senior Research Scientist at Georgia Institute of Technology)

6. Sriram Chockalingam (2017F), “Parallel algorithmic techniques for large-scale problems in computational biology” (Research Scientist at Georgia Institute of Technology)

7. Indranil Roy (2015Su), “Algorithmic techniques for the Micron Automata Processor” (Col- lege of Computing Dissertation Award; Micron Technology, Inc.)

8. Olga Nikolova (2012Su), “Parallel algorithms for Bayesian networks structure learning with applications in systems biology” (Postdoctoral Research Associate at Sage Networks).

9. Andre Wehe (co-major professor; 2012S), “Large-scale tree parsimony” (Senior software en- gineer at Atmospheric and Environmental Research).

10. Xiao Yang (2011Su), “Error correction and clustering algorithms for next generation sequenc- ing” (Cornette Fellowship; ISU Research Excellence Award; Computational Biologist at Broad Institute).

11. Abhinav Sarje (2010Su), “Applications on emerging paradigms in parallel computing” (IEEE TCPP Ph.D. Forum award; ISU Research Excellence Award; Postdoctoral Research Associate at Lawrence Berkeley National Laboratories).

12. Benjamin Jackson (2009Su), “Parallel methods for short read assembly” (USDA MGET Fellowship; GAANN Fellowship; AOL Corporation). 13. Scott Emrich (co-major professor; 2007Su), “Assembly and analysis of complex plant genomes” (ISU Research Excellence Award; NSF IGERT Fellowship; Faculty member at University of Notre Dame).

14. Pang Ko (2007Su), “Algorithms for large-scale analysis of biological sequence data” (ISU Research Excellence Award; Postdoctoral Research Associate at University of California, San Diego).

15. Sudip seal (2007Su), “Parallel methods for large-scale problems in computational electromag- netics and materials science” (Staff Scientist at Oak Ridge National Laboratories).

16. Anantharaman Kalyanaraman (2006Su), “Large-scale methods in computational genomics” (ISU Research Excellence Award; IBM Ph.D. Fellowship; Faculty member at Washington State University)

17. Wei Zhu (co-major professor; 2003S), “Spliced alignment and its application in Arabidopsis thaliana” (The Institute for Genome Research (TIGR)).

18. Natsuhiko Futamura (2002F), “Algorithms for large scale problems in computational biology” (Faculty member at Wright State University).

19. Fatih Sevilgen (2000Su), “Distribution independent spatial data structures and algorithms with applications”, (Syracuse University Doctoral Fellowship award; Faculty member at Gebze Institute of Technology, Turkey).

Masters Degrees Supervised 1. Ankita Mawandia (2012Su), “DIStiled: Ion Torrent data error correction”.

2. Rahul Nihalani (2012Su), “Resolving ambiguities in metagenomic clustering”.

3. Dhanavath Saida (2012Su), “Multicore and manycore algorithms for octrees”.

4. Ankit R. Shah (2012Su), “Parallel methods for NGS error correction”.

5. Robert Farnham (2009S), M.S. Non-thesis.

6. Srikanth Komarina (2004F), “Algorithms for hierarchical clustering of microarray data”.

7. Mahesh Narayanan (2004Su), “Optimal DNA-protein alignments with application to large- scale genome analysis”.

8. Stjepan Rajko (2004S; ISU Research Excellence Award), “Space and time optimal par- allel sequence alignments”.

9. Nicky Heijadit (2003F; co-major professor), “Implementation of VIA on PowerPC Architec- ture”.

10. Sachet Shukla (2003Su; Co-major professor), “Identification of regional motifs in the 50 UTR and their implication in translational control mechanisms”. 11. Bhanu Hariharan (2003S; ISU Research Excellence Award), “Distribution independent parallel algorithms and software for hierarchical methods with applications to computational electromagnetics”.

12. Ananth Kalyanaraman (2002Su; ISU Research Excellence Award), “Parallel clustering of expressed sequence tags”.

13. W. Chen (2001F), “Implementation of MP Lite for the VI architecture”.

14. Liang Cao (1999F), “A comparative evaluation of N-body methods based on spherical and rectangular coordinates”.

15. Sudheer Chandarreddy (1999Su), “A parallel software library for hierarchical spatial data structures”.

16. Guo Wei (1999Su), “Multiple biological sequence alignments”.

17. Songrit Maneewongvatana (1996Su), “Parallel algorithms for visibility and line arrange- ments”.

X. Professional Activities

Memberships in Professional Societies Fellow, Association for Computing Machinery (ACM) Fellow, American Association for the Advancement of Science (AAAS) Fellow, Institute for Electrical and Electronics Engineers (IEEE) Fellow, Society for Industrial and Applied Mathematics (SIAM) Member, International Society of Computational Biology (ISCB)

Editorships 1. Associate Editor, ACM/IEEE Transactions on Computational Biology and Bioinformatics, 2017 – present.

2. Associate Editor, IEEE Transactions on Big Data, 2015 – 2018.

3. Associate Editor, Journal of Parallel and Distributed Computing, 2011 – present.

4. Associate Editor, IEEE Transactions on Parallel and Distributed Systems, 2010 – 2014.

5. Editor, International Journal of Data Mining and Bioinformatics, Inderscience Publishers, 2009 – 2018.

6. Editor, Journal of Computing, Computer Society of India, 2011 – 2017.

7. Editor, Encyclopedia of Parallel Computing, Springer Publishers. Guest Editorships 1. Guest Editor, Journal of Parallel and Distributed Computing special issue on Scalable Systems for Big Data Management and Analytics, Vol. 79-80, 2015.

2. Guest Editor, ACM/IEEE Transactions on Computational Biology and Bioinformatics special issue on ACM Bioinformatics and Computational Biology conference best papers, Vol. 12, No. 1, 2015 (Co-editor with Donna Slonim).

3. Guest Editor, Parallel Computing special issue on High Performance Computational Biology, Vol. 34, No. 11, November 2008 (Co-editor with David A. Bader).

4. Guest Editor, IEEE Transactions on Parallel and Distributed Systems special issue on High Performance Computational Biology, Vol. 17, No. 8, August 2006 (Co-editor with Nancy Amato and David A. Bader).

5. Guest Editor, Concurrency and Computation: Practice and Experience special issue of se- lected papers from Workshop on High Performance Computational Biology, Vol. 16, No. 9, August 2004 (Co-editor with David A. Bader).

6. Guest Editor, Journal of Parallel and Distributed Computing special issue on High Perfor- mance Computational Biology, Vol. 63, No. 7-8, August 2003 (Co-editor with David A. Bader).

Strategic Planning Workshops 1. Fourth U.S. Data Science Leadership Summit, October 12-14, 2020.

2. Third U.S. Data Science Leadership Summit, Santa Fe, New Mexico, November 7-9, 2019.

3. Chair, Second U.S. Data Science Leadership Summit, Salt Lake City, Utah, October 12-13, 2018.

4. International Data Science Leaders Roundtable, Alan Turing Institute, London, August 20, 2018.

5. First U.S. Data Science Leadership Summit, Columbia University, New York, March 26, 2018.

6. ASCR Exascale Requirements Crosscut Review, Department of Energy, Tysons, Virginia, March 9-10, 2017.

7. Organizer, NSF/JST US-Japan Big Data Symposium, Tokyo, Japan, May 11-12, 2016.

8. NSF Workshop on Algorithmic, Mathematical, and Statistical Foundations of Data Science, Arlington, Virginia, April 28-30, 2016.

9. Exascale Requirements Review for Biological and Environmental Research, Department of Energy, Washington D.C., March 29-30, 2016.

10. White House Workshop on the National Strategic Computing Initiative, Washington D.C., McLean, Virginia, October 20-21, 2015. 11. Defense Science Research Council (DSRC) Workshop on Quantum Entanglement, Washington D.C., January 28-29, 2014.

12. White House Event, Data to Knowledge to Action: Building New Partnerships, Washington D.C., November 12, 2013.

13. White House Big Data Workshop, White House Conference Center, Washington D.C., May 3, 2013.

14. Indo-U.S. Workshop on Accelerating Bigdata Applications, Pune, India, December 21-22, 2012.

15. DOE ICiS Summer Workshop on Developing a Next Generation Systems Analysis Library, Park City, Utah, July 28 – Aug 4, 2012.

16. DELSA (Data-Enabled Life Sciences Alliance International) Workshop on Supporting Ecosys- tem for the Life Sciences in the Beginning of the 21st Century, Bethesda, Maryland, May 3-4, 2012.

17. US/India Workshop on Virtual Institutes for Computational and Data-Enabled Science and Engineering, Bengaluru, India, December 21-22, 2011.

18. NSF workshop on Accelerators for Data Intensive Applications: A Workshop to Engage the Science and Engineering Community, Arlington, Virginia, Oct 14-15, 2010.

19. DOE Workshop on Computational Research Needs in Alternative and Renewable Energy (CRNARE), Rockville, Maryland, Sep 19-20, 2007.

20. NSF Workshop on Petascale Computing in the Biological Sciences, Arlington, Virginia, Aug 29-30, 2006.

21. DOE SCaLeS workshop (Science Case for Large-Scale Simulation), Arlington, Virginia, June 24-25, 2003.

Leadership Roles in Professional Organizations 1. Chair, ACM Special Interest Group on Bioinformatics, Computational Biology, and Biomed- ical Informatics, 2015 – present.

2. IEEE Computer Society Fellows Committee, 2011 – 2015, 2018, 2020.

3. Member, SIAM Activity Group on Applied and Computational Discrete Algorithms (ACDA) Nominations Committee

4. Board Member, Indo-U.S. Collaboration for Engineering Education, 2013 – 2018.

5. Member, SIAM Activity Group on Supercomputing Career and Early Career prize selection committee

6. Executive member, Board of Directors, Great Lakes Consortium for Petascale Computation, 2009 – 2013. 7. Vice Chair of IEEE Technical Committee on Parallel Processing, 2007 – 2011.

Conference Leadership Roles 1. Co-General Chair, IEEE International Conference on Big Data (BigData), December 2020.

2. Co-General Chair, ACM Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), September 2020.

3. Co-Chair, Third Annual NSF TRIPODS PI meeting, April 23-24, 2020.

4. Track Chair for HPC Applications, ISC High Performance Conference, Frankfurt, Germany, June 2019.

5. Program Chair for Data Science, 25th IEEE International Conference on High Performance computing (HiPC), Bengaluru, India, December 2018.

6. Program Co-Chair, IEEE International Conference on Data Mining (ICDM), New Orleans, November 2017.

7. Program Co-Chair, 10th International Conference on Contemporary Computing (IC3), New Delhi, August 2017.

8. Track Chair, Distributed Big Data Systems & Analytics, 37th IEEE International Conference on Distributed Computing Systems (ICDCS), Atlanta, June 2017.

9. Steering Committee Member, ACM Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2016-present.

10. Co-General Chair, IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Atlanta, Georgia, October 2015.

11. Co-General Chair, ACM Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), Atlanta, Georgia, September 2015.

12. General Chair, Workshop on Algorithms for Bioinformatics (WABI), Atlanta, Georgia, Septem- ber 2015.

13. Program Chair, IEEE International Parallel and Distributed Processing Symposium (IPDPS), Hyderabad, India, May 2015.

14. Program Vice Chair for Applications, 43rd International Conference on Parallel Processing (ICPP), Minneapolis, Minnesota, September 2014.

15. Program Co-Chair, ACM Conference on Bioinformatics, Computational Biology and Biomed- ical Informatics (BCB), Washington D.C., October 2013.

16. Co-General Chair, IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), New Orleans, Louisiana, August 2013. 17. Program Chair, Workshop on Parallel Algorithms and Software for Analysis of Massive Graphs (ParGraph), International Conference on High Performance Computing, Pune, In- dia, December 18, 2012.

18. Panel Chair, ACM Bioinformatics, Computational Biology and Biomedicine (BCB) Confer- ence, Orlando, Florida, October 2012.

19. Program Vice-Chair for Algorithms Design and Parallelism, 41st International Conference on Parallel Processing (ICPP), Pittsburgh, Pennsylvania, September 2012.

20. Steering committee member, IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), 2011 - present.

21. Steering committee member, IEEE International Conference on High Performance computing (HiPC), 2006-2008, 2010 - present.

22. Program Chair, 4th International Conference on Contemporary Computing (IC3), New Delhi, August 2011.

23. Program Vice-Chair for Cross-Cutting Computational Methods and Bioinformatics Infras- tructure, IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Wash- ington D.C., November 2009.

24. Program Co-Chair, 1st ISCA International Conference on Bioinformatics and Computational Biology (BiCoB), New Orleans, Louisiana, April 2009.

25. Program Vice-Chair for Applications, ACM/IEEE Supercomputing Conference (SC), Austin, Texas, November 2008.

26. Tutorials Chair, 1st International Conference on Contemporary Computing (ICCC), New Delhi, India, August 2008.

27. Algorithms and Bioinformatics track chair, 6th ACS/IEEE International Conference on Com- puter Systems and Applications (AICCSA), Doha, Qatar, April 2008.

28. Program Chair, 14th IEEE International Conference on High Performance Computing, Goa, India, December 2007.

29. Program Vice-Chair for Algorithms and Applications, International Conference on Parallel Processing (ICPP), Si-An, China, August 2007.

30. Program Vice-Chair for Applications, IEEE International Parallel and Distributed Processing Symposium (IPDPS), California, April 2007.

31. Steering committee member, High Performance Computing Symposium, March 2007.

32. Program Vice-Chair for Algorithms, 13th IEEE International Conference on High Perfor- mance Computing, Bangalore, India, December 2006.

33. Tutorials Chair, 9th through 12th IEEE International Conference on High Performance Com- puting. – Goa, India, December 2005. – Bangalore, India, December 2004. – Hyderabad, India, December 2003. – Bangalore, India, December 2002.

34. Organizer, Minisymposium on Parallel Computational Biology, SIAM Conference on Parallel Processing for Scientific Computing, San Francisco, California, February 2004.

35. Algorithms and Applications Area Chair of the Technical Papers Committee, ACM/IEEE Supercomputing Conference, Phoenix, Arizona, November 2003.

Tutorial Presentation at Conferences 1. High performance computational biology, short course at the International School on High Performance Computing, San Sebastian, Spain, April 25-26, 2018.

2. Teaching parallel computing through parallel prefix, HPC Educator Session tutorial, ACM/IEEE Supercomputing Conference (SC), Salt Lack City, Utah, November 15, 2012.

3. Algorithmic challenges of next generation sequencing, Workshop on Introduction to Mathe- matical Techniques in Life Sciences, Indian Institute of Science, Bengaluru, India, January 6, 2011.

4. Exploring Computational Biology with a Massively Parallel High Performance Computing Environment (with K. Jordon), 15th International Conference on Intelligent Systems for Molecular Biology (ISMB), Vienna, Austria, July 2007.

5. High-performance Computing Methods for Computational Genomics (with A. Kalyanara- man and D.A. Bader), IEEE International Parallel and Distributed Processing Symposium (IPDPS), Long Beach, California, March 2007.

6. High-performance Computing Methods for Computational Genomics (with A. Kalyanaraman and D.A. Bader), IEEE/ACM Supercomputing Conference (SC), Tampa, Florida, November 2006.

7. Exploring Computational Biology with a Massively Parallel High Performance Computing Environment (with K. Jordon and S.E. Emrich), 14th International Conference on Intelligent Systems for Molecular Biology (ISMB), Fortaleza, Brazil, 2006.

8. Indexing Methods for Biological Sequences, Tutorial at 12th International Conference on Management of Data (CoMAD), Hyderabad, India, December 2005.

9. Opportunities and Challenges in Computational Biology (with D.A. Bader), IEEE/ACM Supercomputing Conference (SC), Baltimore, Maryland, November 2002.

10. Opportunities and Challenges in Computational Biology, 7th and 8th IEEE International Conferences on High Performance Computing (HiPC).

– Hyderabad, India, December 2001. – Bangalore, India, December 2000. Workshop Organization 1. Founding co-chair, First through thirteenth International Workshops in High-Performance Computational Biology (HiCOMB), held in conjunction with IEEE Parallel and Distributed Processing Symposiums.

– Vancouver, Canada, May 2018. – Orlando, Florida, May 2017. – Chicago, Illinois, May 2016. – Hyderabad, India, May 2015. – Pheonix, Arizona, May 2014. – Boston, Massachusetts, May 2013. – Shanghai, China, May 2012. – Anchorage, Alaska, May 2011. – Atlanta, Georgia, April 2010. – Rome, Italy, May 2009. – Tampa, Florida, March 2008. – Long Beach, California, March 2007. – Rhodes Island, Greece, April 2006. – Denver, Colorado, April 2005. – Santa Fe, New Mexico, April 2004. – Nice, France, April 2003. – Ft. Lauderdale, Florida, April 2002.

2. Founding co-chair, First through Third Workshops on Bioinformatics and Computational Bi- ology, held in conjunction with IEEE International Conferences on High Performance Com- puting.

– Hyderabad, India, December 2003. – Bangalore, India, December 2002. – Hyderabad, India, December 2001.

Program Committee Memberships 1. ACM-IMS Foundations of Data Science Conference (FODS), October 2020. 2. Applications track, ACM/IEEE Supercomputing Conference (SC), Denver, Colorado, Novem- ber 2019. 3. Algorithms track, ACM/IEEE Supercomputing Conference (SC), Dallas, Texas, November 2018. 4. Experiments track, IEEE International Parallel and Distributed Processing Symposium (IPDPS), Vancouver, British Columbia, May 2018. 5. Applications track, ACM/IEEE Supercomputing Conference (SC), Denver, Colorado, Novem- ber 2017. 6. 26th International Conference on Parallel Architectures and Compilation Techniques (PACT), Portland, Oregon, September 2017. 7. Applications track, IEEE International Parallel and Distributed Processing Symposium (IPDPS), Orlando, Florida, May 2017. 8. Algorithms track, IEEE International Parallel and Distributed Processing Symposium (IPDPS), Chicago, Illinois, May 2016. 9. Algorithms track, ACM/IEEE Supercomputing Conference, Austin, Texas, November 2015. 10. Extreme Science and Engineering Discovery Environment (XSEDE), Atlanta, Georgia, July 2014. 11. 10th International Symposium on Bioinformatics Research and Applications (ISBRA), Zhangji- ajie, China, June 2014. 12. 4th IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Miami Beach, Florida, June 2014. 13. Mining Networks and Graphs: A Big Data Analytic Challenge, Workshop in SIAM Interna- tional Conference on Data Mining, Philadelphia, Pennsylvania, April 2014. 14. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Shanghai, China, December 2013. 15. 19th IEEE International Conference on High Performance Computing (HiPC), Bengaluru, India, December 2013. 16. 19th IEEE International Conference on Parallel and Distributed Systems (ICPADS), Seoul, South Korea, December 2013. 17. Algorithms track, ACM/IEEE Supercomputing Conference, Denver, Colorado, November 2013. 18. Applications track, 42nd International Conference on Parallel Processing (ICPP), Lyon, France, October 2013. 19. 1st ASE International Conference on Bigdata, Washington D.C., September 2013. 20. Applications track, 27th IEEE Parallel and Distributed Processing Symposium (IPDPS), Boston, Massachusetts, May 2013. 21. 19th IEEE International Conference on High Performance Computing (HiPC), Pune, India, December 2012. 22. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), November 2012. 23. 8th IEEE International Conference on eScience, Chicago, Illinois, October 2012. 24. ACM Bioinformatics, Computational Biology and Biomedicine (BCB) Conference, Orlando, Florida, October 2012. 25. 12th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP), Fukuoka, Japan, September 2012. 26. 8th International Symposium on Bioinformatics Research and Applications (ISBRA), Dallas, Texas, May 2012. 27. 18th IEEE International Conference on High Performance Computing (HiPC), Bengalooru, India, December 2011. 28. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Atlanta, Geor- gia, November 2011. 29. Algorithms Design and Parallelization track, 40th International Conference on Parallel Pro- cessing (ICPP), Taipei, Taiwan, September 2011. 30. 31st International Conference on Distributed Computing Systems (ICDCS), Minneapolis, Minnesota, June 2011. 31. 7th International Symposium on Bioinformatics Research and Applications (ISBRA), Central South University, China, May 2011. 32. IEEE International Conference on Computational Advances in Bio and Medical Sciences (ICCABS), Orlando, Florida, January 2011. 33. IEEE International Conference on Bioinformatics and Biomedicine (BIBM), Hong Kong, December 2010. 34. Algorithms and Applications track, 39th International Conference on Parallel Processing (ICPP), San Diego, California, September 2010. 35. ACM International Conference on Computing Frontiers (CF), Bertinoro, Italy, May 2010. 36. 6th International Symposium on Bioinformatics Research and Applications (ISBRA), Storrs, Connecticut, May 2010. 37. Algorithms and Bioinformatics track, 8th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA), Hammamet, Tunisia, May 2010. 38. 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS), Atlanta, Georgia, April 2010. 39. 2nd International Conference on Bioinformatics and Computational Biology (BICoB), Hon- olulu, Hawaii, March 2010. 40. 8th Asia Pacific Bioinformatics Conference (APBC), Bengaluru, India, January 2010. 41. Bioinformatics and Computational Biology track, 15th International Conference on Parallel and Distributed Systems (ICPADS), ShenZhen, China, December 2009. 42. 38th International Conference on Parallel Processing (ICPP), Vienna, Austria, September 2009. 43. 16th String Processing and Information Retrieval Symposium (SPIRE), Saariselka, Finland, August 2009. 44. 15th International Conference on Computing and Combinatorics (COCOON), Niagara Falls, New York, July 2009. 45. Sequence Analysis and Alignment Area Program Committee, 17th International Conference on Intelligent Systems for Molecular Biology (ISMB), Stockholm, Sweden, June 2009. 46. Protein Interactions and Networks Area Program Committee, 17th International Conference on Intelligent Systems for Molecular Biology (ISMB), Stockholm, Sweden, Stockholm, Swe- den, June 2009. 47. 2nd IEEE TCPP Ph.D. Forum, held in conjunction with 23rd International Parallel and Distributed Processing Symposium (IPDPS), Rome, Italy, May 2009. 48. 5th International Symposium on Bioinformatics Research and Applications (ISBRA), Ft. Lauderdale, Florida, May 2009. 49. Student Research Symposium, 15th IEEE International Conference on High Performance Computing (HiPC), Bangalore, India, December 2008. 50. 9th IEEE International Conference on Computational Science and Engineering, S˜aoPaulo, Brazil, June 2008. 51. 19th Annual Symposium on Combinatorial Pattern Matching (CPM), Pisa, Italy, June 2008. 52. 13th International Conference on Database Systems for Advanced Applications (DASFAA), New Delhi, India, March 2008. 53. IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN), Innsbruck, Austria, February 2008. 54. 7th International Symposium on Advanced Parallel Processing Technologies (APPT), Guangzhou, China, November 2007. 55. ACM/IEEE Supercomputing Conference (SC), Reno, Nevada, November 2007. 56. Parallel Bio-Computing (PBC) Workshop, Gdansk, Poland, September 2007. 57. 1st IEEE International Workshop on Service Oriented Technologies for Biological Databases and Tools (SOBDAT), Salt Lake City, Utah, July 2007. 58. IEEE International Symposium on Bioinformatics and Life Science Computing (BLSC), Ni- agara Falls, Canada, May 2007. 59. International Conference on Computing: Theory and Applications (ICCTA), Kolkata, India, March 2007. 60. 3rd SIAM Workshop on Combinatorial Scientific Computing (CSC), Costa Mesa, California, February 2007. 61. ISCA International Conference on Parallel and Distributed Computing Systems (PDCS), San Francisco, California, September 2006. 62. 35th International Conference on Parallel Processing (ICPP), Columbus, Ohio, August 2006. 63. IEE International Conference on Computational Science and its Applications (ICCSA), Glas- gow, U.K., May 2006. 64. IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN), Innsbruck, Austria, February 2006. 65. 12th IEEE International Conference on High Performance Computing (HiPC), Goa, India, December 2005. 66. ACM/IEEE Supercomputing Conference (SC), Seattle, Washington, November 2005. 67. 5th IEEE International Symposium on Bioinformatics and Bioengineering (BIBE), Minneapo- lis, Minnesota, October 2005. 68. 6th International Workshop on Advanced Parallel Processing Technologies (APPT), Hongkong, October 2005. 69. ISCA International Conference on Parallel and Distributed Computing Systems (PDCS), Las Vegas, Nevada, September 2005. 70. Workshop on Parallel Bio-Computing, Poznan, Poland, September 2005. 71. 5th ACM SIGKDD Workshop on Data Mining in Bioinformatics, Chicago, Illinois, August 2005. 72. International Workshop on High Performance Computing in Medicine and Biology (HiP- COMB), Japan, July 2005. 73. IEE International Conference on Computational Science and its Applications (ICCSA), Sin- gapore, May 2005. 74. International Conference on Computational Science (ICCS), Atlanta, Georgia, May 2005. 75. IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN), Innsbruck, Austria, February 2005. 76. International Conference on Parallel and Distributed Computing Systems (PDCS), San Fran- cisco, California, September 2004. 77. 4th ACM SIGKDD Workshop on Data Mining in Bioinformatics, Seattle, Washington, Au- gust 2004. 78. International Conference on Computational Science (ICCS), Krak´ow,Poland, June 2004. 79. International High Performance Computing Symposium, Winnipeg, Manitoba, Canada, May 2004. 80. IASTED International Conference on Parallel and Distributed Computing and Networks (PDCN), Innsbruck, Austria, February 2004. 81. 10th IEEE International Conference on High Performance Computing (HiPC), Hyderabad, India, December 2003. 82. 3rd ACM SIGKDD Workshop on Data Mining in Bioinformatics, Washington D.C., August 2003. 83. International Conference on Computational Science (ICCS), Melbourne, Australia and St. Petersburg, Russia, June 2003. 84. Workshop in Bioinformatics, 19th International Conference on Data Engineering, Bangalore, India, March 4, 2003. 85. ACM/IEEE Supercomputing Conference (SC), Baltimore, Maryland, November 2002. 86. 4th Conference on Computational Biology and Genome Informatics (CBGI), Raleigh, North Carolina, March 2002. 87. IEEE International Symposium on Cluster Computing and the Grid, Brisbane, Australia, May 2001. 88. International Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA), Las Vegas, Nevada, June 1999. 89. Panel member, Parallel Systems across industry, academia and national laboratories, Rio Grande ACM Meeting, El Paso, Texas, March 27, 1998.

Proposal Panels and Reviews 2019 NSF MRI Panel 2017 USDA Plant Genome Databases Panel 2017 NSF SBIR/STTR Panel 2014 NSF SSE Panel 2014 NSF XPS Panel 2014 NSF PRAC Panel 2014 NIH SBIR Panel 2014 Canadian Discover Frontiers Initiative Review Committee 2012 NSF/NIH Bigdata Panel 2012 NIH SBIR mail reviews 2012 NSF Information and Intelligent Systems (IIS) Panel 2012 NIH SBIR mail reviews 2012 NSF Computing Research Instrumentation (CRI) Panel 2012 NSF iPlant Review Panel 2011 NIH SBIR mail reviews 2011 NIH Bio Data Management and Analysis (BDMA) Study Section 2011 Greek Ministry of Education 2011 NSF Information and Intelligent Systems (IIS) Panel 2010 Norway Research Council 2010 NSF Software Development for Cyber Infrastructure (SDCI) Panel 2009 DOE SciDac Center Review Panel 2009 NSF Teragrid Review Panel 2008 NSF Advances in Biological Informatics (ABI) Panel 2008 NSF Cluster Exploratory (CluE) Panel 2008 NIH Shared Instrumentation Grant Program 2008 NSF Cyber Discovery Initiative (CDI) Panel 2007 NIH Shared Instrumentation Grant Program 2007 NSF Career mail reviews 2006 NIH High Performance Computing Review Panel 2005 NSF Computer Systems Architecture Panel 2005 NIH Continued Development of Computational Biology Software Study Section 2005 NIH Continued Development of Computational Biology Software Study Section 2005 NSF Science and Engineering Information Integration Informatics (SEIII) Panel 2004 NIH Bio Data Management and Analysis Study Section 2003 NSF Advanced Computational Research (ACR) Panel 2000 NSF Advanced Computational Research (ACR) Panel 2000 NSF Operating Systems and Compilers (OSC) Panel 1998 NSF CISE Research Instrumentation (RI) Panel XI. Institutional Service

Service at Georgia Institute of Technology

University Committees 1. Chair, Internal Advisory Group on High Performance Computing, July 2016 – 2018.

2. Co-Chair, Strategic Initiative in Data Engineering and Science, July 2014 – June 2016.

3. Member, Vice Provost Graduate Education & Faculty Affairs (VPGEFA) Review Committee, Jan 2014 – June 2016.

College Committees 1. Member, College of Computing Cabinet (as CSE Interim Chair), 2019 – 2020.

2. Member, College Reappointment, Promotion, and Tenure Committee, 2014 – 2019, 2020 – present.

3. Member, Machine Learning Chair Search Committee, 2015 – 2016.

4. Member, Machine Learning Task Force, 2014 – 2016.

5. Member, College of Computing advisory committee, 2013 – 2014.

School Committees 1. Chair, Strategic Visioning Committee, 2020-present.

2. Member, Faculty Recruiting Committee, 2013 – 2019.

3. Member, Honors and Awards Committee, 2017 – 2018.

4. Chair, School Reappointment, Promotion, and Tenure Committee, 2014 – 2017, 2018 – 2019, 2020 – present.

5. Chair, Honors and Awards Committee, 2015 – 2017.

6. Chair, School Chair Advisory Committee, 2013 – 2016.

7. Member, Graduate Student Admissions Committee, 2013 – 2015.

Service at Iowa State University

Administrative Roles 1. Chair, Bioinformatics and Computational Biology graduate program, August 2005 – July 2007.

2. Associate Chair for Research and Graduate Education, January 2003 – June 2006. 3. Associate Chair, Bioinformatics and Computational Biology graduate program, August 2003 – July 2005.

University Planning Conferences 1. Faculty roles in the information age: challenges and opportunities, Faculty Spring Conference, Newton, Iowa, March 31 - April 1, 2000.

University Committees 1. Member, Biotechnology council, 2011 – 2014. 2. Member, High Performance Computing Steering Committee, 2011 – 2013. 3. Member, Search Committee for Vice President for Research and Economic Development, 2008. 4. Member, Welfare and Benefits Committee of the Faculty Senate, 2008 – 2009, 2011 – 2012. 5. Member, University Budget Advisory Committee, 2007 – 2010. 6. Member, Research Computing Council, 2007 – 2011. 7. Member, University group for visioning grand challenges in the fundamental areas of physical and biological sciences, 2007. 8. Member, Provost task force on High Performance Computing, 2006. 9. Member, Advisory Group to the Information Technology Task Force. 10. Member, Computing Research and Information Technology Committee (CRIT). 11. Member, Director Search Committee for Laurence H. Baker Center for Bioinformatics and Biological Statistics, 2000 – 2001.

College Committees 1. Member, Budget and Planning Advisory Council, 2009 – 2012. 2. Member, Honors and Awards Committee, 2009 – 2012. 3. Member, committee to investigate biology courses in engineering education, 2000.

Interdepartmental Committees 1. 2001 – 2009, Member, Supervisory Committee for Bioinformatics and Computational Biology graduate program. 2. 2000 – Member, Policy Committee for Laurence H. Baker Center for Bioinformatics and Biological Statistics 3. 1999 – 2000 Member, Admissions Committee for graduate program in Bioinformatics and Computational Biology Departmental Committees 1. 2007 – 2011, Promotion and Tenure Committee 2. 2007 – 2009, Chair, Strategic Planning Committee 3. 2006 – Present, ECpE Honors and Awards Committee 4. 2003 – 2006, ECpE Administrative Committee 5. 2003 – 2006, Chair, Graduate Committee 6. 2005 – 2006, Promotion and Tenure Committee 7. 2003, Chair Search Committee 8. 2001 – 2004, Member, Faculty Search Committee 9. 2001 – 2003, Member, Promotion and Tenure Committee 10. 2001, Chair, Qualifying Examination Review Committee 11. 2000, DEO Evaluation Committee 12. 2000 – 2001, Distinguished Colloquium Series 13. 2000 – 2001 Member, Curriculum Committee 14. 1999 – 2000 Member, Graduate Committee

Service at New Mexico State University 1. 1997 – 1999 Chair, Qualifying Examinations 2. 1997 – 1999 Member, Undergraduate Curriculum Committee 3. 1997 – 1999 Member, Graduate Curriculum Committee 4. 1997 – 1999 Member, Awards Committee 5. 1998 – 1999 Member, Molecular Biology Curriculum Committee. 6. 1997 Judge, ACM local programming competition. 7. 1998 Contributed Problems to ACM local programming competition.

Service at Syracuse University 1. 1994 – 1996 Chair, Computer Architecture and Operating Systems Comprehensive Examina- tion. 2. Designed a new set of requirements for the Ph.D. program in Computer and Information Science at Syracuse University in response to a review by the State of New York.