<<

Biographical Sketch — ChengXiang

University of Illinois Office: (217) 244-4943 Department of Computer Science Home: (217) 356-1292 2116 Thomas Siebel Center for Computer Science [email protected] 201 N. Goodwin Ave., Urbana, IL 61801 http://www.cs.uiuc.edu/homes/czhai/ PROFESSIONAL PREPARATION Nanjing University,Nanjing, ComputerScience B.S. 1984 Nanjing University,Nanjing,China ComputerScience M.S. 1987 Nanjing University,Nanjing,China ComputerScience Ph.D. 1990 Carnegie Mellon Univ., Pittsburgh, PA Computational Linguistics M.S. 1995 Carnegie Mellon Univ., Pittsburgh,PA LanguageandInfo. Technologies Ph.D. 2002 APPOINTMENTS University of Illinois at Urbana-Champaign,Urbana,IL AssociateProfessor 8/2008- University of Illinois at Urbana-Champaign,Urbana,IL AssistantProfessor 8/2002-8/2008 Carnegie Mellon University,Pittsburgh,PA SeniorResearchProgrammer 11/2000-7/2002 Clairvoyance Corporation, Pittsburgh, PA Research Scientist 1/1997-11/2000 Nanjing University,Nanjing,China ResearchAssociate 9/1990-7/1993 SELECTED AWARDS AND HONORS

2011 ACM CIKM 2011 Best Student Paper Award (1 out of 917 international submissions) 2011 HP Innovative Research Award 2010 IBM Faculty Award 2010 UIUC Rose Award for Teaching Excellence 2009 ACM Distinguished Scientist (1 of the 58 named in 2009 by ACM) 2008 Sloan Research Fellowship (1 of the 118 recipients in 2008) 2007 ACM KDD 2007 Best Student Paper Award Runner-Up (1 out of 573 international submissions) 2006 ACM KDD 2006 Best Student Paper Award Runner-Up (1 out of 531 international submissions) 2006 Invited Participant of NAE’s 2006 U.S. Frontiers of Engineering Symposium (1 of 82 participants selected nationally) 2005 2004 Presidential Early Career Award for Scientists and Engineers (PECASE) 2004 National Science Foundation 2004 CAREER Award 2004 ACM SIGIR 2004 Best Paper Award (1 out of 273 international submissions) FIVE MOST CLOSELY RELATED PUBLICATIONS

1. Yanen , , Parikshit Sondhi, Lui Sha, Chengxiang Zhai. Reconstructing Missing Signals in Multi- Parameter Physiologic Data by Mining the Aligned Contextual Information, Proceedings of Computing in Cardiology Conference 2010, 2010. 2. P. Sondhi, J. Sun, C. Zhai, R. Sorrentino, M. S. Kohn, Leveraging Medical Thesauri and Physician Feedback for Improving Medical Literature Retrieval for Case Queries, Journal of the American Medical Informatics Association , to appear. 3. P. Sondhi, M. Gupta, C. Zhai and J. Hockenmaier. Shallow Information Extraction from Medical Forum Data, Proceedings of COLING 2010, pages 1158-1166, 2010. 4. Jing Jiang, ChengXiang Zhai. A Two-Stage Approach to Domain Adaptation for Statistical Classifiers, Pro- ceedings of CIKM 2007, pages 401-410. 5. Jing Jiang and ChengXiang Zhai. Instance Weighting for Domain Adaptation in NLP, Proceedings of ACL 2007, pp. 264-271.

1 FIVE OTHER PUBLICATIONS

1. ChengXiangZhai and John Lafferty. A study of smoothingmethods for language models applied to information retrieval, ACM Transactions on Information Systems, 22(2), April 2004, pp. 179-214. (over 1,200 total citations in Google Scholar for this journal version and a conference version of the paper as of Dec. 2011) 2. Hui , Tao, and ChengXiang Zhai. A formal study of information retrieval heuristics, Proceedings of ACM SIGIR 2004, 2004, pp. 49-56. Best Paper Award. 3. Yuanhua Lv, ChengXiang Zhai. Lower Bounding TF Normalization, Proceedings of ACM CIKM 2011, 2011, pp. 7-16. Best Student Paper Award 4. Qiaozhu , Dong , Hong Cheng, Jiawei , and ChengXiang Zhai. Generating semantic annotations for frequent patterns with context analysis, Proceedings of ACM KDD 2006, 2006, pp. 337-346. Best Student Paper Award Runner-Up. 5. Duo , ChengXiang Zhai, Jiawei Han. Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases, Proceedings of the SDM 2009, 2009, pp. 1123-1134. Best of SDM 09 SYNERGISTIC ACTIVITIES

• Program chair (IR): ACM SIGIR 2009, NAACL HLT 2007, ACM CIKM 2004. • Associate Editor: ACM Transactions on Information Systems, Information Processing & Management • Area program chair/coordinator: ACM SIGIR 2010, 2008,2006; WWW 2011, WSDM 2011, ACL 2006, HLT/NAACL 2006, HLT/EMNLP 2005. • Keynote speaker: ACM SIGIR 2011, ICTIR 2011, AIRS 2010, 6th Dutch-Belgian Information Retrieval Work- shop, 2006. • Tutorial chair: ACM SIGIR 2007. • Tutorial instructor: NAACL HLT 2007, ACM SIGIR 2006, ACM SIGIR 2005, HLT/NAACL 2004. • Program committee Member: regularly serving on program committees for all the major conferences in in- formation retrieval (SIGIR, CIKM), data mining (KDD), natural language processing (ACL), machine learning (ICML, NIPS). • Grant proposal panel/reviewer: NSF panel and external reviewer, Engineeringand Physical Sciences Research Council (EPSRC), UK; US-Israel Binational Science Foundation. • Major developer of Lemur, a toolkit for Language Modeling and Information Retrieval, which has been dis- tributed to the research community, and has been used by many groups in the world for both research and education. • Created the text information management undergraduate course and the bioinformatics undergraduate course in the Computer Science Department at UIUC COLLABORATORS (within the past 48 months)

Kevin (UIUC), Roxana Girju (UIUC), Jiawei Han (UIUC), Rong (Michigan State), Xinghua (Med- ical Univ. of South Carolina), Gene Robinson (UIUC), Sandra Rodriguez-Zas (UIUC), Dan Roth (UIUC), Bruce Schatz (UIUC), Luo (Purdue), Saurabh Sinha (UIUC), Richard Sproat (UIUC)

GRADUATE ADVISORS

John Lafferty (CMU), David Evans (Clairvoyance Corp.), Jiafu (Nanjing Univ.), Guoliang (Nanjing Univ.)

GRADUATE ADVISEES

Hui Fang, Tao Tao, Xuehua Shen, Jing Jiang, Qiaozhu Mei, Xuanhui

2