Curriculum Vita

Byung S. Lee

CURRENT AFFILIATION Professor, Department of Computer Science (primary) and Department of Electrical and Biomedical Engineering (secondary), College of Engineering and Mathematical Sciences, University of Vermont, Burlington, VT 05405, USA Phone: (802)656-1919. Fax: (802)656-0696. Email: [email protected]. Home page: http://www.cems.uvm.edu/~bslee

EDUCATION Ph.D. /Computer Science ( Systems), , Palo Alto, CA, January 1991. Dissertation: Instantiating Objects from a Remote Relational Database through Views. (Advisor: Gio Wiederhold, Computer Science) M.S. Electrical Engineering (Communication Systems), KAIST, Daejon, South Korea, February 1982. Thesis: Implementation of a Multi-rate Speech Digitizer. (Advisor: Chong-Kwan Un, Electrical Engineering) B.S. Electronics, Seoul National University, Seoul, South Korea, February 1980.

EMPLOYMENT September 1999–present: Department of Computer Science, University of Vermont, Burlington, VT, 05405 USA. (Assistant Professor; Associate Professor; Professor) June 1998– August 1998: Computer Science Department, Dartmouth College, Hanover, NH 03755-3510 USA. (Visiting Assistant Professor) September 1993–August 1999: Graduate Programs in Software Engineering, University of St. Thomas, 2115 Summit Avenue, St. Paul, MN 55105 USA. (Assistant Professor) October 1992–August 1993: Datacom Global Communications, Inc., Princeton, NJ, 08544 USA. (Supervisor) December 1990–October 1992: Bell Communications Research (now Telcordia Technologies), 444 Hoes Lane, Piscataway, NJ 08854-4157 USA. (Member of Technical Staff) January 1990–August 1990: Hewlett-Packard Research Laboratories, 1501 Page Mill Road, Palo Alto, CA 94305 USA. (Practical Trainee) June 1986–December 1990: Computer Systems Laboratory, Stanford University, Palo Alto, CA 94305 USA. (Graduate Research Assistant) March 1980–July 1985: Technology Research Center, Goldstar Electric (now LG Communications), Anyang, South Korea. (Engineer; Principal Engineer and Team Lead)

TEACHING September 1999 – present: (University of Vermont) o CS64: Discrete mathematics (undergraduate students; required). o CS121: Computer Organization (undergraduate students; required). o CS124: Data Structures (undergraduate students; required). o CS204: Database Systems (upper-level undergraduate and first year graduate students; elective). o CS224: Algorithm Design and Analysis (upper-level undergraduate and first-year graduate students; elective for undergraduate and required for graduate). o CS331/295: Advanced Database Systems (CS331 for graduate students, CS295 for undergraduate students; elective). June 1998 - August 1998: (Dartmouth College) o CS37: Computer Architecture (undergraduate students; required). September 1993 – June 1999: (University of St. Thomas) o CSIS530: Database Management System and Design (graduate students; required). o CSIS531: Database Management Concepts and Applications (graduate students; required) – Equivalent to CS530, designed for more application-oriented, less technology-oriented students. o CSIS532: Distributed Database Management Systems (graduate students; elective). o CSIS544: Object-Oriented (graduate students; elective).

RESEARCH PROJECTS September 1999 – present: (University of Vermont) o Trajectory analysis of signals from IOT: methods and applications – example applications include environmental watershed monitoring and smart building occupant co-work pattern analysis. o Anomaly detection over data streams: methods and applications – example applications include patient health monitoring, environmental health monitoring, and civil infrastructural health monitoring. o Online social network data analytics: inter-user influence modeling, topical influence modeling, hashtag clustering, geo- social network mining. o Event processing: event stream processing and complex event processing; causal modeling and query processing over event streams. o Stream query processing: continuous aggregation join queries over data streams, distributed join query optimization, adaptive-size reservoir sampling over data streams, spatiotemporal join processing over continuous location data stream, temporal query processing over data streams. o Approximate query evaluation using forecasting techniques: selectivity estimation in query processing; QoS-driven data aggregation in sensor networks, periodic pattern mining from streaming time series. o Predictive modeling in databases: self-tuning cost-modeling of user-defined functions; workload-aware multidimensional histograms. o XML: XML element numbering; large scale XML query processing using information retrieval techniques. o Information retrieval: combining document rankings from multiple search engines; Boolean text search query optimization. o Data mining: data clustering using a multidimensional index; mining partial periodic correlations from time series. o Temporal aggregations using a multidimensional index. o Object-relational databases: nested object selectivity; partial rollback schemes. o Web caching: time-to-Live (TTL) determination. o Approximate ad-hoc query support for scientific simulation mesh data: data model and query language; system architecture. September 1993 – June 1999: (University of St. Thomas) o Full-text indexing systems: a standard generalization markup language (SGML) benchmark testing using Oracle Context and Open Text systems. o Object-oriented databases: feasibility assessment as a repository for SGML documents; object class normalization in a schema design; refined object schema mapping from Enhanced Entity-Relationship (EER) schema. November 1992 - July 1993: (Datacom Global Communications) o Electronic Data Interchange (EDI): development of EDI systems for MS-DOS, Stratus/VOS, and UNIX operating systems; development of EDI Gateway prototype with shared memory architecture. December 1990 - October 1992: (Bell Communications Research; currently Telcordia Technologies) o Object-oriented database management systems (OODBMSs): development of a telephony benchmark suite for evaluating OODBMSs; assessing the features of SIM (a semantic DBMS) and Ithasca (an OODBMS). o Heterogeneous distributed database integration: building a uniform access interface to remote databases in Oracle, Ingres, Sybase, and RDB. August 1985 - December 1990: (Stanford University) o Remote data accesses: development of a Commonlisp language interface to Iris object-oriented database system (for Hewlett-Packard Palo Alto research center) and to Sybase DataServer (for Stanford knowledge systems project); development of an Interlisp interface to a remote SunUnify relational database server (for Stanford University medical expert systems project). o Expert systems: performance analysis of the blackboard control architecture (BB-1) – an opportunistic knowledge-based expert system – for Stanford Knowledge Systems Laboratory. March 1982 - July 1985: (Goldstar Electric Company; currently LG Communications) o Army battery automation (in collaboration with Agency for Defense Development): development of combat communication and control (C3) systems including 155mm howitzer battery firing data calculator, digital message device, and ground data unit; tactical fire control computer system. March 1980 - February 1982: (Korea Advanced Institute of Science and Technology) o Digital signal processing and voice coding: development of a multi-rate speech digitizer.

STUDENT/POSTDOC RESEARCH SUPERVISION Postdoctoral research o Sang-Pil Kim, Finding Twitter Users Compatible with a New Article. October 2015 – March 2017. o Zhen He, Cost and Selectivity Modeling of User-Defined Functions. March 2003 – November 2004. Doctoral dissertation o Ali Javed, Spatiotemporal trajectory analysis from hydrological storm event data. February 2018 – present. o Saurav Acharya, Incremental Causal Network Construction over Event Streams. September 2009 – October 2014 (graduated). o Sasi Kunta, Cellular Automata Based Event Stream Processing. September 2008 – May 2010 (deceased). o Mohammed Al-Kateb, Adaptive-Size Reservoir-Based Sampling and Temporal Coalescing over Data Streams. January 2005 – May 2011 (graduated). o Tri Tran, Efficient Evaluation of Join Queries over Data Streams. July 2004 – October 2010 (graduated). Master’s thesis o Ali Javed, A Hybrid Approach to Semantic Hashtag Clustering in Social media, June 2015 – May 2016 (graduated). o Qiang (AJ) Jing, Event Detection in Binary Sensor Networks. (Co-advised with Professor Sean Wang). Spring 2005 – Spring 2007. o Dennis Fuchs, A Quantized Histogram for Multidimensional Selectivity Estimation. September 2003 – May 2004 (graduated). o Songtao Jiang, Modeling the Cost of Spatial Search Operators Using Nonparametric Regression. September 2002 – October 2003 (graduated). o Jiangyan He, Combined Relevance Ranking of Documents. April 2002 – May 2005 (graduated). o Li Chen, QoS Multicast Routing and Protection Planning in Optical Networks. (Co-advised with Professor Xue). June 2001 – May 2002 (graduated). o Vinod Kannoth, Regression-Based Cost Modeling of User-Defined Functions in Object-Relational Database Management Systems. June 2000 – May 2001 (graduated). o Kwok Yu, Object-Oriented Databases for SGML Document Repository. January 1998 – May 1999 (graduated). o Michael R. Olson, SGML Benchmark Application on Objectstore Object-Oriented DBMS. August 1995 – December 1995 (graduated). Master’s project o Jack Houk, Mobile ECG Anomaly Detection Using Long Short-Term Recurrent Neural Network, June 2018 – present. o Yuhang Lin, Continuous Detection of Abnormal Heartbeats from ECG Using Online Outlier Detection, June 2017 – May 2018 (graduated). o Prajwal Shrestha, Effect Event Prediction Using a Cause-Effect Event Pair Abstraction Tree based on Semantic Web, September 2016 – May 2017 (graduated). o Mamata Hegde, The Feasibility of Life Log Inquiries on GPS Trace Data. January 2008 – October 2008 (graduated). o Eugene Somdahl, Prototype for an Interactive Wild Plant Identification and Use Field Guide. October 1997 – March 1999 (graduated). o Jeanne Hunt, Traffic Control database. (Sponsored by Center for Transportation Studies, Minneapolis, MN.) March 1998 – December 1998 (graduated). TM o Mark Dosdall, Development of COM Interfaces for OBJECTS Software. (Sponsored by 3Objects, Inc., Eden Prairie, MN.) December 1996 – June 1997 (graduated). o Kenneth Perttula, Application of OLAP Technology in Decision Support Systems in a Managed Care Environment. (Sponsored by United Healthcare, Edina, MN.) July 1996 – February 1997 (graduated). o Parvez Mukadam, Automotive Data Identification and Retention (ADIR) System. (Sponsored by 3M, St. Paul, MN.) September 1995 – May 1996 (graduated). o Dan Madsen, Resource Tracking Databases on Microsoft Access. (Sponsored by NCR Corporation, Edina, MN.) September 1994 – May 1995 (graduated). Practical training o Li Chen, Modeling the Costs of User-Defined Functions, Optional Practical Training (OPT) research. September 2002 – August 2003. Independent study o Josiah Witt, Community-Based Social Network System – A Proof of Concept, Fall2018. o Poonima Shetty, Utilizing Temporal Structure for Causal Modeling. Summer 2012 – Fall 2012. o Saurav Acharya, Predictive Real-Time Causal Query Processing over Event Streams. Summer 2012. o Ahmed Hamed, A Scientific Workflow for Biodiversity Risk Monitoring. Spring 2010. o Sean Marchetti, Development of a Wireless Real-Time Market Data Solution. (Sponsored by Amerada Hess Corporation, New York, NY.) Fall 2003. o Jason Storer, Development of a Music Library System. (Sponsored by WRUV, Burlington VT.) Spring 2003. o Jonathan Sullivan and David VanHorn, Generating the Cost Functions of Text Search Functions. Spring 2002 – Fall 2002. o Vinod Kannoth, Adaptive Cost-Modeling of User-Defined Functions. Fall 2001. o Li Chen, Object Queries on Simulation Mesh Data. Winter 2000 – Spring 2001. o Michelle Kurdika, Development of Brio Support and Goal Hierarchy Databases. (Sponsored by EMC Corporation, Hopkinton MA.) Summer 2000. o Mukesh Agarwal, Middleware Technology Assessment. September 1997 – December 1997. o Carlotta Persaud, Sybase SQL Server Study. June 1997 – August 1997. o David Zimmerman, Microsoft SQL Server Study. February 1996 – May 1996. Undergraduate research o Joshua Rothenberg, RFID Programming for Vermont Transportation Signage Management System, Fall 2018 o Joshua Childs, Database Development for Vermont Transportation Signage Management System, Fall 2018 – Spring 2019. o Alexander Swanson, Building a Reddit AI Agent to Increase Awareness about the Humanitarian Crisis in Puerto Rico, Spring 2018.

PUBLICATIONS Journals (peer reviewed) o Ali Javed and Byung Suk Lee, Hybrid Semantic Clustering of Hashtags, Online Social Networks and Media, Volume 5, Elsevier, March 2018, pp. 23-36 o Saurav Acharya, Byung Suk Lee, Paul Hines, Causal Prediction of Top-K Event Types over Real-Time Event Streams, The Computer Journal, Oxford Journals, Volume 60, Issue 11, November 2017, pp. 1561-1581. o Ali Javed and Byung Suk Lee, Sense-Level Semantic Clustering of Hashtags, Communications in Computer and Information Science, Volume 656, Springer, 08 March 2017, pp. 1-16. o Saurav Acharya and Byung Suk Lee, Enhanced Fast Causal Network Inference over Event Streams, Transactions on Large-Scale Data- and Knowledge-Centered Systems XVII, Lecture Notes in Computer Science 8970, Springer, January 2015, pp. 45-73. o Yang-Sae Moon and Byung Suk Lee, “Safe MBR-Transformation in Similar Sequence Matching,” Information Sciences, Elsevier, Volume 270, 20 June 2014, pp. 28-40. o Saurav Acharya and Byung Suk Lee, “Incremental Causal Network Construction over Event Streams,” Information Sciences, Elsevier, Volume 261, March 2014, pp. 32-51. o Mohammed Al-Kateb and Byung Suk Lee, “Adaptive Stratified Reservoir Sampling over Heterogeneous Data Streams,” Information Systems Journal, Elsevier, Volume 39, January 2014, pp. 199-216. o Mohammed Al-Kateb and Byung Suk Lee, “Load Shedding for Temporal Queries over Data Streams,” Journal of Computer Science and Engineering, KIISE, Volume 5, Number 4, December 2011, pp.294-304. o Tri Minh Tran and Byung Suk Lee, “Distributed Adaptive Windowed Stream Join Processing,” International Journal of Distributed Systems and Technologies, IGI Global, Volume 2, Issue 2, April-June 2011, pp.59-81. o Mohammed Al-Kateb and Byung Suk Lee, “Temporal Coalescing on Window Extents over Data Streams,” IEICE Transactions on Information and Systems, IEICE Press, Volume E94-D, Number 3, March 2011, pp.489-503. o Jeong-Hoon Lee, Kyu-Young Whang, Hyo-Sang Lim, Byung Suk Lee, and Jun-Seok Heo, “Progressive Processing of Continuous Range Queries in Hierarchical Wireless Sensor Networks,” IEICE Transactions on Information and Systems, IEICE Press, Volume E-93.D, Number.7, July 2010, pp.1832-1847. o Tri Minh Tran and Byung Suk Lee, "Distributed Stream Join Query Processing with Semi-joins," Distributed and Parallel Databases, Springer, Volume 27, Issue 3, June 2010, pp.211-254. o Tri Minh Tran and Byung Suk Lee, “Transformation of Continuous Aggregation Join Queries over Data Streams,” Journal of Computing Science and Engineering, KIISE, Volume 3, Number 1, March 2009, pp. 27-58. o Wook-Shin Han, Jaehwa Kim, Byung Suk Lee, Yufei Tao, Ralf Rantzau, and Volker Markl, “Cost-Based Predictive Spatio-Temporal Join,” IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Science Press, Volume 21, Issue 2, February 2009, pp. 220-233. o Zhen He, Byung Suk Lee, and X. Sean Wang, “Aggregation in Sensor Networks with a User-Provided Quality of Service Goal,” Information Sciences, Elsevier, Volume 178, Issue 9, May 2008, pp. 2128-2149. o Zhen He, X. Sean Wang, Byung Suk Lee, and Alan C.H. Ling, “Mining Partial Periodic Correlations in Time Series,” Knowledge and Information Systems, Springer, Volume 15, Number 1, April 2008, pp 31-54. o Zhen He, Byung Suk Lee, X. Sean Wang, “Proactive and Reactive Multi-dimensional Histograms for Selectivity Estimation”, Journal of Systems and Software, Elsevier, Volume 81, Issue 3, March 2008, pp. 414-430. o Joon-Ho Woo, Byung Suk Lee, Min-Jae Lee, Kyu-Young Whang, and Woong-Kee Loh, “Temporal Aggregation Using a Multi-dimensional Index,” Journal of Database Management, IGI Global (formerly Idea Group Inc.), Volume 18, Issue 2, April-June 2007, pp. 62-80. o Dennis Fuchs, Zhen He, and Byung Suk Lee, “Compressed Histograms with Arbitrary Bucket Layouts for Selectivity Estimation,” Information Sciences, Elsevier, Volume 177, Issue 3, February 2007, pp. 680-702. o Songtao Jiang, Byung Suk Lee, and Zhen He, “The Cost Modeling of Spatial Operators Using Nonparametric Regression,” Information Sciences, Elsevier, Volume 177, Issue 2, January 2007, pp. 607-631. o Vinod Kannoth, Byung Suk Lee, and Jeff Buzas, “Statistical Cost-Modeling of Financial Time Series Functions,” Journal of Computers and Applications, ACTA Press, Volume 28, Number 3, 2006, pp. 181-188. o Young-Ho Park, Kyu-Young Whang, Byung Suk Lee, and Wook-Shin Han, “Efficient Evaluation of Partial Match Linear Path Expressions on Large-Scale Heterogeneous XML Documents Using Information Retrieval Techniques,” Journal of Systems and Software, Elsevier, Volume 79, Number 2, February 2006, pp. 180-190. o Won-Young Kim, Byung Suk Lee, and Kyu-Young Whang, “Partial Rollback in Object-Oriented/Object-Relational Database Management Systems with Dual Buffer,” Journal of Information and Software Technology, Elsevier, Volume 48, Issue 2, February 2006, pp. 121-132. o Zhen He, Byung Suk Lee, and Robert Snapp, “Self-Tuning Cost Modeling of User-Defined Functions in an Object- Relational DBMS,” ACM Transactions on Database Systems, ACM Publications, Volume 30, Issue 3, September 2005, pp. 812-853. o Byung Suk Lee, Li Chen, Jeff Buzas, and Vinod Kannoth, “Regression-Based Self-Tuning Modeling of Smooth User- Defined Function Costs for an Object-Relational Database Management System Query Optimizer,” The Computer Journal, Oxford University Press, Volume 47, Number 6, November 2004, pp. 673-693. o Jae-Joon Hwang, Kyu-Young Whang, Yang-Sae Moon, and Byung-Suk Lee, “A Top-down Approach for Density-Based Clustering Using Multidimensional Indexes,” Journal of Systems and Software, Elsevier, Volume 73, Number 1, September 2004, pp. 169-180. o Joon-Ho Woo, Byung Suk Lee, Min-Jae Lee, Jae-Gil Lee, and Kyu-Young Whang, “Transformation-Based Temporal Aggregation Using Order-Based Buffer Replacement Strategy,” Journal of Computer Systems Science and Engineering, CRL Publishing, Volume 19, Number 5, September 2004, pp. 3-9. o Byung Suk Lee and Ron Musick, “MeshSQL: the Query Language for Simulation Mesh Data,” Information Sciences, Elsevier, Volume 158, Issues 1-4, January 2004, pp. 177-202. o Byung Suk Lee, Terence Critchlow, Ghaleb Abdulla, Chuck Baldwin, Roy Kamimura, Ron Musick, Robert Snapp, and Nu Ai Tang, “The Framework for Approximate Queries on Simulation Data,” Information Sciences, Elsevier, Volume 157, Issues 1-4, December 2003, pp. 3-20. o Wook-Shin Han, Ki-Hoon Lee, and Byung Suk Lee, “An XML Storage System for Object-Oriented/Object-Relational DBMSs,” Journal of Object Technology, Volume 2, Number 3, May-June 2003, pp. 113-126. o Byung Suk Lee, Robert R. Snapp, Ron Musick, and Terence Critchlow, “Metadata Models for Ad Hoc Queries on Terabyte-Scale Scientific Simulations,” Journal of Brazilian Computer Society, Brazilian Computer Society, Volume 8, Number 1, July 2002, pp 5-15. o Sang Kyun Cha, Kihong Kim, Byung Suk Lee, Changbin Song, Sangyong Hwang, and Yongsik Kwon, “MEADOW: a Middleware for Efficient Access to Multiple Geographic Databases through OpenGIS Wrappers,” Software: Practice & Experience, John Wiley & Sons, Volume 32, Number 4, April 2002, pp. 377-402. o Byung Suk Lee, “OODB Design with EER,” Journal of Object-Oriented Programming, SIGS Publications, March 1996, pp. 61 - 64. o Byung Suk Lee and Gio Wiederhold, “Efficiently Instantiating View-Objects from Remote Relational Databases”, The VLDB Journal, VLDB Endowment, Volume 3, Number 3, July 1994, pp. 289 - 323. o Byung Suk Lee and Gio Wiederhold, “Outer Joins and Filters for Instantiating Objects from Relational Databases through Views”, IEEE Transactions on Knowledge and Data Engineering, IEEE Computer Society, Volume 6, Number 1, February 1994, pp. 108 - 119. o Gio Wiederhold, Peter Rathmann, Thierry Barsalou, Byung Suk Lee, and Dallan Quass, “Partitioning and Composing Knowledge”, Information Systems, Elsevier, Volume 15, Number 1, March 1990, pp. 61-72. o Byung Suk Lee, Hwang Soo Lee, Byung Cheol Shin, and Chong Kwan Un, “Implementation of a Multi-rate Speech Digitizer”, IEEE Transactions on Communications, IEEE Communications Society, June 1983, pp. 775-783.

Journals (editor reviewed) o Byung Suk Lee, “Normalization in OODB Design,” SIGMOD Record, ACM Special Interest Group on Management of Data, Volume 24, Number 3, September 1995, pp. 23 - 27.

Journals (invited) o Byung Suk Lee, “Object-Oriented Databases: Systems and Standards,” in “Object Technology. A Virtual Round Table”, IEEE Computer, October 1995, pp. 64 - 65.

Conferences (peer reviewed) o Yuhang Lin, Byung Suk Lee, and Daniel Lustgarten, Continuous Detection of Abnormal Heartbeats from ECG Using Online Outlier Detection, Proceedings of the 5th International Conference on Information Management and Big Data, Communications in Computer and Information Science, Springer, September 2018. o Prajwal Shrestha, Byung Suk Lee, and James P. Bagrow, Predicting an Effect Event from a New Cause Event Using a Semantic Web Based Abstraction Tree of Past Cause-Effect Event Pairs, Proceedings of the 4thd Annual International Symposium on Information Management and Big Data (SIMBig), Lima, Peru, September 4-6, 2017. o Sang-Pil Kim and Byung Suk Lee, Are You a Compatible User? -- Compatibility of a Microblog User with a News Article, Proceedings of the 5th World Conference on Information Systems and Technologies, Vol.3 pp. 193-204, Porto Santo Island, Madeira, Portugal, April 11-13, 2017 in Volume 571 of the Advances in Intelligent Systems and Computing Series, Springer. rd o Ali Javed and Byung Suk Lee, “Sense-Level Semantic Clustering of Hashtags in Social Media”, Proceedings of the 3 International Symposium on Information Management and Big Data (SIMBig), September 1-3, 2016. o Daehoon Kim, Jae-Gil Lee, and Byung Suk Lee, Topical Influence Modeling via Topic-Level Interests and Interactions on Social Curation Services, Proceedings of the 32nd IEEE International Conference on Data Engineering (ICDE), May 16-20, 2016, Helsinki, Finland., pp. 13-24 o Saurav Acharya and Byung Suk Lee, “Fast Causal Network Inference over Event Streams,” Proceedings of the15th International Conference on Data Warehousing and Knowledge Discovery( DaWaK), Prague, Chez Republic, August 26-29, 2013. o Ahmed Abdeen Hamed, Byung Suk Lee, and Anne Thessen, “Ecosystems Monitoring: An Information Extraction and Event Processing Scientific Workflow,” Proceedings of the IEEE 6th World Congress on Services (SERVICES): IEEE 4th International Workshop on Scientific Workflows (SWF), Miami, Florida, USA, July 5, 2010. o Mohammed Al-Kateb and Byung Suk Lee, "Stratified Reservoir Sampling over Heterogeneous Data Streams," Proceedings of the 22nd International Conference on Scientific and Statistical Database Management (SSDBM), Heidelberg, Germany, June 30-July 2, 2010. o Gyejeong Kim, Kyu-Young Whang, Min-Soo Kim, Byung Suk Lee, Hyo-Sang Lim, and Ki-Hoon Lee, “Incremental Clustering Crawler for Community-Limited Search,” Proceedings of the 2nd International Conference on the Applications of Digital Information and Web Technologies (ICADIWT), August 4-6, 2009, London, United Kingdom. o Tri Tran and Byung Suk Lee, “Transformation of Continuous Aggregation Join Queries over Data Streams,” Proceedings of the 10th International Symposium on Spatial and Temporal Databases (SSTD07) (Lecture Note in Computer Science, Volume 4605), July 16-18, 2007, Boston, MA, U.S.A. pp. 330-347. o Tri Tran, Byung Suk Lee, and Matthew Bovee, "Why Not Semijoins for Streams, When Distributed?", Proceedings of the Second International Conference on Digital Telecommunications (ICDT), July 1-6, 2007, Silicon Valley, U.S.A., Number 27 (CD). (Best paper award). o Mohammed Al-Kateb, Byung Suk Lee, X. Sean Wang, “Adaptive-Size Reservoir Sampling over Data Streams,” Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), Banff, Canada, July 9-11, 2007, Number 22 (CD). o Mohammed Al-Kateb, Byung Suk Lee, X. Sean Wang, “Reservoir Sampling over Memory-Limited Stream Joins,” Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM), Banff, Canada, July 9-11, 2007, Number 23 (CD). o Young-Ho Park, Kyu-Young Whang, Byung Suk Lee, and Wook-Shin Han, “Evaluation of Partial Match Queries for XML Documents Using Information Retrieval Techniques,” Proceedings of the 10th International Conference on Database Systems for Advanced Applications (DASFAA) (Lecture Notes in Computer Science, Volume 3453), Beijing, China, April 17-20, 2005, pp. 95-112. o Wook-Shin Han, Ki-Hoon Lee, Byung Suk Lee, Won-Sik Kim, “An XML Storage System Using Object-Relational DBMSs: Syntax, Semantics, and Implementation,” Proceedings of the International Conference on Information and Knowledge Engineering (IKE), Las Vegas, Nevada, U.S.A., June 21-24, 2004, pp. 358-361. o Zhen He, Byung Suk Lee, and Robert R. Snapp, “Self-tuning UDF Cost Modeling Using the Memory Limited Quadtree,” Proceedings of the 9th International Conference on Extending Database Technology (EDBT), Heraklion, Crete, Greece, March 14-18, 2004, pp 513-531. o R. Kamimura, G. Abdulla, C. Baldwin, T. Critchlow, B. Lee, I. Lozares, R. Musick, and N. Tang, “Use of Numerical Models as Data Proxies for Approximate Ad-Hoc Query Processing,” Proceedings of the 7th Joint Conference on Information Sciences (JCIS), Research Triangle Park, NC, U.S.A.., September 26 – 30, 2003, pp. 588-593. o Byung S. Lee, “Preventing Cache Overflows in an Object-Oriented Database Management System with the Object- Descriptor Architecture,” Proceedings of the 7th Joint Conference on Information Sciences (JCIS), Research Triangle Park, NC, U.S.A., September 26 – 30, 2003, pp. 410-413. o Sangyong Hwang, Keunjoo Kwon, Sang K. Cha, and Byung Suk Lee, “Performance Evaluation of Main-Memory R-tree Variants,” Proceedings of the 8th International Symposium on Spatial and Temporal Databases (SSTD), Santorini island, Greece, July 24-27, 2003, pp. 10–27. o Jeong-Joon Lee, Kyu-Young Whang, Byung Suk Lee, and Ji-Woong Chang, “An Update Risk-Based TTL Estimation in Web Caching,” Proceedings of the 3rd ACM International Conference on Web Information Systems Engineering (WISE), Singapore, December 12-14, 2002, pp. 21-29. o Won-Young Kim, Kyu-Young Whang, Byung Suk Lee, Young-Koo Lee, and Ji-Woong Chang, “Partial Rollback in Object-Oriented/Object-Relational Database Management Systems,” Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM), McLean, VA, U.S.A., November 4-9, 2002, pp. 316-323. o Li Chen, Guoliang Xue, and Byung Suk Lee, “A Delay-Scaling Multicast Algorithm with Multiple QoS Criteria,” Proceedings of the 6th International Conference on Computer Science and Informatics (CSI), Durham, NC, U.S.A., March 8 - 14, 2002, pp. 319-323. o Byung Suk Lee, Robert R. Snapp, Ron Musick, and T. Critchlow, “Ad hoc Query Support for Very Large Scientific Data: the Metadata Approach,” Proceedings of the 16th Brazilian Symposium on Databases (SBBD), Rio de Janeiro, Brazil, October 1-3, 2001, pp.199-212. (An extended version published in Journal of Brazilian Computer Society.) o Ghaleb Abdulla, Chuck Baldwin, Terence Critchlow, Roy Kamimura, Byung Suk Lee, Ida Lozares, Ron Musick, Robert Snapp, and Nu Ai Tang, “Approximate Ad-hoc Query Engine for Simulation Data,” Proceedings of the 1st ACM+IEEE Joint Conference on Digital Libraries (JCDL), Roanoke, VA, USA, June, 2001, pp. 255-256. o Byung Suk Lee, Robert Snapp, and Ron Musick, “Toward a Query Language on Simulation Mesh Data: an Object- Oriented Approach,” Proceedings of the 7th International Conference on Database Systems for Advanced Applications (DASFAA), Hong Kong, April 18-20, 2001, pp. 242-249. o Kwok Y. Yu, Byung Suk Lee, and Michael R. Olson, “The Scalability of an Object Descriptor Architecture OODBMS,” Proceedings of the 3rd International Database Engineering and Application Symposium (IDEAS), August 2-4, 1999, Montreal, Canada, pp. 370-379. th o Michael R. Olson and Byung S. Lee, “Object Databases for SGML Document Management,” Proceedings of the 30 Hawaiian International Conference on Systems Sciences (HICSS), Volume III, Maui, HI, U.S.A., January 7-11, 1997, pp. 39-48. th o Byung Suk Lee, Witold Litwin, and Gio Wiederhold, “Implicit Joins in the Structural Data Model,” Proceedings of the 15 International Computer Software and Application Conference (COMPSAC), Tokyo, Japan, September 1991, pp. 357- 364.

Conferences (editor reviewed) o X. Sean Wang, Byung Suk Lee, and Firooz Sadjadi, “AKSED: Adaptive Knowledge-Based System for Event Detection Using Collaborative Unmanned Aerial Vehicles,” Proceedings of the SPIE Defense and Security Symposium, Orlando (Kissimmee), Florida U.S.A., April 17-21, 2006. o Gio Wiederhold, Thierry Barsalou, Byung Suk Lee, Niki Siambela, and Walter Sujanski, “Use of Relational Storage and a Semantic Model to Generate Objects: the Penguin Project,” in Database '91: Merging Policy, Standards and Technology, The Armed Forced Communications and Electronics Association, Fairfax, VA, U.S.A., June 1991, pp. 503- 515.

Conference proceedings o Byung Suk Lee., Proceedings of the 2008 International Workshop on Scalable Stream Processing System, SSPS 2008, Nantes, France, March 29, 2008.

PRESENTATIONS o “Sense-Level Semantic Clustering of Hashtags in Social Media”, SIMBig (Skype), September 2, 2016 (with Ali Javed). o “Scalable Validation of Data Streams –Background and Introduction”, Uppsala University, Uppsala, Sweden, August 17, 2016. o “Causality over Event Streams, Data Mining Lab”, KAIST, Daeduk, South Korea, July 15, 2016. o “Scalable Validation of Data Streams – Background and Introduction”, Uppsala University, Uppsala, Sweden, August 17, 2016 (invited) o “Fast Causal Network Inference over Event Streams,” Database and Multimedia Lab, Department of Computer Science, KAIST, Daejon, South Korea, June 17, 2013. o “Data Stream Processing Research,” Computer Science Research Day, University of Vermont, Burlington, Vermont, December 8, 2011. o “Biodiversity Risk Assessment: Event Processing + Information Extraction Approach”, Marine Biology Laboratory, Woods Hole, Massachusetts, February 4, 2010 (with Ahmed Abdeen Hamed). o “Complex Event Processing for Healthcare Applications,” GE Healthcare, South Burlington, Vermont, November 4, 2009. o “DOME Complex Event Processing Engine,” DOME Project Meeting, Department of Computer Science, University of Vermont, Burlington, Vermont, September 4, 2009 (with Sasi Kunta). o “Complex Event Processing for Disease Outbreak Monitoring in Environment,” DOME-CEP Kickoff Meeting, Medical Education Center, University of Vermont, Burlington, Vermont, August 6, 2009. o “QoS-Driven Aggregation in a Sensor Network,” US-Korea Conference on Science and Technology, San Jose, California, August 15, 2008. o “Indexing in Wireless Sensor Networks,” Sensor Networks Work Study Group Seminar, University of Vermont, Burlington, Vermont, October 30, 2007 (with Mohammed Al-Kateb). o “Recent Research in Database Systems: Aggregation join query transformation over data streams, Adaptive-size reservoir sampling over data stream, and Distributed data stream join query optimization,” Computer Science Research Day, University of Vermont, Burlington, Vermont, August 23, 2007. o “Transformation of Continuous Aggregation Join Queries over Data Streams,” International Symposium on Spatial and Temporal Databases, Boston, Massachusetts, July 17, 2007 (with Tri Tran). o “QoS Driven Aggregation in a Sensor Network,” Computer Science Seminar Series, Department of Computer Science, University of Vermont, Burlington, Vermont, October 16, 2006. o “QoS-Driven Aggregation in a Sensor Network,” Advanced Information Technology Research Center, Korea Advanced Institute of Science and Technology, Yuseong, South Korea, July 24, 2006. o “Continuous Aggregation Join Queries Over Data Streams,” Global Convention of Scientists and Engineers (GCSE 2006), Convention and Exhibition Center (COEX), Seoul, South Korea, July 19, 2006 (poster). o “AKSED: Adaptive Knowledge-Based System for Event Detection Using Collaborative Unmanned Aerial Vehicles,” Defense and Security Symposium, Orlando (Kissimmee), Florida, April 19, 2006. o “Efficient Processing of Window-Based Aggregation Join Queries over Data Streams using Query Transformations,” Vermont EPSCoR Conference, Burlington, Vermont, August 15, 2005 (poster with Tri Tran). o “Aggregation in Sensor Networks Driven by a User-Provided Combined Objective of Lifetime and Error,” Sensor Networks Work Study Group Seminar, University of Vermont, Burlington, Vermont, January 27, 2005 (with X. Sean Wang). o “Boolean Text Query Optimization,” Department of Computer Science, University of Vermont, Burlington, Vermont, March 28th, 2005. o “Context-Aware Multimedia Content Management in an Embedded Database System: Some Preliminary Thoughts,” Samsung Advanced Institute of Technology, Kiheung, South Korea, December 11, 2004. o “A Top-Down Approach for Density-Based Clustering Using Multidimensional Indexes,” Department of Computer Science, University of Vermont, Burlington, Vermont, January 24, 2003 o “Preventing Cache Overflows in an OODBMS with the Object-Descriptor Architecture,” the 7th International Conference on Computer Science and Informatics, Cary, North Carolina, September 28, 2003. o “Modeling the Execution Cost of a User Defined Function,” DoE EPSCoR Conference, Albuquerque, New Mexico, June 3, 2003. o “The Framework of AQSim,” Science Data Centers Symposium, University of Maryland, College Park, Maryland, May 30, 2003. o “An Update-Risk Based Approach to TTL Estimation in Web Caching,” Department of Computer Science, University of Vermont, Burlington, Vermont, January 24, 2003. o “Modeling and Querying Scientific Simulation Mesh Data,” NewDB Workshop, Advanced Information Technology Research Center, Korea Advanced Institute of Science and Technology, Daeduk, South Korea, June 29, 2001. o “AQSim Framework,” Advanced Information Technology Research Center, Korea Advanced Institute of Science and Technology, Daeduk, South Korea, June 22, 2001. o “Modeling and Querying Scientific Simulation Mesh Data,” (abstract) DOE-NSF Joint Workshop, Brookhaven National Laboratory, Long Island, New York, May 30-31, 2001. o “Toward a Query Language on Simulation Mesh Data: an Object-Oriented Approach,” the 7th International Conference on Database Systems for Advanced Applications, Hong Kong, April 19, 2001. o “The Scalability of an Object Descriptor Architecture OODBMS,” the 3rd International Database Engineering and Application Symposium, Montreal, Canada, August 4, 1999. o “The Scalability of an Object Descriptor Architecture OODBMS,” Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, California, May 19, 1999. o “OODB Design with EER,” Software Engineering Group, Guidant Corporation, St. Paul, Minnesota, November 24, 1998. o “Object Databases for SGML Document Management,” the 30th Hawaiian International Conference on Systems Sciences, Maui, Hawaii, January 9, 1997. o “Databases: Objects and Relations,” Today’s Business Software Challenge Seminar Series, School of Business, University of St. Thomas, St. Paul, Minnesota, April 10, 1995. o “Integrating Databases with Object-Oriented Programs,” Quantitative Method and Computer Science Panel Discussion on Object-Oriented Technology, Computer Science Department, University of St. Thomas, St. Paul, Minnesota, October 19, 1994 o “Outer Joins and Filters for Instantiating Objects from Relational Databases through Views,” Computer Science Colloquium, University of Minnesota, Minneapolis, Minnesota, September 26, 1994. o “Object-Oriented Databases in the Client-Server Architecture,” Korean Scientists and Engineers Association Lecture, University of Minnesota, Minneapolis, Minnesota, March 28, 1994. o “A Telephony Benchmark Suite for OODBMSs,” Computer Science Department Seminar, Rensselaer Polytechnic Institute, Troy, New York, June 23 1992. o “Implicit Joins in the Structural Data Model,” the 15th International Computer Software and Application Conference, Tokyo, Japan, September 13, 1991. o “Efficiency in Instantiating Objects from Relational Databases through Views,” Center for Integrated Facility Engineering Industrial Forum, Stanford University, Palo Alto, California, November 12, 1989. o “KSYS: the Knowledge System,” Stanford Computer Forum, Stanford University, Palo Alto, California, October 1, 1988. o “WATCH: Learning by Apprentice,” Knowledge Systems Lab Retreat, Lagunita, California, August 27, 1987.

GRANTS/CONTRACTS Extramural o Radio Frequency Identification Technology for Transportation Signage Management. Vermont State Transportation Agency: September 1, 2018 – February 29, 2020. Co-Principal Investigator. Principal Investigator: Tian Xia. ($87,165) o Automation of Goodrich Health and Usage Monitoring System. Goodrich Corporation, Vergennes, Vermont: June 9, 2008 – May 14, 2010. Principal Investigator. ($117,105 total in six new or renewed contracts for supporting one of my PhD students). o A Framework for Optimal Approximate Query Evaluation Based on Workload Forecasting. National Science Foundation (Program 04-500 Information and Data Management): August 1, 2004 – July 31, 2007 (no cost extension until July 31, 2009). Principal Investigator. Co-Investigators: Zhen He and X. Sean Wang. ($480,000) o Generating Cost Models of AQSim Functions. Department of Energy (Program 00-14 EPSCoR National DOE Lab. Partnership): January 1, 2002 – December 31, 2004 (no cost extension until December 31, 2006). Principal Investigator. Co-Investigators: Jeff Buzas and Robert Snapp. ($439,917)

Intramural (including Vermont EPSCoR) o Energy-Efficient Data Storage and Retrieval in a Large-Scale Wireless Sensor Network, Vermont EPSCoR (Graduate Research Assistantship and Pilot Project), January 1, 2008 – June 30, 2008. Principal Investigator. ($15,000 for GRA and $16,537 for Pilot Project) o GPS Data Processing with a Multifaceted Spatiotemporal Hierarchy, Vermont EPSCoR Innovation Fund, January 1, 2008 – June 30, 2008. Principal Investigator. ($10,000) o Sensor Systems Research. Vermont Advanced Computing Center of the University of Vermont (Planning Grant): September 1, 2005 – May 31, 2006. Co-Investigator. Principal Investigator: X. Sean Wang. Other co-investigators: Jeff Frolik, Dryver Huston, Yuichi, Donna Rizzo. ($5,000) o Cost Modeling of User-Defined Functions for an Object-Relational Database Management System Query Optimizer. Vermont EPSCoR (Graduate Research Assistantship): June 1, 2004 – March 31, 2006. Principal Investigator. GRA: Tri Tran. ($50,000) o An Error Band Search of a Multi-Resolution Index Tree. Graduate College of the University of Vermont (Institutional Grant from University Committee on Research and Scholarship): May 1, 2002 – April 30, 2003. Principal Investigator. ($3,950) o A Scalability Testing of Querying Text Documents. Office of the Provost at the University of Vermont (SUGR/FAME): January 1, 2002 – December 31, 2002. Graduate Faculty Mentor. Student: Jiangyan He. ($2,400 + $1,000 student scholarship) o Multi-level, Multi-dimensional Clustering Approach to Efficient Data Retrieval from Large Scientific Mesh Data. Graduate College of the University of Vermont (Institutional Grant from University Committee on Research and Scholarship): January 1, 2000 – December 31, 2000. Principal Investigator. ($3,600) o Object-oriented Database Management System as an SGML Document Repository. Faculty Development Center of the University of St. Thomas (Research Assistance Grant): June 1, 1996 – August 31, 1996. Principal Investigator. ($5,000) o Development of a Pedagogical Toolset for Engineering Projects. Faculty Development Center of the University of St. Thomas (Teaching Enhancement Grant): June 1, 1995 – August 31, 1995. Principal Investigator. ($5,000)

INTELLECTUAL PROPERTIES o Mohammed Al-Kateb, Byung Suk Lee, and X. Sean Wang, ``Systems and Methods for Reservoir Sampling of Streaming Data and Stream Joins'', US Patent Application No. PCT/US08/63028, Regular, , University of Vermont (granted) (provisional application on May 8, 2007, full application on May 8, 2008; issued as Patent No. 8,392,381 on March 5, 2013). o Byung Suk Lee, Firooz Sadjadi, and X. Sean Wang, ``Multi-Stage Configuration of Situation-Specific Runtime-Agent Code'', US Patent Application No. 11/758,790, Regular, United States, University of Vermont (provisional application on August 8, 2006, full application on June 6, 2007). o Byung Suk Lee and Jongho Lea, "GPS Data Processing with a Multifaceted Spatiotemporal Hierarchy", University of Vermont Docket No. 333 (filed on December 27, 2007). o Ron Musick, Terence Critchlow, Kevin Durrengerger, Roy Kamimura, Ida Lozares, Deborah Walker, Byung Lee, and Robert Snapp, ``Fast Approximate Ad Hoc Query Support for Large-Scale Computational Science Data,'' Department of Energy Patent Docket No. S-95370, Lawrence Livermore National Laboratory Docket No. IL-10724 (filed on August 28, 2000).

UNIVERSITY SERVICES

Department o Computer Science Graduate Committee. September 1, 2008 – present. Role: chair and program coordinator until August 31, 2010, member from September 1, 2010 – August 31, 2014, chair and program coordinator from September 1, 2014 – May 31, 2017. o Computer Science Seminar Series. September 1. 2002 – August 31, 2008 and September 1, 2009 – August 31, 2012. Role: coordinator o Computer Science Curriculum Committee. September 1, 1999 – present. Role: member. o Computer Science Subcommittee on Intelligent Systems. April 1, 2008 - September 30, 2008. Role: member. o Comprehensive Written/Oral Examination Committees. May 2003 – present. Role: chair or member. o Computer Science teaching load survey. February 1, 2007 – February 15, 2007. Role: member. rd th o Computer Science Research Day, 3 (2004), 12 (2014). Role: organizer. o Faculty Search Committees. Fall 2002 – spring 2003, fall 1999 – spring 2000, and fall 2015 – spring 2016. Role: member. o Knowledge and Data Engineering Group coordination. Fall 2002 – fall 2004. Role: group member.

Colleges o Advisory Committee for Strategic Directions, College of Engineering and Mathematical Sciences, February 2016 – present. Role: member. o Faculty Standards Committee, College of Engineering and Mathematical Sciences. September 1, 2011 – August 31, 2014. Role: member. o Graduate College Program Coordination. September 1, 2008 – August 31, 2010, September 1, 2014 – present. Role: Computer Science program coordinator. o Ad-hoc Committee on College Bylaws, College of Engineering and Mathematical Sciences. Spring 2006. Role: member.

PROFESSIONAL SERVICES Journal Editorial Boards o Knowledge and Information Systems, Associate Editor, October 2008 – December 2014. o Journal of Computing Science and Engineering; Associate Editor, August, 2007 – present. o International Journal for Infonomics; Associate Editor, December 2003 – present.

Conferences rd o Senior Program Committee, 23 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2019) nd o Senior Program Committee, 22 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2018) rd o Program Committee, 33 Symposium on Applied Computing (SAC 2018). o Program Committee, 30th International Florida Society Conference (FLAIRS 2017) st o Senior Program Committee, 21 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2017) st o Program Committee, 32 Symposium on Applied Computing (SAC 2017). st o Program Committee, 31 Symposium on Applied Computing (SAC 2016). th o Program Committee, 30 Symposium on Applied Computing (SAC 2015). th o Program Committee, 29 Symposium on Applied Computing (SAC 2014). th o Program committee, 16 Asia Pacific Web Conference (APWeb 2014). th o Program Committee, 28 Symposium on Applied Computing (SAC 2013). th o Program committee, 15 Asia Pacific Web Conference (APWeb 2013). th o Program Committee, 27 Symposium on Applied Computing (SAC 2012). rd o Program Committee, 3 International Conference on Emerging Databases (EDB 2011). th o Program Committee, 26 Symposium on Applied Computing (SAC 2011). nd o Program committee, 2 International Conference on Emerging Databases (EDB 2010). th o Program committee, 19 International Conference on Information and Knowledge Management (CIKM 2010). th o Program committee, 12 Asia Pacific Web Conference (APWeb 2010). rd o Program committee, 26 International Conference on Data Engineering (ICDE 2010). rd o Program committee, 3 Workshop on Scalable Stream Processing Systems (SSPS 2009). nd o Organizer and Program Chair, 2 Workshop on Scalable Stream Processing Systems (SSPS 2008). rd o Program committee, 23 International Conference on Data Engineering (ICDE 2007). nd o Demo program committee, 32 International Conference on Very Large Data Bases (VLDB 2006) th o Program committee, 8 International Conference on Computer Science and Informatics (CSI 2005). th o Session organizer and chair, special session on predictive modeling in database and data mining, 8 International Conference on Computer Science and Informatics (CSI 2005). th o Program committee, 9 International Conference on Database Systems for Advanced Applications (DASFAA 2004). th o Publicity chair, 9 International Conference on Database Systems for Advanced Applications (DASFAA 2004). th o Session organizer and chair, special session on predictive modeling technology, 7 International Conference on Computer Science and Informatics (CSI 2003). st o Program committee, 21 International Conference on Conceptual Modeling (ER 2003). th o Program committee, 8 International Conference on Database Systems for Advanced Applications (DASFAA 2003). th o Publicity chair, 7 Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2003). th o Program committee, 19 International Conference on Data Engineering (ICDE 2003). th o Program committee, 11 International Conference on Information and Knowledge Management (CIKM 2002). th o Program committee, 20 International Conference on Conceptual Modeling (ER 2001). Funding Agencies o Panelist, NSF IIS proposal review panel, National Science Foundation, Washington, D.C., USA, June 2015. o Panelist, NSF CAREER proposal review panel, National Science Foundation, Washington, D.C., USA, November 2005. o Panelist, NSF ITR proposal review panel, Washington, D.C., USA, May 2003. o Panelist, DoE/NSF EPSCoR Conference panel on DoE EPSCoR Cool Science, Success Stories and Lessons Learned, Alberqueque, NM, USA, June 2-5, 2003.