Shounak Roychowdhury, Ph.D. 10213 Prism Dr., Austin, TX, 78726 || 650-504-8365 || email: [email protected]

Profile • Software development and research experience at Oracle and LG Electronics. • Deep understanding of data science methods: machine learning; probability and statistics. • 5 US patents and 40+ peer reviewed publications in international conferences and top refereed journals

Research Interests • Published research papers on computational intelligence, neural networks and fuzzy theory, numerical optimization, and natural language processing, and information theory.

Education • Ph.D. (Computer Engineering), University of Texas at Austin, Austin, TX, (Dec. 2013) o Dissertation: A Mixed Approach to Spectrum-based Fault Localization Using Information Theoretic Foundations. (Machine Learning in Software Engineering) • M.S. (Computer Science), University of Tulsa, Tulsa, OK, (May 1997) o Thesis: Encoding and Decoding of Fuzzy Rules

Patents • Chaos washing systems and a method of washing thereof (US Patent #5,560,230) • System and method for generating fuzzy decision trees (US Patent #7,197,504) • Method for extracting association rules from transactions in a database (U.S. Patent # 7,370,033) • Expediting K-means cluster analysis data mining using subsample elimination preprocessing (U.S. Patent # 8,229,876) • Bayes-like classifier with fuzzy likelihood (U.S. Patent # 8,229,875)

Computer Languages • Python, , C/C++, MATLAB, R, SQL, PL/SQL, Perl, Ruby, Tcl/Tk

Teaching Experience Adjunct Faculty Texas State University 2017- Present Professional Experience Hewlett Packard Enterprise, Austin, TX (Oct 2018 - present) Expert Technologist

• Executed software development processes for composable rack team of HPE’s OneView cloud management system. • Developed a Python-based system to test the scalability of OneView connections across multiple layers of Plexxi switches. • Scripted Python codes to analyze log files of OneView using granular representation of Bag-of-Words technique.

Oracle Corporation, Redwood Shores, CA and Austin, TX (Mar 1997-Sep 2018) Principal Member of Technical Staff (Oracle Database and Migration Tools) (2009-present)

[Database Cloud Services] • Amazon Redshift data migration to Oracle DataWarehouse Cloud • Integration of RUEI and Oracle Enterprise Manager (OEM) in the context of Database Schema Cloud Services. • Designed and coded Oracle VM statistics collection and reporting system with RESTful APIs (Jersey 2.0 and Apache Http) and Apex (Oracle Restful Services), 1

• Designed and coded orasql plugins in JRuby for Logstash, Elasticsearch and Kibana (ELK) for log analytics of Oracle Cloud. • Automated log analyzer of event logs, syslogs; and patent proposal of security mechanism for log data. • Analysis of database audit logs, correlation analysis of database instance metric logs. [Database Tools] Designed and implemented the following features for 4 major releases of SQL Developer (2.5, 3.0, 3.1, 4.0, 4.1) • Data masking and redaction feature of Oracle 12c R1. The feature was very well recognized at Oracle Open World-2013. • Coded and designed pan based MapViewer using Oracle DB’s Spatial data capabilities (Oracle 11g R1) for geospatial users, similar to Google Maps. • Coded Oracle Scheduler feature (Oracle 11g R1 and Oracle 11g R2), replacement of DBMS Jobs feature. Implemented in Java 5.0 Swing. • Enhanced and fixed bugs of the several command line features of SQLcl.

[Machine Learning Research] • Design and implemented data clustering algorithms for Oracle Spatial data. Improved K-means clustering algorithm by improving the discovery of better initialization points through sub-sampling algorithm. US patents on the algorithm. • Improved on the existing classifier design for spatial data clustering. • Design of optimal clusters using neighborhood graphs (MATLAB). • Design of uncertain-weighted clusters using association rules from large data sets (MATLAB) • Designed feature-based clustering technique using projective maximal stochastic information (MATLAB) • Designed feature removal algorithm using hierarchical granularity of proximal data. • Proposed of collaborative co-clustering using granular models. • Simulation of drift of regression model in long term datasets. • Word sense disambiguation algorithm using web sources for natural language processing • Extraction of word clusters based on word frequency and phrase frequency from textual data • Proposal of private key generation from multiple public keys using compressed private key index cluster

Senior Member of Technical Staff (Oracle Data Warehousing Group) (2001-2008) Worked on major features of 8 releases of Oracle Warehouse Builder (OWB) • Designed several functional and design specifications for dimension modeling of different versions and subcomponents (e.g. Tables, Views, Materialized Views, Dimension, Time Dimension, Cube, Maps, Models, Validation and Compilation units) of OWB till OWB11gR1 (2008). • Implemented OWB’s Dimensional API layers for Star and Snowflake Schema in Java/XML and Swing UI components (dimension, Time dimension, cube, mappings, models, and validation and compilation units). • Developed dimensional and relational portions of command line scripting language called OMB+ (in Tcl/Tk). • Designed an extensible OMB+ testing framework (Tcl/Tk) and Html. Over the years it became the main testing framework to test OWB infrastructure at the API level. • Implemented entire workflow of the time dimension; from the design and implementation of the related UI wizards (Java) to time data generators (PL/SQL) for fiscal and calendar years. Maintained generators for time data and its migration across releases. • Performance evaluation of Rational ROSE model changes for various releases and evaluate its effects on the performance of the new APIs. Also minimized the rate of JDBC calls and SQL statements to the metadata repository. • Improved persistence of Java objects in OWB metadata repository using Oracle Toplink.

Member of Technical Staff (Oracle Tools Group) (1997-2001) • Designed and developed Oracle 8 features for Query Builder/Browser of Developer/2000; Redesigned Query Builder Parser for Object-Oriented capabilities of Oracle 8. Implemented parser in LEX/YACC using C. Proposed computational methods for designing fuzzy queries. • Developed test plans for backend testing as well as front-end testing of Query Builder. Built and maintained bug fixes and stage of Browser/Query Builder for different releases. Fixed Query Builder/Browser backend bugs; Developed regression test suite for Win16 and enhancements of Win32; Developed Query Builder's regression system (OBTEST); Enhanced BENT (internal win32-based Version Control system)

2

• Developed a CGI-bin based Load Balancer for Oracle Forms and Oracle Application Web Server 2.0.4.

University of Tulsa, Tulsa, OK (Jan 1995 – Feb 1997) Graduate Research Assistant : Department of Computer Sciences • Applications of fuzzy sets and rough sets in database mining on relational databases. Involved in design and development of rules using principles of data clustering, rule induction, rule clustering and knowledge compression; derived heuristic knowledge in form associative induced fuzzy rules. • Data security and privacy for mined rules from databases.

LG Electronics Inc. Central Research Laboratory, Seoul, South Korea (Jan 1992 – Dec 1994) Research Engineer [Neural Networks, Fuzzy Systems, and Chaos Group] • Proposed new theoretical foundations on triangular-operators for fuzzy systems; developed new defuzzification methodologies for fuzzy controllers; worked on the theoretical development • Applications of Hebbian learning, supervised and competitive learning, radial basis fuzzy networks in consumer products like washing machines, air-conditioners, intelligent machines and embedded software designed and implemented a MS- Windows (Windows 3.1/SDK) and XMotif/Xview (UNIX) software for fuzzy control simulator. • Designed for Chaos Washing Machine whose prototype was shown at IFSA conference 1994. Korean TV Broadcasting Station (KBS1) aired my work on fuzzy systems at LG research center in May 1994. [Image Processing and Compression Group] • Researched holographic image processing and formation of moiré images (optical and holographic) for non-destructive surface regeneration and reconstruction with phase-shift methods; image generation/image reconstruction, image filters, edge detection and image enhancement; image compressions with RLE and DCT were also developed; developed and implemented image processing software packages (called analyzers) for filtering, compressing, decompressing, and segmentation (using FFT & DCT) of speckled/moiré images produced by holograms and optical systems respectively in Borland-C on DOS system (GoldStar 486).

TCIL-BellSouth Ltd. New Delhi, India (Aug 1991 – Dec 1991) Software Engineer • Developed a part of the GUI front-end for large telecom software using Intergraph's graphic language.

Indian Institute of Science, Bangalore, India, (Aug 1990 – May 1991) Research Assistant • Development of a simulation packages of stochastic Petri nets and Markov models.

Publications Book Chapters 1. S. Roychowdhury and J. Ghosh, “Robust Laplacian Eigenmaps Using Global Information, Manifold Learning Theory and Applications, Editor(s): Yunqian Ma, Yun Fu, CRC Publication, 2011. 2. S. Roychowdhury, “A survey on the theory of defuzzification,” Handbook of Granular Computing, Eds. W. Pedrycz, A. Skowron, and V. Kreinovich, 2006. 3. S. Roychowdhury, “An inquiry into the theory of defuzzification,” Granular Computing: an emerging paradigm, Studies in Fuzziness and Soft Computing, Physica-Verlag, Heidelberg, Germany, 2001

Refereed Journal Publications

1. S. Roychowdhury and W. Pedrycz, “Automatic Discovery of clusters by removing Noisy Data,” International Journal of Intelligent Systems, Vol 33, No. 9, 2018. 2. S. Roychowdhury, “Finding Association rules though lexicographic order computation,” Integrated computer-aided Engineering, Sept. 2002. 3. S. Roychowdhury and W. Pedrycz, “An alternative characterization of the fuzzy negation,” Soft Computing, May 2002.

3

4. S. Roychowdhury, and W. Pedrycz, "Modeling temporal functions with granular regression and fuzzy rules," Fuzzy Sets and Systems, 126, 2002, pp. 377-387, 2002 5. S. Roychowdhury and W. Pedrycz, “A survey of defuzzification techniques,” International Journal of Intelligent Systems, 16, 6, pp. 679-695, 2001 6. S. Roychowdhury and B. H. Wang, "Inconsistency in fuzzy rulebase: measure and optimization,” Int. J. of Fuzzy Logic and Intelligent Systems, vol. 1, no.1, pp. 75-80, June 2001. 7. S. Roychowdhury and B.H. Wang, “Connective generators for Archimedian triangular operators,” Fuzzy Sets and Systems, 94, pp. 367-384, 1998. 8. S. Sen, N. Arora and S. Roychowdhury, “Using limited information to enhance group stability,” Int. J. Human- Computer Studies, 48, pp. 69-82. 1998. 9. S. Roychowdhury, “New Triangular Operator Generators for Fuzzy Systems,” IEEE Transactions on Fuzzy Systems, Vol. 5, No. 2, pp. 189-198, May 1997. 10. S. Roychowdhury and B. H. Wang, “On generalized Hamacher families of triangular operators,” International Journal of Approximate Reasoning, 19, pp. 419-439, 1999. 11. S. Roychowdhury, B.H. Wang and S.K Ahn, “The radial defuzzification,” Int. J. General Systems, 28 (2-3), pp. 201-205, 1999. 12. S. Roychowdhury and B. H. Wang, “Cooperative neighbors in defuzzification,” Fuzzy Sets and Systems, 78, pp. 37-49, 1996. 13. S. Roychowdhury and B. H. Wang, “Composite generalization of Dombi class and a new family of T-operators using additive- product connective generator,” Fuzzy Sets and Systems, 66, pp. 329-346, 1994.

Refereed Conference Proceeding Publications 1. S. Roychowdhury and S. Khurshid, "Software fault localization using feature selection," In Proceedings of the International Workshop on Machine Learning Technologies in Software Engineering}, MALETS '11, pages 11--18, ACM, New York, NY, USA, 2011. 2. S. Roychowdhury and S. Khurshid, "A family of generalized entropies and its application to software fault localization," In Proceedings of IEEE International Conference on Intelligent Systems, IS '12, pages 368--373, 2012. 3. S. Roychowdhury, "Ensemble of feature selectors for software fault localization", In Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, SMC 2012, Seoul, Korea (South), October 14-17, pages 1351--1356, 2012. 4. S. Roychowdhury and S. Khurshid., "Localization of faults in software programs using Bernoulli divergences," In Proceedings of 2012 International Symposium of Information Theory and its Applications, ISITA 2012, pages 586--590, 2012. 5. S. Roychowdhury and S. Khurshid, “A novel framework for locating software faults using latent divergences," In Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part III, ECML PKDD'11, pages 49--64, Springer-Verlag, Berlin, Heidelberg, 2011. 6. S. Roychowdhury and J. Ghosh, “Robust Laplacian Eigenmaps Using Global Information, AAAI Publications, 2009 AAAI Fall Symposium Series. 7. S. Roychowdhury, “Bayes-like classifier using fuzzy likelihood,” Proceedings of the 2006 World Congress on Computational Intelligence, Vancouver, Canada, July, 2006. 8. S. Roychowdhury and W. Pedrycz, “Linguistic Association Rules,” Proceedings of the Ninth International Fuzzy Systems Association World Congress-NAFIPS, Vancouver, Canada, 2001 9. S. Roychowdhury, “Feature subset selection using granular information,” Proceedings of the Eighth International Fuzzy Systems Association World Congress-NAFIPS, Vancouver, Canada, 2001 10. T. A. Runkler and S. Roychowdhury, “Generating Decision Trees and Membership Functions by Fuzzy Clustering,” 7th European Congress on Intelligent Techniques & Soft Computing, Aachen Germany, September 13-16, 1999. 11. S Roychowdhury, “Computing with Granular Numbers,” Proceedings of the Eighth International Fuzzy Systems Association World Congress, Taipei, Taiwan, August 17 – 20, 1999. 12. S Roychowdhury, “Fuzzy Curve Fitting Using Least Square Principles,” Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, San Diego, California, October 11 – 14, 1998. 13. S Roychowdhury, Measuring Inconsistency in Fuzzy Rules, Proceedings of the IEEE International Conference on Fuzzy Systems Proceedings, Anchorage, Alaska, May 4 – 9, 1998. 14. H Cho, B. H. Wang and S Roychowdhury, Automatic Rule Generation for Fuzzy Controllers Using Genetic Algorithms: A Study on Representation Scheme and Mutation Rate, proceedings of the IEEE International Conference on fuzzy systems, pp. 1290-1295, 1998 15. S Roychowdhury and S. Shenoi, Fuzzy Rule Encoding Techniques, Proceedings of the Sixth IEEE International Conference on Fuzzy Systems, Volume II, Barcelona, Spain, July 1-5, 1997.

4

16. S Roychowdhury and S. Shenoi, "Compiling Fuzzy Rules for Knowledge Discovery'' Third Joint Conference on Information Sciences (Fifth International Conference on Fuzzy Theory & Technology) March 2-5, 1997 17. S Roychowdhury, N. Arora and S. Sen, Effects of local information on group behavior, Proceedings of the 13th National Conference on Artificial Intelligence and the 8th Innovative Applications of Artificial Intelligence Conference, 1996. 18. S Roychowdhury, N. Arora and S. Sen, Effects of local information on group behavior, 1996 AAAI Symposium, Stanford, California March 25-27, 1996. 19. S Roychowdhury and B.H. Wang, Hierarchical Cooperation of Neighbors in Defuzzification, Proceedings of 1995 IEEE International Conference on Fuzzy Systems, Yokohama, Japan, March 20 – 24, 1995. 20. S Roychowdhury, B.H. Wang and S.K. Ahn, “Radial Defuzzification Method”, Proceedings of IEEE Conference on fuzzy systems, pp. 1153-1158, 1994. 21. S Roychowdhury and B.H. Wang, “Another Class of Fuzzy Connectives in Fuzzy Neural Networks,” Proceedings of 1993 International Joint Conference on Neural Networks. 1993 22. S Roychowdhury and B.H. Wang, “The T-operators that model synergistic effect,” Proceedings of the 3rd Joint Conference & Exhibition on Artificial Intelligence, Neural Networks and Fuzzy Systems. 1993

Technical Report 1. S Roychowdhury, Y. Narahari, “Modeling of Stochastic Petri Nets,” Technical Report, Department of Computer Science and Automation, I.I.Sc., 1991

Professional Reviewer • IEEE Transactions of Fuzzy Systems (1996 -- current) • Journal of Fuzzy Sets and Systems (1994 – current) • Information Sciences (2002 – current) • Journal of Soft Computing (2015) • IEEE Transactions on Systems, Man and Cybernetics Part B: Cybernetics (1998 - current) • IEEE Transactions of Neural Networks (1996,1997) • IEEE Transactions of Control Systems Technology (1997-2000) • ASE 2011 • FUZZ-IEEE 2000, San Antonio, TX. • Advisory Member, International Natural Artificial System Organization, Canada, 1998-1999. • IEEE International Conference on Systems, Man and Cybernetics, San Diego, 1998. • AAAI Spring Symposium, Stanford University, 1996 • ICML 2006-2009 • ICONIP, 1994, Seoul, Korea.

Miscellany 1. Korean TV Broadcasting Station (KBS1) aired my work on fuzzy systems at LG Research Center, Seoul, Korea, May, 1994. 2. Best Session Presentation Award at “World Congress on Computational Intelligence (WCCI)”, Vancouver, Canada, July, 2006. 3. Research consultancy on mining Silicon Wafer yield data to SEMATECH, Austin, TX, 2007 4. Research consultant to a natural language processing (NLP) startup in Foster City, CA, 2000

5