San Jose State University SJSU ScholarWorks Master's Projects Master's Theses and Graduate Research Fall 2013 Pattern Discovery of Sequential Symbolic Data using Automata with an application to Author Identification Nikhil Kalantri San Jose State University Follow this and additional works at: https://scholarworks.sjsu.edu/etd_projects Part of the Computer Sciences Commons Recommended Citation Kalantri, Nikhil, "Pattern Discovery of Sequential Symbolic Data using Automata with an application to Author Identification" (2013). Master's Projects. 333. DOI: https://doi.org/10.31979/etd.uhdr-ae3z https://scholarworks.sjsu.edu/etd_projects/333 This Master's Project is brought to you for free and open access by the Master's Theses and Graduate Research at SJSU ScholarWorks. It has been accepted for inclusion in Master's Projects by an authorized administrator of SJSU ScholarWorks. For more information, please contact
[email protected]. Pattern Discovery of Sequential Symbolic Data using Automata with an application to Author Identification A Thesis Presented to The Faculty of the Department of Computer Science San José State University In Partial Fulfillment of the Requirements for the Degree Master of Science by Nikhil Kalantri December 2013 © 2013 Nikhil Kalantri ALL RIGHTS RESERVED SAN JOSE STATE UNIVERSITY The Designated Thesis Committee Approves the Thesis Titled Pattern Discovery of Sequential Symbolic Data using Automata with an application to Author Identification by Nikhil Kalantri APPROVED FOR THE DEPARTMENT OF COMPUTER SCIENCE SAN JOSÉ STATE UNIVERSITY December 2013 ________________________________________________________ Dr. T. Y. Lin, Department of Computer Science Date ________________________________________________________ Dr. Chris Tseng, Department of Computer Science Date ________________________________________________________ Mr. Amit Sant, Software Engineer at Apple Date ABSTRACT Author Identification is the process of identifying a piece of text to ascertain if it has an inherent writing style or pattern based on a certain author.