Master Thesis Computer Science Thesis no: MCS-2003:17 June 2003 Computer Forensic Text Analysis with Open Source Software Christian Johansson Department of Software Engineering and Computer Science Blekinge Institute of Technology Box 520 SE – 372 25 Ronneby Sweden This thesis is submitted to the Department of Software Engineering and Computer Science at Blekinge Institute of Technology in partial fulfillment of the requirements for the degree of Master of Science in Computer Science. The thesis is equivalent to 20 weeks of full time studies. Contact Information: Author: Christian Johansson E-mail:
[email protected] University advisor: Bengt Carlsson Department of Software Engineering and Computer Science Department of Internet : www.bth.se/ipd Software Engineering and Computer Science Phone : +46 457 38 50 00 Blekinge Institute of Technology Fax : + 46 457 271 25 Box 520 SE – 372 25 Ronneby ii Sweden ABSTRACT A computer forensic investigation is not only dependent on correct and flawless analysis of the given data, but also of this analysis being conducted within a reasonable amount of time and effort. Given the exponential growth in size of hard disk drives coupled with the fact that many phases in a computer forensic investigation are still performed manually, the compromise between accuracy and completeness is becoming more and more of a problem. This paper concentrates on the text analysis process within computer forensics, focusing on the use of open source software. It discusses and examines the different techniques used in the possible future streamlining of said process. Keywords: forensics, text analysis, natural language Acknowledgments The author would like to thank the following persons and institutions for the help and support they have given prior and during the writing of this thesis.