Canterbury Christ Church University’s repository of research outputs http://create.canterbury.ac.uk Copyright © and Moral Rights for this thesis are retained by the author and/or other copyright owners. A copy can be downloaded for personal non-commercial research or study, without prior permission or charge. This thesis cannot be reproduced or quoted extensively from without first obtaining permission in writing from the copyright holder/s. The content must not be changed in any way or sold commercially in any format or medium without the formal permission of the copyright holders. When referring to this work, full bibliographic details including the author, title, awarding institution and date of the thesis must be given e.g. Day, Ed (2018) The application of machine learning, big data techniques, and criminology to the analysis of racist tweets. Ph.D. thesis, Canterbury Christ Church University. Contact:
[email protected] THE APPLICATION OF MACHINE LEARNING, BIG DATA TECHNIQUES, AND CRIMINOLOGY TO THE ANALYSIS OF RACIST TWEETS. by Ed Day Canterbury Christ Church University Thesis submitted for the degree of Doctor of Philosophy 2018 I, Ed Day, confirm that the work presented in this thesis is my own. Where infor- mation has been derived from other sources, I confirm that this has been indicated in the work. Word Count: 85,994 Abstract Racist tweets are ubiquitous on Twitter. This thesis aims to explore the creation of an automated system to identify tweets and tweeters, and at the same time gain a theoretical understanding of the tweets. To do this a mixed methods approach was employed: ma- chine learning was utilised to identify racist tweets and tweeters, and grounded theory and other qualitative techniques were used to gain an understanding of the tweets’ content.