Automatic Discovery of Newsworthy Content 1
Total Page:16
File Type:pdf, Size:1020Kb
Running head: AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 1 How can automatic systems facilitate journalistic discovery of newsworthy content? An investigation into how editorial workflows can be automatically supported Sebastian Short Snr 2031037 Master's Thesis Communication and Information Sciences Specialisation Business Communication and Digital Media School of Humanities and Digital Sciences Tilburg University, Tilburg Supervisor: Dr. C.W.J. van Miltenburg Second Reader: Dr. E.J Krahmer August 2020 AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 2 Abstract This study investigated in the era of media convergence how editorial workflows can be supported by new tools. In addition, the presence of new technologies such as automated journalism and automatically written text was reasoned as a method to how such routines could be automatically supported, adding onto the emerging body of literature that has investigated the interactions between journalism, automated journalism and automatically written text. 12 interviews were conducted with editors at various news outlets, and those with knowledge of the editorial process. The results provided a current overview of sourcing routines, which consisted of using personal networks, searching online and the use of traditional sources to discover new content. In addition, they do show signs that automatic support can be facilitated by looking at what issues’ editorial teams face and how new tools can then integrate into what they currently do. Such integration however depends on factors such as quality of automatically written text and where new tools can be integrated into their current workflows. Facilitators and barriers to adoption of automated journalism are also discussed, which showed that the use of newer technologies within journalistic practice is still an ongoing and uncertain process. Keywords: automated journalism, editorial workflows, journalism, content discovery AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 3 Contents Introduction 5 The Content Exchange 8 Societal and Scientific Relevance 8 Literature Review 9 The Role of Convergence in the Changing of the News Gathering Process 9 The ‘first wave’ of ethnographic studies. The way newsworthy content is discovered 9 The ‘second wave’ of ethnographic studies. 9 Media convergence and its implications. 10 The sourcing and discovery of content online 14 Automated journalism: can news production be automated? 16 How can journalism be automated? 17 How can automation change the newsgathering and production process? 17 Automatically written texts and readability. 18 Addressing the strengths and limitations of automated journalism. 19 Method 22 Research Design 22 Participants 22 Materials 24 Procedure 26 Analysis 27 Results 28 The current state of content discovery 28 Current editorial routines 28 Use of personal networks. 28 Searching and monitoring information online. 29 Use of traditional sources. 30 Integration of automatic support 31 Factor 1: New tools must integrate into current routines. 31 Factor 2: New tools need to address existing problems. 32 Facilitators of adoption of automated journalism 33 Editors have an open attitude towards automated journalism. 33 Editors are willing to source automatically written content. 34 Barriers of adoption of automated journalism 35 News outlets are slow to react to technological change. 35 There is a need for more technological development. 36 Editors are stubborn. 37 AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 4 News outlets must balance their available resources. 38 Discussion & Conclusion 40 The state of current sourcing routines 41 Moving towards integration of automatic support 42 Addressing the facilitators and barriers to automated journalism 44 Limitations and recommendations for future research 46 References 50 Appendix A 57 Appendix B 58 Appendix C 70 Appendix D 77 Appendix E 84 Appendix F 92 Appendix G 102 Appendix H 112 Appendix I 122 Appendix J 131 Appendix K 139 Appendix L 150 Appendix M 160 Appendix N 174 Appendix O 177 Appendix P 178 Appendix Q 187 Appendix R 220 AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 5 Introduction Journalistic workflows have changed considerably in recent years due to the media convergence of news, where outlets must digitise their content, driven by technological change (Dal Zotto & Lugmayr, 2016). Media convergence has reconfigured the ways news outlets must operate to remain financially stable (Tameling & Broesma 2013), and has led to changes in the way content is discovered, meaning that stories can be quickly produced by re- using content on newswires (Boumans, Trilling, Vliegenthart & Boomgaarden, 2018; Welbers, 2018), or by simply finding a tweet (Broersma & Graham, 2013). One consequence on the other hand is that it has put pressure on journalists to quickly find and publish content before their competitors (Bunce, 2017), and as such it is a constantly flowing process. In a bid to be more efficient journalists turn to Wikipedia, Twitter or Google to source news (Broersma & Graham, 2013; Kruikemeier & Lecheler, 2016). In the Netherlands, there are high levels of reliance on the use of the news agency Algemeen Nederlands Persbureau (ANP), where outlets make use of its content to include in their own stories (Boumans, Trilling, Vliegenthart & Boomgaarden, 2018; Welbers, 2018). Reliance on finding quicker ways to source content is reasoned because of media convergence and the economic pressures that have arisen in the journalistic world (Saridou, Spyridou & Veglis, 2017). Changes in how news outlets operate and alternative ways of discovering content are just a few of the affect’s media convergence has had in the newsroom. Adapting to changes within newsrooms is having consequences for journalists, some even argue the quality of news is affected as many firms turn to quantity versus quality (Bunce, 2017), and debate on whether editorial and commercial interests should remain separate has resurfaced (Cornia, Sehl & Nielsen, 2020), where news content itself has changed, with blurred lines between commercial content for advertising purposes and editorial content for readers. This is then a sign that there are now norms being adopted by both editorial and commercial managers to AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 6 keep up with changing times (Cornia, Sehl & Nielsen, 2020). Therefore, it seems that news outlets are seeking new cost-effective ways to do discover newsworthy content, the exact methods of which are more uncertain. In the world of data driven journalism (Lewis & Westlund, 2015), it is established that tools are used in editorial rooms to monitor web traffic and track article metrics (Wang, 2017; Wu, Tandoc & Salmon, 2019a). What is then interesting to examine is another tool to add to an editors’ toolkit, something that could be used for content discovery. One method to achieve more cost-effective journalist practices can be seen via the use of Automated Journalism, whereby Artificial Intelligence and algorithms can automate some tasks and automatically write content (Beckett, 2019). Natural Language Generation systems, where texts can be automatically written are being used increasingly more for writing news articles (Caswell & Dörr, 2018; Wu, Tandoc & Salmon, 2019b). This then provides an alternative method for producing cost-effective journalism by reducing the cost of producing content itself. This is consequently freeing up time for journalists, where some claim they can focus more on quality journalism (Anderson, 2011; Primo & Zago, 2015), whereas others claim it puts their jobs at risk (Wu, et al, 2019; Schultz & Sheffer, 2017). The presence of automated journalism further highlights how technology is further integrating into journalistic work routines (Wu, et al., 2019b), and can be seen as a new technology that news outlets are adapting to (Beckett, 2019) following years of media convergence. Arguably what this literature does not address is how such practices could be then used to solve the problem to find cost-effective ways to discover content. Based on the assumptions that consumers are generally accepting of automatically written texts (Graefe, et al., 2018; Jung, et al., 2017; Wöker & Powell, 2018), then it does not seem too big of a step for outlets then to source content that is automatically written. It could then be assumed that if a new tool were AUTOMATIC DISCOVERY OF NEWSWORTHY CONTENT 7 to offer automatically written content, then this could go some way to assisting editors in discovering content more cost effectively, if editorial teams are open to it. Such an issue then raises further questions, such as how journalists discover content currently, and how this could be supported to assist journalistic workflows, moving away from the current research that addresses and acknowledges the changes in journalistic workflows over time (Bunce, 2017; Broersma & Graham, 2013; Kruikemeier & Lecheler, 2016), it then seems logical to investigate possible solutions in the sourcing and discovery of newsworthy content. If automated journalism is seen as promising (Anderson, 2011; Primo & Zago, 2015), it seems reasonable to assume that automatic support can be a potential solution for editors to discover new content. In addition, whether current editorial sourcing routines they feel are effective, considering the rise of automated journalism and automatically written text. Based on this, the following research questions are asked: RQ1: What methods do news editors use to help them find content? RQ2: To what extent can editorial