“The Rise and Fall of Overtourism”

Sebastian Ferrari, MSc.

Modul University Vienna

PhD Proposal Manuscript

Doctoral Supervisors:

Prof. Dr. Josef A. Mazanec

Prof. Dr. Karl Wöber

Author Note:

The student´s PhD has been funded by the Vienna Tourist Board.

Table of Contents 1 Research Questions ...... 9

2 Overtourism ...... 13

2.1 Dictionary Entries ...... 13

2.2 Industry Reports ...... 14

2.3 Academic Sources ...... 14

2.4 TourMIS 2019 ...... 16

2.5 Final Remarks ...... 18

3 Literature ...... 20

3.1 Mass Media ...... 20

3.1.1 Agenda-Setting ...... 20

3.1.2 Framing ...... 25

3.1.3 Mass Media in Tourism Research ...... 28

3.2 Tourism and Leisure ...... 29

3.2.1 Historical Excursus ...... 30

3.2.2 Resident Attitudes / Tourism Impacts ...... 31

3.2.3 (Tourism) Carrying Capacity ...... 37

3.3 Overtourism Measurement...... 43

3.3.1 State of the Art ...... 43

4 Methodology ...... 49

5 Empirical Part ...... 51

5.1 Study I ...... 51

5.1.1 Rationale for this Study ...... 51

5.1.2 Methodology ...... 53

5.1.3 Data Collection ...... 61

5.1.4 Preliminary Results ...... 62

Page 2 of 133

5.1.5 Limitations ...... 67

5.2 Study II...... 68

5.2.1 Rationale for this Study ...... 68

5.2.2 Methodology ...... 71

5.2.3 Data Collection ...... 73

5.2.4 Preliminary Results ...... 76

5.2.5 Limitations ...... 80

5.3 Study III ...... 81

5.3.1 Rationale for this Study ...... 81

5.3.2 Methodology ...... 91

5.3.3 Data Collection ...... 94

5.3.4 Preliminary Results ...... 95

5.3.5 Limitations ...... 97

6 Conclusion ...... 98

7 References ...... 99

8 Appendices ...... 128

8.1 Appendix 1 ...... 128

8.2 Appendix 2 ...... 128

8.3 Appendix 3 ...... 129

8.4 Appendix 4 ...... 129

8.5 Appendix 5 ...... 130

8.6 Appendix 6 ...... 131

8.7 Appendix 7 ...... 132

8.8 Appendix 8 ...... 132

Page 3 of 133

List of Figures Figure 1: Dissertation Structure // Overtourism History...... 11 Figure 2: Knowledge throughout Education (Might, 2010) ...... 12 Figure 3: Word Cloud (n=24)...... 16 Figure 4: Respondents´ (Dis-)Agreement (n=22-23) ...... 17 Figure 5: Triangular Relationship Based on Lippmann (1922) ...... 21 Figure 6: Plato´s Cave ...... 22 Figure 7: Frame-Changing Model (Chyi & McCombs, 2004, p. 25) ...... 24 Figure 8: Framing - A Drawing from the Internet...... 25 Figure 9: Agenda-Setting and Framing Theory (McCombs & Ghanem, 2001, p. 71) ...... 27 Figure 10 : Publications about Resident Attitudes ...... 32 Figure 11: Four Phases (Deery et al., 2012, p. 65) ...... 32 Figure 12: Spectrum of Visitor-Resident Interaction (Sharpley, 2014, p. 39) ...... 34 Figure 13: Publications about Carrying Capacity ...... 37 Figure 14: (Non-)Linear Relations between Use and Impact (McCool & Lime, 2001, p. 375) .. 40 Figure 15: Crowding Scale (Neuts & Nijkamp, 2012, p. 2140) ...... 42 Figure 16: Publications about Overtourism Measurement ...... 44 Figure 17: Topic Modeling (Blei, 2012, p. 78) ...... 54 Figure 18: Structural Topic Modeling (Schmiedel et al., 2019, p. 945) ...... 56 Figure 19: Computational Grounded Theory Adapted from Nelson (2020, p. 14) ...... 60 Figure 20: News Coverage over Time 2017 - 2020 (n = 602) ...... 63 Figure 21: Most frequently mentioned locations (n = 602) ...... 65 Figure 22: PRISMA Flow Diagram (Moher et al., 2009) ...... 75 Figure 23: Co-Authorship Network ...... 76 Figure 24: Co-Occurrence Graph of Author Keywords ...... 77 Figure 25: Co-Citation Network ...... 79 Figure 26: Arrivals and Overnights by Month in 2019 ...... 84 Figure 27: Box Map - Airbnbs per km2 by District (14th Jan 2020) ...... 88 Figure 28: Box Map – Airbnbs per ha by Census District (14th January 2020) ...... 89 Figure 29: Number of Responses by District in 2019 (n=3.657) ...... 91 Figure 30: Analysis Workflow ...... 93

Page 4 of 133

Figure 31: Responses by Quarters and Years ...... 94 Figure 32: Average Liking of the City without Tourism ...... 95 Figure 33: Sample BTM ...... 96 Figure 34: Potential Relationships between Crowding and Satisfaction (Wagar, 1964, p. 7) .... 128 Figure 35: Postcard with Sample Text ...... 132 Figure 36: Postcard without Sample Text ...... 133

List of Tables Table 1: Dictionary Definitions ...... 13 Table 2: Seminal Works ...... 31 Table 3: Present vs Previous Studies ...... 52 Table 4: Selected Newspapers ...... 61 Table 5: Present vs Previous Studies ...... 70 Table 6: Indicators by Category ...... 81 Table 7: Indicators by District ...... 87 Table 8: Reviews about Residents Attitudes and Tourism Impacts, Aspects ...... 128 Table 9: Distribution of News Articles (n=602) ...... 129 Table 10: Junk Words ...... 129 Table 11: Indicators for the City of Vienna, 2009-2019 ...... 130 Table 12: Indicators on District Level ...... 131

List of Equations Equation 1: Nominalist Definition (Popper, 1945, p. 9) ...... 18

Page 5 of 133

List of Abbreviations

AIEST International Association of Scientific Experts in Tourism

APA American Psychological Association

CEO Chief Executive Officer

Covid-19 Coronavirus disease 2019

CTM Correlated Topic Model

DOI Digital Object Identifier

DTM Dynamic Topic Model

LAC Limits of Acceptable Change

LDA Latent Dirichlet Allocation

NAS Network Agenda-Setting Model

NUTS Nomenclature of Territorial Units for Statistics

SET Social Exchange Theory

STM Structural Topic Model

UNESCO United Nations Educational, Scientific and Cultural Organization

UNWTO United Nations World Tourism Organization

Page 6 of 133

“When there's an elephant in the room, introduce him.” (Randy Pausch)

ollowing the advice by Prof. Pausch, the author would like to address the elephant in the room right at the outset. As a matter of fact, two elephants need introduction F in this case. The first one relates to the content of this dissertation. Indeed, one may question the significance of a dissertation about “overtourism” in the middle of a pandemic. Last year, the Coronavirus disease 2019 (Covid-19) virtually annihilated the global tourism industry. According to the World Tourism Organization (UNWTO) (2021), international tourist arrivals decreased by 73.9% compared to the previous year. Yet, being at rock bottom can be either a blessing or a curse. As Martin Luther King once said, “Every crisis has both its dangers and its opportunities. Each can spell either salvation or doom.” On the one hand, there are several initiatives to rebuild tourism in a more sustainable manner. Indeed, next to top-down impulses from institutions in the public sector, such as the UNWTO, there have been bottom-up initiatives by companies in the private sector, such as 'Tourism Declares'. On the other hand, the behaviour of people tames one's hopes for a better tomorrow. For instance, Westcott and Culver (2020) reported huge crowds at various places in China. Similarly, Morris (2020) reported huge crowds at various beaches in the United Kingdom. Time will show which path tourism will take when rising from its ashes. Be that as it will, it is immaterial to the significance of this dissertation. To be sure, a return to business-as-usual would make the findings of this dissertation 'hotter'. However, a sustainable rebirth of tourism would by no means belittle knowledge of how a tourism-related phenomenon has been depicted in the news, of how useful the revival of old issues has been in academia, or of how locals experience their destination without tourism. In other words, the findings of the research conducted within the framework of this dissertation will be valuable either way. The second elephant in the room relates to the format of this dissertation. Indeed, the author´s decision to write a monograph, as opposed to publishing three articles, might be regarded as swimming against the stream. After all, the saying “publish or perish” is not without its reasons. Having said that, the author decided to write a monograph for two reasons. First, this format enables him to deal with the subject matter in depth. The detailed discussion of topic modeling will be an example thereof (pp. 54-61). Also, a monograph allows a few unrequired - yet insightful

Page 7 of 133

- digressions, such as the one about the “Allegory of the Cave” by Plato (pp. 22-23). Second, the flowery writing style of the author fits this format better, as it is not subject to some of the guidelines imposed by journals, such as word counts and the like. This, in turn, may come to the advantage of the reader too, as she or he may find the read more entertaining this way. This manuscript is structured as follows. The first section will introduce the three research questions addressed in this dissertation and highlight their significance. The second part will discuss the key concept of this research – that is, “overtourism”. The third section will review the scholarly literature that lays the foundation for the three empirical studies conducted within the framework of this project. The fifth part will introduce their methodology – albeit only briefly, as each study will then be discussed in depth in the sixth section. Here, the author will also present a few sample analyses to whet the reader´s appetite for the final dissertation. Finally, the author would like to point out that he cited all sources to the best of his knowledge according to the 7th edition of the manual of the American Psychological Association (APA) (2020). Specifically, the software “Citavi” (Swiss Academic Software GmbH, 2021) has been used for reference management.

Page 8 of 133

1 Research Questions

“The whole is greater than the sum of the parts” (Aristotle)

Overtourism has made the headlines in the last few years. Indeed, journalists have diligently reported about the occurrence of this phenomenon, particularly in Europe (e.g. Coldwell, 2017; Henley, 2020). In point of fact, news media is the birthplace of overtourism. Indeed, this word was accidently created by the Chief Executive Officer (CEO) of the news outlet 'Skift' (Ali, 2018), which later trademarked it (Skift, Inc., 2018). Considering the origins of overtourism, it comes as a surprise that only a handful of studies have dealt with its depiction in the news (Clark & Nyaupane, 2020; Pasquinelli & Trunfio, 2020; Phi, 2020). This, in turn, begs the question: “How has overtourism been portrayed in news media?” Using structural topic modeling (Roberts et al., 2014) as a point of departure rather than one of arrival in answering this question fits into the framework of Computational Grounded Theory proposed by Nelson (2020). In spite of being only a drop in the ocean, this approach makes a step forward in overcoming the long-standing "artificial dichotomy of inductive versus deductive reasoning" (Mazanec, 2009, p. 320). Moreover, given the role of news media in image formation (Gartner, 1994) and destination selection (Stepchenkova & Eales, 2011), knowledge of how a tourism-related phenomenon has been portrayed in newspapers might be of interest to Destination Management Organizations (DMOs). In fact, this retrospective analysis could provide them with knowledge that allows them to step in and mould the public discourse on other occasions in the future. Following its debut in the news, overtourism then transitioned to academia. Koens et al. (2018) ascribe the rapid uptake of this term by the scholarly community more to its currency than to its meaning. In fact, to date, there is no universally valid definition of “overtourism” (Avond et al., 2019; Capocchi et al., 2019; Capocchi et al., 2020; Nepal & Nepal, 2019). Interestingly, there has been a heated debate about this term in the academic community. By now, the general consensus seems to be that overtourism is indeed “old wine in new bottles” (Dredge, 2017). In this regard, Perkumienė and Pranskūnienė (2019) and Capocchi et al. (2020) differentiated between the word and the issue, and maintained that although the former has only appeared recently the latter actually dates back a long time. Koens et al. (2018) and Benner (2020) considered this word a hypernym for the unfavourable effects of tourism. Milano, Novelli, and Cheer (2019b) argued

Page 9 of 133

that this word denotes “a renewed interest in the adverse impacts of tourism” (Milano, Novelli, & Cheer, 2019b, p. 355). This, in turn, raises the question of what the added value of reigniting such long-standing discussion is. In other words: “How useful has research on overtourism been?” Establishing the usefulness of this research stream fulfils "the significant need for every mature field of knowledge to understand itself" (Tribe & Liburd, 2016, p. 44). Even though it would beyond doubt be premature to consider research on overtourism as such, an interim assessment of this interdisciplinary field of inquiry is worthwhile. In fact, the temporary timeout imposed by the outbreak of Covid-19 might just be the proverbial “calm before the storm” to be exploited for evaluating scholarship on overtourism. As this field of inquiry developed, scholars naturally shifted their attention from the conceptualisation to the measurement of overtourism. Regrettably, though, a review of this research shows that, for the most part, overtourism measurement still hinges on 'old' indicators. In fact, indicators like tourism density and tourism intensity, date back to research conducted by scholars of the International Association of Scientific Experts in Tourism (AIEST) in the mid-20th century (i.e. Markos, 1949; Sundt, 1950). This, in turn, suggests that – like in research on resident attitudes towards tourism – “more limited progress has been made than the volume of research might suggest (Sharpley, 2014, p. 39). Be that as it may, last year, the outbreak of Covid-19 razed the global tourism industry to the ground, thereby putting the measurement of tourism pressure on hold. Indeed, according to the UNWTO (2021), international tourist arrivals dropped by 73.9% in 2020. Thus, bearing in mind that “it is mostly loss that teaches us the worth of things” (Schopenhauer), one may wonder: How have residents experienced their destination without tourism? Insights into the experience of the complete absence of tourism on the part of residents is critical for DMOs, which are charged with the task of spearheading the rebuilt of tourism from scratch. To sum up, the author aims to answer the three following questions in this dissertation:

1. How has overtourism been portrayed in news media? 2. How useful has research on overtourism been? 3. How have residents experienced their destination without tourism?

Page 10 of 133

Interestingly, by addressing these three questions in that specific order, this dissertation follows the rise and fall of the phenomenon under scrutiny. Indeed, moving from the portrayal of overtourism in the newspapers (Study I) to an assessment of its usefulness in research (Study II) parallels the transition of “overtourism” from news to academia. In a similar vein, rounding off with the perception of the complete absence of tourism (Study III) parallels the recent fall of overtourism and, conversely, the emergence of its diametrical opposite – “undertourism”.

Academia • Coinage and • Occurrence of popularisation of opposite scenario: "overtourism" • Development "undertourism" of a pseudo-new research area

News Covid-19

Figure 1: Dissertation Structure // Overtourism History

Moreover, and perhaps more importantly, the findings of the three studies conducted to answer these three questions will achieve a balance between theoretical meaningfulness and practical relevance, overall1.

1 Since this dissertation is a monograph, its contribution shall be assessed as a whole.

Page 11 of 133

At this point, the author would like to make two last observations – one for the laymen and one for the experts. In fact, on one hand, the former may wonder what the significance of focusing on such a narrow topic is. Well, this is precisely what a PhD is about! Turning to the visual explanation of this degree by Might (2010) might help making this point (Figure 2). In short, assuming that the circle represents mankind's knowledge, a PhD essentially consists in advancing that frontier a tiny bit (Might, 2010).

→ This Dissertation!

Figure 2: Knowledge throughout Education (Might, 2010)

The experts, on the other hand, may criticise the one or the other aspect of this dissertation. Thus, it might be worth bearing in mind that the whole point of a PhD is essentially “to show that you’re able to do good enough research by yourself” (Rugg & Petre, 2004, p. 18). With this in mind, the reader is now welcome to proceed to the next section, in which the key concept of this dissertation – “overtourism” – will be thoroughly discussed.

Page 12 of 133

2 Overtourism

In this section, definitions of overtourism are critically reviewed. To this end, non- academic and academic sources were scoured for definitions. Mindful of the exhortation to “avoid becoming giants of definition while remaining midgets of explanation” (Mazanec, 2009, p. 320), the purpose of this section is to highlight the shortcomings of these definitions rather than to provide a comprehensive review thereof.

2.1 Dictionary Entries

Naturally, one would first look up the definition of overtourism in a dictionary. This, however, will be of little help. In fact, this term has only entered the MacMillan and the Cambridge dictionaries and is waiting to be accepted in the Collins dictionary (Table 1Error! Reference source not found.). While this dearth of definitions underscores the recent nature of overtourism, it comes as a surprise in light of the momentum this phenomenon has gained. Be that as it may, these dictionary definitions are volume-based in that they focus heavily on the number of tourists. However, as will be shown later, this is only one of the tip of the iceberg.

Dictionary Definition

Collins “the phenomenon of a popular destination or sight becoming overrun with tourists in an unsustainable way” (Dickinson, 2018)

MacMillan “a situation where the number of tourists visiting a place causes significant problems” (MacMillan Education Limited, 2019)

“the situation when too many people visit a place on holiday, Cambridge so that the place is spoiled and life is made difficult for the people who live there” (Cambridge University Press, n.d.)

Table 1: Dictionary Definitions

Page 13 of 133

2.2 Industry Reports

Dissatisfied with such reductionist definitions, one might then turn to industry reports for more nuanced explanations. Thus, for instance, the European Union defined overtourism as:

“the situation in which the impact of tourism, at certain times and in certain locations, exceeds physical, ecological, social, economic, psychological, and/or political capacity thresholds” (Peeters et al., 2018, p. 15)

On one hand, this definition acknowledges the specificity of overtourism – that is, the fact that it affects particular areas in specific moments. On the other hand, it refers to limits, which suggests the existence of numerical ceilings beyond which things go south. Alternatively, the World Tourism Organization defined overtourism as:

“the impact of tourism on a destination, or parts thereof, that excessively influences perceived quality of life of citizens and/or quality of visitors experiences in a negative way” (UNWTO et al., 2018, p. 6)

This definition is two-sided as well. In fact, while it recognizes the adverse effects of tourism on both residents and tourists, the use of the unquantifiable adverb 'excessively' implicitly hints at the existence of a tipping point.

2.3 Academic Sources

Eventually, one might look for a definition in academic publications. Some scholars have acknowledged that a universally accepted definition of overtourism does not exist to date (Avond et al., 2019; Capocchi et al., 2019; Capocchi et al., 2020; Nepal & Nepal, 2019). This comes as no surprise, considering that researchers do not even agree on an aspect as basic as its spelling. In fact, while the term is mostly spelt without hyphen (overtourism), it is occasionally also written with hyphen (over-tourism). For instance, in the publications examined in Study II, the former spelling

Page 14 of 133

variant has – on average2 – been used 92% of the time. The lack of agreement among scholars is also nicely illustrated by the use of separate interpretations in the recent book edited by Ribeiro de Almeida et al. (2020). For instance, in Chapter 5, overtourism is defined as "tourism impact that immensely and negatively affects residents’ life and/or tourists’ experience" (Lai et al., 2020, p. 93). However, in Chapter 12, it is defined as "the subjective belief of residents that there are too many visitors" (Rejón-Guardia et al., 2020, p. 236). An occasionally cited definition is the one by reporter Dave Richardson, who allegedly considered overtourism as “any destination suffering the strain of tourism” (Richardson, 2017; as cited in Huettermann et al., 2019; Koh & Fakfare, 2019; Séraphin et al., 2018). However, a careful reading of the original article raises doubts about the attribution of this interpretation. In any case, this alleged definition could neither be any more vague nor any less informative. Therefore, it will not be considered in this dissertation. Another occasionally quoted definition is the one published by Milano et al. (2018) in the semi-academic outlet “The Conversation”. They considered overtourism as:

"the excessive growth of visitors leading to crowding in areas where residents suffer the consequences of temporary and seasonal tourism peaks leading to permanent changes to their lifestyles with denied access to local amenities and a general loss of well-being" (Milano et al., 2018)

This definition acknowledges the dynamic component of overtourism – that is, the evolution of visitor numbers over time. At the same time, it ascribes the repercussions of overtourism to temporally limited circumstances. In reality, though, overtourism is also about the incessant presence of visitors. In proverbial words, constant dripping wears away the stone. Moreover, Nepal and Nepal (2019) defined overtourism as "a post-mass tourism phenomenon in which some destinations have transitioned from a state of ‘mass’ to a state of ‘over’" (Nepal & Nepal, 2019, p. 3), In spite of its appeal, this concise definition implies that overtourism is merely an extension of mass tourism. This, in turn, suggests the existence of a

2 Refers to the average of the spelling variant's relative usage in title, abstract and keywords.

Page 15 of 133

tipping point beyond which the phenomenon changes its name. In his editorial, Singh (2018) explicitly distinguished overtourism from mass tourism.

2.4 TourMIS 2019

Next to secondary sources, one could engage in primary data collection to define overtourism. Indeed, professionals might offer valuable input in this regard. Thus, the author conducted a survey at the 15th TourMIS Workshop, which was held at Modul University Vienna on September 12-13, 2019. The initial sample consisted of 47 questionnaires. Following the removal of ineligible and incomplete surveys (n=22), the final sample consisted of 25 questionnaires. Respondents were asked to formally define overtourism. Almost all of them answered this question (n=24). Four answers were corrected for minor mistakes (e.g. quaility → quality). The results are summarized by means of a word cloud (Figure 3). The mention of the terms ‘local’ (n=8) and ‘visitor’ (n=6) suggest that these are the two main parties involved. The occurrence of the stems ‘mani’ (n=6) and ‘overcrowd’ (n=5) lend support to the hypothesis that density and crowding play a major role in overtourism. Finally, the mention of the term ´quality´ (n=5) hints at the relationship between overcrowding and the nature of one´s experience.

Figure 3: Word Cloud (n=24)

Page 16 of 133

Thus, the answers suggest a linear mechanism of overtourism: the excessive concentration of people – both tourists and residents - in space (n=6) or time (n=6) negatively affects either party´s quality of experience. The behaviour of visitors, however, was hardly mentioned. In fact, only one respondent acknowledged the type of tourism as a potential problem. This comes as a surprise, as several cities in Europe suffer from misbehaving visitors (e.g. Amsterdam, Budapest, Prague), and indicates that respondents view overtourism from a quantitative rather than a qualitative perspective. Interestingly, respondents also mentioned destination management (n=4), which suggests that DMOs might become the scapegoats bearing the blame for the lamentable state of affairs. The respondents were further asked to express their dis(-)agreement with six statements about overtourism. The mean scores are summarized in the radar chart below (Figure 4). On one hand, they tend to disagree with the statement that overtourism is mainly a European phenomenon (Ø = 1.78). This shows that they are aware that destinations in other continents suffer from it, too (e.g. Boracay Island). On the other hand, they tend to agree with the statement that overtourism is not only about tourism (Ø = 4.05). This indicates that they understand that there is more to it than meets the eye.

.9 3 3

.3 .8

4 .8

Figure.0 4: Respondents´ (Dis-)Agreement (n=22-23) 3 Page 17 of 133 .8

2.5 Final Remarks

In sum, it seems that scholars have not yet agreed on one single definition and that professionals have a rather linear interpretation of overtourism. A few final consideration are in order. To begin, in the second volume of "The Open Society and its Enemies", Popper (1945) argues that the nominalist definition has to be viewed backwards. In other words, the 'defining formula' needs to precede the 'defined term' (Equation 1).

⏟푦표푢푛푔 푑표푔 = 푝푢푝푝푦⏟ 퐷푒푓푖푛푖푛푔 푓표푟푚푢푙푎 퐷푒푓푖푛푒푑 푡푒푟푚 Equation 1: Nominalist Definition (Popper, 1945, p. 9)

In the case of overtourism, scholars have devoted considerable attention to the 'defined term'. If one considers that, like “tourismphobia”, this term is only one of the "new arbitrary shorthand labels" (Popper, 1945, p. 13) in tourism research, the extent of the discussion surrounding its definition might seem exaggerated. Moreover, the suitability of the term itself could be called into question. According to Koens et al. (2018), this word originated from the news and made it into research more due to its currency than its meaning. It has grown to be an umbrella term for the adverse effects of tourism. However, it does not capture their heterogeneity. In fact, it wrongly implies that the effects of tourism are homogeneous. Hence, they recommend a more impartial expression, such as 'visitor pressure' (Koens et al., 2018, p. 9). Furthermore, the prefix over- is inappropriate for two reasons. First, it gives the whole word a negative connotation. The MacMillan Dictionary defines this prefix as “too much” (MacMillan Education Limited, n.d.). To be sure, one may argue that the sentiment of the whole word depends on the polarity of the term that follows the prefix, such as over-optimistic (positive) and over-pessimistic (negative). However, as Terence once said, “too much of anything is bad”. Thus, the connotation of words containing this prefix tends to be negative. Second, it creates the wrong impression that overtourism is solely about (too much) tourism. Koens et al. (2018) argue that occurrences in fields other than tourism are to some extent responsible for matters associated with it. For instance, the advent of online retail and the resulting fleets of delivery vans flooding the streets exacerbate the impression that the city is overcrowded (Koens et al., 2018). The

Page 18 of 133

emergence of “temporary migrants“ (Maitland & Newman, 2009a, p. 13), such as expats and students, complicates matters further. Indeed, blurring the lines between 'residents' and 'tourists', the rise of this group of people poses a challenge for DMOs. The Vienna Tourist Board, for instance, has responded to this trend by undertaking a paradigm shift from 'tourism' to 'visitor economy' in their most recent strategy (Kettner et al., 2021).

Page 19 of 133

3 Literature

As overtourism is a “multi-dimensional” (Koens et al., 2018) and “multi-faceted” (Capocchi et al., 2019) phenomenon, it shall come as no surprise that the literature review of this dissertation taps into different domains. The purpose of this section is to introduce the literature on which the three empirical studies will be based. In the first part (3.1), the author will introduce two theories from mass media research: agenda-setting and framing. In the second section (3.2), he will review the two fields of inquiry from which research on overtourism has allegedly emerged: tourism impacts and carrying capacity. In the third part (3.3), the author will survey the scarce body of literature on the measurement of this phenomenon.

3.1 Mass Media

3.1.1 Agenda-Setting

“It may not be successful much of the time in telling people what to think, but it is stunningly successful in telling its readers what to think about.” (Cohen, 1963, p. 13)

The origins of the agenda-setting theory can be traced back to the book "Public Opinion" by Lippmann (1922). In particular, in the introduction titled "The World Outside and the Pictures in our Heads", this author argued that people have indirect knowledge of their surroundings, which they consider to be veracious. He substantiated his claim with a few examples. For instance, the glorification and the condemnation of important figures are both based on the positive and negative perception the public has of them, respectively. Lippmann (1922) posited that people create mental representations of things ('fictions'), which they cannot sense or feel first-hand ('realities'). These conceptions, in turn, dictate people's actions in the real world (Figure 5). Thus, people operate in a setting, whose complexity requires them to rely on indirect knowledge. In sum, he asserted that any person “makes for himself a trustworthy picture inside his head of the world beyond his reach” (Lippmann, 1922, p. 29).

Page 20 of 133

Reality

Consequent Mental Action Representation

Figure 5: Triangular Relationship Based on Lippmann (1922)

Interestingly, Lippmann (1922) alluded to the allegory of the cave by Plato. In the seventh book of "The Republic", this philosopher wrote about prisoners in a cavern (Figure 6). They are bound with shackles and can only look forward (A). There is a barrier at their back (B) and farther behind it a fire is burning (C). Between the two, there is an elevated path (D). People move on it holding some items, whose shapes are then projected to the side of the cave in front of the prisoners (E) (Figure 6). Only being able to catch sight of the shades of the items, they mistake the former for the latter. In other words, for the prisoners "the truth would be literally nothing but the shadows of the images" (Plato, ca. 375 B.C./2012, p. 228)3.

3 The allegory does not finish here. In fact, Plato then writes about what happens when a prisoner is freed (Plato (ca. 375 B.C./2012). Since a detailed account of the whole allegory would exceed the scope of this review, the interested reader is directed to Book VII of "The Republic" for the whole story.

Page 21 of 133

B E

Figure 6: Plato´s Cave4

The parallel between the allegory of the cave and the agenda-setting theory is striking. The public (shackled prisoners) can only see the shadow of what the media (moving people) choose to publish (items held). At first sight, this analogy seems to capture the essence of the agenda-setting theory. On second thought, however, there is more to it than meets the eye. Thus, in what follows this framework will be concisely reviewed.

3.1.1.1 First Level Agenda-Setting (Objects)

Building on the work of Walter Lippman, McCombs and Shaw (1972) examined the role media played in the presidential elections of 1968. Specifically, they compared what citizen in Chapel Hill considered to be central matters with what the news they consumed actually reported. In short, they found a high degree of correspondence between the points highlighted by the media and the ones deemed critical by the people. Interestingly, this association holds irrespective of the

4 http://webspace.ship.edu/cgboer/PlatosCave.gif

Page 22 of 133

political orientation of citizen and of the political focus of news. This finding renders the alternative explanation of 'selective attention' improbable, and, conversely, lends weight to the agenda-setting role of news media. Far from intending to make any causal claims, these authors concluded that “the evidence that voters tend to share the media´s composite definition of what is important strongly suggests an agenda-setting function of mass media” (McCombs & Shaw, 1972, p. 184). Their study was the spark that ignited future research, as evidenced by the 13,800+ citations received to date, according to Google Scholar (29th March 2021), as well as by the recent creation of the 'The Agenda Setting Journal'.

3.1.1.2 Second Level Agenda-Setting (Attributes)

Two and a half decades later, McCombs et al. (1997) went a step further. They realised that agenda setting research need not be limited to items ('objects'). Rather, it could be extended to the features of items ('attributes'). Indeed, drawing on the notion of framing, they argued that the inclusion of some and the exclusion of other aspects determine people's mental representation of a given matter. Thus, this new avenue of inquiry shifts the focus of attention from the significance of items to that of their features. They referred to it as "the second level of agenda setting" (McCombs et al., 1997, p. 704). These authors put their amended framework to the test on two elections in Spain in 1995. In short, they investigated whether media's depiction of the contestants during the run moulded people's perception of them after the vote. Suffice it to say, they found some agreement between the two. Thus, far from meaning to make any causal claims, they built on Bernhard C. Cohen in concluding that "the news media not only tell us what to think about, they also tell us how to think about it" (McCombs et al., 1997, p. 716). Less than a decade later, Chyi and McCombs (2004) argued that news media perpetuates an item by stressing different aspects of it as time goes by ('frame-changing'). Drawing on framing literature, they noted a lack of generally applicable features ('frames'), which, in turn, restricts analysis possibilities. Hence, they set out to develop a framework that can be adopted irrespective of the item under scrutiny. In this undertaking, they focussed on the spatial (vertical axis) and temporal (horizontal axis) aspects of reporting (Figure 7).

Page 23 of 133

Figure 7: Frame-Changing Model (Chyi & McCombs, 2004, p. 25)

Chyi and McCombs (2004) put their framework through its paces. They investigated changes in reporting of the rampage at a high school by one news outlet (n=170) with regard to the two features of their framework – space and time. Concerning the former, they observed an expansion from a narrow (particulars) to a broad (criminality) scope. Regarding the latter, they noticed a switch from a backward-looking (circumstances) to a forward-oriented (precautions) perspective. Interestingly, these two frames are inter-related: while the narrow range goes hand in hand with the retrospective angle, the broad range goes arm in arm with the prospective angle (Chyi & McCombs, 2004).

3.1.1.3 Third Level Agenda-Setting (Networks)

Even though not used in this study, for the sake of completeness the existence of a third agenda-setting version ought to be mentioned. Vu et al. (2014) put forward the Network Agenda Setting (NAS) model, which posits that news media may impart the significance of inter- connections between items ('objects') and features ('attributes') together. In other words, they asserted that news media is “capable of telling us what and how to associate” (Vu et al., 2014, p. 669).

Page 24 of 133

3.1.2 Framing

The roots of framing theory can be traced back to the seminal essay by Goffman (1974) and the empirical work by Kahneman and Tversky (1984). However, for the purposes of this review, going back to the work of Entman (1993) may suffice. Lamenting the lack of a universal conceptualisation of 'framing', he defined this act as “to select some aspects of a perceived reality and make them more salient in a communicating text” (Entman, 1993, p. 52). In addition, he argued that frames have four roles. First, they establish an issue. Second, they find its origins. Third, they make appraisals. Fourth, they propose solutions (Entman, 1993). Since then, a plethora of publications have been published on this subject, as evidenced by the 18.700+ citations, according to Google Scholar (29th March 2021). A drawing frequently associated with news media on the Internet visualizes framing quite well (Figure 8). In reality, a victim is running away from a perpetrator. However, the news frames this situation in such a way that – in the eyes of the audience – the perpetrator is the victim, and vice versa. Another, more contemporary, example would be news coverage about COVID-19. Naturally, the reporting of absolute numbers (e.g. '500') captures more attention than that of relative percentages (e.g. '0,03%').

Figure 8: Framing - A Drawing from the Internet5

5 http://indiafacts.org/wp-content/uploads/2015/03/image0021.jpg

Page 25 of 133

Having reviewed the agenda-setting and framing theories separately, it is now time to connect these two conceptual frameworks. McCombs and Ghanem (2001) devoted an entire book chapter to this6. Thus, their work can be adduced to this end. They argue that second level agenda- setting and framing go hand in hand. In short, they claimed that:

“framing is the construction of an agenda with a restricted number of thematically

related attributes in order to create a coherent picture of a particular object”

(McCombs & Ghanem, 2001, p. 70)

McCombs and Ghanem (2001) considered frames as features of items and classified them according to their content as well as to their scope (Figure 9). Regarding the former, they differentiated between rational ('cognitive') and emotional ('affective') features. Concerning the latter, they distinguished between narrow ('micro') and broad ('macro') range. The authors argued that frames lie rather at the right end of the spectrum. They went a step further and drew a distinction between facets ('aspects') and main topics ('central themes'), and deemed the last interpretation to be more beneficial (McCombs & Ghanem, 2001).

6 The chapter is titled “The Convergence of Agenda Setting and Framing”.

Page 26 of 133

Figure 9: Agenda-Setting and Framing Theory (McCombs & Ghanem, 2001, p. 71)

Having said that, the lines between agenda-setting and framing remain quite blurred. In the author´s impression, there simply seem to be two separate school of thoughts. Be that as it may, 'framing' has undoubtedly become the more popular term. Weaver (2007) examined the evolution of these two concepts7 in the database 'Communication Abstracts' between 1971 and 2005. In short, he found that the usage of agenda-setting had modestly increased (n=+39), whereas that of framing had skyrocketed (n=+163). He suggested that the popularity of the latter term could be ascribed to its equivocality and inclusiveness (Weaver, 2007).

7 For the sake of completeness, it should be mentioned that Weaver's (2007) analysis included the concept of 'priming' as well. Not being of relevance to the purposes of this dissertation, the reader was spared this detail in the text.

Page 27 of 133

In sum, the author is well aware of the subtle difference between these two concepts. However, what is relevant to the purposes of this analysis is the common ground they share – that is, their preoccupation with the manner in which items are portrayed by the media (Takeshita, 2006; Weaver, 2007). Hence, even though the one or the other scholar will most certainly criticise this, for the purposes of this dissertation, framing will be interpreted rather narrowly, and will thus be used interchangeably with attribute agenda-setting in the remainder of this dissertation.

3.1.3 Mass Media in Tourism Research

Finally, it is worth spending a few words on the use of the agenda-setting and framing theories in tourism research. The number of studies adopting the former can be counted on the fingers of one hand. The article by Schweinsberg et al. (2017) is a notable example thereof. In contrast, the number of studies adopting the latter framework requires more than one hand to be counted. A few recent examples include the article by Hansen (2020) and that by Leung et al. (2019). Taken together, the combined use of second-level agenda setting and framing theory in tourism ventures into a little-known territory.

Page 28 of 133

3.2 Tourism and Leisure

An aspect that has provided for discussion is the question of whether the neologism "overtourism" actually represents something new or whether it simply rehashes old topics. The title of an article on LinkedIn, which was probably one of the sparks that ignited the debate, put it quite elegantly: "Overtourism: old wine in new bottles?" (Dredge, 2017). To begin, the origins of this term remain uncertain. Rafat Ali, the CEO of the news outlet Skift, proclaimed himself as the inventor of this term (Ali, 2018). However, an online search suggests otherwise. In fact, looking for "overtourism" in Nexis® Uni, reveals that this word was actually first used by Celia Herron on August 6, 1981 (Herron, 1981). To the best of the author´s knowledge, no scholar has remarked this to date. Some personalities deny the existence of this phenomenon. One, for instance, has equated "overtourism" with "overcrowding" (see e.g. Karantzavelou, 2020). In so doing he has mistaken a part for the whole. In fact, the quantitative aspect is only the tip of the iceberg. In a way, this reminds of the differentiation between density and crowding made by Stokols (1972). In his seminal work, he wrote:

"Density is viewed as a necessary antecedent, rather than a sufficient condition, for the experience of crowding" (Stokols, 1972, p. 275)

Most authors simply limit themselves to mentioning their stand on this debate. For instance, Koens et al. (2018) argued that the matters subsumed under this word go back a long way and then briefly review the literature on the effects of tourism. However, to the best of the author´s knowledge, only Capocchi et al. (2020) and Wall (2020) went a step further and actually devoted an entire publication to making their point. In their research letter, Capocchi et al. (2020) drew a distinction between overtourism as a word and as an occurrence. They examined the references of academic and non-academic documents about overtourism (n=29) and found that the authors of these documents cite research dating all the way back to the 1970s, such as Doxey (1975, as cited in Capocchi et al., 2020) (n=3) and Pizam (1978, as cited in Capocchi et al., 2020) (n=2). Accordingly, they concluded that while as a word overtourism is unheard of, as an occurrence it is well-known.

Page 29 of 133

In his perspective article, Wall (2020) traced the roots of overtourism back to carrying capacity. Initially, research on this subject was conducted by leisure scholars in rural areas in North America. Later, their work was picked up by tourism scholars, who extended it to urban destinations in Europe under the masquerade of overtourism. Thus, like Capocchi et al. (2020), Wall (2020) also differentiated between overtourism as a word and as an occurrence, albeit not as explicitly. However, unlike Capocchi et al. (2020), he did not substantiate his claims with empirical evidence – these are solely based on his expertise. In sum, these two studies suggest that one must tell apart the term and the phenomenon. The former is new, whereas the latter is old. In addition, they demonstrate that little attention has been paid to determining the roots of overtourism empirically.

3.2.1 Historical Excursus

Koens et al. (2018) argued that overtourism goes back a long way and urged other scholars “not to let this work go to waste” (Koens et al., 2018, p. 10). Not even two years later, Wall (2020) lamented that tourism scholars have taken up the concepts of carrying capacity “without full appreciation of their origin and history” (Wall, 2020, p. 213). As the author does not want to make himself guilty of the same mistake, in what follows the history of overtourism will be briefly reviewed. To this end, seminal works were identified by sifting through the bibliographies of selected publications. When inspecting the 'older' references of the chosen documents (i.e. 1900-2000), one comes across renowned classics, such as The Economics of Welfare (Pigou, 1930; as cited in Nepal & Nepal, 2019) and methodological milestones, such as A Paradigm for Developing Better Measures of Marketing Constructs (Churchill, 1979; as cited in Martín Martín et al., 2018). Most importantly, though, one stumbles upon influential works in tourism. Table 2 shows five frequently cited 'old' publications.

Page 30 of 133

Publication Topic Times Cited

Butler (1980) Tourism Area Life Cycle 37 Doxey (1975) Irridex Model 34 O'Reilly (1986) Tourism Carrying Capacity 14 Pizam (1978) Social Impacts of Tourism 9 Wagar (1964) Carrying Capacity 5

Table 2: Seminal Works

As can easily be seen from Table 2, overtourism seems to spring from research on resident attitudes / tourism impacts and (tourism) carrying capacity. The fact that both 'capacity' and 'resident' occurred over 200 times in the references lends further weight to this claim. Thus, these two streams of research will be concisely reviewed in the next subsections.

3.2.2 Resident Attitudes / Tourism Impacts

The presence of two distinct subjects in the heading of this section might irritate the one or the other reader and even raise the suspicion that the author will be mixing apples and oranges in what follows. Hence, it shall be pointed out that the decision to treat these topics together was made based on the assertation that the former subject has mainly been studied in tandem with the latter (Rasoolimanesh & Seyfi, 2020). Reviewing the scholarly literature on resident attitudes would require a strenuous effort. In fact, this is one of the most extensively investigated fields in tourism (McGehee & Andereck, 2004). Accordingly, conducting a comprehensive review of this subject "would be a difficult, if not impossible task" (Sharpley, 2014, p. 42). Hence, a slightly unconventional approach was adopted – that is, existing reviews and commentaries on this subject were analysed (Figure 10). To identify such publications, a backward reference search was performed starting from the recent viewpoint article by Rasoolimanesh and Seyfi (2020). The reader could now criticise that the quality of this review depends directly on the quality of the reviews selected. Thus, the recognition that existing reviews offer an outstanding insight into theories and methods adopted in this field (Hadinejad et al., 2019) might allay, or even clear, such doubt.

Page 31 of 133

Figure 10 : Publications about Resident Attitudes

Since it has just been mentioned that this topic has extensively been investigated, the reader might be surprised that the timeline above only includes reviews from the last decade. To be sure, earlier reviews were also identified, such as the ones by Yen and Kerstetter (2009), Harrill (2004), and Easterling (2004). However, Deery et al. (2012) argued that – save for the latter – these tend to have a specific emphasis. That said, the review by Easterling (2004) was excluded too, as it is relatively dated. There seems to be no agreement on exactly how long this topic has been investigated for. Almeida García et al. (2015) argued that it has been studied for over three decades, while Sharpley (2014) claimed that it goes back over three and a half decades. In any case, this subject has been investigated for a long time. Deery et al. (2012) divided this stream of research into four phases (Figure 11). In the first one, fundamental notions were explained. The second phase saw the emergence of conceptual frameworks, such as the Tourism Area Life Cycle by Butler (1980; as cited in Deery et al., 2012). The third one witnessed the appearance of measurement tools, such as the Tourism Impact Scale by Ap and Crompton (1998, as cited in Deery et al., 2012). In the fourth phase, these were put through their paces and fine-tuned.

Figure 11: Four Phases (Deery et al., 2012, p. 65)

Page 32 of 133

In retracing the history of this stream of research, Deery et al. (2012) cited almost only journal articles. It shall be mentioned, however, that the publication of articles was accompanied all along by that of books, such as The Golden Hordes (Turner & Ash, 1975), Tourism, the Good, the Bad, and the Ugly (Rosenow & Pulsipher, 1979), The Holidaymakers (Krippendorf, 1986), and Coping with Tourists (Boissevain, 1996). Overall, this stream of research paints a relatively coherent picture. Almeida García et al. (2015) claimed that while scholarly inquiry has generated consistent results about locals' appraisal of the economic effects of tourism, it has yielded mixed findings about their evaluation of the social and environmental ones. Generally, the former is positive (e.g. employment), whereas the latter are negative (e.g. congestion) (Almeida García et al., 2015). Thus, it seems that the adverse social and environmental effects offset, or even outweigh, the beneficial economic ones. Tourism scholarship seems to have followed that order. Indeed, Sharpley (2014) argued that the realization of the social and environmental disadvantages of tourism moderated the initial excitement for its economic advantages. A brief inspection of the reviews mentioned in Figure 10 suffices to identify three common limitations of research on resident attitudes and tourism impacts. In what follows these will be discussed. However, for the sake of readability, the author will not list all reviews for each aspect in an in-text citation parenthesis. Instead, a summary table of which publication mentions what point is provided in Table 8 in Appendix 1. First, some authors have noted the limited use of theory (Appendix 1Error! Reference source not found.). In their longitudinal review (n=140), Nunkoo et al. (2013) found that less than half of the publications (n=64; ≈46%) were 'theoretical'. At the same time, they observed that over time scholars have more and more embraced theory. Accordingly, they anticipated that this upward-rising tendency would persist in the future. Their bright prediction has recently been confirmed. In their systematic review (n=90), Hadinejad et al. (2019) found that more than half of the publications (n=50; ≈56%) were 'theoretical'. Moreover, there seems to be unanimous agreement on the fact that Social Exchange Theory (SET) is the most frequently used theoretical framework. In fact, without exception, in each review mentioned in Figure 10 the authors argued this to be case. The widespread adoption of this theory might be traced back to its advocacy by John Ap (e.g. Ap, 1992). Be that as it may, both the longitudinal and the systematic review mentioned above substantiate the currency of this theory.

Page 33 of 133

Nunkoo et al. (2013) found that a bit more than half (n=36; ≈56%) and Hadinejad et al. (2019) found that a little less than half (n=29; ≈45%) of the 'theoretical' publications adopted this framework, respectively. In spite of its widespread adoption, SET is not without its criticisms. Sharpley (2014) argued that it assumes a deliberate interaction between visitors and residents. This, however, need not always be the case. Inspired by the work of Krippendorf (1987; as cited in Sharpley, 2014), he put forward a spectrum of possible contacts between visitors and residents (Figure 12). At the one end, there is conscious trade. At the other end, there is inadvertent coexistence. The latter scenario reminds of the poetic description of overtourism by Singh (2018), who argued that it occurs “when local people cannot walk on the street without rubbing shoulders with crowd of tourists” (Singh, 2018, p. 415).

Figure 12: Spectrum of Visitor-Resident Interaction (Sharpley, 2014, p. 39)

In other words, Sharpley (2014) argued that SET is only applicable to the left half of the spectrum (Figure 12). In addition, he claimed that this model has been read in a superficial way - that is, disapproval of tourism by locals has been said to result from an unfavourable cost-benefit analysis on their part. Yet, in reality such straightforward logic is rarely adopted. Thus, he concluded that SET is ill-suited for this field of study (Sharpley, 2014). To be sure, one could find many more criticisms of this framework in the literature. However, this would turn what is meant to be a concise excursus into a tedious digression.

Page 34 of 133

Overall, SET seems to have been exhausted. Rasoolimanesh and Seyfi (2020) noted that in view of its shortcomings and its long-standing prevalence, scholars have attempted to integrate it with other frameworks and to amend it. The systematic review by Hadinejad et al. (2019) corroborates the adoption, albeit marginal, of alternative models. Indeed, they found that academics have employed, inter alia, social representation (n=4; ≈6%) and institutional theory (n=3; ≈5%). A variant of the latter has also been used in one of the publications identified in Study II (i.e. Taş Gürsoy, 2019). Be that as it may, Hadinejad et al. (2019) encouraged researchers to thoroughly deal with frameworks from other areas of inquiry, especially that of psychology. Second, some authors have observed the prevalence of quantitative methods (Appendix 1Error! Reference source not found.). Once again, the longitudinal and systematic reviews corroborate these claims. Nunkoo et al. (2013) found that over two thirds of the publications reviewed (n=101; ≈72%) employed a quantitative method. Hadinejad et al. (2019) went a step further and draw a distinction between data collection and data analysis. They found that a quantitative method was adopted in over four fifths of the publications (n=75, ≈83%) for both data collection and analysis8. Needless to say, the prevalence of quantitative methods is not without its consequences. In fact, some authors have ascribed the lamentable state of research to such preponderance. Most notably, Deery et al. (2012) claimed that scholarly inquiry has managed to discover numerous effects of tourism but has failed to unravel the rationale behind their (un-)favourable appraisal on the part of locals. They borrowed the onion model of culture from Rousseau (1990; as cited in Deery et al., 2012) – whose description would exceed the scope of this discussion – to show that the catalogue of effects generated by quantitative inquiry merely constitutes the outer skin. Accordingly, they advocated employing qualitative methods to scratch beneath the surface and transition from the description to the explanation of effects (Deery et al., 2012). Building on their work, Sharpley (2014) got to the heart of the matter, stating that academic inquiry “tends to describe what residents perceive, but not necessarily explain why” (Sharpley, 2014, p. 42). Third, some authors have noted the loose pertinence of research to reality (Appendix 1). Sharpley (2014) argued that the scholarly work does not mirror the rising importance of tourism

8 Figure to be verified with the article's lead author.

Page 35 of 133

in emerging states. In fact, he found that a great number of investigations has been conducted in the countryside in North America. Accordingly, he lamented that places whose economies hinge on tourism have considerably been neglected. Hadinejad et al. (2019) observed a chasm between academia and industry as well. At country level, they found that the lion's share of investigations has been performed in the United States (21%) and in China (10%). Inversely, at continent level, they found that most investigations have been carried out in Asia (30%) and in North America (26%). Having said that, they argued that official statistics indicate states in Europe, such as France, to be more appropriate study locations (Hadinejad et al., 2019). To be sure, the adoption of a one- dimensional figure, such as tourist arrivals, could be questioned. In fact, a two-dimensional indicator, such as tourism intensity, might be more telling. Nevertheless, the idea of using 'objective' figures to evaluate the need for studying the 'subjective' perception of locals sounds reasonable. Thus, overall, this review of reviews paints a tragic picture of research on resident attitudes and tourism impacts. In fact, it suggests that it has merely scratched the surface. Some authors put these disappointing findings quite elegantly. For instance, Deery et al. (2012) asserted that this field of study is stuck "in a state of arrested development" (Deery et al., 2012, p. 65). In addition, Sharpley (2014) maintained that “more limited progress has been made than the volume of research might suggest” (Sharpley, 2014, p. 39). However, on the bright side, this brief historical excursus offers two main take-home lessons. First, it demonstrates the value of systematic reviews. In fact, such investigations allow scholars to substantiate their claims with 'hard' evidence. This, in turn, lends weight to the work performed in this dissertation (Study II). Second, it raises several questions. In fact, this historical discussion makes one wonder whether research on overtourism has made itself guilty of the same 'mistakes' as that on resident attitudes and tourism impacts, or whether it has learnt from them. Specifically:

o Have scholars embedded their research in conceptual frameworks? o Have qualitative / mixed methods been used more than quantitative ones? o Has research been conducted in destinations in urgent need of solutions?

Page 36 of 133

3.2.3 (Tourism) Carrying Capacity

In a similar vein to the one about resident attitudes and tourism impacts, a review of carrying capacity in leisure and tourism would be a gargantuan undertaking. Hence, here too, a backward reference search was performed starting from the recent perspective articles by Wall (2020) and Butler (2019) to identify seminal works in the field (Figure 13). Thus, in what follows these works will be reviewed in chronological order, with a focus on the added content of each publication to avoid repetition.

Figure 13: Publications about Carrying Capacity

The roots of research on carrying capacity can be traced back to the monograph by Wagar (1964). He noted that even though often referred to in leisure research, carrying capacity lacks a recognized definition. For the aims of his publication, he interpreted it as “the level of recreational use an area can withstand while providing a sustained quality of recreation” (Wagar, 1964, p. 3). Yet, quality can be preserved at various degrees of use, as a balance is found at each point. Hence, carrying capacity has to be tied to administration goals, which serve to determine an acceptable grade of quality. In the case of leisure, the greater aim is to ensure visitor satisfaction, which stems from the achievement of wants. Thus, Wagar (1964) identified a set of motives for leisure and explores the influence of density9 on quality – not empirically, but conceptually – by sketching potential relations for the established purposes (Figure 34 in Appendix 2). For instance, if one longed for isolation, the presence of others would be detrimental (Figure 34-I). In contrast, if one yearned for company, the presence of others would be beneficial (Figure 34-K). In sum, he found

9 Wagar (1964) actually used the term 'crowding'. However, since he actually referred to 'density', the latter term is used. For thorough explanations of the difference between density and crowding, please read Stokols (1972) and Rapoport (1975).

Page 37 of 133

that the influence of density on quality hinges on the motives for leisure. Accordingly, carrying capacity is essentially about normative evaluations. A decade later, Wagar (1974) criticised the word 'capacity'. In fact, it implies that grounds for restricting use lie in the features of an area rather than in its influence on recreation quality. Also, this word conceals the fundamental difference between positive (is) and normative (should) matters. In this regard, he noted that determining degrees of use involves deciding what resulting alterations are admissible – and this is an evaluation of the latter kind, which has to be made by a person. Accordingly, Wagar (1974) recommended replacing 'carrying capacity' with expressions that help keep the attention on administration goals, such as 'use limits'. With regard to the latter, he advocated the adoption of a holistic perspective – that is, to assess their impacts on an entire territory rather than on solely part of it. Moreover, he argued that use restrictions are merely one of the measures people may put up with it in exchange for a satisfactory experience. Other trade- offs include admission charges and behavioural rules. Thus, administrative matters come with intricate challenges, which render “any search for an impersonal carrying capacity formula totally unrealistic” (Wagar, 1974, p. 278). Afterwards, Stankey and McCool (1984) performed a critical assessment of carrying capacity in leisure research drawing on the review article by Graefe et al. (1984, as cited in Stankey & McCool, 1984). They argued that this notion was not originally meant to explain the relation between contentment and number of people met, and identify six reasons for its feeble nature. First, visitors deliberately engage in leisure pursuits that yield enjoyment ('self-selection'). Second, when an area undergoes alteration, it gets visited by a different type of crowd ('displacement'). Third, visitors pursue leisure activities for several reasons. Thus, when confronted with circumstances unfavourable to the achievement of one objective, they give weight to another purpose attainable in that situation ('reducing dissonance'). Fourth, contentment is measured on a single-item scale, which requires respondents to summarise good and bad features into one figure. Such operationalization is at odds with the multi-faceted nature of events ('psychometric additivity'). Fifth, running into other people need not necessarily be unwelcome. In fact, visitors engage in leisure activities for reasons other than isolation, too. Thus, the influence of chance meetings on enjoyment hinges on the significance of one's purpose in a given situation ('saliency'). Sixth and last, contentment is influenced by visitors' liking for ('preferences') and anticipations of ('expectations') certain degrees of use. Moreover, and more importantly, Stankey and McCool

Page 38 of 133

(1984) claimed that the relation between number of people met and contentment is actually immaterial from an administrative point of view. Thus, they advocated moving from extent of use to aspired circumstances and put forward the 'Limits of Acceptable Change' (LAC) model10. In tourism research, the definition of carrying capacity by the UNWTO has grown to be the established one – in spite of falling under 'grey literature'. This institution defined carrying capacity as:

"the maximum number of people that may visit a tourist destination at the same time, without causing destruction of the physical, economic and sociocultural environment and an unacceptable decrease in the quality of visitors’ satisfaction" (UNWTO, 1981, p. 5)

The two underlined sets of words show that this interpretation of carrying capacity makes itself vulnerable to the same criticisms as that in leisure research. In fact, it implies that an excessive amount of people (extent of use) will lead to inadmissible reduction of enjoyment (normative evaluation). Be that as it may, the expression 'tourism carrying capacity' is also often associated with O'Reilly (1986). In his short article, this scholar acknowledged sub-dimensions of carrying capacity that had previously received little attention, such as the socio-cultural one. He argued that each has its own ceiling, and notes these may be at variance with one another. Finally, he stated that "capacity cannot be used as an absolute limit but as a means to identify critical thresholds which need attention" (O'Reilly, 1986, p. 258). Shortly before the turn of the millennium, Lindberg et al. (1997) discussed three shortcomings of carrying capacity. First, it offers limited direction for concrete application, as it hinges on normative standards that are either vague or impractical. Second, it gives the impression of being a value-less notion when yardsticks are by nature value-laden. Third, it places emphasis on degrees of use although administration goals are about aspired circumstances. Having said that, they identified prerequisites for carrying capacity to be valuable, such as accord on normative matters and authority to impose use restrictions. They found that these are seldom fulfilled,

10 For a detailed description of the LAC model, please read Stankey et al. (1985).

Page 39 of 133

wherefore they asserted that the “effective application of the traditional carrying capacity is difficult, if not impossible” (Lindberg et al., 1997, p. 463). Finally, they went a step further, and suggested that the expression itself ought to be dismissed, for it causes confusion. Shortly after the turn of the millennium, McCool and Lime (2001) published a critical review of carrying capacity. They noted that the relation between degree of use and extent of effect need not be linear (Figure 14). For instance, curve A shows a scenario in which little use suffices to swiftly bring about large effects. This would imply that adverse consequences can only be avoided by prohibiting use entirely, and that substantial cutbacks in use would be required to mitigate them. Alternatively, curve C shows a scenario in which large effects only occur after considerable use. This would suggest that sites have an inherent capacity, whose exceedance results in the drastic worsening of circumstances (McCool & Lime, 2001). The reasoning of scholars currently investigating the carrying capacity of urban destinations (e.g. Bertocchi et al., 2020; Tokarchuk et al., 2020) seems to be in keeping with the rationale underlying this last scenario.

Figure 14: (Non-)Linear Relations between Use and Impact (McCool & Lime, 2001, p. 375)

Moreover, McCool and Lime (2001) commented on seven aspects of carrying capacity. First, any degree of use has some sort of effect. Second, carrying capacity is frequently interpreted as an upper limit above which things go south. However, as adverse impacts are inevitable (1st

Page 40 of 133

point), such arithmetic interpretation is impractical and ill-suited. Also, the buildout of tourism is about compromises. For instance, people may put up with environmental drawbacks (e.g. pollution) in exchange for economic benefits (e.g. employment). Third, the relation between degree of use and extent of effect is ambiguous, as a multitude of factors influences it (Figure 14). Fourth, the determination of a carrying capacity unavoidably leads to the imposition of restrictions – and this comes with considerable allocative implications. Fifth, carrying capacity confounds positive (is) with normative (should) matters – a criticism already voiced by Wagar (1974). Sixth, it implies that systems are steady when in reality the opposite is the case. Seventh, and last, carrying capacity is about setting the right focus, namely establishing reasonable circumstances, rather than excessive degrees of use, for a given place. Thus, they asserted that “the concept of a tourism and recreation carrying capacity maintains an illusion of control when it is a seductive fiction, a social trap, or a policy myth” (McCool & Lime, 2001, p. 386). Accordingly, they recommended abandoning it and adopting fundamentally different models, such as LAC, instead. Needless to say, this stream of research is not without its criticisms. In the author´s opinion, two of them are worth mentioning. To begin, leisure research in North America has been carried out by a relatively small group of researchers, which includes Jerri J. Vaske, Lori B. Shelby, Thomas A. Heberlein, and Alan R. Graefe. Having a second name is not the only characteristic these authors share. A brief look at the bibliographies of their publications suffices to suggest that they rarely cited sources outside their own field. Yet, crowding has been studied in disciplines other than leisure, too. For instance, in retail one can find conceptual (e.g. B. B. Anderson, 1976; Harrell & Hutt, 1976; Eroglu & Harrell, 1986) as well as empirical (e.g. Harrell et al., 1980; Eroglu & Machleit, 1990) articles on this subject. In addition, these leisure researchers have mostly employed the single-item scale by Heberlein and Vaske (1977, as cited in Shelby et al., 1989). This instrument is inherently flawed, as it allows for various shades of ‘crowded’ but precludes any nuances of ‘uncrowded’ (Figure 15). In spite of that, they abide by the old exhortation “to use this scale in other studies” (Shelby et al., 1989, p. 288).

Page 41 of 133

Figure 15: Crowding Scale (Neuts & Nijkamp, 2012, p. 2140)

The contribution of this historical excursus is twofold. First, it shows that the determination of a numerical carrying capacity in cities would raise several questions. For instance, who would decide on the threshold (e.g. scientists vs politicians)? Moreover, and more importantly, what would people do if they found it (e.g. restrict access, impose taxes)? Thus, the reduction of tourism carrying capacity to a single number seems to remain a mirage. This, in turn, begs the question of whether – in spite of the lessons this stream of research has taught – scholars investigating overtourism have fallen under the spell of the 'magic number'. Or, more formally:

o Do scholars further pursue the quest for a numerical carrying capacity?

Page 42 of 133

3.3 Overtourism Measurement

Like the phenomenon of overtourism, the measurement of tourism pressure is “old wine in new bottles” (Dredge, 2017). In fact, it may already have turned into vinegar. Around the middle of the past century, members of the International Association of Scientific Experts in Tourism (AIEST) were debating on the measurement of tourism. Markos (1949) noted the limited meaning of absolute figures in national tourism statistics. Hence, he suggested the adoption of a common denominator, namely each country's respective area. He named it “le degré d'intensité du tourisme“ (Markos, 1949, p. 132). Sundt (1950) argued that this measure is not well suited for countries with either a very small (e.g. Monaco) or a very large (e.g. Australia) area. Accordingly, he proposed using the resident population as common denominator. He referred to this measure as “l'intensité touristique” (Sundt, 1950, p. 4). Around seventy years later, scholars are still discussing the denominators of these two indicators, albeit under the masquerade of 'overtourism' (e.g. Amore et al., 2020; Weber et al., 2019). This seems to suggest that, with respect to the measurement of tourism pressure, the scholarly community may have remained 'none the wiser' (Ap, 1990, p. 615) after all.

3.3.1 State of the Art

The author finds that little attention has been paid to the measurement of 'overtourism' - an aspect noted by De La Calle-Vaquero et al. (2020), too. This comes as no surprise. In fact, it is natural for scholars to address the measurement of a phenomenon after having dealt with its conceptualisation. Deery et al. (2012), for instance, found this to be the case with research on the social effects of tourism. Since overtourism is a relatively recent phenomenon at the time of writing (2020-2021), only a handful of publications entirely dedicated to its measurement could be identified. In what follows, these will be reviewed in chronological order (Figure 16). Interestingly, this mirrors the evolution of overtourism. Indeed, this buzz word has transitioned from media to academia (Koens et al., 2018). In a similar way, research using this term has been published first in industry reports (2017-2019), then in book chapters (2019-2020), and finally in journal articles (2020-2021).

Page 43 of 133

Figure 16: Publications about Overtourism Measurement

Two words of caution are in order before delving into the literature review. First, the author would like to emphasize that only studies about 'overtourism' have been examined. Thus, publications in which scholars attempted to establish a destination's carrying capacity, such as the research note by Tokarchuk et al. (2020) or the journal article by Bertocchi et al. (2020), will only be acknowledged. Second, the author would like to point out that the latest research on this subject published in the 'Special Issue on Measuring and Monitoring Overtourism' in the Journal of Travel & Tourism Marketing (Wattanacharoensil & Weber, 2020) has not been reviewed, yet. It will, however, be integrated into the final dissertation.

3.3.1.1 Industry Reports

The first three seminal publications about overtourism measurement are non-academic (Figure 16). To begin, Guevara Manzo et al. (2017) came up with nine indicators. Of these, two are about the significance of tourism and seven about the implications of overcrowding. The authors computed these indicators for several cities (n=68), which they grouped into five percentiles for comparison11 (Guevara Manzo et al., 2017). Some of the measures calculated in this dissertation (Study III) will be compared against these percentiles, as done, for instance, by Alcalde Garcia et al. (2018). Next, Peeters et al. (2018) conducted a large-scale study at regional level (NUTS 2)12. They identified eight measures as indicative of overtourism hazard. Of these, five are about relative shares and three about proximity to certain infrastructure. Their study built in several respects on the one by Guevara Manzo et al. (2017). For instance, the authors adopted

11 1st → 5th percentile: risk = high → low (Guevara Manzo et al., 2017) 12 NUTS: Nomenclature of Territorial Units for Statistics

Page 44 of 133

two measures from that study, namely 'tourism share of GDP' and 'air transport seasonality' (Peeters et al., 2018, pp. 74–75). In addition, since the data is skewed and kurtic, the authors classified the regions into quintiles, too. To be exact, though, they reversed their polarity13 (Peeters et al., 2018, p. 44). That said, it cannot be emphasized enough, that in both reports the categories indicate varying degrees of potential hazard of overcrowding (Guevara Manzo et al., 2017) and overtourism (Peeters et al., 2018), respectively – not varying extents of actual occurrence of either phenomenon. In both publications, the authors pointed this out explicitly. These two studies are not without criticism. Peeters et al. (2018) argued that the approach by Guevara Manzo et al. (2017) has four shortcomings. First, the authors only considered urban destinations. Second, examining solely places where overtourism is felt hinders the determination of limits beyond which this phenomenon gets perceptible. Third, data for cities tends to lack completeness and comparability. Fourth and last, the authors mixed measures about antecedents with those about consequences of overtourism. In addition, Peeters et al. (2018) also acknowledged the limitations of their own study. Most notably, they could not identify critical thresholds for their measures, as these fluctuate a great deal across regions. Instead, they put forward a tentative catalogue of questions (n=10), whose answers indicate the extent to which a place is exposed to overtourism (Peeters et al., 2018, p. 79). Finally, Weber et al. (2019) brought it all together by calculating traditional statistics, measures from both reports, and alternative figures for a few destinations (n=9). Most importantly, though, they conveyed the importance of the denominator in two-dimensional indicators, such as tourism density and intensity (Weber et al., 2019). In this regard, Guevara Manzo et al. (2017) used the surface containing the twenty most popular sights as denominator for tourism density. Amore et al. (2020) were a little more conservative and used the surface of just the ten most popular sights to compute such measure. As for tourism intensity, the latter used the number of residents in the 'city' instead of in the 'larger urban agglomeration' as denominator (Amore et al., 2020, p. 122). This distinction brings to mind that made by Eurostat between 'city' 'functional urban area', and 'greater city' (Kotzeva et al., 2019, p. 14). It also evokes the differentiation made by TourMIS between 'city area only' and 'greater city area' (TourMIS, n.d.–a).

13 1st → 5th percentile: risk = low → high (Peeters et al., 2018)

Page 45 of 133

3.3.1.2 Academic Publications

In academia, four publications about overtourism measurement could be identified. Of these, two are book chapters (De La Calle-Vaquero et al., 2020; Visentin & Bertocchi, 2019) and two are journal articles (Amore et al., 2020; Perles-Ribes et al., 2020).

3.3.1.2.1 Book Chapters

Visentin and Bertocchi (2019) thoroughly examined the state of tourism in Venice. In particular, they analysed the infiltration of tourism into non-touristic areas of the city. To this end, they calculated the ratio of beds in vacation rentals (i.e. Airbnb) over beds in traditional lodging establishments (e.g. hostel), by district. Even though they found high values for less frequented districts, they argued that vacation rentals are only half the story. In fact, these are still reconvertible. Thus, they examined the commercial offering of the districts. To this end, they computed the ratio of alimentary over non-alimentary stores by district. Rather unsurprisingly, they found a high value for the city's most popular district: San Marco. Moreover, and perhaps more interestingly, they drew a connection between the two analyses and discovered that while traditional lodging establishments go hand in hand with non-alimentary stores, vacation rentals go arm in arm with alimentary stores. This result suggests different consumption patterns on the part of the visitors. In sum, they concluded that overtourism is “irreversibly changing the balance of the city´s economic landscape” (Visentin & Bertocchi, 2019, p. 19). Their conclusion echoes the stream of research on 'new tourism areas' (Maitland & Newman, 2009b), and also reminds of the words by Magnus Enzensberger, who stated that “the tourist destroys what he seeks by finding it". In fact, it is safe to say that, eventually, more and more visitation will turn the once authentic place into a touristic hotspot. This, in turn, will lead visitors to swarm to another 'off the beaten path' area, thereby furthering this vicious cycle. De La Calle-Vaquero et al. (2020) performed a comprehensive review of research on overtourism measurement. They drew a distinction between tourism activity and tourism specialisation indicators. The former merely measure the quantity of tourism (e.g. number of overnights). Hence, they are not well suited to reveal tourism pressure. Nevertheless, examining their evolution (growth) and distribution (seasonality) over time enhances their value. The latter

Page 46 of 133

link an element concerning tourism with one regarding the destination (e.g. tourism intensity). Moreover, and more importantly, these authors acknowledged that the measurement of overtourism faces the same challenges as the determination of carrying capacity – that is, “the inability to establish values based on which many visitors can be considered too many” (De La Calle-Vaquero et al., 2020, p. 319). On a side note, De La Calle-Vaquero et al. (2020) acknowledged big data as an additional data source. In spite of understanding its appeal, in the author's opinion this type of data is hard to 'institutionalise' for two reasons. First, using data from a company makes DMOs dependent on them. This, in turn, is not reliable in the long run. For instance, Kádár (2014) analysed tourists' consumption of Budapest with data from Flickr. The decline of this platform would have left DMOs relying on its data empty-handed. Hence, the author suggests enjoying the advent of big data with due caution.

3.3.1.2.2 Journal Articles

Amore et al. (2020) attempted to create a compound measure of overtourism. To begin, they computed traditional statistics in an innovative way. In the case of tourism density, they replaced the surface of the destination with that containing the ten most popular sights on TripAdvisor. In other words, they simply changed the denominator. In the case of tourism intensity, they were a little bolder and changed the numerator, too. That is, in addition to selecting the number of residents in the city instead of in the metropolitan area for the denominator, they also substituted the number of overnights with that of museum visitors in the numerator (Amore et al., 2020). While their choices are justified, they are not ground-breaking. In fact, other authors have already calculated tourism density using other areas (Guevara Manzo et al., 2017; Weber et al., 2019). Similarly, TourMIS gives users the possibility to select statistics for 'city area only' or for 'greater city area' (TourMIS, n.d.–a). Be that as it may, Amore et al. (2020) computed four measures for a few cities in Europe (n=15). They then standardised them14 and, finally, treated the simple mean as compound measure for overtourism.

14 Z-score standardisation by Amore et al. (2020): z = (x- x̅ ) / s

Page 47 of 133

Amore et al. (2020) adopted a purely quantitative – and thus allegedly objective – approach. Indeed, they explicitly stated to have favoured this over the subjective perspective (Amore et al., 2020, p. 121). In contrast, Perles-Ribes et al. (2020) prided themselves on appreciating the subjective nature of overtourism. In short, they operationalised overtourism as membership in the 'Network of Southern European Cities against Touristification'. The authors considered deliberately being part of this association a sign of a destination's concern for the adverse effects of tourism. Accordingly, they argued that the adoption of this measure differentiates theirs from other studies (Perles-Ribes et al., 2020). This line of reasoning seems a little far-fetched, though. In fact, the decision to join such association may be motivated by other factors, such as political reasons. In this regard, Perles-Ribes et al. (2020) only contemplated one alternative hypothesis, namely that of member destinations being particularly responsive to the phenomenon under scrutiny. Thus, they examined whether members of this network score higher on selected dimensions from Peeters et al. (2018). Otherwise, however, the authors did not attempt to rule out other surrogate options. Be that as it may, they modeled overtourism as a function of a destination's competitiveness as well as of its pressure from and dependence on tourism. They computed that model using different machine learning methods, first separately and then in conjunction15 (Perles- Ribes et al., 2020).

15 Methods adopted by Perles-Ribes et al. (2020): Logit Model, Naïve Bayes, Support Vector Machines, (Classification and Regression Tree).

Page 48 of 133

4 Methodology

This dissertation includes three studies. In this section, their methodologies will be presented, albeit only briefly. In-depth descriptions will be provided in the respective sections. In the first study, the author will analyse news articles about overtourism. To be more precise, content from the following five newspapers will be examined: Guardian, Independent, Financial Times, Times, and Telegraph. These are the main 'broadsheets' in the United Kingdom (Oxford Royale Academy, n.d.). Structural topic modeling (Roberts et al., 2014) will be performed to identify the 'attributes' (agenda-setting), or 'frames' (framing), used in the portrayal of this phenomenon. Most importantly, though, this method will only be a point of departure – not one of arrival. In line with Computational Grounded Theory (Nelson, 2020), topic modeling will solely be employed at the beginning of the analytical journey to let the words speak for themselves. The findings will then be fine-tuned and validated empirically (Nelson, 2020). While the successful application of this inductive-deductive framework will only be a drop in the ocean, it makes one step forward in overcoming the long-standing "artificial dichotomy of inductive versus deductive reasoning" (Mazanec, 2009, p. 320). In the second study, the author will examine academic publications about overtourism. More precisely, a bibliometric analysis will be conducted. This investigation will consist of two parts. In the first one (descriptive), the landscape of research on overtourism will be mapped. Then, bibliographic metadata will be summarised and topic modeling will be performed on the abstracts of selected documents. In the second part (inferential), a citation analysis will be carried out to ascertain the origins of overtourism research empirically. This knowledge, in turn, will enable the author to evaluate the “usefulness” of this stream of research. Here, this will be judged according to i) whether it has made itself vulnerable to the same criticisms of its precursors, and ii) its boldness. In other words, these analyses will establish whether overtourism research has advanced our knowledge or whether it has left us "none the wiser" (Ap, 1990, p. 615) after all. In the third study, selected indicators of tourism pressure will be computed for Vienna (Austria) to demonstrate the limited significance of allegedly objective measures. Accordingly, the author will undertake a leap to subjective indicators. In view of COVID-19, this will lead him to the diametrical opposite of overtourism – that is, “undertourism”. Specifically, the experience of the destination without tourism on the part of residents will be examined. Moreover, and perhaps more importantly, the author had already asked residents how they would imagine their destination

Page 49 of 133

without tourism before the outbreak of the pandemic. Back then, he posed this question out of a flash of inspiration, which, in turn, was motivated by the given that “mostly it is loss which teaches us about the worth of things” (Arthur Schopenhauer). In fact, never would he have thought that such apocalyptic scenario could become reality. Well, COVID-19 proved him wrong! Be that as it has been, in this study, residents´ hypothetical imagination of their destination without tourism (2018-2019) will be compared with their actual experience thereof (2020-2021). In the author´s opinion, this represents the unique contribution of this last study.

Page 50 of 133

5 Empirical Part

5.1 Study I

5.1.1 Rationale for this Study

In spite of the attention overtourism has received from the media, to the best of the author´s knowledge, in only three studies have researchers investigated how this phenomenon is depicted in the news. In what follows their shortcomings will be pinpointed. The author would like to emphasise that the aim is not to bash other researchers, but rather to highlight the added value of this study. To begin, Phi (2020) carried out content analysis on news articles in English. Her research suffers from three limitations. First, since the term "overtourism" was reportedly invented in 2016 (Ali, 2018), it is rather surprising that she found articles dating all the way back to 2008. This might be due to the fact that the platform she used disregards hyphens (LexisNexis, n.d.). If not get to the bottom of such odd results, she could have at least addressed them. Second, she did not provide concrete information about the sources of the articles. In fact, she only vaguely referred to them as "leading international/national newspaper outlets" (Phi, 2020, p. 2093). Such imprecise indication is of little help to anyone wishing to replicate her study. Third, her analytical effort was probably rather limited, as she fed the articles to a software, whose slogan used to be “text in insight out” (Leximancer, n.d.). Next, Pasquinelli and Trunfio (2020) carried out a thematic narrative analysis on news articles in English. Their study is not bulletproof either. First, they collected articles from pro- tourism outlets, such as Adventure Travel News, and from neutral outlets, such as The Independent. Thus, they mixed apples and oranges. If not considered, this should at least have been addressed in the analysis. Second, they inferred topics from the appearance of the most common terms together with "overtourism". This reminds of the approach adopted by Capocchi et al. (2020), who derived topics from the most common keywords of academic publications. Be that as it may, such inference seems a little far-fetched. In fact, there are more appropriate techniques to identify topics, such as Latent Dirichlet Allocation (Blei et al., 2003) and Structural Topic Modeling (Roberts et al., 2014). The latter will be used in this study. Finally, Clark and Nyaupane (2020) performed inductive and deductive content analysis on news articles in English. Before critically reviewing their article, a word of praise is indicated.

Page 51 of 133

In fact, these two authors are the only ones to have adopted a conceptual framework from mass media – that is, framing theory by Entman (1993, as cited in Clark & Nyaupane, 2020). Having said that, their study differs in a few fundamental respects from the present one. To begin, they identified destinations linked with overtourism first, and then looked for articles about them. In this study, the reverse approach will be adopted. In addition, they applied ex ante defined frames. In this study, the frames, or rather the topics, will be determined ex post – thereby letting 'the text speak for itself'. Finally, like Pasquinelli and Trunfio (2020), the authors analysed articles from different types of outlets. However, unlike these two scholars, they took this into consideration during the analysis. In contrast, in this study, only articles from broadsheet newspapers will be considered. In sum, this study differs in several respects from the ones mentioned above. In a nutshell, it boasts a larger sample (n>300), covers the whole lifecycle of overtourism (2016-2020), adopts a more sophisticated method (structural topic modeling), and, most importantly, borrows theory from another discipline. Hence, the added value of this research. Similarities and differences between the present and the three aforementioned studies are summarized in Table 3.

Pasquinelli and Trunfio Clark and Nyaupane Criterion Phi (2018) Present Study (2020) (2020)

overtourism, Keyword(s) overtourism > overtourism over-tourism

Sample Size n = 202 n = 56 n = 85 n > 300

Article Language English English English English

Time Period 2008 - 2018 05/2018 - 08/2018 - 06/2016 - 06/2020

news media outlets, Article Source established newspapers online news broadsheet news credible publications

thematic narrative inductive and deductive structural topic Method content analysis analysis content analysis model

agenda-setting and Theory - - framing theory framing theory

Software Leximancer NVivo - R (Studio)

Table 3: Present vs Previous Studies

Page 52 of 133

5.1.2 Methodology

5.1.2.1 (Structural) Topic Modeling

Much of the research conducted on topic models builds on the work by Blei et al. (2003). In fact, in this publication the authors put forward the Latent Dirichlet Allocation (LDA). Their work has been a catalyst for further research, as evidenced by the 36.800+ citations received thus far, according to Google Scholar (30th March 2021). Following the example by Blei et al. (2003), the meaning of central terms is explained right at the outset. Thus, in an effort to bring the description of this method closer to the subject of this examination, what they named 'document' and 'corpus' will here be denoted as 'article' (news) and 'collection' (dataset), respectively. In contrast to Blei et al. (2003), LDA will only be explained conceptually, though. Hence, the curious reader is redirected to their original paper for technical details, such as the calculation of posterior distributions and the like. LDA is defined as "a generative probabilistic topic model" (Blei et al., 2003, p. 997). Its fundamental proposition is that articles display several topics (Blei, 2012). This sounds reasonable. Indeed, an article about overtourism may primarily deal with resident attitudes, but could also touch upon other subjects, such as gentrification and degrowth. Specifically, LDA posits that a collection of articles is formed by a fixed set of topics, each of which comprises a given set of words. Thus, the creation of an article consists of two steps. In the first one, the constellation of topics is determined (1). In the second one, every word is allocated to a topic (2A) and selected from that topic's pool of words (2B) (Blei, 2012). This creation process is shown in Figure 17.

Page 53 of 133

(1)

(2B) (2A)

(2B) Figure 17: Topic Modeling (Blei, 2012, p. 78)

Blei (2012) made a very interesting observation in this regard – that is, the articles are revealed, whereas the constellation of topics is concealed. Accordingly, topic models aim to derive the invisible from the visible, thus "reversing the generative process" (Blei, 2012, p. 79). As mentioned above, LDA has been a springboard for further research. Blei et al. (2003) praised it for its versatility. Almost a decade later, Blei (2012) observed that several modifications have been developed. For instance, Blei and Lafferty (2006a) acknowledged one shortcoming of LDA, namely its inability to account for associations between topics. Therefore, they developed the Correlated Topic Model (CTM), which, as the name suggests, takes such inter-relations into account. Specifically, this shortcoming is ascribed to the independence postulate of the Dirichlet distribution. The CTM circumvents this impractical restriction by working with a logit-normal distribution instead. They put this topic model through its paces and found that it outperforms LDA in both exploration and prediction (Blei & Lafferty, 2006a). In addition, Blei and Lafferty (2006b) noted that the exchangeability postulate of LDA is often ill-suited. In fact, in various collections

Page 54 of 133

of records16, topics change with the passing of time (e.g. academic publications). Therefore, these authors advanced the Dynamic Topic Model (DTM). As the name suggests, this variant takes into consideration the temporal development of topics. In short, it does so by splitting a collection of records into time spans and deriving the topics of an interval from those of the previous one. The resulting topics are then used to create the records of that interval. The authors put the DTM to the test and found that it yields good results (Blei & Lafferty, 2006b). To be sure, there are countless other alterations of LDA. However, reviewing them would exceed the scope of this section. The CTM could be quite useful to identify overtourism themes. Indeed, several overtourism-related issues are interconnected, such as vacation rentals and urban gentrification. The DTM, in contrast, would be less helpful. In fact, as overtourism has only been studied for a few years, the temporal sequence is only of marginal relevance. Nevertheless, it might become more important in the future. For instance, if scholars were to analyse news coverage about overtourism in a few years, they would be well advised to divide their datasets into before, during, and after COVID-19. Be that as it may, the method of choice for this study is the Structural Topic Model (Roberts et al., 2014). Even though their publication has 'only' received 900+ citations, according to Google Scholar (30th March 2021), their work is a gateway to further research, too17. Roberts et al. (2016) emphasize that they have created STM as a generic topic model for social science researchers, because the development of ad hoc ones would otherwise be an extremely onerous task for most such scholars. According to Roberts et al. (2014), the value of STM lies in its ability to integrate relevant variables into the determination of the popularity and the composition of topics. Broadly speaking, STM belongs to the family of 'unsupervised models'. As such, it lets topics emerge from words a posteriori. More specifically, STM falls into the sub-family of 'mixed- membership models'. As such, it lets articles display several topics. Thus, in short, STM can be defined as “a mixed membership model for the analysis of documents with meta-information” (Roberts et al., 2016, p. 1000). STM is only described theoretically here. For technical details, the reader is redirected to Roberts et al. (2016). Thus, according to Roberts et al. (2014), like every

16 The more general term ´record´ is deliberately used here to remind the reader that topic modeling need not be restricted to text – an aspect stressed by its inventors, too (Blei et al. (2003). 17 To be precise, this method had already been introduced by Roberts et al. (2013).

Page 55 of 133

other topic model, STM consists of two building blocks (Figure 18). In the first one, the share of words that can be allocated to any one topic is established for every single article ('topical prevalence'). In the second one, the sets of words that have the highest probability of arising from any one topic are determined ('topical content'). Unlike in every other topic model, though, in STM, both can be influenced by characteristics of the articles ('covariates') (Roberts et al., 2014).

Figure 18: Structural Topic Modeling (Schmiedel et al., 2019, p. 945)

Finally, the author is concerned about the interpretability of the topics. In fact, it has previously been noted that “latent topics are not amenable to substantive interpretation” (Dickinger et al., 2017, p. 812). Hence, an attempt will be made to perform topic modeling with 'word embeddings'. This being only an aspiration of the author, a thorough discussion of that technique would be premature. In short, words are placed in a vector, which contains information about its context (Žižka et al., 2019). The linguistic concept underpinning this approach – that is, the distributional hypothesis by Harris (1954) – will be discussed in the dissertation. For the time being, it shall suffice to put it in Firthian terms: “You shall know an object by the company it keeps!” (Firth, 1962, p. 179). Even though, there are some relevant packages in Python, such as lda2Vec (Moody, 2016) and Top2Vec (Angelov, 2020), this attempt will be performed in R. The reason for

Page 56 of 133

this is quite simple: the author does not master the other programming language. Having said that, the word2vec (Wijffels, 2020c) package in R will be used to create the word embeddings. This relies on the homonymous algorithm by Mikolov et al. (2013).

5.1.2.2 Suitability of the Method

The reader may wonder why the author chose to use 'topic modeling' to analyse the 'frames' (framing), or 'attributes' (agenda-setting), of overtourism in news media. After all, these seem to be two conceptually distinct entities. In computer science, a topic is "a distribution over a fixed vocabulary" (Blei, 2012, p. 78). In communication research, a frame is "a central organizing idea or story line that provides meaning to an unfolding strip of events" (Gamson & Modigliani, 1987, p. 143). Turning to the field of linguistics helps unveil the common ground between these two entities. According to the Cambridge Dictionary of Linguistics, topics have an effect on the register used, which includes – inter alia – words (Brown & Miller, 2013). Put simply, if newspapers were to emphasize the conflict between residents and tourists, they would probably use words like 'angry' and 'backlash', whereas if they were to stress the sharing economy they would probably use words like 'ban' and 'rentals'. Hence, the connection between the two entities. Some scholars have explicitly addressed the suitability of topic modeling to the identification of frames. DiMaggio et al. (2013) argued that topics could be regarded as frames. Ylä-Anttila et al. (2018) went a step further and claimed that three conditions have to be fulfilled for this to be the case. First, framing needs to be considered as inter-concept associations. Second, the text to be placed under scrutiny needs to be about one sole matter. Third, the resulting topic model needs to be validated (Ylä-Anttila et al., 2018). In this study, McCombs and Ghanem's (2001) conceptualisation of framing is adopted (1st condition). In addition, the corpus to be examined consists only of news articles about 'overtourism' (2nd condition). Finally, the identified topic model will be subject to validation (3rd condition). Furthermore, even though the use of topic modeling for the identification of frames is a relatively new ground, a few applications can be found. Scholars have for instance done this on the subject of the Refugee Crisis (e.g. Heidenreich et al., 2019) and Covid-19 (e.g. Yu et al., 2020). Thus, overall, this brief discussion suggests that topic modeling is suitable for identifying frames, or attributes, in thematically restricted news articles.

Page 57 of 133

5.1.2.3 Some Thoughts

The reader may observe that the author could have opted for supervised machine learning, as done by Burscher et al. (2014) for instance. However, Nicholls and Culpepper (2020) noted that such approach comes with the risk of succumbing to a particular bias – that is:

“there is a danger of falling into the drunk under the streetlight syndrome, in which we focus on the points we can see under the streetlight of supervised methods, and failing to explore the darker terrain of unconsidered and unexpected potential frames.” (Nicholls & Culpepper, 2020, p. 3)

In addition, these authors put k-means clustering, exploratory factor analysis, and structural topic modeling to the test and found that the latter one outperforms the former two. In more detail, they concluded that this method works well, albeit only for thematically restricted content (Nicholls & Culpepper, 2020). Since this study only examines articles about one subject – overtourism – such constraint should not pose a problem. The reader may also question the role of the researcher in such approach. According to Grimmer and Stewart (2013), by no means does automated text analysis make scholars redundant. Rather, they continue to play an important role throughout the entire investigation (Grimmer & Stewart, 2013). This point of view is shared by other scholars, too. For instance, DiMaggio et al. (2013) argued that since the software has no knowledge of the text it processes, it is their own reading of the topics that imbues them with meaning. Accordingly, they consider topic models a point of departure rather than one of arrival (DiMaggio et al., 2013). In sum, the literature seems to suggest that, at last, "the burden of making sense of the results is still on the researcher" (Jacobi et al., 2016, p. 103). In addition to the role of the scholar, the reader may also question the role of theory in such approach. Indeed, the advent of big data has raised doubts about the need for theory. Most notably, the CEO of Wired, Chris Anderson, published a provocative article titled “The End of Theory” (C. Anderson, 2008). His piece of writing was the spark that ignited the debate about the role of theory in science, as evidenced by the plethora of scholars citing his work (e.g. Boyd & Crawford, 2012). In this regard, Mazanec (2020) argued that research is never free from theory, as “we are bound to hypothesize if we like it or not” (Mazanec, 2020, p. 5).

Page 58 of 133

The 'Computational Grounded Theory' framework introduced by Nelson (2020) nicely rounds up the two previous paragraphs. In short, this is a methodological workflow that consists of three phases (Figure 19). In the first one ('Pattern Detection'), the analyst uses software tools to bring the text into a more intelligible format. This might unveil aspects that had not been thought of. In addition, the analysis is documented and can therefore easily be replicated. In the second phase ('Pattern Refinement'), the analyst can simply inspect the most relevant documents of each topic instead of having to browse through all of them. This selective engagement with the data guarantees the trustworthiness of its reading as well as the completeness of its coverage. In the third stage ('Pattern Confirmation'), the analyst examines the generalisability of his or her results with other text mining tools. This empirically validates – or better said, 'corroborates'18 – the findings from the two previous phases. In sum, this framework is "a three-step approach that combines inductive grounded theory with deductive quantitative tests" (Nelson, 2020, p. 5). Interestingly, the clever combination of rivalling research strategies (inductive vs deductive) and approaches (qualitative vs quantitative) calls into question the usefulness of these long-standing dichotomies. This, however, shall by no means overshadow another noteworthy aspect of this framework, namely its focus on replicability. The latter is of utmost importance, especially considering the heatedly debated 'reproducibility crisis' of the scientific realm19.

18 See Popper (1934/1959). 19 See e.g. Baker (2016).

Page 59 of 133

2. Pattern Refinement

• Inductive • Deductive

• e.g. STM • Guided deep reading • e.g. NLP

1. Pattern Detection 3. Pattern Confirmation

Figure 19: Computational Grounded Theory Adapted from Nelson (2020, p. 14)

Thus, the author will perform structural topic modeling on news articles to detect patterns, fine-tune these by carefully reading the most representative ones for each topic, and then corroborate these empirically by means of other text mining tools (Figure 19).

5.1.2.4 Applications in Tourism

A search for prior applications of structural topic models in tourism research was performed by sifting through the publications in tourism journals that cited Roberts et al. (2014). The underlying assumption was that if someone used this method, she or he would also reference its developers. Relatively few publications were identified. Nevertheless, a few comments can be made. To begin, STM has mainly been used to analyse reviews – be those of restaurants (e.g. Wen et al., 2020), airlines (e.g. Stamolampros et al., 2019) or hotels (e.g. Hu et al., 2019). To the best of the author´s knowledge, this method has not been used to examine news media in tourism research to date. Thus, the adoption of STM for the analysis of news articles represents a methodological contribution to tourism research.

Page 60 of 133

5.1.3 Data Collection

A search was performed in Nexis® Uni for news articles in English containing the term "overtourism" published between 1st January 2016 and 31st December 2020. The variant "overtourism" was deliberately not used, as this platform disregards hyphens (LexisNexis, n.d.). The search was also limited to the following five sources: Guardian, Independent, Financial Times, Times, and Telegraph (n=602). These include different editions (e.g. The Sunday Times) as well as sister outlets (e.g. The Observer). The detailed breakdown is provided in Table 9 in Appendix 3. The list of articles was exported as .xlsx file from this platform for preliminary analysis. For the dissertation, duplicates and irrelevant articles will be removed. A few words about the choice of newspaper is in order. According to the Oxford Royale Academy (n.d.), the five selected outlets belong to the so-called “broadsheets” and each has its own political inclination. As can be seen from Table 4, these five papers cover the whole political spectrum. At the same time, the distribution of articles leans toward the right (≈ 51%). Having said that, since political orientation is not of central interest to this investigation, this slight imbalance shall not be a cause for concern.

Political Source Articles21 Orientation20 Guardian left-wing 46

Independent centre-left 113

Financial Times centrist 33

Times centre-right 104

Telegraph right-wing 306

Total 602 Table 4: Selected Newspapers

20 According to Oxford Royale Academy (n.d.). 21 Approximate number of articles before removal of duplicates.

Page 61 of 133

From the list exported from Nexis® Uni only the following information was kept: headline, publisher and date. Even though topic modeling could be performed on the titles of the articles, doing so on their bodies allows a much 'richer' analysis. Therefore, for the dissertation, the full text of each relevant article will be manually copy-pasted into Excel. At first glance, this may seem like an incredibly tedious task. On closer examination, though, the two alternatives – setting up a web crawler and using an Application Programming Interface (API) – are not appreciably less demanding, at least for a social scientists. Hence, the author´s decision.

5.1.4 Preliminary Results

Even though the first phase of the analysis consists in letting the data speak for itself, the mind of the author is no blank slate. In fact, as Prof. Mazanec asserted over a decade ago and corroborated last year, human beings "cannot not hypothesize" (Mazanec, 2009, p. 320, 2020, p. 5). Having read the articles by Clark and Nyaupane (2020), Pasquinelli and Trunfio (2020) and Phi (2020), this is even more so the case. Thus, the author will embed his a priori expectations, along with their underlying rationale, in the subsequent presentation of preliminary results.

5.1.4.1 Descriptive Results

In this subsection, basic descriptive aspects are presented. One such feature will be the temporal distribution of news coverage about overtourism. The three studies reviewed earlier (Clark & Nyaupane, 2020; Pasquinelli & Trunfio, 2020; Phi, 2020) offer little insight into the temporal distribution of news articles. Understandably22, none of them has considered the entire lifecycle of news coverage about overtourism. Therefore, it comes as no surprise that they have not resorted to any theory for the temporal distribution of news articles. In contrast, this study

22 Since the articles by Clark and Nyaupane (2020), Pasquinelli and Trunfio (2020) and Phi (2020) have been published online on 4th August 2020, 7th May 2020 and 21st May 2019, respectively, their data collection must have taken place before those dates. Hence, the rationale for 'Understandably'.

Page 62 of 133

examines news coverage about overtourism from its start (2016) to its end (2020)23, wherefore a reference to a conceptual framework is indicated. The temporal distribution of the unfiltered headlines retrieved from Nexis® Uni (n=602) was summarised by quarter and plotted by means of a line graph in Excel. The visualisation suggests that news coverage about overtourism might have followed the issue-attention cycle by Downs (1972) (Figure 20). Another aspect worth noting is the fact that the peaks in 2018 and 2019 occurred before (Q2) and during (Q3) the summer, respectively. This mirrors the findings of Guizi et al. (2020), who observed a similar pattern. They suggested that the adverse effects of tourism might become more visible in this time (Guizi et al., 2020). Finally, in the dissertation, an attempt will be made to identify potential explanations for the observed peaks (e.g. notable events).

0 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4 2017 2018 2019 2020

Figure 20: News Coverage over Time 2017 - 2020 (n = 602)

23 By no means does the author claim that overtourism has ended. On the contrary, he believes that the current paralysis of the tourism industry due to Covid-19 – and thus its absence in news media – is only temporary.

Page 63 of 133

The other descriptive aspect will be the geographic spread of news coverage about overtourism. Scholars seem to agree on the reported geography and nature of overtourism. Phi (2020) claimed that this phenomenon tends to occur in urban destinations. Pasquinelli and Trunfio (2020) substantiated this claim empirically. They examined the concomitance of the most common terms with 'overtourism' and found a value of 82% for 'city'. In addition, Phi (2020) found that the most frequently cited cities are Barcelona (n=65) and Venice (n=44). Pasquinelli and Trunfio (2020) also identified 'Barcelona' (74%) and 'Venice' (56%) as destinations often connected to 'overtourism'. Clark and Nyaupane (2020) adopted an entirely different approach. They established which places are most often linked to overtourism through an online search first, and then gathered news articles about them. Nevertheless, 6 of the 17 locations identified are cities (≈35%). A 'quick and dirty ' way to identify the most popular destinations is to check the list of headlines against a geographic database. In this case, the titles of the articles were compared against the basic database from SimpleMaps.com (n.d.) using the COUNTIF function in Excel. While this approach is unable to detect some non-urban destinations, such as islands (e.g. Boracay, Santorini, Skye, etc), it provides a 'good enough' overview of the spatial distribution of news coverage about overtourism. The results of this exercise (Figure 21) show the bias inherent in restricting the dataset to newspapers in English. In fact, Edinburgh occupying the fifth place might more likely be due to the language of the articles than to actual tourism pressure. This, in turn, raises the question of whether the most frequently mentioned destinations are also the ones suffering the most from overtourism. This will be verified in the dissertation by taking tourism intensity as a proxy for tourism pressure24.

24 The measurement of overtourism will be discussed in more detail in Study III.

Page 64 of 133

50 45 40 35 30 25 20 15 10 5 0

Figure 21: Most frequently mentioned locations (n = 602)

5.1.4.2 Topics / Frames

In this subsection, the portrayal of overtourism in the news will be examined. As such, it constitutes the core of this exploratory analysis. The findings of the previous research yield insights into the actual content of news articles. To begin, Phi (2020) found that the lion's share of coverage deals with the volume of visitors. Hence, she argued that “the current public media discourse on this issue remains rather simplistic” (Phi, 2020, p. 2095). Moreover, Pasquinelli and Trunfio (2020) found that news media places special emphasis on the amount of visitors. Contrasting this portrayal with scholarly research, they realised that the numerical dimension is only half the story, though (Pasquinelli & Trunfio, 2020). Thus, the author expects news coverage about overtourism to mainly revolve around the volume of tourism. This, in turn, would be a simplistic as well as reductionist depiction of this 'multidimensional' (Koens et al., 2018) and 'multi-faceted' (Capocchi et al., 2019) phenomenon. If this were the case, news media would make itself vulnerable to the criticism of spreading “false knowledge”. Moreover, previous studies have portrayed overtourism as a clash between visitors and residents. Phi (2020) found that news media depicts the former as the ones creating the problem and the latter as the ones suffering from it. Pasquinelli and Trunfio (2020) found that news media neglects the contribution of locals to the occurrence of this phenomenon. These findings suggest

Page 65 of 133

the adoption of the 'conflict' frame (Semetko & Valkenburg, 2000). On a side note, the contemporariness of the guest-host dichotomy is questionable. In fact, Maitland and Newman (2009a) asserted that local inhabitants “increasingly behave like tourists in their own city” (Maitland & Newman, 2009a, p. 14). Tourism scholars – particularly those focusing on overtourism as an urban phenomenon – have acknowledged this (e.g. Koens et al., 2018; Novy, 2019). Thus, the author expects the news coverage under scrutiny to portray overtourism as a conflict in which residents are victims and visitors perpetrators. Furthermore, it would be surprising if news media had not found a whipping boy. After all, as Eisenhower said, “the search for a scapegoat is the easiest of all hunting expeditions”. Phi (2020) found that news media puts the blame on local stakeholders rather than on the sector's growth. In a similar vein, Pasquinelli and Trunfio (2020) found that news media holds public administrations accountable. Together, these findings suggest the adoption of the 'attribution of responsibility' frame (Semetko & Valkenburg, 2000). On another side note, this ascription of guilt can be traced back to a statement by the now former Secretary-General of the UNWTO, Taleb Rifai. Indeed, he declared that “growth is not the enemy; it´s how we manage it that counts” (World Tourism Organization, 2017). This statement indirectly puts the blame for the state of affairs on local stakeholders while pledging allegiance to the growth paradigm (Novy & Colomb, 2019). Thus, the author expects news coverage to put the blame on local authorities. As the three paragraphs above nicely demonstrate, the scarce body of knowledge on the portrayal of overtourism in the news suffices to generate a few expectations a priori. Thus, the analysis will be carried out with these in mind. Apropos, a thumbnail sketch thereof is in order. In the dissertation, text pre-processing will be performed with the tm package (Feinerer et al., 2008) in R. More precisely, the filtered Excel file will be converted into a corpus. Then, upper-case characters will be converted into lower-case ones, and non-alphabetic characters (numbers and punctuation) as well as stop words (English) and junk words (custom) will be removed. Next, a Document-Term Matrix (DTM) will be created. As mentioned earlier, the Structural Topic Model (Roberts et al., 2014) will be preferred over the Latent Dirichlet Allocation (Blei et al., 2003). Accordingly, the analysis will be performed with the stm package (Roberts et al., 2019) in R. The topics will be chosen according to the criteria of 'cohesion' and 'exclusivity', as recommended by Roberts et al. (2014). Moreover, since these authors emphasize the importance of personal evaluation, selected articles will be attentively read by the researcher. Finally, the identified topics

Page 66 of 133

will be ratified by a senior scholar familiar with topic modeling (Prof. Mazanec), and validated empirically. Overall, this procedure will be in line with the tripartite 'Computational Grounded Theory' framework by Nelson (2020) presented earlier.

5.1.5 Limitations

This study is not without its limitations. To begin, the author will only consider news from the United Kingdom. Future studies could also consider news from the United States, such as the New York Times and the Washington Post. Indeed, it would be interesting to make a comparison between English and American news coverage of overtourism. In addition, the author will only consider news articles published in English. As noted in the preliminary analysis, this restriction might cause some bias. Hence, future research could consider news in other languages, such as Italian and Dutch. Indeed, the preliminary analysis showed that Venice and Amsterdam are two destinations frequently mentioned in connection with overtourism. Finally, even though the author will attempt performing structural topic modeling with word embeddings, there is still room for improvement as regards the methodology.

Page 67 of 133

5.2 Study II

5.2.1 Rationale for this Study

The second study of this dissertation is a bibliometric analysis of research on overtourism. At first glance, only one such study seems to have been published on this subject (Barbieri da Rosa et al., 2020). On closer examination, though, three systematic reviews bearing some resemblance to a bibliometric analysis can be identified (Agyeiwaah, 2019; Carvalho et al., 2020; Veríssimo et al., 2020). Thus, one may wonder what the benefit of this study could possibly be. Hence, in what follows the methodologies of the aforementioned publications will be critically reviewed to show how this study overcomes their shortcomings. The bibliometric analysis conducted by Barbieri da Rosa et al. (2020) is the only investigation that is labelled as such. Their work has a few shortcomings. To start with, there is an inconsistency between the different sections of their book chapter. In fact, in the methodology, they used the plural form two times when claiming to have looked for “the terms of overtourism” (Barbieri da Rosa et al., 2020, p. 46) ; which suggests the adoption of multiple keywords. However, in the conclusion, the use of the singular form in their statement to have looked for “only the search term overtourism” (Barbieri da Rosa et al., 2020, p. 46) indicates the adoption of a single keyword. Moreover, and more importantly, considering that their data collection took place in March 2019, the small sample size (n=24) seems rather improbable. The systematic literature review by Veríssimo et al. (2020) is introduced as a systematic review, yet the analysis is described as bibliometric. At first sight, their study seems to bear a striking resemblance to the present one. In fact, they, too, performed a bibliometric analysis and visualised results with VOSViewer (van Eck & Waltman, 2010). However, on second thought, one realises that their study differs in one fundamental respect from the present investigation – that is, they performed a review of studies about 'tourismphobia' AND 'overtourism' (Veríssimo et al., 2020). In the author's opinion, in spite of being closely related, these are two distinct concepts. The former indicates “the fear, aversion or social rejection that local citizens in a destination feel towards tourists”, whereas the latter denotes “the subjective belief of residents that there are too many visitors” (Rejón-Guardia et al., 2020, p. 236). Thus, one may conclude that looking for both terms in their systematic review, Veríssimo et al. (2020) ended up mixing apples and oranges.

Page 68 of 133

The mapping literature review by Carvalho et al. (2020) is not without its flaws. The description of their literature search is somewhat confusing. In the text, they claimed to have looked for words other than 'overtourism', which they vaguely referred to as “synonyms, singular, plural and derivative words” (Carvalho et al., 2020, p. 18). Expecting to find these words in the table, the reader is then disappointed to discover that none is actually mentioned there. Moreover, Carvalho et al. (2020) only retained heavily cited publications. This criterion is not particularly sensible, as more recent publications (e.g. 2019) will naturally have less citations than older ones (e.g. 2018). In other words, the adoption of this criterion leads to the a priori exclusion of potentially valuable studies. Furthermore, Carvalho et al. (2020) reviewed grey literature, too (e.g. magazines). In contrast, in this study, only peer-reviewed articles are considered, as this format of knowledge dissemination is deemed to be the 'gold standard' in academia. The systematic literature review by Agyeiwaah (2019) has a few limitations as well. The description of her methodology is imprecise. She used the abbreviation "e.g." when listing the keywords and platforms used (Agyeiwaah, 2019, p. 101). Since this expression, by definition, indicates a non-exhaustive set of instances, the reader cannot be sure that the keywords and platforms mentioned are the only ones actually used. Moreover, as she used the abbreviation "i.e." earlier on the same page (Agyeiwaah, 2019, p. 101), the reader would assume that she is aware of the difference between these two expressions. The description of her methodology is also incomplete. Agyeiwaah (2019) specified neither in which field(s) she looked for the keywords nor whether she looked for them together or separately. Finally, the set of publications resulting from her search seems implausible. In fact, it is virtually impossible that using a "list of broad key search terms" (Agyeiwaah, 2019, p. 101), she obtained so few journal articles (n=17). Overall, this study differs in three respects from the ones reviewed above. First, they only covered scientific inquiry about overtourism until 2019. Accordingly, the entire the body of research produced on this subject in 2020 has not been considered in their analysis. This, in turn, explains the difference in sample size. In fact, while in the four studies it ranged from 15 to 84, in this one it is expected to lie around 150. Second, the four studies relied on relatively simple methods. Barbieri da Rosa et al. (2020) conducted a bibliometric analysis, Carvalho et al. (2020) performed a content analysis, and Veríssimo et al. (2020) carried out both. Building on the latter, in this study the author will perform a bibliometric analysis and topic modeling on peer-reviewed publications about overtourism. Third, and most importantly, the objective of the four reviews was

Page 69 of 133

– albeit each with a different focus – to provide an overview of research on overtourism. In contrast, this study aims to assess the added value of “overtourism”. The differences between the four studies reviewed in this section and the present one are summarised in Table 5.

Barbieri da Rosa Verissimo et Carvalho et Agyeiwaah Present et al. (2020) al. (2020) al. (2020) (2019) Study25

Review Bibliometric Systematic literature Mapping literature Systematic literature Bibliometric Type analysis review review review analysis

Tague-Sutckiffe Method Tranfield et al. (1992) and - > > by (2003) Garousi (2015)

Keywords "Overtourism" "over-?tourism", "overtourism" and 'list of broad key "over-?tourism" searched (Boolean operators) "touris(-)?mphobia" variations thereof search terms (e.g.)'

Data March 2019 June 2019 May-Oct 2019 - May 2021 collected

'e.g. ScienceDirect, B-on, ProQuest, Scopus, Databases Scopus, Web of WOS, Scopus WOS, Scopus Scopus, Web of Science, searched Science, Google WOS, Emerald Dimensions scholar'

Period 'no year limit 1998-2018 → 2019 1997-2019 2016-2020 considered was used'

Articles n = 24 n = 53 n = 84 n = 15 n ≈ 150 reviewed

Bibliometric Bibliometric Method Bibliometric Content analysis, Synthesis analysis, Topic used analysis analysis Content analysis modeling

VOSviewer, Software - Nvivo 12 Plus - VOSviewer, R Histcite

Table 5: Present vs Previous Studies

25 At the moment (Q1-2021). Elements may be subject to change.

Page 70 of 133

5.2.2 Methodology

The analysis conducted in this study consists of two parts. In the first one (descriptive), bibliographic metadata (e.g. authorship) will be summarised to map the scientific landscape of research on overtourism. In addition, topic modeling (Blei, 2012) will be performed on the abstracts of the identified publications to establish the facets of this kaleidoscopic phenomenon. The combination of these two methods is not unprecedented, as evidenced – for instance – by the work of De Battisti et al. (2015) and that of Figuerola et al. (2017). In the second part (inferential), a citation analysis will be conducted to determine whether overtourism is in fact “old wine in new bottles” (Dredge, 2017). Knowledge of its origins in academia, in turn, will enable an informed judgement of whether overtourism research has overcome the shortcomings of its predecessors. And, what´s more, it will provide a baseline against which to evaluate the “boldness” of this stream of research.

5.2.2.1 Bibliometrics

In the author´s opinion, the history of bibliometrics is quite intriguing. This term is generally attributed to Pritchard (1969). Dissatisfied with the term 'statistical bibliography', this scholar advanced the term 'bibliometrics' - which he defined as "the application of mathematics and statistical methods to books and other media of communication" (Pritchard, 1969, p. 349). A few years later, Da Fonseca (1973) remarked that the latter had neglected literature in other languages. In fact, Otlet (1934) titled one of his sections "Le Livre et la Mesure. Bibliométrie." (Otlet, 1934, p. 13). Be that as it may, the roots of bibliometrics go back even further. In fact, such type of analysis can be traced back – albeit under a different name – to the work by Cole and Eales (1917), who examined documents about animal anatomy between 1543 and 1860 (n=6,436). The terminological debate does not end here, though. In fact, the two related terms "scientometrics" and "informetrics" cause further confusion. Suffice it to say, at least for the moment, that while these terms share some common ground, they are not one and the same (Hood & Wilson, 2001). A search for bibliometric studies in tourism journals shows that in this field such type of investigation has only gained momentum in the last few years (2018-2020). Some researchers have focused on a specific journal, such as Travel and Tourism Marketing (Mulet-Forteza et al., 2018) and Tourism Geographies (Merigó et al., 2019). Other scholars have concentrated on a particular

Page 71 of 133

topic, such as sustainable tourism (Moyle et al., 2021; Ruhanen et al., 2015) and forecasting (Liu et al., 2019; Zhang et al., 2020). The former investigations are 'journal-focused', whereas the latter ones are 'theme-focused' (Koseoglu et al., 2016). The bibliometric analysis performed in this study falls under the last-mentioned category, as it focuses on academic publications about overtourism.

5.2.2.2 Descriptive Part

In the first part of the analysis, the author will examine the identified publications to map the body of research on overtourism. To the best of the authors knowledge, only Barbieri da Rosa et al. (2020) and Veríssimo et al. (2020) have conducted a bibliometric analysis about overtourism, so far. However, they examined the metadata of a relatively small number of papers (n=24 and n=53, respectively). In this study, both bibliographic data (e.g. authors) and raw text (e.g. abstracts) will be inspected. Concerning the latter, Capocchi et al. (2019) inferred topics from keywords. In the author's opinion, this is a bit of a stretch. Therefore, topic modeling (Blei, 2012) will be performed on the abstracts of the selected publications instead. The reader is redirected to Section 5.1.2.1 for the description of this method. It might be worth mentioning, though, that terms belonging to the academic jargon, such as 'methodology' and 'findings', will be treated as “junk words” (Table 10 in Appendix 4). The choice of limiting the analysis to the abstract of the articles might be questioned. Hence, it only seems fair to spend a few words on its rationale. In short, the author´s decision was motivated by the significance of this section. Landes (1951) asserted that “in terms of the market reached, the abstract is the most important part of the paper” (Landes, 1951, p. 1660). In view of the plethora of papers published, scholars use it as a screening tool to decide which ones to dedicate their attention to (Pitkiiz, 1987). Previous research backs this choice, too. For instance, Mazanec (2017) analysed 858 abstracts published in Annals of Tourism Research between 1975 and 2015. In addition, Kirilenko and Stepchenkova (2018) examined 8,890 abstracts published in Annals of Tourism Research, Journal of Travel Research, and Tourism Management between 1974 and 2017.

Page 72 of 133

5.2.2.3 Inferential Part

In the second part, the author will conduct a citation analysis of the selected publications to ascertain whether overtourism is in fact “old wine in new bottles” (Dredge, 2017). While several scholars have expressed their view on the age of this phenomenon, to the best of the author´s knowledge, Capocchi et al. (2020) are the only ones who have pursued an empirical investigation of this matter. However, they examined the bibliographies of a rather small number of papers (n=29) (Capocchi et al., 2020). Veríssimo et al. (2020) also inspected the bibliographies of selected documents. While their sample was almost twice as large (n=53), it included articles about both 'over(-)tourism' and 'tourism(-)phobia' (Veríssimo et al., 2020). The present analysis builds on these two studies, but boasts a larger sample (n=149) and covers a longer period (2017-2020). On a more technical note, the author will conduct the citation analysis with a dedicated software. Specifically, the following ones will be considered:

A. Bibexcel (Persson et al., 2009) B. bibliometrix (Aria & Cuccurullo, 2017)26 C. SciMAT (Cobo et al., 2011) D. VOSViewer (van Eck & Waltman, 2010)

An up-to-date comparison of these, and other, software can be found in Moral-Muñoz et al. (2020).

5.2.3 Data Collection

The keywords selected for this study are "overtourism" and "over-tourism". Synonyms were consciously not used for two reasons. First, in the author´s opinion, there is no synonym for overtourism. In fact, terms such as 'mass tourism' or 'tourism-phobia' have, albeit only slightly, different meanings. Second, one of the objectives of this study is to determine whether overtourism

26 Package in R.

Page 73 of 133

is "old wine in new bottles" (Dredge, 2017). This can only be achieved by analysing solely the literature that uses this specific term. These two keywords were searched for in Scopus and Web of Science. In view of the high number of duplicates, performing the search in other databases was deemed redundant. The identified documents were filtered. As the term itself was invented in June 2016 (Ali, 2018), documents with an earlier publication date came as somewhat of a surprise. However, a brief inspection revealed the reason for the appearance of such results – that is, the platforms also looked for the search term without a hyphen, thereby generating irrelevant results. Hence, only documents published between 2017 and 2020 were kept (Scopus=247, Web of Science=225). Next, the search was restricted by document type: only articles, reviews, editorials, and other short formats – such as research notes and letters – were maintained (Scopus=208, Web of Science=189). After that, the search was limited to documents published in English (Scopus=201, Web of Science=180). The refined results lists were then exported from Scopus and Web of Science as comma- and tab- delimited files, respectively. Next, the two files were imported into Microsoft Excel. Only non-duplicate records were kept (n=209). Publications from journals that publish in a language other than English were deleted (n=15), as were those without a Digital Object Identifier (DOI) (n=4). Further, publications that were not directly related to overtourism were removed (n=41). These included articles that were completely unrelated to overtourism, such as those in which the expression "over tourism" had been used. They also included papers that were only partly related to overtourism, such as those in which one of the two search terms had been marginally mentioned. Thus, the final dataset consists of 149 documents. The steps of this filtering process are visualised in Figure 22.

Page 74 of 133

Scopus Web of Science (n = 201) (n = 180)

Records after duplicates removed (n = 209)

Records screened Records excluded (n = 209) (n = 19)

Full-text articles Full-text articles assessed for eligibility excluded, with reason (n = 190) (n = 41)

Studies included in synthesis (n = 149)

Figure 22: PRISMA Flow Diagram (Moher et al., 2009)

Page 75 of 133

5.2.4 Preliminary Results

5.2.4.1 Descriptive Part

An example of the descriptive bibliometric analysis might be the co-authorship network shown in Figure 23, which was created with VOSviewer (van Eck & Waltman, 2010). This visualisation allows to rapidly identify teams of scholars researching overtourism. For instance, Joseph Cheer, Claudio Milano and Marina Novelli constitute such a group. They edited the special issue 'Overtourism and Tourismphobia' (Milano, Novelli, & Cheer, 2019b) as well as the book 'Overtourism: Excesses Discontents and Measures in Travel and Tourism' (Milano, Cheer, & Novelli, 2019). In addition, they published an article about degrowth (Milano, Novelli, & Cheer, 2019a). Be that as it may, this is only one example of how such an exploratory visualisation may help the analyst detect bibliographic patterns. Further analysis would for instance demonstrate that Hugues Séraphin is the most prolific author (n=10). At the same time, it would show that his publications are mostly co-authored (Ø = 3,7 authors) and rather short (Ø = 8,3 pages).

Figure 23: Co-Authorship Network27

27 Specifications: network visualization / co-authorship: authors → fractional counting / min docs = 2 / weights = links.

Page 76 of 133

An example of the descriptive text analysis could the co-occurrence graph of author keywords shown in Figure 24, which was also created using VOSviewer (van Eck & Waltman, 2010). The presence of both spelling variants of 'over(-)tourism' highlights the need of specifying synonyms prior to the analysis. This can easily be done using the thesaurus function in VOSviewer (see van Eck & Waltman, 2020). For instance, "social carrying capacity" and "tourism carrying capacity" could be subsumed under "carrying capacity"28.

Figure 24: Co-Occurrence Graph of Author Keywords29

28 Strictly speaking, these terms are not interchangeable. However, if one is only interested in the relations between overarching concepts, this methodological step is in order. 29 Specifications: overlay visualization / co-occurrence: author keywords → fractional counting / min. occ. key word: 2 / weights: occurrences & scores: avg. pub. year.

Page 77 of 133

The colouring in Figure 24 shows the timing of the terms. Thus, this visualisation suggests that the academic discourse on overtourism has evolved over time. To give an example: both spelling variants of 'tourism(-)phobia' are shown in violet, whereas 'tourism intensity' is shown in lime (Figure 24)30. This suggests that scholars dealt first with the conceptualisation and then with the measurement of overtourism. Such sequence would mirror the evolution of research on tourism impacts as outlined by Deery et al. (2012). Be that as it may, the nature of the text under scrutiny calls for the adoption of DTM (Blei & Lafferty, 2006b) or, alternatively, the use of time as covariate in STM (Roberts et al., 2014). Finally, while no sample analysis of topic modeling is presented here, a preliminary reading of the literature suggests the presence of the following topics31:

➢ Carrying capacity (e.g. Bertocchi et al., 2020; Tokarchuk et al., 2020) ➢ Degrowth (e.g. Higgins-Desbiolles et al., 2019; Milano, Novelli, & Cheer, 2019a) ➢ Resident attitudes (e.g. Kuščer & Mihalič, 2019; Muler Gonzalez et al., 2018) ➢ Social media (e.g. Alonso-Almeida et al., 2019; Bourliataux-Lajoinie et al., 2019) ➢ Urban tourism (e.g. Novy, 2019; Novy & Colomb, 2019) ➢ Vacation rentals (e.g. Celata & Romano, 2020; Sarantakou & Terkenli, 2019)

5.2.4.2 Inferential Part

An example of the second part of the analysis might be the co-citation network shown in Figure 25, which was created with Bibexel (Persson et al., 2009) and Pajek (Batagelj & Mrvar, 2020). This visualisation allows for two comments. First, the presence of older references, such as Emerson (1976) (Social Exchange Theory), Butler (1980) (Tourism Area Life Cycle), and O'Reilly (1986) (Tourism Carrying Capacity), lends some weight to the claim that overtourism is indeed

30 The software allows to interactively browse through the graph (van Eck and Waltman, 2020), which considerably facilitates its interpretation. 31 The expected topics are listed in alphabetical order.

Page 78 of 133

"old wine in new bottles" (Dredge, 2017). This would be in line with the results of the empirical investigation by Capocchi et al. (2020) and of the bibliometric analysis by Veríssimo et al. (2020).

Figure 25: Co-Citation Network32

Second, it shows the challenge of dealing with bibliographic data. The presence of duplicates, such as the literature review by Capocchi et al. (2019), shows that tiny differences in referencing suffice for the same work to be counted as two separate records. In this case, the discrepancy lay in the inclusion versus omission of the publication's issue number. Thus, it seems that after all, “the devil is in the detail”. Since the author expects to analyse 6,000+ references, 'harmonising' them will be a resource-intensive task. Cobo et al. (2011) noted the inaccuracy of data extracted from bibliographic platforms. Accordingly, they considered the preparation of the data to be a task of paramount importance (Cobo et al., 2011).

32 Co-citation network: n > 4, Layout: Kamada-Kawai (Free).

Page 79 of 133

5.2.5 Limitations

Having critically reviewed several studies, it only seems fair to acknowledge the own imperfections. It is probably best to start by addressing "the elephant in the room" – that is, the fact that the author only searched for 'over(-)tourism'. While the adoption of only one keyword may raise some eyebrows, this was consciously done to identify the body of research using this particular term. Future research could collect academic articles employing related – yet distinct – terms, such as 'tourismphobia' or 'touristification', and compare them to the ones using 'over(-)tourism'. The filtering process was relatively strict, too. In fact, publications in other languages, such as Spanish and German, were excluded. Future research could make a comparison between scholarly research in different languages. Finally, as suggested by Mazanec (2017), it would be really intriguing to compare rejected manuscripts against published articles about overtourism. What could the body of knowledge on this subjects have looked like? Would it have advanced our understanding further or would it have left us "none the wiser" (Ap, 1990, p. 615)?

Page 80 of 133

5.3 Study III

5.3.1 Rationale for this Study

The purpose of this section is to show the limitations of allegedly 'objective' and 'granular' indicators. To this end, a thorough analysis of (over-)tourism in Vienna will be conducted. Bearing in mind that “les chiffres absolus ne disent pas grande chose” (Markos, 1949, p. 132), one- dimensional measures, such as overnights, will not be discussed at length. However, for the sake of completeness, they are mentioned in Table 11 in Appendix 5Error! Reference source not found.. That said, this section is structured according to the classification by De La Calle-Vaquero et al. (2020) (Table 6).

Category (Sample) Indicators

Magnitude absolute number of arrivals and overnights

(average) growth rate of arrivals and overnights, Dynamics population decline at destination

Seasonality highest / lowest month, length of high season

Intensity tourism intensity

Density tourism density

Table 6: Indicators by Category

5.3.1.1 Magnitude

The first two indicators are arrivals and overnights. In 2019, Vienna recorded 7.926.768 arrivals33 and 17.604.573 overnights34 (TourMIS, n.d.–b). By themselves, these one-dimensional statistics are not particularly meaningful. Or better said, they “have only a comparative value” (De

33 Arrivals in all forms of paid accommodation in city area only 34 Bednights in all forms of paid accommodation in city area only

Page 81 of 133

La Calle-Vaquero et al., 2020, p. 309). According to the latest benchmarking report by European Cities Marketing (2020), Vienna ranks eighth in terms of total overnights. A direct comparison with the cities preceding it would be misleading, though. In fact, it would be like comparing apples and oranges – or, rather, watermelons and cherries. For instance, with a population of 3.664.371 inhabitants and an area of 891,1 km2 (Amt für Statistik Berlin-Brandenburg, n.d.), Berlin is roughly twice as big as Vienna. Hence, two-dimensional measures ought to be considered.

5.3.1.2 Dynamics

In this subsection, the temporal evolution of one-dimensional indicators is mainly presented. The underlying reason is to follow the logic of De La Calle-Vaquero et al. (2020), who argue that taking into consideration this aspect enhances the informativeness of elementary measures. Accordingly, the temporal evolution of two-dimensional indicators is presented in their respective subsections. Concerning demand, the number of arrivals has grown by 81% and that of overnights by 79% between 2009 and 2019 (TourMIS, n.d.–b). The average yearly growth rate stands at roughly 6% for both statistics (TourMIS, n.d.–b). Interestingly, this figure is higher than the one Visentin and Bertocchi (2019) found for Venice between 2009 and 2017 (4,4%). Another indicator is the growth of air transport (Peeters et al., 2018), which is computed as the yearly percentage increase of air passengers. In the present study, the “passengers carried (arrival)” figure35 (Eurostat, n.d.) was used to calculate this measure. The number of people transported to Vienna International Airport by plane in 2019 (15.776.153) increased by 75,1% with respect to 2009 (9.009.250). The average yearly growth rate over this period is 5,9%. Regarding supply, the number of hotels and pensions grew from 400 in 2009 to 422 in 2019 (MA 23, n.d.–d). This corresponds to an increase of 5,5%. The number of beds rose from 50.911 in 2009 to 68.200 in 2019 (MA 23, n.d.–b), which represent an increase of 34%. These two different growth rates suggest the opening of large properties. This is also evidenced by the average number of beds per hotel, which went from 127,3 in 2009 and to 161,6 in 2019. Since several authors have associated overtourism with Airbnb (e.g. Martín Martín et al., 2018; Sarantakou &

35 Figure excludes direct transit passengers.

Page 82 of 133

Terkenli, 2019), examining the evolution of vacation rentals would be worthwhile, too. Having said that, since this type of accommodation is relatively recent, the analysis possibilities are rather limited. Indeed, for Vienna, data is only available as of 2015 (Inside Airbnb, n.d.). Finally, Visentin and Bertocchi (2019) identified population shrinkage as a symptom of overtourism. More precisely, they noted that the centre's population has decreased by 74% between 1950 and 201536. In Vienna, the population increased from 1.627.566 in 1961 to 1.714.227 in 2011 (MA 23, n.d.–a). This represents an overall increase of 5,3%. At first sight, this figure seems more reassuring than that for Venice. At a closer look, however, considerable differences between the districts can be detected. For instance, the population of the 1st district – the city’s touristic hotspot – dropped from 32.243 in 1961 to 16.374 in 2011 (MA 23, n.d.–a). This corresponds to a decrease of 49%. In contrast, the population of the 22nd district – an area of expansion – rose from 57.137 in in 1961 to 161.419 in 2011 (MA 23, n.d.–a). This marks an increase of 183%.

5.3.1.3 Seasonality

Seasonality has been given a great deal of attention by tourism researchers. Guevara Manzo et al. (2017) and Peeters et al. (2018) both adopted air transport statistics to this end. Specifically, they computed the ratio of the peak over the trough month in terms of 'arriving seats' (Guevara Manzo et al., 2017) and 'passengers' (Peeters et al., 2018). As mentioned above, in the present study, the “passengers carried (arrival)” figure (Eurostat, n.d.) has been used instead. For 2019, dividing the top (August) over the bottom (January) month yields a value of 1,76. In the author´s opinion, however, it is not so much the severity of the temporal concentration but rather its duration that matters in urban destinations. For instance, Visentin and Bertocchi (2019) observed that 11 of 12 months are regarded as high season in Venice. Accordingly, they argued that the adverse effects of tourism “affect the entire historical city unrelentingly throughout the year” (Visentin & Bertocchi, 2019, p. 34). In the case of Vienna, arrivals and overnights were above average in 8 of 12 months in 2019 (Figure 26). Such unabated tourism

36 To be exact, they only mention absolute numbers. The percentage change has been calculated by the author on that basis.

Page 83 of 133

pressure most likely generates a feeling of restlessness and, consequently, a sense of impotence in the local population.

2.000.000

1.750.000

1.500.000

1.250.000 n - Arrivals 1.000.000 n - Overnights 750.000 Ø - Arrivals 500.000 Ø - Overnights

250.000

Figure 26: Arrivals and Overnights by Month in 2019

5.3.1.4 Intensity

One of the two most popular indicators of tourism pressure is tourism intensity. According to Eurostat, it is computed as the number of overnights with respect to the destination’s population (Kotzeva et al., 2019, p. 151). However, some variations do exist. For instance, Guevara Manzo et al. (2017) replaced overnights with arrivals for this measure. In the present study, the conventional calculation of this indicators has been adopted. In 2019, Vienna had a high intensity (9,3) (Appendix 5). A comparison of this value with the results by Peeters et al. (2018) highlights its extreme. The intensity figure lies slightly below the upper threshold of the fourth quintile (9,58) (Peeters et al., 2018). Given the importance of temporal evolution, it is worth noting that tourism intensity rose from 5,9 in 2009 to 9,3 in 2019. This represents an increase of 58% (Appendix 5). Furthermore, several authors have linked overtourism to budget airlines (e.g. Dodds & Butler, 2019; Visentin & Bertocchi, 2019). Thus, it might be worthwhile determining the intensity of low-cost travel. The share of passengers traveling with budget airlines to and from Vienna International Airport grew from 23,7% in 2018 to 31,6% in 2019 (Flughafen Wien AG, 2020,

Page 84 of 133

p. 35). This constitutes an increase of 33,3%. Thus, a qualitative analysis of air transport seems to be desirable.

5.3.1.5 Density

The other most popular indicator of tourism pressure is tourism density. According to Eurostat, it is computed as the number of overnights with respect to the destination’s surface (Kotzeva et al., 2019, p. 148). Here, too some variations do exist. For instance, Guevara Manzo et al. (2017) restricted the surface of the destination to the one containing the twenty most popular sights for this measure. In the present study, the conventional calculation of this indicator has been adopted. In 2019, Vienna had a very high density (42.434) (Appendix 5). Here, too, contrasting this value with the results by Peeters et al. (2018) highlights its extreme. The density figure lies somewhat above the upper threshold of the fifth quintile (37.290) (Peeters et al., 2018). Moreover, tourism intensity rose from 23.725 in 2009 to 42.434 in 2019. This represents an increase of 79% (Appendix 5). The fact that this growth rate is much larger than that of tourism intensity (58%) can easily be explained by the fact that – unlike the population – the surface of a destination remains constant.

5.3.1.6 Others

Finally, Peeters et al. (2018) also considered the proximity to cruise harbours and airports as well as World Heritage Sites (WHS) as potential catalysts for overtourism. Vienna has one harbour (Schiffstation Wien) and is close to two airports (Vienna International Airport; Bratislava Airport)37. Technically, there are only two WHSs in a radius of 30 km: the historic centre and the complex of Schönbrunn. However, three other sites are relatively near: the cultural landscapes of Neusiedlersee (≈50km) and Wachau (≈70km) as well as the railway in Semmering (≈80km)38 (UNESCO, n.d.). Furthermore, it shall be mentioned that the city centre was placed on the 'List of World Heritage in Danger' in July 2017 because of the renovation project of the Hotel

37 Strictly speaking, both airports lie beyond the boundaries of the city. 38 Approximate beeline distances estimated using luftlinie.org.

Page 85 of 133

InterContinental 39 (UNESCO, 2017). Drawing on the opinion of the former director of the UNESCO World Heritage Centre, Francesco Bandarin, the director of the Vienna Tourist Board, Norbert Kettner, argued that the revocation of such status would not have major repercussions on a mature destination like Vienna (Wien Tourismus, 2016). To be sure, his statement made some waves. However, the 6% (n= 970.767) and 7% (n= 1.121.076) increase in overnights in 2018 and 2019, respectively, suggest that he may have been right after all. In conclusion, the author would like to make a final remark. As mentioned in the theoretical part earlier (Section 3.3), the measurement of overtourism is "old wine in new bottles". Having said that, in his research, the author uncovered a few rather aged indices – such as Defert’s tourist function index (Defert, 1954). Rather surprisingly, though, these have not been mentioned in any of the publications reviewed earlier (Figure 16). At the same time, they seem to have found some adoption by scholars in South-Eastern Europe (e.g. Marković et al., 2017; Štefko et al., 2018). The reasons for this remain unclear.

5.3.1.7 Granularity: Fool's Gold

Towards the end of their chapter on overtourism measurement, Peeters et al. (2018) call for more granular data. Specifically, they assert that:

“…it is advisable to develop a guideline for tourism statistics to be gathered at the NUTS 3 level because the NUTS 2 level is too coarse to successfully develop such an early warning tool…” (Peeters et al., 2018, p. 78)

In the particular case of Vienna, the surface (414,82 km2) and population (1.911.191) of NUTS-2 (AT13) and NUTS-3 (AT130) are the same (Statistik Austria, 2020). Hence, a shift from one level to the other yields no additional information. Furthermore, Weber et al. (2019) affirm that “disaggregated data is needed that includes the spatial and temporal distribution of visitors”

39 More about this here: www.heumarkt-neu.at.

Page 86 of 133

(Weber et al., 2019, p. 94). Following these calls for finer-grained analyses, in this subsection the author would like to offer a word of caution to scholars wishing to act on these exhortations.

5.3.1.7.1 District Level ('Bezirk)

Visentin and Bertocchi (2019) analysed the state of tourism in Venice on district level. Following their example, selected indicators have been computed on district level ('Bezirk') for Vienna. Table 7 shows the three districts with the highest values for tourism density and intensity. The scores of each district (n=23) can be found in Table 12 in Appendix 6Error! Reference source not found.. Rather unsurprisingly, the 1st district ranks first. The rest of the podium is not jaw- dropping either. In fact, the 7th and 6th districts occupy the second and third place, respectively.

Area Tourism Density Tourism Intensity

Vienna (overall) 42.434 9,3

1st district 1.087.441 191,3

7th district 752.353 37,5

6th district 498.779 22,8

Vienna (median) 55.064 6,1 Table 7: Indicators by District

Indicators could be mapped for better interpretation. For instance, the density of vacation rentals by district can be shown with a 'box map'40 (Figure 27). The 6th district has the highest number of listings per square kilometre (≈181), closely followed by the 6th (≈172) and the 5th (≈168) district. The 4th (≈149), 1st (≈135) and the 8th (≈133) district follow at some distance. The scores of each district are provided in Appendix 6 and a methodological note on the calculation of these figures is offered in Appendix 7. Bearing in mind how this density figure is computed, it comes as no surprise that its upper quartile is dominated by rather small districts (Ø = 1,8 km2). At

40 i.e. spatial equivalent of a box plot (Anselin, 2020).

Page 87 of 133

the same time, this leaves one wondering what may lie behind the seemingly unremarkable state of larger district (e.g. 2nd district). This will be examined in the next subsection.

Figure 27: Box Map - Airbnbs per km2 by District (14th Jan 2020)

5.3.1.7.2 Census District ('Zählbezirk')

At first sight, the results between tourism density and tourism intensity are rather similar (Appendix 6). However, on closer examination, a few dissimilarities spring to the eye. For instance, the 2nd district ranks fifth in terms of intensity (20,9) but only tenth in terms of density (113.836) (Appendix 6). This can easily be explained by its relatively large area (19,2km2) (MA 18, n.d.). Intrigued by such disparities, one could go a step further and try to compute indicators per census district ('Zählbezirk'). The curious researcher will soon be disappointed, though. In fact, traditional tourism statistics are not available at such level of granularity in the case of Vienna. The reason for this is that in some census districts there are so few properties that one could theoretically figure out which one the statistics refer to – and this, in turn, would violate the Federal Statistics Act for confidentiality reasons (J. Urlesberger, personal communication, March 23, 2020). However, all is not lost. In fact, it is still possible to calculate 'non-traditional' indicators on such level, albeit with some effort. The number of vacation rentals per census district is an example thereof (Figure 28). The step-by-step description of how these figures were calculated can be found in Appendix 7. This analysis yields interesting results. For instance, the 2nd district has the highest

Page 88 of 133

absolute number of listings (n≈569). With a surface of approximately 19,2 km2, it only has a density of 29,6 Airbnbs per km2, though. Consequently, it did not stand out in the analysis conducted on district level earlier (Figure 27). In contrast, the analysis on census district level shows considerable variations within districts. For instance, in the 2nd district, the density of Airbnbs ranges from 0 (Zählbezirk 210) to 2,3 (Zählbezirk 207) listings per hectare41. A look at the map helps explain this intra-district variation. The former area is the port ('Hafen'), whereas the latter one is an area that has been developing recently ('Stuwerviertel').

89 listings ≈ 2,3 38 hectares

Figure 28: Box Map – Airbnbs per ha by Census District (14th January 2020)

In sum, this small exercise demonstrates that reportedly 'granular' analyses may be fool's gold. Accordingly, tourism professionals are advised not let themselves be blinded by the apparent lustre of such findings. Rather, bearing in mind that "all that glitters is not gold", they are encouraged to scratch beneath the surface. In fact, since overtourism occurs in specific places, spatial analyses have to be conducted at the most detailed level possible – as permitted by law. To the best of the author's knowledge, a state-of-the-art example in this regard is the 'tourism observatory' by the city of Buenos Aires in Argentina.

41 The scale was shifted to hectare, as square kilometre would not be very meaningful at such level of analysis.

Page 89 of 133

5.3.1.8 Shifting the Focus

To be sure, one could now attempt to combine some of the above-mentioned measures into a compound index, as attempted by Amore et al. (2020). However, such focus on numbers might cause one to not see the wood for the trees – that is, one might easily get lost in details and become oblivious to the fact that, after all, these are only numbers. More than half a century ago, the founding father of market psychology, Bernt Spiegel, asserted that market reality ultimately boils down to consumer perception. To be more precise, he wrote:

“Die Realität im sozialen Feld ist alleine das Phänomenale, das unmittelbar Angetroffene; nicht aber, jedenfalls nicht allein, die nackte objektive Beschaffenheit, die uns im Leben wohl kaum einmal völlig gesondert begegnet.“ (Spiegel, 1961, p. 30)42

It follows that these allegedly 'objective' indicators are only half the story. Accordingly, the next logical step would be to relate these indicators to the subjective perception of tourism by residents. After all, overtourism has been defined as “the subjective belief of residents that there are too many visitors” (Rejón-Guardia et al., 2020, p. 236). Thus, one could for instance compare the density of vacation rentals in a particular district with the perception thereof by its inhabitants43. This would allow bridging the gap between the objective and the subjective dimensions of overtourism. However, in the case of the resident survey of the Vienna Tourist Board, quotas have only been set with respect to age and gender (Wien Tourismus, 2020), wherefore the number of responses fluctuates a great deal across districts. For example, in 2019, 12,6% of the respondents (n=459) were living in the 22nd district and only 0,9% (n=32) of them were living in the 1st district (Figure 29). Such variation makes statistical – let alone meaningful – comparisons difficult.

42 Quoting Spiegel (1961) has been an arbitrary choice. In fact, for the distinction between perception and reality, the author could also have cited other 'authors', such as Plato (Allegory of the Cave) and Kant (Noumenal vs Phenomenal). 43 e.g. with data from the resident survey of the Vienna Tourist Board.

Page 90 of 133

459

363 348

270 215 185 192 198 169 165 137 110 114 109 95 109 78 81 70 63 48 47 32

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

Figure 29: Number of Responses by District in 2019 (n=3.657)

5.3.2 Methodology

5.3.2.1 The Origins

In 2018, the term overtourism was on everyone's lips. It was even in the running for that year's “Word of the Year” (Oxford University Press, n.d.). Having learnt that “mostly it is loss that teaches us the worth of things”(Schopenhauer), when seeing the extensive news coverage of protests against tourism, the author wondered what residents would actually do without it. Thus, upon his recommendation, the Vienna Tourist Board included the following two questions in their ongoing survey of resident attitudes towards tourism:

- How would you like the city without tourists? (0-100) - And how would you imagine life and the city without tourists? (text)

At that time, the idea of a deserted city was an unimaginable scenario. This year, the outbreak of Covid-19 turned this dystopian vision into reality. Nevertheless, since “in the middle of difficulty lies opportunity” (Einstein), upon the author's recommendation, the Vienna Tourist Board then asked residents about their experience of the city (and the life) without tourism. Specifically, the two questions above were altered as follows:

Page 91 of 133

- How did you like the city without tourists? (0-100) - How did you experience life and the city without tourists? (text)

This way, a comparison between the hypothetical and actual loss of tourism can be made. Moreover, upon recommendation of a senior employee of the Vienna Tourist Board, respondents had to answer the last question in form of a post card. That person, in turn, had been inspired by the work of Tussyadiah and Miller (2020), who adopted the 'letters from the future' method. In short, the idea behind the adoption of this method was to elicit richer responses. Finally, the post card format was adopted both with and without sample text in the third and fourth quarter of 2020, respectively (Appendix 8).

5.3.2.2 Hypotheses

In this section, the research framework and its underlying hypotheses are presented. The analysis will consist of two 'blocks'. In the first one, a comparison is made between residents' imagination of the city without tourists in the second quarters of 2018 and 2019 (Figure 30-A). Tourism has continued to grow between these two periods (July 2018 - March 2019), as evidenced, for instance, by the average monthly increase of 7,7% in the number of overnights44 with respect to the previous year (TourMIS, n.d.–b). This, in turn, is then reflected in measures of tourism pressure. Tourism intensity, for example, was 8,6% higher compared to the previous year (TourMIS, n.d.–b). Hence, it seems reasonable to assume that as 'objective' indicators of tourism pressure increase, the outlook of a city without tourists becomes more appealing. Thus, the first hypothesis reads as follows:

H1: As 'objective' measures of overtourism increase, the prospect of a city without tourism becomes more appealing.

44 Bednights in all forms of paid accommodation in city area only.

Page 92 of 133

In the second block, the imagination of the city without tourists by respondents in the third and fourth quarters of 2019 is compared with the experience thereof by respondents in the third and fourth quarters of 2020 (Figure 30-B). Here, the author assumes that once residents realise what they have lost, they appreciate what they had. More formally:

H2: Residents' hypothetical liking is higher than their actual liking of the city without tourism.

The analysis of the numerical indications by the respondents (0 →100) will be complemented by an examination of their answers to the open-ended questions (text). Specifically, these will be examined by means of topic modeling. However, unlike in Study I, it would not be prudent to use STM here. In fact, Yan et al. (2013) identified sparsity as an obstacle to the application of traditional topic models (e.g. LDA) to brief documents (e.g. tweets). They ascribed this to the fact that the traditional algorithms operate at the document level. Accordingly, they developed an alternative model that generates topics from pairs of words in the entire corpus. They named this the 'Biterm Topic Model' (BTM) (Yan et al., 2013). This algorithm is implemented in the BTM package (Wijffels, 2020a) in R. The presence vs absence of the sample text in the postcard will be taken into consideration during this analysis. Figure 30 visualizes the two 'blocks' of the analysis. The answers given last year (2020) need to be taken with a grain of salt, though. In fact, external circumstances, such as the 'hard' lockdown in November 2020, might have distorted the results. To the extent possible, aspects like this will be considered in the analysis.

Figure 30: Analysis Workflow

Page 93 of 133

5.3.3 Data Collection

The data under scrutiny was collected in form of an online survey by Manova GmbH on behalf of Wien Tourismus. The ‘accessible population’ (Trochim & Donnelly, 2006) consisted of people residing in Vienna aged between 18 and 70. ‘Proportional quota sampling’ (Trochim & Donnelly, 2006) was adopted by age and gender. The collected data was cleaned and weighted by age, gender, district, and education (Wien Tourismus, 2020). Figure 31 provides an overview of the responses by quarters and years.

1000 912 915 910 918 920 873 900 851 871 865

800 709 700 605 600 511 500 Closed 400 Open 300

200

100

0 Q2 Q2 Q3 Q4 Q3 Q4 2018 2019 2019 2020 Imagination vs Imagination Imagination vs Perception

Figure 31: Responses by Quarters and Years45

45 Q3-2020 is about a third smaller than the other quarters due to a mishap on the part of the market research company.

Page 94 of 133

5.3.4 Preliminary Results

5.3.4.1 Closed Questions

For the purposes of this sample analysis, responses to the closed question were averaged by quarter. The comparison between the imagined liking of the city without tourism in the second quarters of 2018 and 2019 is rather unspectacular. In fact, this value only increased by 11% (Figure 32). In contrast, the comparison between the imagined and experienced liking of the city without tourism in the third and fourth quarters of 2019 and 2020 is striking. This value increased by 76% and 65%, respectively (Figure 32). This preliminary result suggests a rejection of H2.

70 63,2 59 60

40 36 35,8 34,1 30,8 30

0 Q2 Q2 Q3 Q4 Q3 Q4 2018 2019 2019 2020 Imagination vs Imagination Imagination vs Experience

Figure 32: Average Liking of the City without Tourism

5.3.4.2 Open Questions

For the purposes of this sample analysis, the responses to the open-ended question in the second quarter of 2018 (n=851) were examined. The data was pre-processed with the tm package (Feinerer et al., 2008). Words that indicate a negation were consciously retained (e.g. 'nicht'), as Covid-19 is about the absence of 'things' (e.g. 'nicht überlaufen'). The text was then annotated with

Page 95 of 133

the udpipe package (Wijffels, 2020b). A pre-trained model was used for this (i.e. german-hdt)46. Finally, a Biterm Topic Model was performed with five topics using the BTM package (Wijffels, 2020a). This being a demonstrative exercise, special attention was paid neither to the number nor to the evaluation of topics.

Figure 33: Sample BTM

A brief look at the graph (Figure 33) suggests that the BTM delivers what it promises. Indeed, it is safe to say that the topics are more interpretable than they would have been if generated with traditional algorithms – even more so considering the limited length of the answers (Ø ≈ 7 words). Topic 1 includes three frequently used adjectives: empty ('leer'), quiet ('ruhig'), and boring ('langweilig')47. Respondents seem to be well aware of what would go missing without tourism. Topic 2 mentions the economic aspect ('fehlen', 'einnahmequellen') and topic 3 mentions the reduced offering ('weniger', 'Angebot'). Interestingly, topic 4 could be interpreted as almost suggesting that tourism is part of the Viennese DNA. A closer examination of this topic is desirable.

46 Information about this model can be found here.

47 This is because the author set background=TRUE when building the model.

Page 96 of 133

Nevertheless, respondents also appear to see some benefit in the absence of tourism. Topic 5 mentions more housing space ('mehr', 'Wohnraum'). Thus, overall, Figure 33 suggests that while residents see the advantages of tourism, they are not completely blind to its disadvantages, either. At this point, the reader might be perplexed by the dissonance between the scores of the closed-ended and the text of the open-ended questions. In fact, the high liking of the city without tourism in the third and fourth quarters of 2020 (Figure 32) stands in stark contrast to the awareness of this sector's benefits displayed in the second quarter of 2018 (Figure 33). This incongruence cannot be left uncommented. On one hand, the results in Figure 32 suggest that residents enjoy having the city to themselves. This seems plausible. In fact, after a decade of unabated growth48, tourism in Vienna was at its record high in 201949. It only seems fair that locals breathe a sigh of relief when getting a break from such 'tour de force'. On the other hand, the results in Figure 33 indicate that the downsides of tourism's success have made themselves felt. This, in turn, begs the question of how the attitude of residents will develop in the long run. Will rising unemployment rates change their mind? Perhaps not right away. Yet, in the author's opinion, at the latest when the façades of the city's iconic buildings start crumbling, their enthusiasm will fade in favour of regret.

5.3.5 Limitations

Like most studies, this one is not without its limitations either. To begin, the respondents' answers might have been influenced by a myriad of other factors. This could especially have been the case in 2020. In fact, the consequences of the pandemic (e.g. sudden unemployment) as well as the measures adopted to counter them (e.g. 'hard' lockdown) may, to some extent, have distorted the results. In addition, the critical reader may view the open-ended questions as “double-barreled” (Babbie, 2008, pp. 273–275).

48 Average yearly growth rate of arrivals and overnights ≈ 6%. 49 Arrivals = 7.926.768; Overnights = 17.604.573.

Page 97 of 133

6 Conclusion

To sum up, in this manuscript, the author has offered the reader a comprehensive glimpse of what their journey into the rise and fall of overtourism will look like. In Study I, the analysis of newspaper articles will show how overtourism has been portrayed in the news. The use topic modeling will allow traveling beyond the light of the known into the darkness of the yet unknown themes of this discourse. Moreover, the corroboration of the discoveries with other text mining techniques will take one more brick off the iron curtain separating inductive and deductive approaches. In Study II, the analysis of journal articles will map the landscape of research on overtourism. The citation analysis will then determine the roots of this phenomenon, thereby contributing to the debate about its novelty with empirical evidence. In addition, evaluating the usefulness of overtourism research will allow a critical reflection on tourism scholarship. In Study III, the Pindaric flight to its diametrical opposite – “undertourism” – will bring the reader back to the present. Understanding the limitations of the objective and granular indicators of tourism pressure gives some food for thought on what is worth measuring. Most importantly, though, insights into the residents´ experience of their destination without tourism will yield information that is essential for shaping travel and tourism in a sustainable manner.

Page 98 of 133

7 References

Agyeiwaah, E. (2019). Over-tourism and sustainable consumption of resources through sharing:

the role of government. International Journal of Tourism Cities, 6(1), 99–116.

https://doi.org/10.1108/IJTC-06-2019-0078

Alcalde Garcia, J., Guitart Casalderrey, N., Pitarch Mach, A., & Vallvé Fernández, Ó. (2018). De

la turismofobia a la convivencia turística: el caso de Barcelona. Análisis comparativo con

Ámsterdam y Berlín. Journal of Tourism Research, 8(2), 25–34.

https://revistes.ub.edu/index.php/ara/article/view/21980

Ali, R. (2018, August 14). The Genesis of Overtourism: Why We Came Up With the Term and

What’s Happened Since. Skift. https://skift.com/2018/08/14/the-genesis-of-overtourism-why-

we-came-up-with-the-term-and-whats-happened-since/

Almeida García, F., Balbuena Vázquez, A., & Cortés Macías, R. (2015). Resident’s attitudes

towards the impacts of tourism. Tourism Management Perspectives, 13, 33–40.

https://doi.org/10.1016/j.tmp.2014.11.002

Alonso-Almeida, M.‑M., Borrajo-Millán, F., & Yi, L. (2019). Are Social Media Data Pushing

Overtourism? The Case of Barcelona and Chinese Tourists. Sustainability, 11(12), 3356.

https://doi.org/10.3390/su11123356

American Psychological Association. (2020). Publication manual of the American Psychological

Association (7th ed.): The official guide to APA style (7th ed.). https://doi.org/10.1037/0000165-

000

Amore, A., Falk, M., & Adie, B. A. (2020). One visitor too many: assessing the degree of

overtourism in established European urban destinations. International Journal of Tourism Cities,

6(1), 117–137. https://doi.org/10.1108/IJTC-09-2019-0152

Page 99 of 133

Amt für Statistik Berlin-Brandenburg. (n.d.). Statistiken [Data Table]. Retrieved February 19, 2021,

from https://www.statistik-berlin-brandenburg.de/Statistiken/inhalt-statistiken.asp

Anderson, B. B. (Ed.) (1976). Advances in Consumer Research Volume 3. Association for

Consumer Research.

Anderson, C. (2008, June 23). The End of Theory: The Data Deluge Makes the Scientific Method

Obsolete. WIRED. https://www.wired.com/2008/06/pb-theory/

Angelov, D. (2020). Top2Vec: Distributed Representations of Topics. arXiv.

https://arxiv.org/abs/2008.09470

Anselin, L. (2020). GeoDa Workbook: Basic Mapping - Box Map.

https://geodacenter.github.io/workbook/3a_mapping/lab3a.html#box-map

Anselin, L., Syabri, I., & Kho, Y. (2006). GeoDa: An Introduction to Spatial Data Analysis.

Geographical Analysis, 38(1), 5–22. https://doi.org/10.1111/j.0016-7363.2005.00671.x

Ap, J. (1990). Residents’ perceptions research on the social impacts of tourism. Annals of Tourism

Research, 17(4), 610–616. https://doi.org/10.1016/0160-7383(90)90032-M

Ap, J. (1992). Residents’ perceptions on tourism impacts. Annals of Tourism Research, 19(4), 665–

690. https://doi.org/10.1016/0160-7383(92)90060-3

Aria, M., & Cuccurullo, C. (2017). bibliometrix : An R-tool for comprehensive science mapping

analysis. Journal of Informetrics, 11(4), 959–975. https://doi.org/10.1016/j.joi.2017.08.007

Avond, G., Bacari, C., Limea, I., Séraphin, H., Gowreesunkar, V., & Mhanna, R. (2019).

Overtourism: a result of the Janus-faced character of the tourism industry. Worldwide

Hospitality and Tourism Themes, 11(5), 552–565. https://doi.org/10.1108/WHATT-06-2019-

0039

Page 100 of 133

Babbie, E. R. (2008). The basics of social research (4th ed.). Thomson Wadsworth.

Baker, M. (2016). Is there a Reproducibility Crisis? Nature, 533, 452–454.

https://doi.org/10.1038/533452a

Barbieri da Rosa, L. A., Martins-Rodrigues, M. C., Pentiado Godoy, T., Damke, L. I., &

Gomes, C. M. (2020). Methods of Published Articles on Overtourism: Analysis and Reflections

of Scopus and Web of Science (1998-2018). In C. Ribeiro de Almeida, A. Quintano, M.

Simancas, R. Huete, & Z. Breda (Eds.), Handbook of Research on the Impacts, Challenges, and

Policy Responses to Overtourism (pp. 37–59). IGI Global. https://doi.org/10.4018/978-1-7998-

2224-0.ch003

Batagelj, V., & Mrvar, A. (2020). Pajek - Program for Large Network Analysis (Version 5.11)

[Computer software]. http://mrvar.fdv.uni-lj.si/pajek/

Benner, M. (2020). The Decline of Tourist Destinations: An Evolutionary Perspective on

Overtourism. Sustainability, 12(9), 3653. https://doi.org/10.3390/su12093653

Bertocchi, D., Camatti, N., Giove, S., & van der Borg, J. (2020). Venice and Overtourism:

Simulating Sustainable Development Scenarios through a Tourism Carrying Capacity Model.

Sustainability, 12(2), 512. https://doi.org/10.3390/su12020512

Blei, D. M. (2012). Probabilistic topic models. Communications of the ACM, 55(4), 77–84.

https://doi.org/10.1145/2133806.2133826

Blei, D. M., & Lafferty, J. D. (2006a). Correlated topic models. In Y. Weiss, B. H. Schölkopf, & J.

C. Platt (Eds.), Advances in neural information processing systems 18: Proceedings of the 2005

conference. MIT Press.

Page 101 of 133

Blei, D. M., & Lafferty, J. D. (2006b). Dynamic topic models. In W. W. Cohen & A. Moore (Eds.),

Proceedings of the 23rd international conference on Machine learning (pp. 113–120).

Association for Computing Machinery. https://doi.org/10.1145/1143844.1143859

Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine

Learning Research, 3, 993–1022.

Boissevain, J. (Ed.). (1996). Coping with Tourists: European Reactions to Mass Tourism.

Berghahn Books.

Bourliataux-Lajoinie, S., Dosquet, F., & Del Olmo Arriaga, J. L. (2019). The dark side of digital

technology to overtourism: the case of Barcelona. Worldwide Hospitality and Tourism Themes,

11(5), 582–593. https://doi.org/10.1108/WHATT-06-2019-0041

Boyd, D., & Crawford, K. (2012). Critical Questions for Big Data. Information, Communication

and Society, 15(5), 662–679.

Brown, E. K., & Miller, J. E. (2013). The Cambridge Dictionary of Linguistics. Cambridge

University Press. https://doi.org/10.1017/CBO9781139049412

Burscher, B., Odijk, D., Vliegenthart, R., de Rijke, M., & de Vreese, C. H. (2014). Teaching the

Computer to Code Frames in News: Comparing Two Supervised Machine Learning Approaches

to Frame Analysis. Communication Methods and Measures, 8(3), 190–206.

https://doi.org/10.1080/19312458.2014.937527

Butler, R. W. (1980). The Concept of a Tourist Area Cycle of Evolution: Implications for

Management of Resources. The Canadian Geographer/Le Géographe Canadien, 24(1), 5–12.

https://doi.org/10.1111/j.1541-0064.1980.tb00970.x

Butler, R. W. (2019). Tourism carrying capacity research: a perspective article. Tourism Review,

75(1), 207–211. https://doi.org/10.1108/TR-05-2019-0194

Page 102 of 133

Cambridge University Press. (n.d.). overtourism. Cambridge Dictionary. Retrieved January 12,

2021, from https://dictionary.cambridge.org/dictionary/english/overtourism

Capocchi, A., Vallone, C., Amaduzzi, A., & Pierotti, M. (2020). Is ‘overtourism’ a new issue in

tourism development or just a new term for an already known phenomenon? Current Issues in

Tourism, 23(18), 2235–2239. https://doi.org/10.1080/13683500.2019.1638353

Capocchi, A., Vallone, C., Pierotti, M., & Amaduzzi, A. (2019). Overtourism: A Literature Review

to Assess Implications and Future Perspectives. Sustainability, 11(12), 3303.

https://doi.org/10.3390/su11123303

Carvalho, F. L., Guerreiro, M., & Matos, N. (2020). Overtourism: A Systematic Review of

Literature. In C. Ribeiro de Almeida, A. Quintano, M. Simancas, R. Huete, & Z. Breda (Eds.),

Handbook of Research on the Impacts, Challenges, and Policy Responses to Overtourism

(pp. 12–36). IGI Global. https://doi.org/10.4018/978-1-7998-2224-0.ch002

Celata, F., & Romano, A. (2020). Overtourism and online short-term rental platforms in Italian

cities. Journal of Sustainable Tourism, 1–20. https://doi.org/10.1080/09669582.2020.1788568

Chyi, H. I., & McCombs, M. E. (2004). Media salience and the process of framing:: Coverage of

the Columbine school shootings. Journalism and Mass Communication Quarterly, 81(1), 22–

35. https://doi.org/10.1177/107769900408100103

Clark, C., & Nyaupane, G. P. (2020). Overtourism: An Analysis of Its Coverage In the Media by

Using Framing Theory. Tourism Review International, 24(2-3), 75–90.

https://doi.org/10.3727/154427220X15845838896314

Cobo, M. J., López-Herrera, A. G., Herrera-Viedma, E., & Herrera, F. (2011). Science mapping

software tools: Review, analysis, and cooperative study among tools. Journal of the American

Page 103 of 133

Society for Information Science and Technology, 62(7), 1382–1402.

https://doi.org/10.1002/asi.21525

Cohen, B. C. (1963). The Press and Foreign Policy. Princeton University Press.

Coldwell, W. (2017, August 10). First Venice and Barcelona: now anti-tourism marches spread

across Europe. The Guardian. https://www.theguardian.com/travel/2017/aug/10/anti-tourism-

marches-spread-across-europe-venice-barcelona

Cole, F. J., & Eales, N. B. (1917). The History of Comparative Anatomy: Part I. - A Statistical

Analysis of the Literature. Science Progress, 11(44), 578–596.

https://www.jstor.org/stable/43426882

Da Fonseca, E. N. (1973). Bibliografia Estatística e Bibliometria: Uma Reivindicação de

Prioridades. Ciência Da Informação, 2(1). http://revista.ibict.br/ciinf/article/view/19

De Battisti, F., Ferrara, A., & Salini, S. (2015). A decade of research in statistics: a topic model

approach. Scientometrics, 103(2), 413–433. https://doi.org/10.1007/s11192-015-1554-1

De La Calle-Vaquero, M., García-Hernández, M., Mendoza de Miguel, S., & Ferreiro-Calzada, E.

(2020). In Search of Overtourism Indicators in Urban Centres. In C. Ribeiro de Almeida, A.

Quintano, M. Simancas, R. Huete, & Z. Breda (Eds.), Handbook of Research on the Impacts,

Challenges, and Policy Responses to Overtourism (pp. 302–324). IGI Global.

https://doi.org/10.4018/978-1-7998-2224-0.ch016

Deery, M., Jago, L., & Fredline, L. (2012). Rethinking social impacts of tourism research: A new

research agenda. Tourism Management, 33(1), 64–73.

https://doi.org/10.1016/j.tourman.2011.01.026

Defert, P. (1954). Essai de localisation touristique. The Tourist Review, 9(3), 110–125.

https://doi.org/10.1108/eb059744

Page 104 of 133

Dickinger, A., Lalicic, L., & Mazanec, J. A. (2017). Exploring the generalizability of discriminant

word items and latent topics in online tourist reviews. International Journal of Contemporary

Hospitality Management, 29(2), 803–816. https://doi.org/10.1108/IJCHM-10-2015-0597

Dickinson, G. (2018). Overtourism: New Word Suggestion. Collins Dictionary.

https://www.collinsdictionary.com/submission/19794/Overtourism

DiMaggio, P., Nag, M., & Blei, D. M. (2013). Exploiting affinities between topic modeling and

the sociological perspective on culture: Application to newspaper coverage of U.S. government

arts funding. Poetics, 41(6), 570–606. https://doi.org/10.1016/j.poetic.2013.08.004

Dodds, R., & Butler, R. W. (2019). The phenomena of overtourism: a review. International

Journal of Tourism Cities, 5(4), 519–528. https://doi.org/10.1108/IJTC-06-2019-0090

Downs, A. (1972). Up and down with ecology – the “issue-attention cycle”. Public Interest, 28,

38–50.

Doxey, G. V. (1975). A causation theory of visitor-resident irritants: Methodology and research

inferences. In The impact of tourism: Sixth annual conference proceedings (pp. 195–198).

Travel Research Association.

Dredge, D. (2017, September 13). “Overtourism” Old wine in new bottles? LinkedIn Pulse.

https://www.linkedin.com/pulse/overtourism-old-wine-new-bottles-dianne-dredge

Easterling, D. S. (2004). The Residents’ Perspective in Tourism Research: A Review and Synthesis.

Journal of Travel & Tourism Marketing, 17(4), 45–62. https://doi.org/10.1300/J073v17n04_05

Emerson, R. M. (1976). Social Exchange Theory. Annual Review of Sociology, 2, 335–362.

https://doi.org/10.1146/annurev.so.02.080176.002003

Entman, R. M. (1993). Framing: Toward Clarification of a Fractured Paradigm. Journal of

Communication, 43(4), 51–58. https://doi.org/10.1111/j.1460-2466.1993.tb01304.x

Page 105 of 133

Eroglu, S. A., & Harrell, G. D. (1986). Retail crowding: Theoretical and strategic implications.

Journal of Retailing, 62(4), 346–363.

Eroglu, S. A., & Machleit, K. A. (1990). An empirical study of retail crowding: Antecedents and

consequences. Journal of Retailing, 66(2), 201–221.

(2020). The European Cities Marketing Benchmarking Report: 16th Official Edition 2019-2020.

European Cities Marketing.

Eurostat. (n.d.). Air passenger transport by main airports in each reporting country (avia_paoa)

[Data Set]. Retrieved February 19, 2021, from

https://appsso.eurostat.ec.europa.eu/nui/show.do?dataset=avia_paoa&lang=en

Feinerer, I., Hornik, K., & Meyer, D. (2008). Text Mining Infrastructure in R. Journal of Statistical

Software, 25(5). https://doi.org/10.18637/jss.v025.i05

Figuerola, C. G., García Marco, F. J., & Pinto, M. (2017). Mapping the evolution of library and

information science (1978–2014) using topic modeling on LISA. Scientometrics, 112(12),

1507–1535. https://doi.org/10.1007/s11192-017-2432-9

Firth, J. R. (1962). A Synopsis of Linguistic Theory, 1930-1955. In J. R. Firth, W. Haas, M.

Halliday, W. S. Allen, R. H. Robins, R. K. Prigg, F. R. Palmer, J. Carnochan, & T. F. Mitchell

(Eds.), Studies in Linguistic Analysis (pp. 1–32). Basil Blackwell.

Flughafen Wien AG. (2020). Annual Report 2019.

https://www.viennaairport.com/jart/prj3/va/uploads/data-

uploads/Konzern/Investor%20Relations/Geschaeftsberichte/VIE_GB_2019_en.pdf

Gamson, W. A., & Modigliani, A. (1987). The Changing Culture of Affirmative Action. In R. G.

Braungart & M. M. Braungart (Eds.), Research in Political Sociology (3rd ed., pp. 137–177).

JAI Press.

Page 106 of 133

Gartner, W. C. (1994). Image Formation Process. Journal of Travel & Tourism Marketing, 2(2-3),

191–216. https://doi.org/10.1300/J073v02n02_12

Goffman, E. (1974). Frame analysis: An essay on the organization of experience. Harvard

University Press.

Grimmer, J., & Stewart, B. M. (2013). Text as Data: The Promise and Pitfalls of Automatic

Content Analysis Methods for Political Texts. Political Analysis, 21(3), 267–297.

https://doi.org/10.1093/pan/mps028

Guevara Manzo, G., Ruggles-Brise, O., Turner, R., Constantin, M., Dichter, A., Köpke, S.,

Lim, C., & Seitzman, N. (2017). Coping with Success: Managing Overcrowding in Tourism

Destinations. World Travel & Tourism Council, McKinsey & Company.

https://www.mckinsey.com/~/media/McKinsey/Industries/Travel%20Logistics%20and%20Inf

rastructure/Our%20Insights/Coping%20with%20success%20Managing%20overcrowding%2

0in%20tourism%20destinations/Coping-with-success-Managing-overcrowding-in-tourism-

destinations.pdf

Guizi, A., Breda, Z., & Costa, R. (2020). How are overtourism and host–guest relationships

portrayed by the Portuguese print media? International Journal of Tourism Cities, 6(1), 215–

232. https://doi.org/10.1108/IJTC-06-2019-0081

Hadinejad, A., Moyle, B. D., Scott, N., Kralj, A., & Nunkoo, R. (2019). Residents’ attitudes to

tourism: a review. Tourism Review, 74(2), 150–165. https://doi.org/10.1108/TR-01-2018-0003

Hansen, T. (2020). Media framing of Copenhagen tourism: A new approach to public opinion

about tourists. Annals of Tourism Research, 84, 102975.

https://doi.org/10.1016/j.annals.2020.102975

Page 107 of 133

Harrell, G. D., & Hutt, M. D. (1976). Buyer Behavior under Conditions of Crowding: An Initial

Framework. In B. B. Anderson (Ed.), Advances in Consumer Research Volume 3 (pp. 36–39).

Association for Consumer Research.

Harrell, G. D., Hutt, M. D., & Anderson, J. C. (1980). Path Analysis of Buyer Behavior under

Conditions of Crowding. Journal of Marketing Research, 17(1), 45–51.

https://doi.org/10.2307/3151115

Harrill, R. (2004). Residents’ Attitudes toward Tourism Development: a Literature Review with

Implications for Tourism Planning. Journal of Planning Literature, 18(3), 251–266.

https://doi.org/10.1177/0885412203260306

Harris, Z. S. (1954). Distributional Structure. WORD, 10(2-3), 146–162.

https://doi.org/10.1080/00437956.1954.11659520

Heidenreich, T., Lind, F., Eberl, J.‑M., & Boomgaarden, H. G. (2019). Media Framing Dynamics

of the ‘European Refugee Crisis’: A Comparative Topic Modelling Approach. Journal of

Refugee Studies, 32(Special Issue 1), 172-182. https://doi.org/10.1093/jrs/fez025

Henley, J. (2020, January 25). Overtourism in Europe’s historic cities sparks backlash. The

Guardian. https://www.theguardian.com/world/2020/jan/25/overtourism-in-europe-historic-

cities-sparks-backlash

Herron, C. (1981, August 6). Ansel Adams; Protecting the Environment with a Camera. The

Christian Science Monitor.

https://www.csmonitor.com/1981/0806/080660.html#:~:text=The%20wilderness%20and%20

mountains%20have,stomping%20ground%20throughout%20his%20life.&text=The%20work

s%20of%20Ansel%20Adams,preserving%20their%20continent%27s%20wilderness%20areas.

Page 108 of 133

Higgins-Desbiolles, F., Carnicelli, S., Krolikowski, C., Wijesinghe, G., & Boluk, K. (2019).

Degrowing tourism: rethinking tourism. Journal of Sustainable Tourism, 27(12), 1926–1944.

https://doi.org/10.1080/09669582.2019.1601732

Hood, W. W., & Wilson, C. S. (2001). The literature of bibliometrics, scientometrics, and

informetrics. Scientometrics, 52(2), 291–314. https://doi.org/10.1023/A:1017919924342

Hu, N., Zhang, T., Gao, B., & Bose, I. (2019). What do hotel customers complain about? Text

analysis using structural topic model. Tourism Management, 72, 417–426.

https://doi.org/10.1016/j.tourman.2019.01.002

Huettermann, M., Thimm, T., Hannich, F., & Bild, C. (2019). Requirements for future digital

visitor flow management. Journal of Tourism Futures, 5(3), 241–258.

https://doi.org/10.1108/JTF-03-2019-0023

Inside Airbnb. (n.d.). Get the Data: Vienna, Vienna, Austria [Data File]. Retrieved February 11,

2021, from http://insideairbnb.com/get-the-data.html

Jacobi, C., van Atteveldt, W., & Welbers, K. (2016). Quantitative analysis of large amounts of

journalistic texts using topic modelling. Digital Journalism, 4(1), 89–106.

https://doi.org/10.1080/21670811.2015.1093271

Kádár, B. (2014). Measuring tourist activities in cities using geotagged photography. Tourism

Geographies, 16(1), 88–104. https://doi.org/10.1080/14616688.2013.868029

Kahneman, D., & Tversky, A. (1984). Choices, values, and frames. American Psychologist, 39(4),

341–350. https://doi.org/10.1037/0003-066X.39.4.341

Karantzavelou, V. (2020, February 17). Dimitrios Buhalis (Bournemouth University): “There is no

Overtourism - only badly managed tourism”. TravelDailyNews.

Page 109 of 133

https://www.traveldailynews.com/post/dimitrios-buhalis-bournemouth-university-there-is-no-

overtourism-only-badly-managed-tourism

Kettner, N., Kolesa, D., Kohl, J., Költringer, C., Tschöll, P., Zettel, A., Antalobsky, E., Bartik, H.,

Hofinger, J., & Lutter, J. (Eds.). (2021). Shaping Vienna: Vienna Visitor Economy Strategy 2025.

Vienna Tourist Board. https://strategie-2025.cdn.prismic.io/strategie-2025/d9c9e350-f5f3-

4917-9348-9162a2e81fe3_Toursimusstrategie_Strategie2025_EN_korr.pdf

Kirilenko, A. P., & Stepchenkova, S. (2018). Tourism research from its inception to present day:

Subject area, geography, and gender distributions. PloS One, 13(11), Article e0206820.

https://doi.org/10.1371/journal.pone.0206820

Koens, K., Postma, A., & Papp, B. (2018). Is Overtourism Overused? Understanding the Impact

of Tourism in a City Context. Sustainability, 10(12), 4384. https://doi.org/10.3390/su10124384

Koh, E., & Fakfare, P. (2019). Overcoming “over-tourism”: the closure of Maya Bay. International

Journal of Tourism Cities, 6(2), 279–296. https://doi.org/10.1108/IJTC-02-2019-0023

Koseoglu, M. A., Rahimi, R., Okumus, F., & Liu, J. (2016). Bibliometric studies in tourism.

Annals of Tourism Research, 61, 180–198. https://doi.org/10.1016/j.annals.2016.10.006

Kotzeva, M., Brandmüller, T., & Önnerfors, Å. (Eds.). (2019). Eurostat regional yearbook: 2019

edition. Eurostat. https://doi.org/10.2785/1522

Krippendorf, J. (1986). Die Ferienmenschen: Für ein neues Verständnis von Freizeit und Reisen.

Deutscher Taschenbuch-Verlag.

Kuščer, K., & Mihalič, T. (2019). Residents’ Attitudes towards Overtourism from the Perspective

of Tourism Impacts and Cooperation–The Case of Ljubljana. Sustainability, 11(6), 1823.

https://doi.org/10.3390/su11061823

Page 110 of 133

Lai, H. K., Pinto, P., & Pintassilgo, P. (2020). Overtourism: Impacts on Residents’ Quality of Life

– Evidence From Macau. In C. Ribeiro de Almeida, A. Quintano, M. Simancas, R. Huete, & Z.

Breda (Eds.), Handbook of Research on the Impacts, Challenges, and Policy Responses to

Overtourism (pp. 75–93). IGI Global. https://doi.org/10.4018/978-1-7998-2224-0.ch005

Landes, K. K. (1951). A Scrutiny of the Abstract. Bulletin of the American Association of

Petroleum Geologists, 35(7), 1660–1680.

Leung, X. Y., Xue, L., & Wen, H. (2019). Framing the sharing economy: Toward a sustainable

ecosystem. Tourism Management, 71, 44–53. https://doi.org/10.1016/j.tourman.2018.09.021

Leximancer. (n.d.). Text In Insight Out. Retrieved April 21, 2020, from

https://info.leximancer.com/.

LexisNexis. (n.d.). Developing a Search (with Terms and Connectors). Retrieved January 24, 2021,

from http://www.lexisnexis.com/help/global/AU/en_AU/gh_terms.asp

Lindberg, K., McCool, S., & Stankey, G. (1997). Rethinking carrying capacity. Annals of Tourism

Research, 24(2), 461–465. https://doi.org/10.1016/S0160-7383(97)80018-7

Lippmann, W. (1922). Public opinion. Macmillan.

Liu, H., Liu, Y., Wang, Y. , & Pan, C. (2019). Hot topics and emerging trends in tourism forecasting

research: A scientometric review. Tourism Economics, 25(3), 448–468.

https://doi.org/10.1177/1354816618810564

MA 18. (n.d.). Stadtgebiet nach Nutzungsklassen und Bezirken 2020 [Data Table]. Stadt Wien.

Retrieved September 3, 2020, from

https://www.wien.gv.at/statistik/lebensraum/tabellen/nutzungsklassen-bez.html

MA 21. (n.d.). Zählbezirksgrenzen Wien [Data File]. Stadt Wien. Retrieved September 3, 2020,

from https://www.data.gv.at/katalog/dataset/stadt-wien_zhlbezirksgrenzenwien

Page 111 of 133

MA 23. (n.d.–a). Bevölkerung seit 1869 Wien [Data File]. Stadt Wien. Retrieved September 3,

2020, from https://www.data.gv.at/katalog/dataset/091a085f-2652-429f-8dde-c69199440ddf

MA 23. (n.d.–b). Gästebetten in Wien nach Betriebskategorien 2000 bis 2019 [Data Table]. Stadt

Wien. Retrieved September 3, 2020, from

https://www.wien.gv.at/statistik/wirtschaft/tabellen/betten-betriebskat-zr.html

MA 23. (n.d.–c). Gästeübernachtungen nach Bezirken 2007 bis 2019 [Data Table]. Stadt Wien.

Retrieved September 3, 2020, from https://www.wien.gv.at/statistik/wirtschaft/tabellen/uebern-

bezirk-zr.html

MA 23. (n.d.–d). Hotels und Pensionen in Wien nach Betriebskategorien 2000 bis 2019 [Data

Table]. Stadt Wien. Retrieved September 3, 2020, from

https://www.wien.gv.at/statistik/wirtschaft/tabellen/hotels-betriebskat-zr.html

MA 41. (n.d.). Bezirksgrenzen Wien [Data File]. Stadt Wien. Retrieved September 3, 2020, from

https://www.data.gv.at/katalog/dataset/2ee6b8bf-6292-413c-bb8b-bd22dbb2ad4b

MacMillan Education Limited. (n.d.). over-: Definition 1. macmillan dictionary. Retrieved

February 7, 2021, from

https://www.macmillandictionary.com/dictionary/british/over_3?q=over-

MacMillan Education Limited. (2019). overtourism. Open Dictionary.

https://www.macmillandictionary.com/dictionary/british/overtourism

Maitland, R., & Newman, P. (2009a). Developing world tourism cities. In R. Maitland & P.

Newman (Eds.), World Tourism Cities: Developing tourism off the beaten track (pp. 1–21).

Routledge.

Maitland, R., & Newman, P. (Eds.). (2009b). World Tourism Cities: Developing tourism off the

beaten track. Routledge.

Page 112 of 133

Markos, B. (1949). Analyse de l’intensité touristique à la lumière de la statistique touristique

internationale. The Tourist Review, 4(3), 131–133. https://doi.org/10.1108/eb059642

Marković, S., Perić, M., Mijatov, M., Doljak, D., & Žolna, M. (2017). Application of tourist

function indicators in tourism development. Journal of the Geographical Institute “Jovan

Cvijić” SASA, 67(2), 163–178. https://doi.org/10.2298/IJGI1702163M

Martín Martín, J. M., Guaita Martínez, J. M., & Salinas Fernández, J. A. (2018). An Analysis of

the Factors behind the Citizen’s Attitude of Rejection towards Tourism in a Context of

Overtourism and Economic Dependence on This Activity. Sustainability, 10(8), 2851.

https://doi.org/10.3390/su10082851

Mazanec, J. A. (2009). Unravelling Myths in Tourism Research. Tourism Recreation Research,

34(3), 319–323. https://doi.org/10.1080/02508281.2009.11081606

Mazanec, J. A. (2017). Determining Long-Term Change in Tourism Research Language With

Text-Mining Methods. Tourism Analysis, 22(1), 75–83.

https://doi.org/10.3727/108354217X14828625279771

Mazanec, J. A. (2020). Hidden theorizing in big data analytics: With a reference to tourism design

research. Annals of Tourism Research, 83, 102931.

https://doi.org/10.1016/j.annals.2020.102931

McCombs, M. E., & Ghanem, S. I. (2001). The Convergence of Agenda Setting and Framing. In

S. D. Reese, O. H. Gandy, & A. E. Grant (Eds.), Framing Public Life: Perspectives on media

and our understanding of the social world (pp. 67–82). Routledge.

McCombs, M. E., Llamas, J. P., Lopez-Escobar, E., & Rey, F. (1997). Candidate Images in

Spanish Elections: Second-Level Agenda-Setting Effects. Journalism & Mass Communication

Quarterly, 74(4), 703–717. https://doi.org/10.1177/107769909707400404

Page 113 of 133

McCombs, M. E., & Shaw, D. L. (1972). The Agenda-Setting Function of Mass Media. Public

Opinion Quarterly, 36(2), 176–187. https://doi.org/10.1086/267990

McCool, S. F., & Lime, D. W. (2001). Tourism Carrying Capacity: Tempting Fantasy or Useful

Reality? Journal of Sustainable Tourism, 9(5), 372–388.

https://doi.org/10.1080/09669580108667409

McGehee, N. G., & Andereck, K. L. (2004). Factors Predicting Rural Residents’ Support of

Tourism. Journal of Travel Research, 43(2), 131–140.

https://doi.org/10.1177/0047287504268234

Merigó, J. M., Mulet-Forteza, C., Valencia, C., & Lew, A. A. (2019). Twenty years of Tourism

Geographies: a bibliometric overview. Tourism Geographies, 21(5), 881–910.

https://doi.org/10.1080/14616688.2019.1666913

Might, M. (2010). The illustrated guide to a Ph.D. http://matt.might.net/articles/phd-school-in-

pictures/

Mikolov, T., Sutskever, I., Chen, K., Corrado, G., & Dean, J. (2013). Distributed Representations

of Words and Phrases and their Compositionality. arXiv. https://arxiv.org/abs/1310.4546

Milano, C., Cheer, J. M., & Novelli, M. (Eds.). (2019). Overtourism: Excesses, discontents and

measures in travel and tourism. CABI. https://doi.org/10.1079/9781786399823.0000

Milano, C., Novelli, M., & Cheer, J. M. (2019a). Overtourism and degrowth: a social movements

perspective. Journal of Sustainable Tourism, 27(12), 1857–1875.

https://doi.org/10.1080/09669582.2019.1650054

Milano, C., Novelli, M., & Cheer, J. M. (2019b). Overtourism and Tourismphobia: A Journey

Through Four Decades of Tourism Development, Planning and Local Concerns [Editorial].

Tourism Planning & Development, 16(4), 353–357.

Page 114 of 133

Moher, D., Liberati, A., Tetzlaff, J., Altman, D. G., & PRISMA Group (2009). Preferred reporting

items for systematic reviews and meta-analyses: the PRISMA statement. PLoS Medicine, 6(7),

e1000097. https://doi.org/10.1371/journal.pmed.1000097

Moody, C. E. (2016). Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec.

arXiv. https://arxiv.org/abs/1605.02019

Moral-Muñoz, J. A., Herrera-Viedma, E., Santisteban-Espejo, A., & Cobo, M. J. (2020). Software

tools for conducting bibliometric analysis in science: An up-to-date review. El Profesional De

La Información, 29(1), Article e290103. https://doi.org/10.3145/epi.2020.ene.03

Morris, S. (2020, July 31). English beaches packed despite Covid-19 social distancing plea. The

Guardian. https://www.theguardian.com/uk-news/2020/jul/31/england-beaches-packed-

despite-covid-19-social-distancing-plea

Moyle, B. D., Moyle, C., Ruhanen, L., Weaver, D., & Hadinejad, A. (2021). Are we really

progressing sustainable tourism research? A bibliometric analysis. Journal of Sustainable

Tourism, 29(1), 106–122. https://doi.org/10.1080/09669582.2020.1817048

Muler Gonzalez, V., Coromina, L., & Galí, N. (2018). Overtourism: residents’ perceptions of

tourism impact as an indicator of resident social carrying capacity - case study of a Spanish

heritage town. Tourism Review, 73(3), 277–296. https://doi.org/10.1108/TR-08-2017-0138

Mulet-Forteza, C., Martorell-Cunill, O., Merigó, J. M., Genovart-Balaguer, J., & Mauleon-

Mendez, E. (2018). Twenty five years of the Journal of Travel & Tourism Marketing: a

bibliometric ranking. Journal of Travel & Tourism Marketing, 35(9), 1201–1221.

https://doi.org/10.1080/10548408.2018.1487368

Nelson, L. K. (2020). Computational Grounded Theory: A Methodological Framework.

Sociological Methods & Research, 49(1), 3–42. https://doi.org/10.1177/0049124117729703

Page 115 of 133

Nepal, R., & Nepal, S. K. (2019). Managing overtourism through economic taxation: policy

lessons from five countries. Tourism Geographies. Advance online publication.

https://doi.org/10.1080/14616688.2019.1669070

Neuts, B., & Nijkamp, P. (2012). Tourist crowding perception and acceptability in cities: An

Applied Modelling Study on Bruges. Annals of Tourism Research, 39(4), 2133–2153.

https://doi.org/10.1016/j.annals.2012.07.016

Nicholls, T., & Culpepper, P. D. (2020). Computational Identification of Media Frames: Strengths,

Weaknesses, and Opportunities. Political Communication. Advance online publication.

https://doi.org/10.1080/10584609.2020.1812777

Novy, J. (2019). Urban tourism as a bone of contention: four explanatory hypotheses and a caveat.

International Journal of Tourism Cities, 5(1), 63–74. https://doi.org/10.1108/IJTC-01-2018-

0011

Novy, J., & Colomb, C. (2019). Urban Tourism as a Source of Contention and Social

Mobilisations: A Critical Review. Tourism Planning & Development, 16(4), 358–375.

https://doi.org/10.1080/21568316.2019.1577293

Nunkoo, R., Smith, S. L. J., & Ramkissoon, H. (2013). Residents’ attitudes to tourism: a

longitudinal study of 140 articles from 1984 to 2010. Journal of Sustainable Tourism, 21(1), 5–

25. https://doi.org/10.1080/09669582.2012.673621

O’Reilly, A. M. (1986). Tourism carrying capacity: Concept and issues. Tourism Management,

7(4), 254–258. https://doi.org/10.1016/0261-5177(86)90035-X

Otlet, P. (1934). Traité de Documentation: Le Livre sur le Livre, Théorie et Pratique. Editiones

Mundaneum.

Page 116 of 133

Oxford Royale Academy. (n.d.). Black and White and Read All Over: A Guide to British

Newspapers. Retrieved January 24, 2021, from https://www.oxford-royale.com/articles/a-

guide-to-british-newspapers/

Oxford University Press. (n.d.). Word of the Year 2018: Shortlist. Retrieved February 16, 2021,

from https://languages.oup.com/word-of-the-year/2018-shortlist/

Pasquinelli, C., & Trunfio, M. (2020). Overtouristified cities: an online news media narrative

analysis. Journal of Sustainable Tourism, 28(11), 1805–1824.

https://doi.org/10.1080/09669582.2020.1760871

Peeters, P., Gössling, S., Klijs, J., Milano, C., Novelli, M., Dijkmans, C., Eijgelaar, E.,

Hartman, S., Heslinga, J., Isaac, R., Mitas, O., Moretti, S., Nawijn, J., Papp, B., & Postma, A.

(2018). Research for TRAN Committee - Overtourism: impact and possible policy responses.

European Parliament, Policy Department for Structural and Cohesion Policies.

https://doi.org/10.2861/919195

Perkumienė, D., & Pranskūnienė, R. (2019). Overtourism: Between the Right to Travel and

Residents’ Rights. Sustainability, 11(7), 2138. https://doi.org/10.3390/su11072138

Perles-Ribes, J. F., Ramón‐Rodríguez, A. B., Moreno‐Izquierdo, L., & Such‐Devesa, M. J. (2020).

Machine learning techniques as a tool for predicting overtourism: The case of Spain.

International Journal of Tourism Research, 22(6), 825–838. https://doi.org/10.1002/jtr.2383

Persson, O., Danell, R., & Schneider, J. W. (2009). How to use Bibexcel for various types of

bibliometric analysis. In F. Åström, R. Danell, B. Larsen, & J. W. Schneider (Eds.), Celebrating

Scholarly Communication Studies: A Festschrift for Olle Persson at his 60th Birthday (Special

volume of the e-zine of the ISSI, vol. 05-S June 2009, pp. 9–24).

Page 117 of 133

Phi, G. T. (2020). Framing overtourism: a critical news media analysis. Current Issues in Tourism,

23(17), 2093–2097. https://doi.org/10.1080/13683500.2019.1618249

Pitkiiz, R. M. (1987). The Importance of the Abstract. Obstetrics & Gynecology, 70(2), 267.

Pizam, A. (1978). Tourism’s Impacts: The Social Costs to the Destination Community as Perceived

by Its Residents. Journal of Travel Research, 16(4), 8–12.

https://doi.org/10.1177/004728757801600402

Plato. (2012). The Republic ((B. Jowett, Trans.)) (2nd ed.). Andrews UK. (Original work published

ca. 375 B.C.)

Popper, K. R. (1945). The Open Society and Its Enemies Volume II: The High Tide of Prophecy:

Hegel, Marx, and the Aftermath. Routledge.

Popper, K. R. (1959). The Logic of Scientific Discovery. Hutchinson. (Original work published

1934)

Pritchard, A. (1969). Statistical Bibliography or Bibliometrics? Journal of Documentation, 25(4),

348–349.

QGIS Development Team. (2020). QGIS Geographic Information System (Version 3.10.9)

[Computer software]. https://qgis.org/

R Core Team. (2020). R: A Language and Environment for Statistical Computing (Version 4.02)

[Computer software]. R Foundation for Statistical Computing. Vienna, Austria. https://www.r-

project.org/

Rapoport, A. (1975). Toward a Redefinition of Density. Environment and Behavior, 7(2), 133–158.

https://doi.org/10.1177/001391657500700202

Page 118 of 133

Rasoolimanesh, S. M., & Seyfi, S. (2020). Residents’ perceptions and attitudes towards tourism

development: a perspective article. Tourism Review, 76(1), 51–57. https://doi.org/10.1108/TR-

11-2019-0461

Rejón-Guardia, F., Marković, N., & García-Sastre, M. A. (2020). The Development of a Scale to

Measure Tourism-Phobia: An Exploratory Case of Residents in Majorca. In C. Ribeiro de

Almeida, A. Quintano, M. Simancas, R. Huete, & Z. Breda (Eds.), Handbook of Research on

the Impacts, Challenges, and Policy Responses to Overtourism (pp. 217–236). IGI Global.

https://doi.org/10.4018/978-1-7998-2224-0.ch012

Ribeiro de Almeida, C., Quintano, A., Simancas, M., Huete, R., & Breda, Z. (Eds.). (2020).

Handbook of Research on the Impacts, Challenges, and Policy Responses to Overtourism. IGI

Global. https://doi.org/10.4018/978-1-7998-2224-0

World Tourism Organization. (2017, August 15). Tourism: growth is not the enemy; it’s how we

manage it that counts [Press release]. https://www.unwto.org/archive/global/press-

release/2017-08-15/tourism-growth-not-enemy-it-s-how-we-manage-it-counts

Roberts, M. E., Stewart, B. M., & Airoldi, E. M. (2016). A Model of Text for Experimentation in

the Social Sciences. Journal of the American Statistical Association, 111(515), 988–1003.

https://doi.org/10.1080/01621459.2016.1141684

Roberts, M. E., Stewart, B. M., & Tingley, D. (2019). stm : An R Package for Structural Topic

Models. Journal of Statistical Software, 91(2). https://doi.org/10.18637/jss.v091.i02

Roberts, M. E., Stewart, B. M., Tingley, D., & Airoldi, E. M. (2013, December 10). The Structural

Topic Model and Applied Social Science [Paper presentation}. Advances in Neural Information

Processing Systems Workshop on Topic Models: Computation, Application, and Evaluation.

http://mimno.infosci.cornell.edu/nips2013ws/nips2013tm_submission_24.pdf

Page 119 of 133

Roberts, M. E., Stewart, B. M., Tingley, D., Lucas, C., Leder-Luis, J., Kushner Gadarian, S.,

Albertson, B., & Rand, D. G. (2014). Structural Topic Models for Open-Ended Survey

Responses. American Journal of Political Science, 58(4), 1064–1082.

https://doi.org/10.1111/ajps.12103

Rosenow, J. E., & Pulsipher, G. L. (1979). Tourism: The good, the bad, and the ugly. Century

Three Press.

Rugg, G., & Petre, M. (2004). The Unwritten Rules of PhD Research. Open University Press.

Ruhanen, L., Weiler, B., Moyle, B. D., & McLennan, C. J. (2015). Trends and patterns in

sustainable tourism research: a 25-year bibliometric analysis. Journal of Sustainable Tourism,

23(4), 517–535. https://doi.org/10.1080/09669582.2014.978790

Sarantakou, E., & Terkenli, T. S. (2019). Non-Institutionalized Forms of Tourism Accommodation

and Overtourism Impacts on the Landscape: The Case of Santorini, Greece. Tourism Planning

& Development, 16(4), 411–433. https://doi.org/10.1080/21568316.2019.1569122

Schmiedel, T., Müller, O., & vom Brocke, J. (2019). Topic Modeling as a Strategy of Inquiry in

Organizational Research: A Tutorial With an Application Example on Organizational Culture.

Organizational Research Methods, 22(4), 941–968.

https://doi.org/10.1177/1094428118773858

Schweinsberg, S., Darcy, S., & Cheng, M. (2017). The agenda setting power of news media in

framing the future role of tourism in protected areas. Tourism Management, 62, 241–252.

https://doi.org/10.1016/j.tourman.2017.04.011

Semetko, H. A., & Valkenburg, P. M. (2000). Framing European politics: A Content Analysis of

Press and Television News. Journal of Communication, 50(2), 93–109.

https://doi.org/10.1111/j.1460-2466.2000.tb02843.x

Page 120 of 133

Séraphin, H., Sheeran, P., & Pilato, M. (2018). Over-tourism and the fall of Venice as a destination.

Journal of Destination Marketing & Management, 9, 374–376.

https://doi.org/10.1016/j.jdmm.2018.01.011

Sharpley, R. (2014). Host perceptions of tourism: A review of the research. Tourism Management,

42, 37–49. https://doi.org/10.1016/j.tourman.2013.10.007

Shelby, B., Vaske, J. J., & Heberlein, T. A. (1989). Comparative analysis of crowding in multiple

locations: Results from fifteen years of research. Leisure Sciences, 11(4), 269–291.

https://doi.org/10.1080/01490408909512227

SimpleMaps.com. (n.d.). World Cities Database - Basic [Data File]. Retrieved January 31, 2021,

from https://simplemaps.com/data/world-cities

Singh, T. (2018). Is over-tourism the downside of mass tourism? [Editorial]. Tourism Recreation

Research, 43(4), 415–416. https://doi.org/10.1080/02508281.2018.1513890

Skift, Inc. (2018). Overtourism(Registration No. 5494076). U.S. Patent and Trademark Office.

https://uspto.report/TM/87257024

Spiegel, B. (1961). Die Struktur der Meinungsverteilung im sozialen Feld: Das psychologische

Marktmodell. Huber.

Stamolampros, P., Korfiatis, N., Kourouthanassis, P., & Symitsi, E. (2019). Flying to Quality:

Cultural Influences on Online Reviews. Journal of Travel Research, 58(3), 496–511.

https://doi.org/10.1177/0047287518764345

Stankey, G. H., Cole, D. N., Lucas, R. C., Petersen, M. E., & Frissell, S. S. (1985). The limits of

acceptable change (LAC) system for wilderness planning (General technical report INT-176).

Ogden, UT. United States Department of Agriculture, Forest Service, Intermountain Forest and

Range Experiment Station.

Page 121 of 133

Stankey, G. H., & McCool, S. F. (1984). Carrying capacity in recreational settings: Evolution,

appraisal, and application. Leisure Sciences, 6(4), 453–473.

https://doi.org/10.1080/01490408409513048

Statistik Austria. (2020). Gliederung Österreichs in NUTS-Einheiten: Gebietsstand 1.1.2020.

https://www.statistik.at/wcm/idc/idcplg?IdcService=GET_PDF_FILE&RevisionSelectionMet

hod=LatestReleased&dDocName=023722

Statistik Austria, & MA 23. (n.d.). Bevölkerung nach Bezirken 2005 bis 2020 [Data Table]. Stadt

Wien. Retrieved September 3, 2020, from

https://www.wien.gv.at/statistik/bevoelkerung/tabellen/bevoelkerung-bez-zr.html

Štefko, R., Vašaničová, P., Litavcová, E., & Jenčová, S. (2018). Tourism Intensity in the NUTS III

Regions of Slovakia. Journal of Tourism and Services, 9(16), 45–59.

https://doi.org/10.29036/jots.v9i16.43

Stepchenkova, S., & Eales, J. S. (2011). Destination Image as Quantified Media Messages: The

Effect of News on Tourism Demand. Journal of Travel Research, 50(2), 198–212.

https://doi.org/10.1177/0047287510362780

Stokols, D. (1972). On the distinction between density and crowding: Some implications for future

research. Psychological Review, 79(3), 275–277. https://doi.org/10.1037/h0032706

Sundt, H.‑A. (1950). La statistique du tourisme: A propos de l’«intensité touristique». The Tourist

Review, 5(1), 4–7. https://doi.org/10.1108/eb059652

Swiss Academic Software GmbH. (2021). Citavi: Reference Management and Knowledge

Organization (Version 6.8) [Computer software]. https://www.citavi.com/en

Takeshita, T. (2006). Current Critical Problems in Agenda-Setting Research. International Journal

of Public Opinion Research, 18(3), 275–296. https://doi.org/10.1093/ijpor/edh104

Page 122 of 133

Taş Gürsoy, İ. (2019). Beauty and the Beast: A Fairy Tale of Tourismphobia. Tourism Planning &

Development, 16(4), 434–451. https://doi.org/10.1080/21568316.2019.1582086

Tokarchuk, O., Gabriele, R., & Maurer, O. (2020). Estimating tourism social carrying capacity.

Annals of Tourism Research, 86(3), 102971. https://doi.org/10.1016/j.annals.2020.102971

(n.d.–a). TourMIS. https://www.tourmis.info/

TourMIS. (n.d.–b). Cities [Data Set]. Retrieved February 19, 2021, from https://www.tourmis.info

Tribe, J., & Liburd, J. J. (2016). The tourism knowledge system. Annals of Tourism Research, 57,

44–61. https://doi.org/10.1016/j.annals.2015.11.011

Trochim, W. M. K., & Donnelly, J. P. (2006). The research methods knowledge base (3rd ed.).

Atomic Dog.

Turner, L., & Ash, J. (1975). The golden hordes: International tourism and the pleasure periphery.

Constable.

Tussyadiah, I., & Miller, G. (2020). Imagining the Future of Travel: Technology and Sustainability

Transitions. E-Review of Tourism Research, 17(5), 674–684.

https://journals.tdl.org/ertr/index.php/ertr/article/view/551

UNESCO. (n.d.). Austria: Properties inscribed on the World Heritage List. Retrieved February 16,

2021, from https://whc.unesco.org/en/statesparties/at

UNESCO. (2017, July 6). Historic Centre of Vienna inscribed on List of World Heritage in Danger

[Press release]. https://en.unesco.org/news/historic-centre-vienna-inscribed-list-world-

heritage-danger

Page 123 of 133

UNWTO. (1981). Report of the Secretary General on the General Programme of Work for the

Period 1980-1981 (A/4/12 Add. B.3.2.1.). https://www.e-

unwto.org/doi/epdf/10.18111/unwtogad.1981.1.un406362r557g40k

UNWTO. (2021). COVID-19 and Tourism | 2020: A year in review. https://www.unwto.org/covid-

19-and-tourism-2020

UNWTO, Centre of Expertise Leisure, Tourism & Hospitality, NHTV Breda University of Applied

Sciences, & NHL Stenden University of Applied Sciences. (2018). ‘Overtourism’? –

Understanding and Managing Urban Tourism Growth beyond Perceptions. World Tourism

Organization. https://doi.org/10.18111/9789284419999 van Eck, N. J., & Waltman, L. (2010). Software survey: VOSviewer, a computer program for

bibliometric mapping. Scientometrics, 84(2), 523–538. https://doi.org/10.1007/s11192-009-

0146-3 van Eck, N. J., & Waltman, L. (2020). VOSviewer Manual.

https://www.vosviewer.com/download/f-33t2.pdf

Veríssimo, M., Moraes, M., Breda, Z., Guizi, A., & Costa, C. (2020). Overtourism and

tourismphobia: A systematic literature review. Tourism, 68(2), 156–169.

https://doi.org/10.37741/t.68.2.4

Visentin, F., & Bertocchi, D. (2019). Venice: An analysis of tourism excesses in an overtourism

icon. In C. Milano, J. M. Cheer, & M. Novelli (Eds.), Overtourism: Excesses, discontents and

measures in travel and tourism (pp. 18–38). CABI.

https://doi.org/10.1079/9781786399823.0018

Page 124 of 133

Vu, H. T., Guo, L., & McCombs, M. E. (2014). Exploring “the World Outside and the Pictures in

Our Heads”: A Network Agenda-Setting Study. Journalism & Mass Communication Quarterly,

91(4), 669–686. https://doi.org/10.1177/1077699014550090

Wagar, A. J. (1964). The Carrying Capacity of Wild Lands for Recreation. Forest Science, 10(2),

1–24. https://doi.org/10.1093/forestscience/10.s2.a0001

Wagar, A. J. (1974). Recreational Carrying Capacity Reconsidered. Journal of Forestry, 72(5),

274–278. https://doi.org/10.1093/jof/72.5.274

Wall, G. (2020). From carrying capacity to overtourism: a perspective article. Tourism Review,

75(1), 212–215. https://doi.org/10.1108/TR-08-2019-0356

Wattanacharoensil, W., & Weber, F. (2020). Preface to Journal of Travel and Tourism Marketing:

special issue on measuring and monitoring overtourism. Journal of Travel & Tourism Marketing,

37(8-9), 871–872. https://doi.org/10.1080/10548408.2020.1840839

Weaver, D. H. (2007). Thoughts on Agenda Setting, Framing, and Priming. Journal of

Communication, 57(1), 142–147. https://doi.org/10.1111/j.1460-2466.2006.00333.x

Weber, F., Eggli, F., Meier-Crameri, U., & Stettler, J. (Eds.). (2019). Measuring Overtourism:

Indicators for overtourism: Challenges and opportunities. Lucerne University of Applied

Sciences and Arts. https://doi.org/10.5281/zenodo.3759178

Wen, H., Park, E., Tao, C.‑W., Chae, B., Li, X., & Kwon, J. (2020). Exploring user-generated

content related to dining experiences of consumers with food allergies. International Journal

of Hospitality Management, 85, 102357. https://doi.org/10.1016/j.ijhm.2019.102357

Westcott, B., & Culver, D. (2020, April 7). Chinese tourist sites packed as country comes out of

lockdown, but experts say risk still high. CNN. https://edition.cnn.com/2020/04/06/asia/china-

coronavirus-tourist-warning-intl-hnk/index.html

Page 125 of 133

Wien Tourismus. (2016, December 16). Kettner zu Wiens Weltkulturerbe: Qualitäts- statt Epochen-

Diskussion notwendig [Press release]. https://b2b.wien.info/de/presse/unternehmens-presse-

info/2016/kettner-zu-weltkulturerbe-20161216

Wien Tourismus. (2020). Vienna residents’ attitude towards tourism 2019.

https://b2b.wien.info/media/files-b2b/vtb-tourism-attitudes-2019.pdf

Wijffels, J. (2020a). BTM: Biterm Topic Models for Short Text: R package version 0.3.4.

https://CRAN.R-project.org/package=BTM

Wijffels, J. (2020b). udpipe: Tokenization, Parts of Speech Tagging, Lemmatization and

Dependency Parsing with the ‘UDPipe’ ‘NLP’ Toolkit: R package version 0.8.5.

https://CRAN.R-project.org/package=udpipe

Wijffels, J. (2020c). word2vec: Distributed Representations of Words: R package version 0.3.3.

https://CRAN.R-project.org/package=word2vec

Yan, X., Guo, J., Lan, Y., & Cheng, X. (2013). A biterm topic model for short texts. In D. Schwabe,

V. Almeida, H. Glaser, R. Baeza-Yates, & S. Moon (Eds.), Proceedings of the 22nd international

conference on World Wide Web (pp. 1445–1456). Association for Computing Machinery.

https://doi.org/10.1145/2488388.2488514

Yen, I.‑Y., & Kerstetter, D. (2009). Residents’ View of Expected Tourism Impact, Attitude, and

Behavioral Intention. Tourism Analysis, 13(5), 545–564.

https://doi.org/10.3727/108354208788160531

Ylä-Anttila, T., Eranti, V., & Kukkonen, A. (2018). Topic Modeling as a Method for Frame

Analysis: Data Mining the Climate Change Debate in India and the USA. SocArXiv Papers.

https://doi.org/10.31235/osf.io/dgc38

Page 126 of 133

Yu, J., Lu, Y., & Muñoz-Justicia, J. (2020). Analyzing Spanish News Frames on Twitter during

COVID-19 — A Network Study of El País and El Mundo. International Journal of

Environmental Research and Public Health, 17(15), 5414.

https://doi.org/10.3390/ijerph17155414

Zhang, C., Wang, S., Sun, S., & Wei, Y. (2020). Knowledge mapping of tourism demand

forecasting research. Tourism Management Perspectives, 35, 100715.

https://doi.org/10.1016/j.tmp.2020.100715

Žižka, J., Dařena, F., & Svoboda, A. (2019). Text Mining with Machine Learning: Principles and

Techniques. CRC Press. https://doi.org/10.1201/9780429469275

Page 127 of 133

8 Appendices

8.1 Appendix 1

Limited Use Prevalence of Loose Geo. Publication of Theory Quant. Methods Relevance

Deery et al., 2012 ✓

Nunkoo et al., 2013 ✓ ✓

Sharpley, 2014 ✓ ✓ ✓

Almeida García et al., 2015 ✓ ✓

Hadinejad et al., 2019 ✓ ✓ Table 8: Reviews about Residents Attitudes and Tourism Impacts, Aspects

8.2 Appendix 2

Figure 34: Potential Relationships between Crowding and Satisfaction (Wagar, 1964, p. 7)

Page 128 of 133

8.3 Appendix 3

Category Outlet Quality Level Articles

The Guardian Morning Quality 44 Guardian The Observer Sunday Quality 2

The Independent - 93 Independent i-Independent Print Morning Quality 20

Financial Times Financial Times Morning Quality 33

The Times Morning Quality 28

Times The Sunday Times Sunday Quality 33

thetimes.co.uk - 43

The Daily Telegraph Morning Quality 48

Telegraph The Sunday Telegraph Sunday Quality 17

telegraph.co.uk - 241 Table 9: Distribution of News Articles (n=602)

8.4 Appendix 4

Junk Words

Paper, Article, Study, Purpose, Design, Methodology, Approach, Findings, Research, Limitations, Implications, Originality, Value

Table 10: Junk Words

Page 129 of 133

8.5 Appendix 5

Dynamics Category Indicator Change 2019 2009

Arrivals 7.926.768 4.385.529 +81%

Overnights 17.604.573 17.604.573 +79% Magnitude Hotels 422 400 +6%

Beds 68.200 50.911 +34%

highest / lowest month 1,7 (1,9) 2,0 (2,2) - Seasonality months above average 8 (8) 6 (6) -

Tourism intensity 9,3 5,9 +58%

Intensity Share of budget travel 31,6% - -

Airbnbs per hotel 17,3 - -

Tourism density 42.434 23.725 +79% Density Airbnbs per km2 17,6 - -

World heritage sites 2 2 -

Others Airports 1 1 -

Cruise harbour 1 1 -

Table 11: Indicators for the City of Vienna, 2009-201950

50 Figures and calculations available upon request as separate Excel file.

Page 130 of 133

8.6 Appendix 6

Tourism Tourism Airbnbs Airbnbs District Density Intensity per km2 per Hotel (2019)51 (2019)52 (2020) (2019-2020) 1. Innere Stadt 1.087.441 191,3 134,9 4,9 2. Leopoldstadt 113.836 20,9 29,6 13,5 3. Landstraße 237.836 19,2 68,4 16,9 4. Wieden 442.878 23,6 149,3 13,9 5. Margareten 303.480 11,0 167,5 33,7 6. Mariahilf 498.779 22,8 181,4 13,2 7. Neubau 752.353 37,5 172,9 9,6 8. Josefstadt 487.177 20,9 133,0 6,3 9. Alsergrund 201.391 14,2 125,0 13,3 10. Favoriten 57.613 9,0 9,8 10,4 11. Simmering 13.010 2,9 2,8 8,3 12. Meidling 28.473 2,4 25,7 26,0 13. Hietzing 7.682 5,4 1,2 5,0 14. Penzing 14.255 5,2 3,4 10,4 15. Rudolfsheim-F. 249.114 12,6 103,4 22,5 16. Ottakring 26.838 2,2 26,7 33,1 17. Hernals 30.717 6,1 15,6 25,4 18. Währing 4.145 0,5 22,1 140,0 19. Döbling 8.248 2,8 3,8 11,9 20. Brigittenau 55.064 3,6 42,2 48,2 21. Floridsdorf 1.254 0,3 0,8 9,3 22. Donaustadt 6.842 3,7 1,2 7,4 23. Liesing 2.344 0,7 0,7 2,6 Table 12: Indicators on District Level

51 = Overnights (MA 23 (n.d.–c) / Area (MA 18 (n.d.) 52 = Overnights (MA 23 (n.d.–c) / Population (Statistik Austria and MA 23 (n.d.)

Page 131 of 133

8.7 Appendix 7

The initial dataset consisted of 13.157 observations. First, the dataset was restricted to listings of entire apartments (8.743). Second, it was limited to listings with at least one review in the last year (6.632). Third, and last, only listings with a precise location were kept (5.343). This last step was taken, as Airbnb data is not entirely accurate. Indeed, its anonymisation entails a degree of inaccuracy of up to 150 metres as well as the dispersal of rentals in the same edifice to the nearby environment (Inside Airbnb, n.d.). In sum, the dataset was reduced by 59%. The number of listings per district was calculated in R (R Core Team, 2020) and that per census district in QGIS (QGIS Development Team, 2020). These figures were then attached to the shapefile of the district (MA 41, n.d.) and that of the census district (MA 21, n.d.) boundaries, respectively. Finally, the calculation of density figures (raw rate) and the corresponding visualisations (box map) were performed directly in GeoDa (Anselin et al., 2006).

8.8 Appendix 8

Figure 35: Postcard with Sample Text

Page 132 of 133

Figure 36: Postcard without Sample Text

Page 133 of 133