Research Article

Communication & Sport 1-19 ª The Author(s) 2019 Anything Can Happen Article reuse guidelines: sagepub.com/journals-permissions in Women’s , DOI: 10.1177/2167479519890571 or Can It? An Empirical journals.sagepub.com/home/com Investigation Into Bias in Sports Journalism

Alice N. Tejkalova1 and Ladislav Kristoufek1

Abstract The claim that “anything is possible in women’s sports” frequently employed by both sports journalists and general audiences highlights the widespread perception of a seemingly uncontested truth about female athletes and their (in)ability to perform consistently at peak levels in comparison to male athletes. We focus on this treat- ment of female athletes in the world of women’s tennis and contest the “common sense” and “experience” justifications of the unpredictability in women’s sports with actual data to reveal clear media bias. Utilising a database of the Association of Tennis Professionals and Women’s Tennis Association tournaments dating back to the late 1960s and covering approximately 225,000 fully described matches, we examine the “anything can happen in women’s tennis” assumption through logistic regression, focusing on the effect of rank differential on the winning probability in the match while controlling for other factors (tournament type and stage, court surface, age differential, and elite players). The results are rather shocking. The women’s matches do not show higher instability or lower predictability at all, but rather the contrary—the men’s matches show lower dependence on the rank difference. The results are robust as checked for data sets of the year 2000 onwards and those including only special events such as Grand Slams.

1 Faculty of Social Sciences, Charles University, , Czech Republic

Corresponding Authors: Alice N. Tejkalova and Ladislav Kristoufek, Faculty of Social Sciences, Charles University, Smetanovo nabrezi 6, Prague 1, 110 01, Czech Republic. Emails: [email protected]; [email protected] 2 Communication & Sport XX(X)

Keywords tennis, gender bias, sports journalism, empirical analysis, logistic regression

It is indecent that spectators should be exposed to the risk of seeing the body of a woman being smashed before their eyes. Besides, no matter how toughened a sports- woman may be, her organism is not cut out to sustain certain shocks. Her nerves rule her muscles, nature wanted it that way. de Coubertin in Fuller (2008, p. 5)

The above view of the female athlete’s body as inherently offensive and biologically doomed by founder of the modern Olympic Games Baron Pierre de Coubertin may be more than 100 years old but still resonates in the minds of many sports journalists and experts. Even though women now participate in the vast majority of Olympic and other sports, and an abundance of analytic literature and recommendations how to treat women’s sports and female athletes exist, women’s athletic and competitive ability continues to be underestimated by both men and women alike. The assump- tion that gender overarchingly determines the degree of an athlete’s physical per- formance and psychological stability is still very much present. The pejorative phrase “anything is possible in women’s sports” and its variations, used by sports journalists as well as general audiences, reflects a widespread percep- tion of a seemingly uncontested truth about female athletes and their (in)ability to perform consistently at peak levels in comparison to male athletes. This unequal treatment is often supported with appeals to “common sense” or references to past competitions of which the author or speaker has had personal experience, as Bruce (2012) describes:

Sports journalists’ decisions appear to be based on historical precedent and tradition (thus privileging established male sports), anecdotal evidence and intuition (based on their experiences and ideological beliefs rather than critical public feedback or research) and their belief that a predominantly male audience is not interested in women’s sport for its own sake. (pp. 130–131)

One such example of media bias against femaleathletesisfoundintheworldof women’s tennis where the “anything can happen” attitude has a particular traction. “The unpredictability of ladies with a racket is a well-known thing. Women have much bigger swings in their performance compared to men. It is not a reproach. It is simply the truth. It has been and it will always be,” states the author of “Pl´ıˇskov´a and her first title: her way again showed how ‘crazy’ women’s tennis is” ((mzk), 2019) on the Czech tennis player Karol´ına Pl´ıˇskov´a. A similar state- ment, “women’s tennis sometimes lacks logic,” is found in an article about another top Czech player, Petra Kvitov´a, in which the author asserts: Tejkalova and Kristoufek 3

The duel of Petra Kvitov´a and Julia Putinceva from Kazakhstan will be written in history as one of the most unbelievable duels and as the definition of women’s tennis. You can win any match but you can also lose it. (ram TN.cz, 2018)

Such claims are certainly not local excesses connected only to Czech tennis, but they can be found in global outlets as well. The general view of women’s tennis as unstable is also expressed in social media discussions, for example, on Reddit in a post titled “Why is women’s tennis so unpredictable?”:

I usually just watch Wimbledon and maybe 1 or 2 other events a year but find the men’s game really easy to follow because it’s pretty much the same guys on top but with women’s tennis I always feel lost like so much has changed in the last few months. (EdwardBigby, 2017)

Similarly, individual evidence gathered for an opinion article describing the instability of women’s tennis results was published in The Economic Times (Kalra, 2018) or when Perrota (2017) argued for the Los Angeles Times why with Serena Williams pregnant and Maria Sharapova injured: “The scrappy group of women who remain is an enthralling reminder that sometimes, when everything seems to go wrong, the tennis season becomes wild unpredictable and oh-so-fun”. However, this approach is not new, as the years in which the quoted statements were published could suggest. Tandon (2010) discussed on ESPN.com “the WTA unpredictability” and related it to various factors, including inconsistency in performance and lack of the players’ resistance to stress. “Anything is pos- sible in women’s tennis,” said the coach of the Czech women’s national team, Jan Kukal, before the match of his team with in 2013 (Plaˇsil & Marek, 2013). In some cases, female players themselves adopt the “common shared knowledge” stance on the unpredictable nature of their sport and perpetuate the belief in their statements, like Karol´ına Pl´ıˇskov´a did in this case, “I know a lot of people who are more likely to watch women’s tennis because there can happen more surprises. Men’s matches are more stereotypical” (Nˇemy´, 2018). Anderson (2008) described it as the acceptance of hegemonic masculinity’s (Messner, 1988) perspective. As one of the most popular and most bet on sports around the world, tennis has especially detailed data available for essentially all matches played at the profes- sional level decades into the past. We thus aim to test the common sense and “experience” justifications for the claim that female tennis players are less predict- able and less results-stable than their male counterparts. To do so, we utilise a large database going back to the late 1960s and listing approximately 225,000 matches together for Association of Tennis Professionals (ATP) and Women’s Tennis Asso- ciation (WTA) tournaments. We test the hypothesis of the “anything can happen” assumption utilising logistic regression focusing on the effect of the rank differential on the final outcome of the match. We show that no “anything can happen” effect in 4 Communication & Sport XX(X) women’s tennis exists compared to men’s tennis, but rather, to the exact contrary— the results of men’s tennis turn out to be less predictable than the women’s with respect to the rank differential of players. The same results are shown for both data sets starting from 2000 and 1968, as well as those of only special events such as tournaments or the Olympics. These empirically robust results thus cast important doubt on the accuracy of perceptions of the unpredictability of women’s tennis in comparison to the men’s game.

Literature Review Women’s Sport in Mediasport Sports and mediasport (Wenner, 1998) are widely believed to be very gendered institutions and spaces where men’s hegemony and dominance have been reinforced by many strategies, for example, strengthening the position of male athletes or “normality” of men’s sport and the “otherness” of women’s sport, and marginalising the achievements of female athletes being among the most frequent ones (Anderson, 2008; Bernstein, 2002; Bissel, 2006; Messner, 1988; Vincent, 2004; Yip, 2018). We can find multiple content analyses focusing on the comparison of women’s and men’s sports coverage in all types of media, including the newest online media outlets, presenting very similar results: Women’s sports are underrepresented and at the same time the coverage of female athletes centres more often on the soft news features of their performance (e.g., clothes, visual impression, sexualization or infantilization of the athlete’s image as well as details about personal lives) over actual athletic competence (Bernstein, 2002; Domeneghetti, 2018; Kian, Bernstein, & McGuire, 2013; Kian, Fink, & Hardin, 2011; Konjer, Mutz, & Meier, 2019; Vincent, 2004; Vincent, Pedersen, Whisenant, & Massey, 2007). However, some authors have suggested the emergence of a new, “neutral” treat- ment of women’s sports by the media focused on plain information about events. This “boring” coverage may be the result of journalists’ fears of being accused of sexism (Cooky, Messner, & Musto, 2015). Suggestions of how to better cover women’s sports include hiring and retaining journalists without prejudices, chiefly with the aim to refer to female athletes in an attractive but not preoccupied way (Cooky et al., 2015). This can be also reflected in what Bruce (2016) called the discourse of the pretty and powerful, meaning that athletes’ femininity is pronounced in the media coverage but within athletes’ rules and under their control. That is something made possible by the extension of the Internet and social media, through which the athletes can directly spread the narra- tives and pictures they like. And last, but not least, recommendations also suggest discovering whether the ubiquitously invoked anecdotal evidence and common sense justification of the variability of women’s athletic performance have any factual basis, as we do in this article, and if not, then reflecting it in its media coverage. Tejkalova and Kristoufek 5

Media Coverage of Female Tennis Players At the 1900 Olympic Games in Paris, Charlotte Cooper became the first ever female Olympic champion (Fuller, 2018), with lawn tennis, later tennis, being one of the few “acceptable” sports for women. Although tennis holds a special position with both the women’s and men’s games enjoying similar amounts of media coverage, especially as the biggest events share the same venues (Bissell, 2006; Bruce, 2012; Domeneghetti, 2018; Fuller, 2018; Kian et al., 2013; Kian & Clavio, 2011; Kian, Fink, & Hardin, 2011), the balanced quantity of media coverage does not translate into equal quality (Bissell, 2006; Quayle et al., 2019; Vincent et al., 2007; Yip, 2018). Women’s tennis remains “the other,” while “the standard” of tennis is the men’s game (Messner, Duncan, & Jensen, 1993; Vincent, 2004). Quotations from the content analyses of women’s tennis media coverage from the beginning of the 21st century document explicit objectification of female players’ bodies and a sexualization one can hardly believe made it to publication, such as this description from a Daily Mail article, analysed by Vincent (2004):

At the end of the spectrum, there’s Scary Spice in the person of Venus Williams, 6ft 2in of sinuous black muscle. At the other end, there’s Baby Spice in the tasty shape of Anna Kournikova, long blonde tresses plaited into a pigtail, breasts trussed up in the designer sports bra she richly endorsed. Between these extremes there sprawls a sultan’s seraglio of other vixens, all tanned and trained to appeal the most jaded appetites. (p. 443)

More than 15 years later, Quayle et al. (2019) speaks not only about typical media stereotyping of female tennis players through fixation on physical appearance and dependence on traditionally “feminine” descriptors to evaluate performance. They also acknowledge the increase in media stereotyping by omission, whereby a com- plete lack of evaluative description of female athletic effort not seen in the male domain perpetuates antiquated notions of gender difference (Quayle et al., 2019). Omission may also be caused by journalists’ self-censorship embedded in the uncer- tainty of what is and is not acceptable to be said about women players. Regardless, following the tradition of de Coubertin’s view previously referenced here, female tennis players are regularly labelled with negative descriptors regarding their mental stability and force. While male tennis players are acknowledged as tough warriors, fighting until the very end, women are “more likely to be framed as failures due to some combination of nervousness, lack of confidence, lack of being ‘comfortable’, lack of aggression, and lack of stamina” (Messner et al., 1993, p. 130). “[M]entality is constructed as a weakness in the women’s matches,” adds Quayle et al. (2019, p. 15). These findings resonate with the results of analyses by Bissell (2006), Vincent (2004), and Yip (2018) that also found female players to be represented as more psychologically fragile and less mentally resilient. In the past 15 years, this style of reporting can be partially influenced by the strongly mediatized rivalries between the two male stars Roger Federer and Rafael 6 Communication & Sport XX(X)

Nadal. Their matches have been almost worshipped by both the fans and media, a tendency even extending into scholarly articles, for example, “Why Roger Federer is a GOAT: An account of sporting genius” (Higgins, 2018). There was no narrative like that about women’s tennis in the same time.

Studies Analysing the Differences in Female and Male Tennis Players’ Performance Our article is not the first to compare the media coverage of men’s and women’s tennis. However, to the best of our knowledge, it is the first to use sports data analysis only, as opposed to any kind of media analysis or interviews with media professionals, to examine one of the most widespread media narratives perpetuating the inferiority of female tennis players. Several studies from the field of psychological, behavioural, or economic sciences (De Paola & Scoppa, 2017; Jetter & Walker, 2015; Magnus & Klaasen, 2010; Wozniak, 2012) analysed tennis performance data in relation to gender dif- ferences but on smaller samples (in all cases except for Jetter and Walker, 2015, even significantly smaller and limited data sets). While analysing the results of the final sets of Wimbledon matches from 1992 to 1995, Magnus and Klaasen (2010) found a significant difference between women’s and men’s competitions, stating that “men are closely equal in quality” (p. 467) meaning that there is a higher probability of an upset in the result of the match (when presupposing the higher ranked player should win) in men’s competition. This finding is in strict opposition to anecdotal and common sense evidence presented in the media by journalists, coaches, and players. Wozniak (2012) and Jetter and Walker (2015) focused on the “hot-hand effect,” analysing any differences between women and men in influences of the previous matches on their competitive performance. Wozniak (2012) revealed gender differ- ences and similarities: “[B]oth men and women are more likely to enter tournaments after doing well, but women are affected by their last tournament while men’s previous performance can affect entry beyond the next tournament” (p. 158). The higher ability women also react positively to the results from their last tournament while men do not (Wozniak, 2012). Jetter and Walker (2015) found, in their exam- ination of the hot-hand effect, no gender differences at all, then went further to analyse the “clutch-player effect” and found that “top-ranked female players are at least as likely as males to perform well in important matches.” (p. 107) Contrary to those results, the most recent study of De Paola and Scoppa (2017) had the following findings:

[W]omen losing the first set are much likely to play poorly in the second set. This suggests that women are more discouraged when facing the pressure of falling behind and receiving negative feedback. The gender differential is stronger in high-stake matches. On the other hand, when players are tied in the third set we do not find any Tejkalova and Kristoufek 7

gender differences in players’ reactions suggesting that women are as able as men to handle pressure if they do not lag behind. (p. 444)

The deviation of De Paola and Scoppa (2017) from the other reported findings may be explained by their analysis of a limited sample, using best-of-three matches “from all the tournaments organized by the Association of Tennis Professionals (ATP) and by Women’s Tennis Association” (p. 446) from 2007 to 2014; thus their results should be considered within the particular parameters of the analysis. The empirical evidence of the “anything can happen” in women’s tennis is thus not as clear-cut as it might seem from the practices of some sport commentators. However, most of the reviewed studies limit their sample with respect to some specific features that could influence the results. To overcome such an issue and to provide a robust evidence in favour or against the claimed effect, we present a detailed analysis of the possible effect gender plays in the winning probabilities of tennis matches based on a large data set of almost 225,000 matches dating back to the late 1960s with no additional ad hoc restrictions on the sample.

Methods and Data Set There are multiple ways to define the “anything can happen” assumption, which is well illustrated in the previous section. In our analysis, we study the probability that the better player, that is, the player with the better rank at the start of the match wins the match and how it is affected by other factors. The rank differential between the higher ranked and the lower ranked player is our key variable of interest as it is expected that the greater the rank difference between the players, the greater the likelihood of the higher ranked player actually winning is as well. To do so, we utilise the broad data set of Jeff Sackmann (https://github.com/JeffSackmann), spe- cifically the ATP and WTA databases. Jeff Sackmann is the founder of TennisAb- stract that collects detailed data about tennis results and statistics and makes them freely available also for research purposes. In addition to the historical databases we study, the provided data cover point-by-point statistics of professional games for a more recent period after 2011. The data sets we examine go back to 1968, and we cover the period up to April 2018. Each record contains information about surface, draw size, tournament level, tournament date, match number, winner/loser seed, winner/loser name, winner/loser hand, winner/loser rank, winner/loser age, match score, tournament round, and a few more identification variables. After removing the observations with missing data on the variables of interest, the core sample contains 224,890 observations (matches ranging from futures for men and WTA international for women up to the Grand Slams and the Olympics), which is unpre- cedented in the topical literature. To find whether there is a distinction between the role of rank difference on the final probability of winning the match for men and women tennis players, we construct a logistic regression model with an explained variable being the better 8 Communication & Sport XX(X) ranked player winning the match. As explanatory variables, we include the rank difference as the main variable of interest and then we also include other control explanatory variables that are expected to play a role in the likelihood of winning a match, specifically the age difference between players, the tournament level, sur- face, round of the tournament, and whether at least one of the players is ranked in the Top 5, Top 10, or Top 100 of the respective ranking. The selection of control variables is not random and is based on prevailing expectations about tennis match dynamics and data availability. Specifically, one would expect that the better players have better focus during high-level tournaments such as the Grand Slams, the Olympics, the Masters, and similar, so that the probability of the better player winning the match would be on average higher for such types of tournaments. A similar interaction is expected for the phase of the tournament as the better players may be eliminated during the early rounds of the tournaments before they get into the rhythm while the worse players may be more motivated and surprise the field. Tennis court surfaces might play a role as well as one might speculate that for faster surfaces, there is a higher chance of worse players overperforming, or in other words, the variance of results might be higher. For very specialised court surfaces, specifically clay, one would assume there are players focusing on the specific type of court so this would influence the probability of winning as well. Probability of the better ranked player winning p is then modelled using the binary logistic regression as p log ¼ b þ b Age þ b Age2 þ b Rank þ b Rank2 þ b Special þ Z Top5 1 p 0 1 2 3 4 5 1 X5 X10 þ Z2 Top10 þ Z3 Top100 þ gi Surfacei þ dj Roundj; i¼1 j¼1 ð1Þ where Age represents the age difference between the higher ranked player and the lower ranked player, Rank is the rank difference between the two, Special represents a dummy variable that is equal to one if the tournament is of a special category (Grand Slam, Olympics, Fed Cup, , and variations of Players’ Champion- ships) and zero otherwise, Surface represents the set of court types (Carpet, Clay, Grass, Hard, and not specified or others), Round is a set of different round levels, and Top5, Top10, and Top100 are dummy variables equal to one if at least one of the players is ranked up to Rank 5, Rank 10, and Rank 100, respectively, and 0 other- wise. Variables Age and Rank are also included in their squared form as it is expected that the effect of age difference and rank difference will be diminishing. The model can be further expanded for detection of differences between men and women. Specifically, we are interested in the effect of the rank differential on the probability of winning the match. If the rank differential is more important for predicting this probability for men than for women, we consider men’s matches to be more stable and more predictable, that is, the “anything can happen” effect for Tejkalova and Kristoufek 9 women would be reflected in the rank differential playing a weaker role in the model for women. We thus extend the model in the following way: p log ¼ b þ y W þ b Age þ y AgeW þ b Age2 þ y Age2W þ b Rank 1 p 0 0 1 1 2 2 3 2 2 þ y3 RankW þ b4 Rank þ y4 Rank W þ b5 Special þ y5 Special

þ Z1 Top5 þ Z2 Top10 þ Z3 Top100 þ o1 Top5W þ o2 Top10W X5 X10 þ o3 Top100W þ gi Surfacei þ dj Roundj: i¼1 j¼1 ð2Þ The model thus adds partial effects (represented by the dummy variable W that is equal to one if we consider the women tennis match and zero otherwise) for the age differential, rank differential, special events, and the highly ranked players. Our main hypothesis that the men’s tennis matches are more stable and predictable with respect to the players’ quality than women’s tennis matches is represented by para- meters y3 and y4. If the “anything can happen in women’s tennis” effect is valid, the y3 parameter will be negative, that is, our null hypothesis is that y3 < 0 against the alternative that y3 0, which would suggest that the results in women’s tennis are in fact more predictable with respect to the rank differential than in men’s tennis. Value of the y4 parameter specifies the shape of the partial effect, and it will be discussed in more detail with the other results if found significant.

Results Before jumping to the estimates of the model of interest, we investigate the basic statistics of the analysed data set with respect to the men–women separation and focusing on the quantitative variables. In Table 1, we can see that the unconditional probability of the higher ranked player winning is very close for men and women, specifically around 66% and 67% for men and women, respectively, that is, slightly higher for women. Thus, without knowing any further details about the match, the probability of the better ranked player to win the match is around 67%. However, the standard deviation of 47% suggests high uncertainty of this average winning prob- ability and thus calls for further inspection. For the explanatory variables, we inspect the age difference and rank difference in more detail. The average age difference is rather small for both men and women (less than 1 year) but with a very high standard deviation above 5, which suggests potentially high information value for our model. The range of the age difference is rather wide, and we observe that the winner could be up 25 years or 28 years younger for men and women, respectively, or up to 26 or 27 years older for men and women, respectively. For the rank difference, we find more stable statistics with an average rank difference of 87 and 70 for men and women, respectively, and comparable 10 Communication & Sport XX(X)

Table 1. Descriptive Statistics of the Quantitative Variables. Variable Statistics All Men Women Better ranked wins Average 0.67 0.66 0.67 SD 0.47 0.47 0.47 Age difference Average 0.40 0.29 0.58 Median 0.41 0.28 0.63 SD 5.24 5.19 5.31 Minimum 28.28 25.06 28.28 Maximum 26.97 26.15 26.97 Rank difference Average 80.18 86.66 69.77 Median 46.00 48.00 42.00 SD 116.30 127.59 94.88 Minimum 1 1 1 Maximum 2,125 2,125 1,428 Observations 224,890 138,135 86,755

Figure 1. Quantiles of rank differences. standard deviations. The much lower median values suggest that there are several extreme observations that drag the average levels higher. The rank differences vary from 1 to 2,125 for men and from 1 to 1,428 for women. Overall, we have 224,890 recorded matches, split between 138,135 for men and 86,755 for women. The data set is thus quite well balanced. Even though the range of rank differences is found to be rather wide, a more detailed inspection of the rank differences quantiles (Figure 1) uncovers that around 80% of matches in the sample are between players who are not further than 100 ranks from each other. Going further, around 99% of the recorded matches show the Tejkalova and Kristoufek 11

Table 2. Model Estimates for the Whole Sample. Variable Estimate p Value Estimate p Value Intercept 6.906000 .7956 6.840000 .7975 Age 0.021370 <.0001 0.022780 <.0001 Age2 0.000029 .7962 0.000042 .7766 Rank 0.005169 <.0001 0.004635 <.0001 Rank2 0.000003 <.0001 0.000003 <.0001 Special 0.232200 <.0001 0.225300 <.0001 Surface 6.550600 <.0001 6.595200 <.0001 Round 11.798000 <.0001 11.997000 <.0001 Top5 0.444200 <.0001 0.413600 <.0001 Top10 0.573200 <.0001 0.548500 <.0001 Top100 0.455300 <.0001 0.467400 <.0001 W_Intercept 0.009195 .7717 W_Age 0.002521 .1680 W_Age2 0.000221 .3400 W_Rank 0.002312 <.0001 W_Rank2 0.000003 <.0001 W_Special 0.003835 .8693 W_Top5 0.079740 .1132 W_Top10 0.074720 .0337 W_Top100 0.021010 .4784 AIC 275,624 275,376 Null deviance 311,764 311,764 Residual deviance 275,578 275,578 Pseudo R2 .1161 .1161 Observations 224,890 224,890

Note. Estimated effects are shown up to the sixth decimal for better orientation, p values are shown up to the fourth decimal. Estimates for the rank differential are shown in bold. For model quality, Akaike information criterion (AIC), null, and residual deviances are shown as well as the respective pseudo R2. The model according to Equation 1 is shown on the left, and the model according to Equation 2 is shown on the right. For qualitative variables (surface and round), the F statistics are shown instead of the estimates. rank difference below 500. This is true for both men’s and women’s matches, and it will be taken into consideration for the final model interpretation and presentation. Estimated models for the whole sample are shown in Table 2. We find that most of the selected variables are statistically significant. In fact, for the model according to Equation 1, the left part of Table 2, only the squared effect of age is found insignificant; other variables are highly significant and the signs are of the expected value. The effect of age is negative which suggests that the older the player is compared to the opponent, the lower the probability of winning is. The insignif- icance of the squared form suggests that the effect is linear. A strong and statistically significant effect is found for the Special tournaments, which confirms the expec- tations that the better players focus more on the more important events. The effects 12 Communication & Sport XX(X)

Figure 2. Observed and model-implied probabilities of the higher ranked player winning. of the highest ranked players are also found to be statistically significant and strong. Surface and the stage of the tournament (Round) also play a significant role but we do not delve into details as these are not essential for our main hypothesis. Expanding the model to distinguish between the effects for men and women, we arrive at the model according to Equation 2 summarised in the right part of Table 2. There, we find that the “anything can happen in women’s tennis” hypothesis is strongly rejected, and in fact, the opposite result prevails—the effect of the rank differential is much higher for women compared to man, approximately by 50%. The other effects do not seem to be gender-dependent. Interestingly, the squared effects for the rank differential are negative and significant both for men and women which suggests concave partial effects, that is, the difference between having a rank Tejkalova and Kristoufek 13

Table 3. Model Estimates for the Sample From Year 2000 Onward. Variable Estimate p Value Estimate p Value Intercept 0.395600 .0674 0.418000 .0541 Age 0.015840 <.0001 0.017680 <.0001 Age2 0.000387 .0187 0.000379 .1301 Rank 0.005037 <.0001 0.004510 <.0001 Rank2 0.000003 <.0001 0.000002 <.0001 Special 0.299800 <.0001 0.342800 <.0001 Surface 6.644700 <.0001 6.172500 <.0001 Round 9.142400 <.0001 8.941000 <.0001 Top5 0.375100 <.0001 0.394200 <.0001 Top10 0.545400 <.0001 0.573800 <.0001 Top100 0.434600 <.0001 0.478100 <.0001 W_Intercept 0.043940 .3741 W_Age 0.003145 .2151 W_Age2 0.000018 .9575 W_Rank 0.001807 <.0001 W_Rank2 0.000003 <.0001 W_Special 0.088140 .0056 W_Top5 0.038770 .5775 W_Top10 0.054040 .2653 W_Top100 0.071010 .1229 AIC 133,005 132,947 Null deviance 148,900 148,900 Residual deviance 132,961 132,885 Pseudo R2 .1070 .1076 Observations 107,409 107,409

Note. Estimated effects are shown up to the sixth decimal for better orientation, p values are shown up to the fourth decimal. Estimates for the rank differential are shown in bold. For model quality, Akaike information criterion (AIC), null, and residual deviances are shown as well as the respective pseudo R2. The model according to Equation 1 is shown on the left, and the model according to Equation 2 on the right. For qualitative variables (surface and round), the F statistics are shown instead of the estimates. differential of 1 and 10 is higher than having a rank differential of 91 and 100. The negative estimate of y4 shows that this slowing down is faster for women. Such shape generally implies that at some point, in this case for some rank differential and higher, the partial effect declines. To put these results into a better perspective, we present Figure 2 that shows conditional probabilities of winning observed from the data (top) and probabilities implied by the model in Table 2 (bottom). As expected, the observed probabilities are much noisier than the ones implied by the model but most important patterns are evident. As discussed above, 99% of the matches are between players with a rank differential below 500 and we thus show only such matches in the figure. Both charts show that the probabilities for the women’s subsample are higher than the ones for men. For the observed conditional probabilities, this is transparent for rank 14 Communication & Sport XX(X)

Table 4. Model Estimates for the Sample From Year 2000 Onwards and Only for Special Events. Variable Estimate p Value Estimate p Value Intercept 0.910700 .0003 0.859700 .0006 Age 0.014040 <.0001 0.014890 .0001 Age2 0.000080 .8146 0.000106 .8427 Rank 0.005368 <.0001 0.004834 <.0001 Rank2 0.000003 <.0001 0.000002 <.0001 Surface 2.765400 .0168 2.656600 .0209 Round 17.955000 <.0001 16.901000 <.0001 Top5 0.565800 <.0001 0.656400 <.0001 Top10 0.678900 <.0001 0.670800 <.0001 Top100 0.623600 <.0001 0.665800 <.0001 W_Intercept 0.039360 .6667 W_Age 0.001171 .8262 W_Age2 0.000064 .9272 W_Rank 0.002123 <.0001 W_Rank2 0.000003 <.0001 W_Top5 0.173800 .1983 W_Top10 0.012540 .9001 W_Top100 0.099750 .2324 AIC 31,233 31,220 Null deviance 38,104 38,104 Residual deviance 31,191 31,162 Pseudo R2 .1814 .1822 Observations 27,486 27,486

Note. Estimated effects are shown up to the sixth decimal for better orientation, p values are shown up to the fourth decimal. Estimates for the rank differential are shown in bold. For model quality, Akaike information criterion (AIC), null and residual deviances are shown as well as the respective pseudo R2. The model according to Equation 1 is shown on the left, and the model according to Equation 2 is shown on the right. For qualitative variables (surface and round), the F statistics are shown instead of the estimates. differentials up to 100, then the probabilities become too noisy. For the model- implied probabilities, the pattern is apparent. The negative estimate of y4 is reflected by the curves getting closer with an increasing rank difference. For much higher rank differentials, the expected probability for women would fall below such probability for men. However, this would represent only a tiny portion of the data set (less than 0.1% of all matches). The model thus shows strong evidence for the non-existence of the “anything can happen in women’s tennis,” but rather quite the contrary. To make sure that the results are not driven by the sample reaching back to the late 1960s, we have performed the same analysis for two additional samples—one starting from the year 2000 and one starting from the year 2000 and covering only Special- level tournaments. The former is summarised in Table 3 and the latter in Table 4. The results confirm the findings for the whole sample analysis. For the 2000 onwards sample, we find that the effect of the rank differential is slightly lowered for Tejkalova and Kristoufek 15 women players as the additional effect is around 31% higher than for men, compared to the 50% comparison for the whole sample. When the 2000 onwards sample is limited to special events tournaments only, the additional effect increases to around 44%. The effect is thus more pronounced for the more important tournaments in favour of women in the shorter data set. The other effects captured by the model remain in the same logic as for the original model. Overall, we have found strong evidence for the non-existence of the “anything can happen in women’s tennis” effect. On the contrary, we have uncovered that the results in the ATP (men) are less predictable with respect to the rank difference between the players compared to the WTA (women) matches. The results are robust across different samples.

Discussion and Conclusions When comparing the literature on women’s tennis media coverage with the results of our analysis, one must wonder at the stark contrast between the empirical evidence and the interpretation of anecdotal evidence and individual experience. The ideolo- gical perspective of hegemonic masculinity (Anderson, 2008; Messner, 1988; Mess- ner et al., 1993) in both sport and sports journalism has succeeded in asserting a narrative of difference and unpredictability within women’s tennis, based in gen- dered notions of female psychological instability and physical inferiority. The results of our analysis show that in a very broad sample of almost 225,000 matches between 1968 and 2018, there is no support to be found for this narrative. In testing the narrative hypothesis of an existent or non-existent “anything can happen” effect, the transformation of the research question into an empirically testable hypothesis is essential. Here, we treat the “anything can happen” effect as the dependence or likelihood of the better ranked player to win the match with respect to the rank difference between the players, or more specifically, how the strength of this dependence differs for men and women tennis players. This reflects the prevalent attitude present in the media coverage and fan discussions, as the incapacity of an individual with a higher ranking to win the match is often inter- preted differently, depending on the player’s gender. In addition to the presented literature review focusing on media coverage and social media discussions, we perform two robustness checks to control for two phenomena—the prestigious tour- naments effect and the recent years effect. The former is applied to check the validity of the common perception that women tennis players break down more easily and frequently (compared to men) during the most important tournaments of the seasons. And the latter control is to see whether the specific situation of the tennis ladders after the year 2000, when both the ATP and WTA rankings were dominated by a small group of players (predominantly the Big Three of Federer, Nadal, and Djokovic for men and Serena Williams for women), might help to explain the differences between men’s and women’s match success rates with respect to their rankings. The period after 2000 also well represents the period when most sports 16 Communication & Sport XX(X) commentators actually remember the match histories and can base their commentary on personal experience and memory. However, both control samples confirm the results from the original large sample—There is no actual empirically valid “anything can happen” effect for women present. If anything, the men’s matches are less predictable than the ones of women. The results for the data set of special events correlate with the above-mentioned findings of Jetter and Walker (2015, p. 97) that top female players appear more likely to compete well in high stakes tournaments, even though Jetter and Walker (2015) considered only the matches from the Grand Slams while we have included the Olympic Games as well. The unpredictability of professional tennis, and any sports discipline for that matter, is a part of its attractiveness, and our analysis shows that a lot of uncertainty in both men’s and women’s tennis still exists that cannot be predicted by statistical models. However, there is a sharp distinction between the narrative of such uncer- tainty and unpredictability in men’s and women’s divisions. In the cases of surpris- ing results in men’s sports, journalists, fans, and some researchers generally talk about the “clashes of titans,” “more competitiveness,” and “more close equality in quality”; while in women’s sports, the unpredictability is more often attributed to female emotional instability and used as argument for the lower maturity and there- fore the lower status of women’s sport (Domeneghetti, 2018; Kian & Clavio, 2011; Magnus & Klaasen, 2010; Quayle et al., 2019). This can be also illustrated by the results of our analysis that, from a certain perspective, allow for interpreting the lower predictability in men’s tennis not as a sign of the “anything can happen” effect in men but as a sign of their “equality in quality.” However, such different perspec- tives and treatments pose a problem. What is perceived as the advantage in one case is dismissed as the weakness in the other and the pattern of assessment has remained unchanged. It is certainly a hard job to be a sports journalist and especially a commentator. One has to be quick-witted, always ready to say something, ideally interesting and meaningful at the same time. There is the necessity to distinguish the specifics of a single match or performance as well as to place it in its proper context, to be aware of unique aspects of a game or athlete as well as overall, general conditions which shape our appreciation and meaning of what we view. The media coverage of female athletes continues to be laden with sexism and double standards and when journalists struggle to avoid it, they often end up with another extreme—a boringly neutral, inactive, and uninteresting presentation in the case of women’s sports, as Bruce (2012), Cooky, Messner, and Musto (2015), and Quayle et al. (2019) pointed out. Our results lend strong credence to the conclusions of Bruce (2012, p. 131) that “rather than being gender neutral, many journalists are instead ‘gender blind(ed)’” and perpetuate gender stereotypes of female athletes as inferior, subject to emotion and physical weakness, when no such empirical evidence exists. Living in the era of a continuous (big) data collection in many areas of personal lives and society in general, one would hope that the sports coverage will utilise such data availability to present sportsmen and sportswomen without preconceptions. Tejkalova and Kristoufek 17

Declaration of Conflicting Interests The author(s) declared no potential conflicts of interest with respect to the research, author- ship, and/or publication of this article.

Funding The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was supported by the Faculty of Social Sciences, Charles University (PROGRES Q19).

ORCID iD Alice N. Tejkalova https://orcid.org/0000-0002-9150-6685

References Anderson, E. (2008). “I used to think women were weak”: Orthodox masculinity, gender segregation, and sport. Sociological Forum, 23, 257–280. Bernstein, A. (2002). Is it time for a victory lap? Changes in the media coverage of women in sport. International Review for the Sociology of Sport, 37, 415–428. Bissell, K. L. (2006). Game face: Sports reporters’ use of sexualized language in coverage of women’s professional tennis. In L. K. Fuller (Ed.), Sport, rhetoric and gender (1st ed., pp. 171–184). New York, NY: Palgrave Macmillan. Bruce, T. (2012). Reflections on communication and sport: On women and femininities. Communication & Sport, 1, 125–137. Bruce, T. (2016). New rules for new times: Sportswomen and media representation in the third wave. Sex Roles: A Journal of Research, 74, 361–376. Cooky, C., Messner, M. A., & Musto, M. (2015). “It’s dude time!” A quarter century of excluding women’s sports in televised news and highlight shows. Communication & Sport, 3, 261–287. De Paola, M., & Scoppa, V. (2017). Gender differences in reaction to psychological pressure: Evidence from tennis players. European Journal of Work and Organizational Psychology, 26, 444–456. Domeneghetti, R. (2018). “The other side of the net”: (Re)presentations of (emphasized) femininity during Wimbledon 2016. Journal of Policy Research in Tourism, Leisure and Events, 10, 151–163. EdwardBigby. (2017). Why is women’s tennis so unpredictable? Reddit. Retrieved from https://www.reddit.com/r/tennis/comments/6dzjru/why_is_womens_tennis_so_ unpredictable/ Fuller, L. K. (2018). Female Olympian and Paralympian events. Cham, Switzerland: Palgrave Macmillan. Higgins, J. (2018). Why Roger Federer is a GOAT: An account of sporting genius. Journal of the Philosophy of Sport, 45, 296–317. Jetter, M., & Walker, J. K. (2015). Game, set, and match: Do women and men perform differently in competitive situations? Journal of Economic Behavior and Organization, 119, 96–108. 18 Communication & Sport XX(X)

Kalra, G. (2018, January 24). Why women’s tennis is unpredictable. The Economic Times. Retrieved from https://economictimes.indiatimes.com/blogs/chip-and-charge/why- womens-tennis-is-unpredictable/ Kian, E. M., Bernstein, A., & McGuire, J. S. (2013). A major boost for gender equality or more of the same? The television coverage of female athletes at the 2012 London Olympic Games. The Journal of Popular Television, 1, 143–149. Kian, E. M., & Clavio, G. (2011). A comparison of online media and traditional newspaper coverage of the men’s and women’s U.S. Open tennis tournaments. Journal of Sports Media, 6, 56–84. Kian, E. M., Fink, J. S., & Hardin, M. (2011). Examining the impact of journalists’ gender in online and newspaper tennis articles. Women in Sport and Physical Activity Journal, 20, 3–21. Konjer, M., Mutz, M., & Meier, H. E. (2019). Talent alone does not suffice: Erotic capital, media visibility and global popularity among professional male and female tennis players. Journal of Gender Studies, 28, 3–17. Magnus, J. R., & Klaasen, F. J. G. M. (2010). The final set in a tennis match: Four years at Wimbledon. Journal of Applied Statistics, 26, 461–468. Messner, M. A. (1988). Sports and male domination: The female athlete as contested ideo- logical terrain. Sociology of Sport Journal, 5, 197–211. Messner, M. A., Duncan, M. C., & Jensen, K. (1993). Separating men from girls: The gendered language of televised sports. Gender & Society, 7, 121–137. (mzk). (2019, January 19). Pl´ıˇskov´a a jej´ı prvn´ı titul: Jej´ı cesta zase uk´azala, jak je zenskyˇ ´ tenis “ˇs´ıleny´” [Pl´ıˇskov´a and her first title: Her way again showed how “crazy” women’s tennis is]. Expres.cz. Retrieved from https://www.expres.cz/karolina-pliskova-brisbane- titul-zensky-tenis-fli-/sport.aspx?c¼A190106_185244_dx-sport_mzk Nˇemy´, M. (2018, September 28). Ot´azka sexismu v zensk´ˇ em tenisu: Ne, postar´ame se o sebe, shoduj´ıseˇcesk´e reprezentantky [A question of sexism in women’s tennis: No, we will take care of ourselves, agree the Czech national team members]. Lidovky.cz. Retrieved from https://www.lidovky.cz/sport/ostatni-sporty/otazka-sexismu-v-zenskem-tenisu-ne-postar ame-se-o-sebe-shoduji-se-ceske-reprezentantky.A180928_122023_ln-sport-ostatni_dak Perrota, T. (2017, July 3). Op-Ed: With Serena Williams out, women’s tennis is suddenly an unpredictable Wild West. Los Angeles Times. Retrieved from https://www.latimes.com/ opinion/op-ed/la-oe-perrotta-wimbledon-women-tennis-serena-williams-pregnant- 20170703-story.html Plaˇsil, J., & Marek, J. (2013, April 20). Oˇcek´av´am tˇesn´e a zaj´ımav´e utk´an´ı, ˇr´ık´aby´valy´ tenista a tren´er Jan Kukal [I suppose it will be a tight and interesting match, says former tennis player and a coach Jan Kukal]. irozhlas.cz. Retrieved from https://www.irozhlas.cz/sport_ tenis/fc-ocekavam-tesne-a-zajimave-utkani-rika-byvaly-tenista-a-trener-jan-kukal-_2013 04201020_jmarek Quayle, M., Wurm, A., Barnes, H., Barr, T., Beal, E., Fallon, M., ... Wei, R. (2019). Stereo- typing by omission and commission: Creating distinctive gendered spectacles in the tele- vised coverage of the 2015 men’s and women’s singles semi-finals and finals. International Review for the Sociology of Sport, 54, 3–21. Tejkalova and Kristoufek 19 ram TN.cz. (2018, March 10). Prostˇe zenskyˇ ´ tenis: Kvitov´a78vy´mˇen darovala a stejnˇe vyhr´ala! [Simply Women’s tennis: Kvitov´a donated 78 exchanges but won anyway!]. tn.cz. Retrieved from http://sport.tn.nova.cz/clanek/typicky-zensky-tenis-kvitova-80- vymen-darovala-a-stejne-vyhrala.html Tandon, K. (2010, May 12). How to handle the WTA unpredictability. ESPN.com. Retrieved from https://www.espn.com/sports/tennis/columns/story?columnist¼tandon_kamakshi& id¼5180343. Vincent, J. (2004). Game, sex and match: The construction of gender in British newspaper Coverage of the 2000 Wimbledon Championships. Sociology of Sport Journal, 21, 35–456. Vincent, J., Pedersen, P. M., Whisenant, W., & Massey, D. (2007). Analysing the print media coverage of professional tennis players: British newspaper narratives about female com- petitors in the Wimbledon championships. International Journal of Sport Management and Marketing, 2, 281–300. Wenner, L. (1998). Preface. In L. Wenner (Ed.), Media sport (1st ed., pp. xiii–xiv). New York, NY: Routledge. Wozniak, D. (2012). Gender differences in a market with relative performance feedback: Professional tennis players. Journal of Economic Behavior & Organization, 83, 158–171. Yip, A. (2018). Deuce or advantage? Examining gender bias in online coverage of profes- sional tennis. International Review for the Sociology of Sport, 53, 517–532.