Citizen Science Frontiers: Efficiency, Engagement, and Serendipitous
Total Page:16
File Type:pdf, Size:1020Kb
Citizen science frontiers: Efficiency, engagement, and serendipitous discovery with human–machine systems Laura Trouillea,b,1, Chris J. Lintottc, and Lucy F. Fortsond aDepartment of Citizen Science, The Adler Planetarium, Chicago, IL 60605; bCenter for Interdisciplinary Exploration and Research in Astrophysics, Northwestern University, Evanston, IL 60208; cDepartment of Physics, The University of Oxford, Oxford, OX1 3RH, United Kingdom; and dDepartment of Physics and Astronomy, The University of Minnesota-Twin Cities, Minneapolis, MN 55455 Edited by Youngmoo E. Kim, Drexel University, Philadelphia, PA, and accepted by Editorial Board Member Eva Tardos December 7, 2018 (received for review May 20, 2018) Citizen science has proved to be a unique and effective tool in large, diverse datasets. In this Perspectives piece, we describe in helping science and society cope with the ever-growing data our recent efforts and future considerations for designing a rates and volumes that characterize the modern research land- human–machine system optimized for “happy chance discov- scape. It also serves a critical role in engaging the public with ery” (a definition of serendipity that provided guidance for the research in a direct, authentic fashion and by doing so promotes Cybernetic Serendipity exhibit) that best takes advantage of the a better understanding of the processes of science. To take full efficiencies of the machine while acknowledging the complexity advantage of the onslaught of data being experienced across of human motivation and engagement. the disciplines, it is essential that citizen science platforms lever- What is now called citizen science—the involvement of the age the complementary strengths of humans and machines. This general public in research—has a long history. An early example Perspectives piece explores the issues encountered in designing is Edmund Halley’s study of timings during the 1715 total human–machine systems optimized for both efficiency and vol- solar eclipse, which included observations from a distributed, unteer engagement, while striving to safeguard and encourage self-organized group of observers (1). Works by refs. 2 and opportunities for serendipitous discovery. We discuss case stud- 3, among others, have linked modern-day efforts to their ies from Zooniverse, a large online citizen science platform, and 19th century antecedents, for example, highlighting the role show that combining human and machine classifications can effi- played by amateur networks of meteorological observers in ciently produce results superior to those of either one alone establishing that field of study (i.e., by 1900, more than 3,400 and how smart task allocation can lead to further efficiencies observers were contributing data to a network organized by in the system. While these examples make clear the promise of George Symons, producing data on a scale that could not be human–machine integration within an online citizen science sys- matched by the professional efforts of the time). In recent tem, we then explore in detail how system design choices can decades, citizen science has gained renewed prominence, inadvertently lower volunteer engagement, create exclusionary boosted in part by technological advances and digital tools practices, and reduce opportunity for serendipitous discovery. like mobile applications, cloud computing, and wireless and Throughout we investigate the tensions that arise when design- sensor technology which have enabled new modes of public ing a human–machine system serving the dual goals of carry- engagement in research (4) and facilitated research projects ing out research in the most efficient manner possible while that investigate questions from data at scales beyond the empowering a broad community to authentically engage in this professional research community’s resource capacity (5). research. Professional citizen science organizations have been created in Europe, Australia, and the United States. In the United States, citizen science j machine learning j human computing interaction j the Crowdsourcing and Citizen Science Act of 2015 was physical sciences j biological sciences introduced to encourage the use of citizen science within the federal government and, that same year, the first Citizen he 1968 Cybernetic Serendipity exhibition (www. Science Association (CSA) conference was held (although Tstudiointernational.com/index.php/cybernetic-serendipity- some consider the 2012 European Space Agency side event 50th-anniversary) was an early imagining and exploration of on citizen science the first CSA gathering). CitizenScience.gov computer-aided creative activity, play, and interplay. The exhibit, (https://www.CitizenScience.gov) currently lists over 400 active curated by Jasia Reichardt at the Institute of Contemporary Arts citizen science projects. Participation in citizen science today in London, examined the role of cybernetics in contemporary art ranges from hands-on data collection, tagging, analysis, and and included robots; algorithmically generated movies, poetry, research projects [e.g., iNaturalist.org (https://www.iNaturalist. and music; painting machines; and kinetic interactives. Several org) (research grade observations: https://www.gbif.org/dataset/ of the works featured chance as an important ingredient in the 50c9509d-22c7-4a22-a47d-8c48425ef4a7), eBird.org (https:// creative process, reflected in the priority given in the exhibition’s www.eBird.org) (6), and CitSci.org (https://www.CitSci.org) (7)] emphasis on machine-enabled serendipity. It is useful to reflect, 50 y later, whether machines have indeed enabled serendip- itous discovery, albeit in the realm of science rather than This paper results from the Arthur M. Sackler Colloquium of the National Academy of Sciences, “Creativity and Collaboration: Revisiting Cybernetic Serendipity,” held March the arts. 13–14, 2018, at the National Academy of Sciences in Washington, DC. The complete pro- We consider this concept in the context of online citizen sci- gram and video recordings of most presentations are available on the NAS website at ence projects. These projects, which massively share the task www.nasonline.org/Cybernetic Serendipity.y of data analysis among a crowd of volunteers, in many ways Author contributions: L.T., C.J.L., and L.F.F. performed research; and L.T., C.J.L., and L.F.F. exemplify the promise of those early ideas, providing a com- wrote the paper.y pelling modern example of the transformative power of the The authors declare no conflict of interest.y integration of human and machine effort. Online citizen sci- This article is a PNAS Direct Submission. Y.E.K. is a guest editor invited by the Editorial ence not only is a powerful tool for efficiently processing our Board.y growing data rates and volumes (the “known knowns”), but Published under the PNAS license.y also can function as a means of enabling serendipitous discov- 1 To whom correspondence should be addressed. Email: [email protected] ery of the “known unknowns” and the “unknown unknowns” Published online February 4, 2019. 1902–1909 j PNAS j February 5, 2019 j vol. 116 j no. 6 www.pnas.org/cgi/doi/10.1073/pnas.1807190116 Downloaded by guest on September 27, 2021 PAPER to contributing in-person data and participating in hands-on data Zoo project (31, 32), Zooniverse projects have led to over COLLOQUIUM analysis [e.g., the Denver Museum of Science Genetics of Taste 150 peer-reviewed publications, enabling significant contribu- Laboratory (8)] to a growing number of cocreated environmen- tions across many disciplines (see zooniverse.org/publications tal monitoring projects using low-cost sensors with community for the full list), e.g., in ecology (33–36), humanities (37, 38), members working in collaboration with researchers [e.g., the LA biomedicine (39), physics (40, 41), climate science (42, 43), and Watershed Project (https://www.epa.gov/urbanwaterspartners/ astronomy (31, 32, 44–46). These projects have established a diverse-partners-brownfields-healthfields-la-watershed)] to on- track record of online citizen science producing quality data line data-processing efforts, described in more detail below. for use by the wider scientific community. This paper pro- There has also been an explosion of citizen science efforts vides a compilation of lessons learned and questions raised carried out in classroom settings; for example, Sea-Phages around the integration of machine learning into online citi- (9), Small World Initiative (10), and the Genomic Education zen science based on the experiences from myriad projects on Partnership (11) provide standardized curricula for under- the Zooniverse platform. With each Zooniverse project high- graduate students to collect soil and other samples from their lighted, we reference the specific project URL and the relevant local environments, isolate the bacteria in them, annotate the citations. genomes, characterize them, and upload their results into The number of projects supported by Zooniverse has recently national databases. Over 300 universities participate annually in experienced rapid growth, an acceleration which is a result of these recently launched efforts, with dozens of peer-reviewed the launch in July 2015 of the free Project Builder Platform articles to date and a major impact on these fields of study (e.g., (https://www.zooniverse.org/lab) which enables anyone to build ref. 12). and deploy an online citizen science project at no cost, within Online citizen science, which has become a proven method