Global Research Report the Value of Bibliometric Databases: Data-Intensive Studies Beyond Search and Discovery
Total Page:16
File Type:pdf, Size:1020Kb
Feburary 2020 Global Research Report The value of bibliometric databases: Data-intensive studies beyond search and discovery Jonathan Adams, David Pendlebury and Martin Szomszor Author biographies Jonathan Adams is Chief Scientist at David Pendlebury is Head of Martin Szomszor is Director at the the Institute for Scientific Information Research Analysis at ISI. Since 1983 Institute for Scientific Information (ISI). He is also a Visiting Professor he has used Web of Science data to and has also held the role of Head at King’s College London, Policy study the structure and dynamics of of Research Analytics at ISI. He was Institute. In 2017 he was awarded research. He worked for many years named a 2015 top-50 UK Information an Honorary D.Sc. by the University with ISI founder Eugene Garfield. Age data leader for his work in creating of Exeter, for his work in higher With Henry Small, David developed the REF2014 impact case studies education and research policy. ISI’s Essential Science Indicators. database for the Higher Education Funding Council for England (HEFCE). Foundational past, visionary future The Institute for Today, as the ‘university’ of the • Carries out research to sustain, Scientific Information Web of Science Group, ISI both: extend and improve the knowledge base and • Maintains the foundational disseminates that knowledge ISI builds on the work of Dr. Eugene knowledge and editorial rigor upon to our colleagues, partners and Garfield – the original founder and which the Web of Science index and all those who deal with research a pioneer of information science. its related products and services in academia, corporations, Named after the company he are built. Our robust evaluation and funders, publishers and founded – the forerunner of the curation have been informed by governments via our reports Web of Science Group – ISI was research use and objective analysis and publications and at re-established in 2018 and serves as a for almost half a century. Selective, events and conferences. home for analytic expertise, guided structured and complete data in by his legacy and adapted to respond the Web of Science provide rich to technological advancements. insights into the contribution and value of the world's most impactful Our global team of industry- scientific and research journals. recognized experts focus on the These expert insights enable development of existing and researchers, publishers, editors, new bibliometric and analytical librarians and funders to explore approaches, whilst fostering the key drivers of a journal's value collaborations with partners and for diverse audiences, making academic colleagues across the better use of the wide body of global research community. data and metrics available. ISBN: 978-1-9160868-7-6 2 Executive summary In 2019, around 145,000 researchers The topical structure of such from, on average, 139 countries publications demonstrates the working across a diversity of research critical relevance of Web of Science disciplines interrogated the Web of literature to review health policy Science each day to explore research targets like cancer, women’s health information and discover key and cardiovascular disease, the literature to inform current research. management of medical outcomes, and the development of innovative In 1981 the Web of Science indexed methods and treatments. approximately 5 00,000 papers (substantive academic articles and Tracking such literature across time reviews) from 6,800 journals; this reveals the emergence of new expanded substantially to 2.5 million fields and helps inform the direction papers sourced from 21,300 journals of funding. Identifying frequent in 2019. This is a deep data resource authors in a select area reveals the for a wide range of analytic uses. distribution of expertise and supports comparative evaluation of activity There are, however, few studies of and outcomes for national policy and how the Web of Science is used as a institutional research management. bibliographic database other than for the purposes of search and discovery. Our analysis shows that Web of Science is the primary source of publication and citation data for the Our analysis majority of systematic research reviews across a broad range of disciplines and shows that Web about twice as many research management and evaluation studies as of Science is the any other source. Web of Science is the primary data source for such work primary source of in the USA, China, and most of western Europe. Countries where the Scopus publication and database was more frequently acknowledged include Iran and Italy. citation data for A key beneficiary of structured use of the majority of Web of Science bibliographic records are the biomedical researchers systematic who have an established and structured approach to accessing raw research reviews material for reviews that inform the development and current state of research topics that are critical to human health and disease control. 3 Introduction The Web of Science was The numbers of papers in the Web of The focus of research management interrogated every day in 2019 Science, and the spread of research it interest is, however, rarely pitched at by around 145,000 researchers covers, has expanded hugely over the this categorical level. It is much more across an average of 139 countries, last few decades. In 1981 it indexed likely to be ‘topical’ at a finer grain, representing the full spectrum about 500,000 papers (substantive within or between established journal of research disciplines in the academic articles and reviews) every categories. Research is increasingly natural and social sciences and, year from around 6,800 journals. interdisciplinary, as the focus of increasingly, in the humanities. Today, it indexes about 2.5 million research topics often spans traditional papers from about 21,300 journals disciplinary structures and cutting- Researchers use the Web of annually. The index houses around 20 edge solutions to major challenges Science to initiate new research million papers from researchers based draw on multiple fields. This means plans and help frame the questions in the USA or about 27 million papers that the information needed by both they want to answer; and then to sourced from researchers in the researchers and those tasked with search for and discover the key European Union: a deep data resource selecting, funding and evaluating literature that helps to support and for a wide range of analytic uses. the outcomes of research projects inform their current research. is likely to come from targeted searches, analyses and reports, to Perhaps surprisingly, there are few be of wide interest across a field. studies of the ways in which the bibliographic database is used The numbers of The concentrated mass of the by the research community for database has many tales to other purposes than for search and papers in the Web tell. Drawing together related discovery. Pringle (2008) discussed publications in a structured way the rising use of bibliographic of Science, and the provides the raw material for reviews metadata in research evaluation that reveal the development and and Schnell (2018) documented spread of research current state of broader research the historical place of the Web topics. Tracking literature across of Science as the first citation it covers, has time reveals the emergence of index for data analytics and new fields and may help to direct scientometrics. Most recently, Li, expanded hugely funding where it can be most usefully Rollins and Yan (2018) confirmed spent. Identifying the most frequent that the quantitative impact of over the last few authors in a select area reveals the Web of Science had not been the distribution of expertise and rigorously examined by scientific decades. supports comparative evaluation studies. They investigated the of activity and outcomes. ways in which this source was mentioned in a sample of 19,478 papers published between 1997- 2017 and analysed the distribution Publication records are structured across countries, institutions and in journal-based categories domains. This appears to have (originally designed to make been the first study to empirically searching easier) but now, are more investigate the documentation frequently employed as the basis on the use of the database. for comparative analytics. There are 254 detailed Web of Science journal In this report, we extend the studies categories (as of January 2020) and of Li et al (2018) and Schnell (2018) 21 broader categories in the Essential by focusing on the ways researchers Science Indicators. Citation analysis is are using Web of Science data to often carried out using the framework learn about the findings reported of Web of Science categories, where across the entire scientific and the close disciplinary relationship scholarly communication system between journals assigned to any and how this information can category means that the average support specific studies, especially citation count for all articles in that systematic reviews, and support category in a particular year is a better research management. meaningful global benchmark. 4 When, what and where? Our first step in this report is to look there has been more usage than the at the numbers of papers (i.e. both ‘normal’ level of academic search articles and reviews) published and discovery that would be required The data shows how each year and indexed in the Web to produce a research paper. of Science that also reference, in rapid the growth their titles, abstracts and key-words, The data shows how rapid the growth the analysis of data from the Web of such studies has been – compared of such studies has of Science or other widely-used, to the overall growth of the underlying global publication databases. database. Whereas acknowledgments been, compared to of database source were rare in Over the period 1999 through to studies using bibliographic data in the the overall growth mid-2019, when we extracted these 1990s, numbers rose to around 1,000 data, we found 51,120 papers that papers per year by 2010 and growth of the underlying explicitly referenced the use of since then has sky-rocketed.