University of Southampton Research Repository
Total Page:16
File Type:pdf, Size:1020Kb
Load more
Recommended publications
-
Google Scholar from Wikipedia, the Free Encyclopedia
Google Scholar From Wikipedia, the free encyclopedia Google Scholar is a freely accessible web search engine that indexes the full Google Scholar text or metadata of scholarly literature across an array of publishing formats and disciplines. Released in beta in November 2004, the Google Scholar index includes most peer-reviewed online academic journals and books, Web address scholar.google.com conference papers, theses and dissertations, preprints, abstracts, (http://scholar.google.com) technical reports, and other scholarly Type of site Bibliographic database literature, including court opinions and Registration Optional patents.[1] While Google does not publish the size of Google Scholar's Owner Google database, third-party researchers Launched November 20, 2004 estimated it to contain roughly 160 Current status Active million documents as of May 2014[2] and an earlier statistical estimate published in PLOS ONE using a Mark and recapture method estimated approximately 80-90% coverage of all articles published in English.[3] Google Scholar is similar in function to the freely available CiteSeerX and getCITED. It also resembles the subscription-based tools, Elsevier's Scopus and Thomson Reuters' Web of Science. Contents 1 History 2 Features and specifications 3 Ranking algorithm 4 Limitations and criticism 5 See also 6 References 7 External links History Google Scholar arose out of a discussion between Alex Verstak and Anurag Acharya,[4] both of whom were then working on building Google's main web index.[5][6] Their goal was to "make the world's problem solvers 10% more efficient" [7] by allowing easier and more accurate access to scientific knowledge. This goal is reflected in the Google Scholar's advertising slogan – "Stand on the shoulders of giants" – taken from a quote by Isaac Newton and is a nod to the scholars who have contributed to their fields over the centuries, providing the foundation for new intellectual achievements. -
Tipping Points: Cancelling Journals When Arxiv Access Is Good Enough
Tipping points: cancelling journals when arXiv access is good enough Tony Aponte Sciences Collection Coordinator UCLA Library ASEE ELD Lightning Talk June 17, 2019 Preprint explosion! Brian Resnick and Julia Belluz. (2019). The war to free science. Vox https://www.vox.com/the-highlight/2019/6/3/18271538/open- access-elsevier-california-sci-hub-academic-paywalls Preprint explosion! arXiv. (2019). arXiv submission rate statistics https://arxiv.org/help/stats/2018_by_area/index 2018 Case Study: two physics journals and arXiv ● UCLA: heavy users of arXiv. Not so heavy users of version of record ● Decent UC authorship ● No UC editorial board members 2017 Usage Annual cost Cost per use 2017 Impact Factor Journal A 103 $8,315 ~$80 1.291 Journal B 72 $6,344 ~$88 0.769 Just how many of these articles are OA? OAISSN.py - Enter a Journal ISSN and a year and this python program will tell you how many DOIs from that year have an open access version2 Ryan Regier. (2018). OAISSN.py https://github.com/ryregier/OAcounts. Just how many of these articles are OA? Ryan Regier. (2018). OAISSN.py https://github.com/ryregier/OAcounts. Just how many of these articles are OA? % OA articles from 2017 % OA articles from 2018 Journal A 68% 64% Journal B 11% 8% Ryan Regier. (2018). OAISSN.py https://github.com/ryregier/OAcounts. arXiv e-prints becoming closer to publisher versions of record according to UCLA similarity study of arXiv articles vs versions of record Martin Klein, Peter Broadwell, Sharon E. Farb, Todd Grappone. 2018. Comparing Published Scientific Journal Articles to Their Pre-Print Versions -- Extended Version. -
Microsoft Academic Search and Google Scholar Citations
Microsoft Academic Search and Google Scholar Citations Microsoft Academic Search and Google Scholar Citations: A comparative analysis of author profiles José Luis Ortega1 VICYT-CSIC, Serrano, 113 28006 Madrid, Spain, [email protected] Isidro F. Aguillo Cybermetrics Lab, CCHS-CSIC, Albasanz, 26-28 28037 Madrid, Spain, [email protected] Abstract This paper aims to make a comparative analysis between the personal profiling capabilities of the two most important free citation-based academic search engines, namely Microsoft Academic Search (MAS) and Google Scholar Citations (GSC). Author profiles can be very useful for evaluation purposes once the advantages and the shortcomings of these services are described and taken into consideration. A total of 771 personal profiles appearing in both the MAS and the GSC databases are analysed. Results show that the GSC profiles include more documents and citations than those in MAS, but with a strong bias towards the Information and Computing sciences, while the MAS profiles are disciplinarily better balanced. MAS shows technical problems such as a higher number of duplicated profiles and a lower updating rate than GSC. It is concluded that both services could be used for evaluation proposes only if they are applied along with other citation indexes as a way to supplement that information. Keywords: Microsoft Academic Search; Google Scholar Citations; Web bibliometrics; Academic profiles; Research evaluation; Search engines Introduction In November 2009, Microsoft Research Asia started a new web search service specialized in scientific information. Even though Google (Google Scholar) already introduced an academic search engine in 2004, the proposal of Microsoft Academic Search (MAS) went beyond a mere document retrieval service that counts citations. -
Emerging Technologies for Academic Libraries in the Digital Age CHANDOS INFORMATION PROFESSIONAL SERIES
Emerging Technologies for Academic Libraries in the Digital Age CHANDOS INFORMATION PROFESSIONAL SERIES Series Editor: Ruth Rikowski (email: [email protected]) Chandos’ new series of books are aimed at the busy information professional. They have been specially commissioned to provide the reader with an authoritative view of current thinking. They are designed to provide easy-to-read and (most importantly) practical coverage of topics that are of interest to librarians and other information professionals. If you would like a full listing of current and forthcoming titles, please visit our web site www.chandospublishing.com or contact Hannah Grace-Williams on email [email protected] or telephone number +44 (0) 1993 848726. New authors: we are always pleased to receive ideas for new titles; if you would like to write a book for Chandos, please contact Dr Glyn Jones on email [email protected] or telephone number +44 (0) 1993 848726. Bulk orders: some organisations buy a number of copies of our books. If you are interested in doing this, we would be pleased to discuss a discount. Please contact Hannah Grace-Williams on email [email protected] or telephone number +44 (0) 1993 848726. Emerging Technologies for Academic Libraries in the Digital Age LILI LI Chandos Publishing Oxford · England Chandos Publishing (Oxford) Limited TBAC Business Centre Avenue 4 Station Lane Witney Oxford OX28 4BN UK Tel: +44 (0) 1993 848726 Fax: +44 (0) 1865 884448 E-mail: [email protected] www.chandospublishing.com First published in Great Britain in 2009 ISBN: 978 1 84334 320 2 (paperback) 978 1 84334 323 3 (hardback) 1 84334 320 7 (paperback) 1 84334 323 1 (hardback) © LiLi Li, 2009 British Library Cataloguing-in-Publication Data. -
Scholarly Metrics Recommendations for Research Libraries: Deciphering the Trees in the Forest Table of Contents
EUROPE’S RESEARCH LIBRARY NETWORK Scholarly Metrics Recommendations for Research Libraries: Deciphering the Trees in the Forest Table of Contents 01 3D. Learn About Platforms Before Implementing Them In Services & 15 INTRODUCTION 03 Homogenize Different Sources ABOUT LIBER 3E. Work With Researchers To Build Awareness Of Benefits But Educate 16 1. DISCOVERY & DISCOVERABILITY 05 About Weaknesses of Scholarly Metrics 1A. Provide Contextual Information To Allow the Discovery of Related 05 3F. Expand Your Perspective When Developing Services; Avoid Single- 17 Work & Users Purpose Approaches 1B. Exploit Rich Network Structures & Implement Bibliometric Methods To 07 3G. Teach Colleagues New Skills & Library Goals & Services 17 Enable Discovery 1C. Encourage Sharing Of Library Collections Under Open Licenses 08 4. RESEARCH ASSESSMENT 19 4A. Establish Appropriate Goals for Assessment Exercises Before Selecting 19 2. SHOWCASING ACHIEVEMENTS 09 Databases & Metrics; Be Transparent About Use & Interpretation 2A. Incentivize Researchers To Share Scholarly Works, Promote Achievements 09 4B. Use Different Data Sources to Include Various Scholarly Works & 20 Online & Engage With Audiences Disciplinary Communication & Publication Cultures 2B. Encourage Researchers To Showcase Scientific Contributions & Monitor 11 4C. Rely on Objective, Independent & Commonly-Accepted Data Sources 21 Impact To Provide Sound & Transparent Scholarly Metrics 4D. Avoid Using Composite Indicators & Conflating Different Aspects 21 3. SERVICE DEVELOPMENT 13 Of Scholarly Works & Impact; Don’t Lose The Multifaceted Nature of 3A. Join Forces With Stakeholders To Provide Services Reusing Existing 13 Metrics & Distort Interpretation Resources, Tools, Methods & Data 3B. Value Various Levels of Engagement; Favour Standardized, Well-Established 15 23 CONCLUSION & CALL FOR ACTION Practices & Easy-To-Use Tools 25 REFERENCES 3C. -
Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and Opencitations’ COCI: a Multidisciplinary Comparison of Coverage Via Citations
✅ The content of this manuscript has been peer-reviewed and accepted for publication at Scientometrics (ISSN 1588-2861) Cite as: Martín-Martín, A., Thelwall, M., Orduna-Malea, E., & Delgado López-Cózar, E. (2021). Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: A multidisciplinary comparison of coverage via citations. Scientometrics, 126(1), 871-906. https://doi.org/10.1007/s11192-020-03690-4 Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations’ COCI: a multidisciplinary comparison of coverage via citations Alberto Martín-Martín 1 , Mike Thelwall 2 , Enrique Orduna- 3 1 Malea , Emilio Delgado López-Cózar Abstract (also in Spanish, Chinese) Introduction New sources of citation data have recently become available, such as Microsoft Academic, Dimensions, and the OpenCitations Index of CrossRef open DOI-to-DOI citations (COCI). Although these have been compared to the Web of Science (WoS), Scopus, or Google Scholar, there is no systematic evidence of their differences across subject categories. Methods In response, this paper investigates 3,073,351 citations found by these six data sources to 2,515 English-language highly-cited documents published in 2006 from 252 subject categories, expanding and updating the largest previous study. Results Google Scholar found 88% of all citations, many of which were not found by the other sources, and nearly all citations found by the remaining sources (89%-94%). A similar pattern held within most subject categories. Microsoft Academic is the second largest overall (60% of all citations), including 82% of Scopus citations and 86% of Web of Science citations. In most categories, Microsoft Academic found more citations than Scopus and WoS (182 and 223 subject categories, respectively), but had coverage gaps in some areas, such as Physics and some Humanities categories. -
Citeseerx: 20 Years of Service to Scholarly Big Data
CiteSeerX: 20 Years of Service to Scholarly Big Data Jian Wu Kunho Kim C. Lee Giles Old Dominion University Pennsylvania State University Pennsylvania State University Norfolk, VA University Park, PA University Park, PA [email protected] [email protected] [email protected] ABSTRACT access to a growing number of researchers. Mass digitization par- We overview CiteSeerX, the pioneer digital library search engine, tially solved the problem by storing document collections in digital that has been serving academic communities for more than 20 years repositories. The advent of modern information retrieval methods (first released in 1998), from three perspectives. The system per- significantly expedited the process of relevant search. However, spective summarizes its architecture evolution in three phases over documents are still saved individually by many users. In 1997, three the past 20 years. The data perspective describes how CiteSeerX computer scientists at the NEC Research Institute (now NEC Labs), has created searchable scholarly big datasets and made them freely New Jersey, United States – Steven Lawrence, Kurt Bollacker, and available for multiple purposes. In order to be scalable and effective, C. Lee Giles, conceived an idea to create a network of computer AI technologies are employed in all essential modules. To effectively science research papers through citations, which was to be imple- train these models, a sufficient amount of data has been labeled, mented by a search engine, the prototype CiteSeer. Their intuitive which can then be reused for training future models. Finally, we idea, automated citation indexing [8], changed the way researchers discuss the future of CiteSeerX. -
Overview of the Altmetrics Landscape
Purdue University Purdue e-Pubs Charleston Library Conference Overview of the Altmetrics Landscape Richard Cave Public Library of Science, [email protected] Follow this and additional works at: https://docs.lib.purdue.edu/charleston Part of the Library and Information Science Commons An indexed, print copy of the Proceedings is also available for purchase at: http://www.thepress.purdue.edu/series/charleston. You may also be interested in the new series, Charleston Insights in Library, Archival, and Information Sciences. Find out more at: http://www.thepress.purdue.edu/series/charleston-insights-library-archival- and-information-sciences. Richard Cave, "Overview of the Altmetrics Landscape" (2012). Proceedings of the Charleston Library Conference. http://dx.doi.org/10.5703/1288284315124 This document has been made available through Purdue e-Pubs, a service of the Purdue University Libraries. Please contact [email protected] for additional information. Overview of the Altmetrics Landscape Richard Cave, Director of IT and Computer Operations, Public Library of Science Abstract While the impact of article citations has been examined for decades, the “altmetrics” movement has exploded in the past year. Altmetrics tracks the activity on the Social Web and looks at research outputs besides research articles. Publishers of scientific research have enabled altmetrics on their articles, open source applications are available for platforms to display altmetrics on scientific research, and subscription models have been created that provide altmetrics. In the future, altmetrics will be used to help identify the broader impact of research and to quickly identify high-impact research. Altmetrics and Article-Level Metrics Washington as an academic research project to rank journals based on a vast network of citations The term “altmetrics” was coined by Jason Priem, (Eigenfactor.org, http://www.eigenfactor.org/ a PhD candidate at the School of Information and whyeigenfactor.php). -
The Visibility of Authority Records, Researcher Identifiers, Academic Social Networking
James Madison University JMU Scholarly Commons Libraries Libraries 4-23-2019 The iV sibility of Authority Records, Researcher Identifiers, Academic Social Networking Profiles, and Related Faculty Publications in Search Engine Results Rebecca B. French James Madison University, [email protected] Jody Condit Fagan James Madison University, [email protected] Follow this and additional works at: https://commons.lib.jmu.edu/letfspubs Part of the Library and Information Science Commons Recommended Citation French, Rebecca B. and Fagan, Jody Condit, "The iV sibility of Authority Records, Researcher Identifiers, Academic Social Networking Profiles, and Related Faculty Publications in Search Engine Results" (2019). Libraries. 148. https://commons.lib.jmu.edu/letfspubs/148 This Article is brought to you for free and open access by the Libraries at JMU Scholarly Commons. It has been accepted for inclusion in Libraries by an authorized administrator of JMU Scholarly Commons. For more information, please contact [email protected]. The visibility of authority records, researcher identifiers, academic social networking profiles, and related faculty publications in search engine results Rebecca B. French and Jody Condit Fagan James Madison University Author Note JMU Libraries, Carrier Library 1704, James Madison University, Harrisonburg, VA 22807 [email protected] [email protected] This is an Accepted Manuscript of an article published by Taylor & Francis in the Journal of Web Librarianship on April 23, 2019, available online: http://www.tandfonline.com/10.1080/19322909.2019.1591324. 1 Abstract Information about faculty and their publications can be found in library databases such as the Library of Congress Name Authority File, VIAF, WorldCat, and institutional repositories; in identifier registries like ORCID and ISNI; and on academic social networking sites like Academia, Google Scholar, and ResearchGate, but the way search engines use such identifiers and profiles is unclear. -
Can Microsoft Academic Assess the Early Citation Impact of In-Press Articles? a Multi-Discipline Exploratory Analysis1
Can Microsoft Academic assess the early citation impact of in-press articles? A multi-discipline exploratory analysis1 Kayvan Kousha, Mike Thelwall and Mahshid Abdoli. Statistical Cybermetrics Research Group, University of Wolverhampton, UK. Many journals post accepted articles online before they are formally published in an issue. Early citation impact evidence for these articles could be helpful for timely research evaluation and to identify potentially important articles that quickly attract many citations. This article investigates whether Microsoft Academic can help with this task. For over 65,000 Scopus in-press articles from 2016 and 2017 across 26 fields, Microsoft Academic found 2-5 times as many citations as Scopus, depending on year and field. From manual checks of 1,122 Microsoft Academic citations not found in Scopus, Microsoft Academic’s citation indexing was faster but not much wider than Scopus for journals. It achieved this by associating citations to preprints with their subsequent in-press versions and by extracting citations from in-press articles. In some fields its coverage of scholarly digital libraries, such as arXiv.org, was also an advantage. Thus, Microsoft Academic seems to be a more comprehensive automatic source of citation counts for in-press articles than Scopus. Introduction Citation indicators derived from conventional scholarly databases, such as Scopus and the Web of Science (WoS), are often used for the impact assessment of published articles. They are rarely useful for recently- published and in-press articles, however, since their citation counts tend to be zero. The overall publication delay (the time between submission or acceptance and publication) also negatively influences citation indicators (Luwel & Moed, 1998; Yu, Wang, & Yu, 2005; Tort, Targino & Amaral, 2012; Shi, Rousseau, Yang, & Li, 2017). -
(NLP) Approaches to Determine Semantic Similarity Between Texts in Domain-Specific Context
Analysis of Natural Language Processing (NLP) approaches to determine semantic similarity between texts in domain-specific context Author: Surabhi Som (6248160) [email protected] Supervisors: Denis Paperno [email protected] Rick Nouwen [email protected] A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Artificial Intelligence in Faculty of Science Utrecht University 1 Table of Contents Chapter 1 ............................................................................................ 3 Introduction ........................................................................................ 3 1.1 Problem Description ................................................................. 4 Chapter 2 ............................................................................................ 5 Literature Review .............................................................................. 5 2.1 Natural Language Processing .................................................. 5 2.2 Ontologies and its importance ................................................. 5 2.3 Semantics ................................................................................. 6 2.4 Semantic Similarity .................................................................. 7 2.5 Sentence Semantic Similarity ................................................ 15 2.6 Text Semantics ....................................................................... 23 2.7 Stemming and Lemmatization .............................................. -
A Study on Microsoft Academic Search and Google Scholar: a Gateway for Researchers
International Journal of Multidisciplinary Research and Development International Journal of Multidisciplinary Research and Development Online ISSN: 2349-4182, Print ISSN: 2349-5979 Impact Factor: RJIF 5.72 www.allsubjectjournal.com Volume 4; Issue 11; November 2017; Page No. 41-47 A study on Microsoft academic search and Google scholar: A gateway for researchers J Arumugam, M Prakash Librarian, PSG College of Technology, Coimbatore, Tamil Nadu, India Abstract This paper deals about the working techniques of Microsoft Academic Search and Google Scholar. MAS which automatically create a profile for authors who have published in the universe it searches. Therefore it is a good idea for scholars to check their profile and make any needed changes. Google Scholar provides a simple way to broadly search for scholarly literature. This paper deals the merits and demerits of both of these search engines for scholarly publishers.. Most of the peer reviewed Journals, non- peer reviewed journals and open access journals are indexed here. This paper is very much helpful for the research beginners and academicians for locating the related review of literature. Keywords: google scholar, microsoft academic search, research Introduction users 100 percent availability of information. While Google People are living in the world of information explosion Scholar could not compete in this respect, this search engine especially in the influence of information communication could help to lead users to the library’s offerings. Problematic technology has brought drastically changes in the field of with this approach is that the user often doesn’t know or information. The growing amount of literature on the Web and recognize that he indeed uses the library’s offerings—i.e., the the need for multidisciplinary information retrieval content the library has licensed for its users.