Bibliometrics and Patent Indicators for the Science and Engineering Indicators 2018: Technical Documentation
Total Page:16
File Type:pdf, Size:1020Kb
Task 21: Bibliometrics and Patent Indicators for the Science and Engineering Indicators 2018 Technical Documentation January 2018 Task 21: Bibliometrics and Patent Indicators for the Science and Engineering Indicators 2018 Technical Documentation January 15, 2018 Submitted to: SRI International Authors Grégoire Côté Guillaume Roberge Philippe Deschamps Nicolas Robitaille Project Leader Grégoire Côté By: Science-Metrix 1.514.495.6505 ▪ 1.800.994.4761 [email protected] ▪ www.science-metrix.com Bibliometrics and Patent Indicators for the Science and Engineering Indicators 2018 Technical Documentation Contents Tables ...................................................................................................................................................... ii Figures .................................................................................................................................................... ii 1 Introduction ................................................................................................................................. 1 2 Bibliometric methods ................................................................................................................ 2 2.1 Database implementation ................................................................................................ 4 Completeness of the database ........................................................................ 7 Filtering non-peer-reviewed documents ........................................................... 9 Filtering low-quality papers ............................................................................. 10 Missing addresses .......................................................................................... 11 2.2 Data standardization ....................................................................................................... 12 Linking WebCASPAR classification to the database ..................................... 12 Data standardization: country, country groups, regions ............................... 14 Data standardization: U.S. states ................................................................... 16 Data coding: U.S. sectors ................................................................................ 17 2.3 Production database ....................................................................................................... 19 Computation of the citations .......................................................................... 20 Production database structure ....................................................................... 21 2.4 Indicators ......................................................................................................................... 22 Number of publications................................................................................... 22 Collaboration ................................................................................................... 23 Index of collaboration ...................................................................................... 24 Scientific impact analysis – citations and journal impact factors ................ 24 Fractioning of citations ................................................................................... 27 Relative citation index ..................................................................................... 28 International citations ..................................................................................... 30 3 Patent indicators ......................................................................................................................31 3.1 Kind codes ....................................................................................................................... 31 3.2 Database .......................................................................................................................... 31 3.3 Database implementation .............................................................................................. 32 3.4 Data standardization ....................................................................................................... 33 Mapping of patents by technical fields .......................................................... 33 Mapping of patents in sustainable energy technologies .............................. 34 Linking citations to non-patent literature to bibliometric database ............. 36 Data standardization: country, country groups, regions ............................... 39 Data standardization: U.S. states ................................................................... 39 Data coding: U.S. sectors ................................................................................ 40 Non-U.S. academic institutions ...................................................................... 41 3.5 Indicators ......................................................................................................................... 41 January 2018 © Science-Metrix Inc. i Bibliometrics and Patent Indicators for the Science and Engineering Indicators 2018 Technical Documentation Inventors versus applicants ............................................................................ 41 Applications versus granted patents .............................................................. 41 Number of patents .......................................................................................... 42 Tables Table I Link between XML items and columns in the SQL table ............................................. 5 Table II Link between XML items and columns in the SQL table ............................................. 6 Table III Link between XML items and columns in the SQL table ............................................. 7 Table IV Monthly follow-up of the completion rate for the year 2016 ...................................... 8 Table V Combinations of source types and document types used for the production of bibliometric indicators ................................................................................................. 10 Table VI Distribution of scientific output in Science-Metrix’ subfield of Marketing across WebCASPAR subfields ................................................................................................. 13 Table VII Geographic entities that changed over time .............................................................. 15 Table VIII Coding papers by sector .............................................................................................. 18 Table IX Number of documents after each step of filtering performed by Science-Metrix ... 20 Table X Example of citation fractioning on a pair of citing–cited articles ............................. 28 Table XI Citations counts between country pairs for a pair of citing–cited articles ............... 29 Table XII USPTO kind codes included for the production of statistics on granted patents .... 31 Table XIII WIPO classification scheme for the production of SEI patent indicators ................. 33 Table XIV Example of a patent fractioned by technical fields according to IPC codes, following conversion from CPC codes ........................................................................................ 34 Table XV Sustainable energy technologies technical areas ..................................................... 35 Table XVI Most frequent 2-grams in patent reference strings .................................................. 37 Figures Figure 1 Bibliographic information for the computation of bibliometric indicators ................. 3 Figure 2 Observed data and evaluation of the completeness, July 2017 ................................ 9 Figure 3 Average number of addresses on publications in Scopus, 1996–2017 ................. 12 Figure 4 Database schema ........................................................................................................ 21 Figure 5 Database schema ........................................................................................................ 22 Figure 6 PatentsView database structure ................................................................................. 32 January 2018 © Science-Metrix Inc. ii Bibliometrics and Patent Indicators for the Science and Engineering Indicators 2018 Technical Documentation 1 Introduction Science-Metrix has been commissioned by SRI International, on behalf of the National Science Foundation, to develop measures and indicators of research and patent activity using bibliometrics and patent data for inclusion in the Science and Engineering Indicators (SEI) 2018. This technical document details the various steps taken to implement the databases, clean and standardize the data, and produce statistics. This documentation is accompanied by a collection of external files that are necessary complements to perform these tasks. The following is the list of accompanying external files: External File 1: Postgresql scripts External File 2: Scopus cancelled title list External File 3: DOAJ cancelled title list External File 4: Source id to WebCASPAR External File 5: Scopus country External File 6: Scopus US addresses to US states External File 7: Scopus US sectors External File 8: Impact NSF production External File 9: IPC technology concordance table External File 10: Patent number to clean technology External File 11: Patent number and uuid to Scopus ID External File 12: Patent number and SEQ to countries and regions External File 13: Patent