Identifying Author Heritage Using Surname Data: An Application for Russian Surnames Maria Karaulova Manchester Institute of Innovation Research, Alliance Manchester Business School, University of Manchester, Manchester, M13 9PL, UK. E-mail:
[email protected] Abdullah Gök Hunter Centre for Entrepreneurship, Strathclyde Business School, University of Strathclyde, 199 Cathedral Street, Glasgow, G4 0QU, UK. E-mail:
[email protected] Philip Shapira Manchester Institute of Innovation Research, Alliance Manchester Business School, University of Manchester, M13 9PL, UK, and the School of Public Policy, Georgia Institute of Technology, Atlanta, GA, 30332-0345, USA. E-mail:
[email protected] This research article puts forward a method to identify contribute to advancing research in scientificmobility the national heritage of authors based on the morphol- and migration, patenting by certain groups, publishing ogy of their surnames. Most studies in the field use vari- and collaboration, transnational and scientificdiaspora ants of dictionary-based surname methods to identify links, and the effects of diversity on the innovative per- ethnic communities, an approach that suffers from meth- formance of organizations, regions, and countries. odological limitations. Using the public file of ORCID (Open Researcher and Contributor ID) identifiers in 2015, we developed a surname-based identification method Introduction and applied it to infer Russian heritage from suffix-based morphological regularities. The method was developed One of the uses of bibliometric research is to extricate conceptually and tested in an undersampled control set. information that is not explicitly presented in mainstream Identification based on surname morphology was then databases, such as inferring patterns of new technology complemented by using first-name data to eliminate emergence, the extent of novelty and originality in patents, false-positive results.