Future Directions for arXiv User Perspectives
Oya Y. Rieger Associate University Librarian Scholarly Resources & Preservation Services arXiv Program Director July 2016
1
1.2 million OA e-prints in Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance and Statistics • 2012 – 84,000 new submissions – 64 million downloads • 2013 – 92,500 new submissions – 67 million downloads • 2014 – 97,000 new submissions – 90 million downloads* • 2015 – 105,000 new submissions – 139 million downloads*
* The numbers are sensitive to robot downloads and it is hard to remove all from our numbers so potential significant over- 2 counting – we put less effort in cleaning up this data 2014 on.
Cornell University Library, July 2016 1 arXiv submission rate statistics
http://arxiv.org/help/stats3
http://arxiv.org/help/stats4
Cornell University Library, July 2016 2 5
6
Cornell University Library, July 2016 3 Business Model, 2013-2017 • Member Institution • Simons Foundation • Cornell University Library
7
Governance Model Cornell University Library (CUL) • Manages the moderation of submissions and user support • Operates arXiv’s technical infrastructure • Ensures long-term access • Establishes and maintains partnerships • Assumes financial responsibility • Maintains transparent and open communication • Provides legal protection
Scientific Advisory Member Advisory Board (SAB) Board (MAB) • Provides advice and guidance • Advises CUL on issues related to: pertaining to the intellectual o repository management and oversight of arXiv development • Oversees arXiv's moderation system o standards implementation & • Reviews the criteria and standards interoperability for deposit in arXiv o development priorities • Proposes new subject or discipline o business planning domains o outreach and advocacy 8
Cornell University Library, July 2016 4 arXiv Organizational Chart @ Cornell University Library
Program Director (0.20 FTE) * Scientific Director (0.40 FTE) Oya Y. Rieger vacant
* CUL indirect contribution
Technology & Policy Advisor Library (.3 FTE) Operations Manager Membership (0.10 Simeon Warner (1 FTE) FTE) * Jim Entwood Chloe McLaren Lead Developer (1 FTE) 2 FTE Admin Martin Lessmeister 0.6 students
1.50 FTE Programming Total Admin Staff = 3.6 FTE .5 FTE User Experience *Gail Steinhart (10%), Scholarly * CUL’s indirect contribution Total IT Staff = 3 FTE Communication Librarian CUL indirect contribution
Total Staff = 7.7 FTE
Additional Contributors: Paul Ginsparg & ~150 moderators Plus indirect-related staffing, e.g., HR, Accounting 9
https://confluence.cornell.edu/display/culpublic/arXiv+Sustainability+Initiative
10
Cornell University Library, July 2016 5 https://confluence.cornell.edu/display/culpublic/arXiv+Sustainability+Initiative
11
April 6-27, 2016 12
Cornell University Library, July 2016 6 DEMOGRAPHICS OF RESPONDENTS
13
main place of work is located in:
Other Countries: 1% or less representation each from 113 countries 14
Cornell University Library, July 2016 7 15
16
Cornell University Library, July 2016 8 68% younger than 39 years
17
KEY FINDINGS
18
Cornell University Library, July 2016 9 95%
19
Key Findings
• Keep to the core mission
• Enable arXiv’s partners and related service providers to continue to build new services and innovations on top of arXiv
20
Cornell University Library, July 2016 10 Key Findings: Wish List
• Improve the search function & author name disambiguation
• Provide better support for submitting and linking research data, code, slides and other materials associated with papers
21
Key Findings: New Services
• Add direct links to papers in the references and support reference extraction
• Offer citation export formats such as BibTeX, RIS
• Enable extraction for the BibTeX entry for the arXiv citation
22
Cornell University Library, July 2016 11 23
Key Findings: QC & Moderation
• Continue to implement quality control measures: – checking for text overlap – correct classification of submissions – rejection of papers without much scientific value, – asking authors to fix format-related problems
• Provide more information about the moderation process and policies
24
Cornell University Library, July 2016 12 Q32 - Please choose any ONE of the following statements that you agree with the most:
25
Key Findings: arXiv and Scientific Communication • Divided opinions: – think boldly and further advance open access – emphasis on the importance of sticking to the main mission
• Urge vigilance when approaching any changes
• Caution against turning arXiv into a “social media” style platform 26
Cornell University Library, July 2016 13 Key Findings: Open Science • Rating system – split between very important/important (36%) and not important/should not be doing this (36%)
• Annotation feature – split with 34.89% of users ranking it as very important/important and 34.08% as not important/should not be doing this
• Implement very carefully and systematically
27
DEMOGRAPHIC CHARACTERISTICS & CORELATIONS
28
Cornell University Library, July 2016 14 Very important, important
Somewhat important
Not important, should not be doing this
No opinion
0% 20% 40% 60% 80% 100%
0 - 5 years 6- 10 years 11 or more years
How important is it to… Improve support for submitting research papers by updating the TeX engine,” by years respondents have used arXiv
29
0 - 5 years
6- 10 years
11 or more years
0% 20% 40% 60% 80% 100%
Very important, important Somewhat important Not important, should not be doing this No opinion
Responses to the question “How important is it to… Offer a rating system so readers can recommend arXiv papers that they find valuable,” by years respondents have used arXiv.
30
Cornell University Library, July 2016 15 72%
20%
8%
31
Source:
Oya Y. Rieger, Gail Steinhart, Deborah Cooper (2016). arXiv@25: Key findings of a user survey. http://arxiv.org/abs/1607.08212
32
Cornell University Library, July 2016 16