Ctwatch Quarterly

ISSN 1555-9874 AVAILABLE ON -LINE AT http://www.ctwatch.org/quarterly/ VOLUME 3 NUMBER 3 AUGUST 2007 The Coming Revolution in Scholarly Communications & Cyberinfr astructure GUEST EDITORS LEE DIRKS AND TONY HEY 1 Introduction Lee Dirks and Tony Hey FEATURED ARTICLES 5 The Shape of the Scientific Article in the 42 Incentivizing the Open Access Research Web: Developing Cyberinfrastructure Publication-Archiving, Data-Archiving and Clifford Lynch Scientometrics Tim Brody, Les Carr, Yves Gingras, Chawki Hajjem, Stevan Harnad, and Alma Swan 11 Next-Generation Implications of Open Access 51 The Law as Cyberinfrastructure Paul Ginsparg Brian Fitzgerald and Kylie Pappalardo 19 Web 2.0 in Science Timo Hannay PERSPECTIVES 26 Reinventing Scholarly Communication for the 58 Cyberinfrastructure For Knowledge Sharing Electronic Age John Wilbanks J. Lynn Fink and Philip E. Bourne 32 Interoperability for the Discovery, Use, and 67 Trends Favoring Open Access Re-Use of Units of Scholarly Communication Peter Suber Herbert Van de Sompel and Carl Lagoze http://www.ctwatch.org/ AUGUST 2007 1 Introduction By now, it is a well-observed fact that scholarly communication is in the midst of Lee Dirks tremendous upheaval. That is as exciting to many as it is terrifying to others. What Director, Scholarly Communication is less obvious is exactly what this dramatic change will mean for the academic Microsoft Corporation world – specifically what influence it will have on the research community – and the Tony Hey advancement of science overall. In an effort to better grasp the trends and the potential Corporate Vice President, External Research impact in these areas, we’ve assembled an impressive constellation of top names in the Microsoft Corporation field – as well as some new, important voices – and asked them to address the key issues for the future of scholarly communications resulting from the intersecting concepts of cyberinfrastructure, scientific research, and Open Access. All of the hallmarks of sea-change are apparent: attitudes are changing, roles are adjusting, business models are shifting – but, perhaps most significantly, individual and collective behaviors are very slow to evolve – far slower than expected. That said, each of the authors in this CTWatch Quarterly issue puts forward a variety of visions and approaches, some prac- tical considerations, and in several cases, specific prototypes or break-through projects already underway to help point the way. Leading off is Clifford Lynch’s excellent overview (“The Shape of the Scientific Article in the Developing Cyberinfrastructure”) – an outstanding entry point to the broad range of issues raised by the reality of cyberinfrastructure and the impact it will have on scientific publishing in the near-term. His paper is an effective preview to the fundamental shift of how scholarly communication will work, namely how the role of the author is changing in a Web 2.0 environment. A core element in this new world is the growing potential benefit for inclusion of data in submissions (or links to data sets). Lynch thoughtfully addresses the many implications arising in this new paradigm (e.g., papers + data) – and how policies and behaviors will need to adapt – most especially the impact this will have on the concept of peer review. He astutely raises the issue of the importance of software/middleware in this new ecosystem – namely in the areas of viewing/reading and visualization. This is a critical point for accurate dissemination to facilitate further research – and is also integral to discoverability as well as the ability to aggregate across multiple articles. In his piece, “Next-Generation Implications of Open Access,” Paul Ginsparg provides an invaluable perspective on the current state of affairs – a “long view” – as one of the originators of the Open Access movement. Having in essence invented the Open Access central repository when he launched arXiv.org in 1991, Ginsparg’s brief retro- spective and forward-looking assessment of this space is a useful look at the features and functionality that open repositories must consider to stay relevant and to add value in this changing environment. Indeed, it is a testament that arXiv.org has been able to remain true to its original tenets of remaining low-cost, selective, and complimentary/ supplemental to other publishers or repositories. However, Ginsparg’s treatment hints at several new directions and areas for enhancement/improvement relating to the issues of (a) storage and access of documents/articles at scale, (b) the social networking implications for large-scale repositories as well as (c) a discourse on how to handle compound objects, data and other related supporting documentation. Also insightful are Ginsparg’s musings of the economics of Open Access, and he surfaces the important theme highlighted by several of the authors in this issue—the notion that a generational shift is required to enable the necessary behavioral change, and the recognition that our field(s) may not progress until this reality is brought about. AUGUST 2007 2 Introduction Timo Hannay’s extremely useful survey of the Web 2.0 landscape is an especially valuable landscape map. In this environmental scan, Hannay takes a snapshot of the current state-of-the-art and provides not only definitions but also definitive examples/ applications that demonstrate the reality, the potential, and the remaining hurdles faced by the social-networking phenomenon. Now that we’ve finally begun to realize the power and potential that had been promised us with the “web-as-platform” – we’re also understanding the many benefits and the driving-force of the network effect: the more who participate, the richer the experience. (Yet, Hannay also points out the cruel truth that the scientific community has been miserably late to the game, when it should have been first – considering the Internet was initially constructed to facilitate the sharing of scientific data.) As exciting as it might be at this point in time, a core tenet of this article is to point out that – as a community – we have yet to realize the full potential of Web 2.0, as we are still so very early in the initial phase. Considering the very medium we are using changes/alters the methods we employ, Hannay stresses that is “impossible to predict” the future, but the hints he provides promise us a very exciting journey. Lynn Fink and Phil Bourne’s “Reinventing Scholarly Communication in the Electronic Age” is an especially compelling article in that it lays out examples of research and projects currently in progress to enact Web 2.0 principals. Echoing the irony from Hannay’s paper, the authors note that scientific and scholarly articles have not evolved at the same pace as other developments on the Internet. In an effort to change that, the University of California, San Diego is undertaking two projects to catalyze developments in this space: (1) the “BioLit” project relating to the semantic mark-up of journal articles enhanced during the authoring stage – not after the fact – and, (2) the launch of “SciVee.com”, a new online resource for augmenting scientific papers with brief video presentations. Also striking is that the article raises a theme that appears in several of the other papers related to the “generational change” that is a crucial underlying factor to the success of these types of projects. There is clearly agreement among the authors that the newer, younger generation is going to carry the scientific world forward in a way that the existing, established community cannot. So, changes that are being implemented now will begin to have greater and greater impact as this new generation of scientists and scholars shift the behavior of the community. It is an exciting prospect – but the groundwork to ensure this occurs is only being laid now with enabling research projects such as these from UCSD. Herbert Van de Sompel and Carl Lagoze’s “Interoperability for the Discovery, Use, and Re-Use of Units of Scholarly Communication” is an exciting look at the seminal work now underway related to the Open Archives Initiative’s “Object Reuse & Exchange” (ORE) project. Building upon a concept initially referenced in Hannay’s paper, they point out the need to understand “the change in the nature of the unit of scholarly communication” – meaning we can no longer effectively think about an article or paper as the primary vessel of conveying knowledge within the academic world or across scientific disciplines. The transmission of knowledge has grown, broadened, and now spans a range from atomic data to entire datasets, from a single paragraph to a series of articles about specific concept, to presentations, videos, or other modes/ formats related to the dissemination of a given concept. Since our ability to commu- nicate information has exploded, we must likewise evolve our effort to describe, find, and utilize these new “compound objects.”. This paper is an in-depth “nuts-and-bolts” presentation, and Van de Sompel and Lagoze explain why there is a crucial need to re-architect scholarship on the internet and how they propose to enact this to ensure that we maximize the intent and the value of what is made available for scholars and researchers. Accompanying their article is an illustrative, online demonstration of their prototype ORE implementation in the form of a screencast referenced in the Appendix of their article; this companion piece provides useful context and a quick AUGUST 2007 3 Introduction overview with examples to their work in progress. The screencast can be found here: http://www.ctwatch.org/quarterly/multimedia/11/ORE_prototype-demo/. In the article by Stevan Harnad et al. entitled “Incentivizing the Open Access Research Web: Publication-Archiving, Data-Archiving and Scientometrics,” we see a bold proposal for employing a new application of research metric across multiple sources (defined as “scientometrics” – the collection, measurement and analysis of full-text, metadata, download and citation metrics) to drive author self-archiving, encourage data-archiving, and enhance research quality overall.

Ctwatch Quarterly

March 2010, Corrected 3/31/10 ISSN: 0195-4857

Download Book of Abstracts

The Political Methodologist

Bioinformatics

Of Us Research Program Ethical, Legal, and Social Implications (ELSI)

Challenges in Sustaining the Million Book Project, a Project Supported by the National Science Foundation

Digital Scholarship 2009 (PDF)

A Fool's Errand

The Fourth Paradigm

The Machine That Would Predict the Future

The Legal Framework for Reproducible Scientific Research Licensing and Copyright

NAME: Mary-Jo K. Romaniuk