What Is the Price of Open-Source Software?

What Is the Price of Open-Source Software?

Viewpoint pubs.acs.org/JPCL What Is the Price of Open-Source Software? he notion that all scientific software should be open- As examples, consider two widely used electronic structure 5 6 T source and free has been actively promoted in recent programs, Q-CHEM and MOLPRO. These codes consist of ∼5.5 years, mostly from the top down via mandates from funding and ∼2.5 million lines of source code, respectively, written in agencies1 but occasionally from the bottom up, as exemplified multiple languages and each in continuous development over 2 by a recent Viewpoint in this journal. A commonly articulated several decades. Q-CHEM incorporates scientificadvances rationale is that the results of scientific research funded by reported in more than 300 peer-reviewed scientific publications, government grants should be free for society and that the whereas methods implemented for the first time in MOLPRO scientific community benefits from free access. The purpose of have led to 20 high-impact papers that have each been cited this Viewpoint is to examine the consequences of these over 300 times. Neither code is static: more than 70 scientists opinions. are actively contributing to MOLPRO, and the Q-CHEM developer What Is Scientific Software? Modern computational chemistry base numbers more than 100. Such agile innovation comes at a software is an extremely complex product based on advanced price, however. Significant effort is required to keep the code scientific ideas (models and theories) and sophisticated robust, efficient, and sound and to provide the documentation algorithms that transform these ideas from equations into that ensures the usability of new methods and the extensibility useful tools. The development of practical software that can be of older ones. used by nonexperts to solve contemporary research problems Software from academia is often developed with an emphasis requires considerable technical effort to produce and maintain on ideas rather than implementation, fed by the need for timely robust, efficient, and validated code. Unlike the development of, peer-reviewed journal publications that provide ongoing grant for example, a smart-phone app, where the code base is small3 support and future jobs for graduate students. To bring new and a relatively large community can easily write extensions and ideas to the production level, with software that is accessible to add-ons, production of scientific software involves the curation (and useful for) the broader scientific community, contribu- of millions of lines of source code. The complexity of this code tions from expert programmers are required. These technical demands long-term user and developer support to maintain its tasks usually cannotand generally should notbe conducted integrity and performance while keeping up with new computer by graduate students or postdocs, who should instead be architectures, fixing bugs, and adding features. Recognizing the focused on science and innovation. To this end, Q-CHEM importance of these ideas, various funding agencies in the U.S. employs four scientific programmers. Other quantum chemistry 6 7 8 9 10 have made “sustainable software” a key priority in the codes (e.g., MOLPRO, TURBOMOLE, JAGUAR, MOLCAS, PQS, 11 distribution of research support.1 Sustainability is a critical and ONETEP ) face the same challenges and adopt similar goal, but one that can be realized in various ways. models to ensure sustainability. Good Software Is Important to Science. Computational It is important to distinguish these academically led software chemistry software is an essential scientific instrument that ventures from purely commercial endeavors. The large majority facilitates discovery and innovation far beyond the laboratories of the code in a package like Q-CHEM is funded by the in which it is created, an achievement that was recognized by government, either through grants to academic groups or, in the 1998 and 2013 Nobel Prizes in Chemistry.4 Focusing on some cases, through technology grants to the company itself. Publication Date (Web): July 16, 2015 | doi: 10.1021/acs.jpclett.5b01258 quantum chemistry software in this Viewpoint, we note that The role of the company programmers is to enable today any chemist can (with very little training) use numerous sustainability through bug fixes, user support, release manage- ment, and the addition of features that academic developers Downloaded by UNIV OF SOUTHAMPTON on September 10, 2015 | http://pubs.acs.org quantum chemistry programs as teaching and research tools that aid in the design and interpretation of experiments. either cannot or will not add themselves. Programmers A software package should be more than just a tool for end employed by the company place emphasis on functionality, users, however; it should also be a platform to develop and test robustness, and performance, more so than scientific “ new models and algorithms. Maintaining a code base requires innovation. They are directly addressing the reproducibility ” 2 extensive validation, and given the complexity of modern problem . Sales revenue cannot support the entire develop- computational methods, even testing of “pilot code” or a ment cost of an academic code, but it contributes critically to its “proof-of-principle” implementation requires access to basic sustainability. The cost that the customer pays for a code like fl software infrastructure, for example, an integrals library, a self- Q-CHEM re ects this funding model: it is vastly lower than the consistent field procedure, efficient I/O and memory manage- development cost, particularly for academic customers but also fl ment, tools for manipulating tensors, and so forth. Modularity for industry. It primarily re ects the sustainability cost. Software Is Not Data 2 is a laudable goal, but in reality, “interoperability” often comes . In his Viewpoint, Gezelter argues that at the expense of performance. In high-performance codes, the both software and data should be open, yet it is important not fl aforementioned components are tightly interwoven, to the to con ate the two. Software is not data, and simply because it feasible extent that expert help is often required to modify key is to put software on the Internet does not imply that it should components or to develop nonstandard interfaces to them. As be posted. Software is a product that contains an such, the ability to innovate along either applied or theoretical intellectual component (models and algorithms) but owes its lines depends crucially on the quality of the software and the availability of documentation and expert support. Published: July 16, 2015 © 2015 American Chemical Society 2751 DOI: 10.1021/acs.jpclett.5b01258 J. Phys. Chem. Lett. 2015, 6, 2751−2754 The Journal of Physical Chemistry Letters Viewpoint existence to additional technical efforts. Such efforts include from the pool of research funding available to the rest of the implementation of minor but useful (or requested) features community. The individual researcher pays for these codes, in a that are not publishable in the peer-reviewed literature. This is sense, with his rejected grant proposals in times of lean funding. not to say that details should be withheld as proprietary In contrast to using one’s own performance metrics to guide information. Just as models and algorithms should be described software purchases, within this system, one has no choice in in full detail in scientific publications, so too should what one pays for. In other words, “free software” is not free for implementation details be specified, along with performance you; the only sense in which it is “free” is that you are freed metrics (timings and scaling data) and benchmarks (energies from making a choice about how to spend your research and other computed properties). Nevertheless, the software money. itself is a product, not a scientific finding, more akin to, say, an Computational chemistry software must balance the needs of NMR spectrometera sophisticated instrumentthan to the two audiences: users, who gauge their productivity based on the spectra produced by that instrument. speed, functionality, and user-friendliness of a given program; Consider an analogy from the field of photovoltaics. and developers, who may be more concerned with whether the Scientific findings concerning the mechanistic details of charge structure “under the hood” provides an environment that generation and exciton propagation in a given material are fosters innovation and ease of implementation. As a results that merit discussion in the peer-reviewed literature. quantitative example, consider that the cost of supporting a However, creating a new solar panel based on this research postdoctoral associate (salary plus benefits) is perhaps $4,800/ requires significant additional engineering effort, which is most month. If the use of well-supported commercial software can commonly conducted in an industrial setting. This is a common save 2 weeks of a postdoc’s time, then this would justify an mechanism for technology transfer, by means of which society expense of ≳$2,000 to purchase a software license. This benefits from academic research. Likewise, new telecommuni- amount exceeds the cost of an academic license for many cation technologies, information storage media, computer computational chemistry programs. Given the choice between a chips, and so forth are products that build uponbut are not free product and a commercial one, a scientist should make a equivalent toscientific findings. Going from a journal article decision based on her own needs and her own criteria for doing to a product in one’s home or office requires a significant innovative research. investment of resources that is often impossible to achieve in What Is “Open Source”? The term “open source” is ubiquitous the absence of a commercial platform. Software is not different. but its meaning is ambiguous. Some codes are “free” but are not There Is No Free Software. The creation of scientific software is open,13 whereas others make the source code available, albeit a labor-intensive process, and its support and curation even without binary executables, so that responsibility for more so.

View Full Text

Details

  • File Type
    pdf
  • Upload Time
    -
  • Content Languages
    English
  • Upload User
    Anonymous/Not logged-in
  • File Pages
    4 Page
  • File Size
    -

Download

Channel Download Status
Express Download Enable

Copyright

We respect the copyrights and intellectual property rights of all users. All uploaded documents are either original works of the uploader or authorized works of the rightful owners.

  • Not to be reproduced or distributed without explicit permission.
  • Not used for commercial purposes outside of approved use cases.
  • Not used to infringe on the rights of the original creators.
  • If you believe any content infringes your copyright, please contact us immediately.

Support

For help with questions, suggestions, or problems, please contact us