<<

editorial

Defining the The rise of ‘’ methods and -driven presents new possibilities for but also stimulates disagreement over how should be conducted and even how it should be defined.

“Hypotheses Modern biological research methods are powerful tools in how Google Translate relies on only correlative analyses of ’ arsenal for investigating . But is the abil- documents on the . aren’t simply ity of these methods to amass extraordinary amounts of This generated quite a response from the scientific com- data altering the of scientific ? munity with California Institute of useful tools in As schoolchildren we are taught that the scientific Sean Carroll arguing in Edge that “hypotheses aren’t simply some potentially method involves a question and suggested useful tools in some potentially outmoded vision of sci- () based on , followed by the care- ence; they are the whole point. is , outmoded vision ful and execution of controlled , and and understanding our world is what science is all about.” finally validation, refinement or rejection of this hypoth- Is the generation of parts lists and correlations in the of science; they esis. Developed by thinkers including Bacon, Descartes and absence of functional models science? Based on the often are the whole Pierce, this has been credited with much of accepted definition of the scientific method, the answer science’s success. Modern philosophers such as Feyerabend would be a qualified no. But not everyone would agree. point.” have argued that this is not how most science is conducted, Carroll’s colleague, David Goodstein, previously stated in but up to now most modern have subscribed to a Thesis article in Nature that “science, it turns out, Sean Carroll the hypothesis-centric scientific method. is whatever scientists do.” A philosopher would find this Scientists’ defense of this methodology has often been to be a circular and unfulfilling , but it is likely vigorous, likely owing to the historic success of predictive that many biologists who are more interested in the prac- “Science, it hypothesis-driven mechanistic in physics, the tical outcomes of their methods than their philosophical turns out, dangers inherent in ‘fishing expeditions’ and the likeli- underpinnings would agree with this sentiment. hood of false correlations based on data from improperly But the rise of that generate massive is whatever designed experiments. For example, The Genome amounts of data does not dictate that biology should be data- Project was considered by many at the to be a serious driven. In a return to hypothesis-driven research, systems scientists do.” biologists are attempting to use the same ‘omics’ methods

© All rights reserved. 2009 Inc. Nature America, break with the notion that proper biological research must to generate data for use in quantitative biological mod- David Goodstein be hypothesis-driven. But the project proceeded because others successfully argued that it would yield information els. Hypotheses are needed before data collection because vital for understanding . model-driven quantitative analyses require rich dynamic Methodological developments are now making it pos- data collected under defined conditions and stimuli. sible to obtain massive amounts of ‘omics’ data on a variety So where does this leave us? It is likely that the high com- of biological constituents. These immense datasets allow plexity of biology will actually make full biological under- biologists to generate useful (for example, standing by purely correlative impossible. This gene-finding and function or protein structure and func- method works for Google because language has simple tion) using and that do not rules and low . Biology has neither constraint. take into account the underlying mechanisms that dictate Correlations in large datasets may be able to provide some design and functionconsiderations that would form the useful answers, but not all of them. basis of a traditional hypothesis. But ‘omics’ data can provide information on the size Now that the against data-driven investigation has and composition of biological entities and thus deter- weakened, the desire to simplify ‘omics’ data reuse has led mine the boundaries of the problem at hand. Biologists to the establishment of minimal information requirements can then proceed to investigate function using classical for different types of primary data. The hope is that this hypothesis-driven experiments. It is still unclear whether will allow new analyses and predictions using aggregated even this marriage of the two methods will deliver a com- data from disparate experiments. plete understanding of biology, but it arguably has a better Last summer, the editor-in-chief of Wired, Chris chance than either method on its own. Anderson, went so far as to argue that biology is too com- Philosophers are free to argue whether one method is plex for hypotheses and models, and that the classical sci- science and the other is not. Ultimately the public who entific method is dead. Instead, he called for these methods funds the and the biologists who conduct it want to be replaced by powerful correlative analyses of massive that will materially impact the quality of life regard- amounts of data gathered by new similar to less of what the method is called.

nature methods | VOL.6 NO.4 | APRIL 2009 | 237