Automated Image Computing Reshapes Computational Neuroscience Hanchuan Peng1*, Badrinath Roysam2 and Giorgio a Ascoli3

Peng et al. BMC Bioinformatics 2013, 14:293 http://www.biomedcentral.com/1471-2105/14/293 COMMENTARY Open Access Automated image computing reshapes computational neuroscience Hanchuan Peng1*, Badrinath Roysam2 and Giorgio A Ascoli3 Abstract We briefly identify several critical issues in current computational neuroscience, and present our opinions on potential solutions based on bioimage informatics, especially automated image computing. Computational neuroscience is undergoing a transform- org/science/public_resources/atlases/connectivity_atlas.html), ation. Traditionally, this field has focused on studying and the Harvard mouse brain connectome project (cbs. information processing in nervous systems by collecting, fas.harvard.edu/science/connectome-project), have been analyzing, and simulating neuronal electrophysiology launched to acquire massive datasets, in which each image data [1,2]. Increasingly, the emphasis is shifting towards contains hundreds of millions of pixels. neuromorphological pattern analysis and brain atlas The task of storing, organizing, and disseminating modeling using advanced microscopy and image com- these vast datasets has driven the development of several puting. One important motivation for this changing neuronal databases and digital brain atlases. For ex- focus is the need to ground our understating of animal ample, NeuroMorpho.Org is a web-accessible database behavior in real three-dimensional (3D) neuronal that allows researchers to share neuron reconstructions, morphologies and connectivity. When dynamic imaging quantify neuronal morphology in a standardized manner, of living nervous tissue is possible, there is an additional mine the resulting morphological features, and use them opportunity to investigate the time-dependent molecular in computational modeling and neuronal network simu- mechanisms in brain tissue in a data-driven manner. lation studies [11]. The digital representations of light- Advances in automated image computing are making level single neurons in NeuroMorpho.Org is associated such studies practical and informative. with rich metadata, but only coarse information about the cellular location and orientation relative to sur- rounding brain structures. In contrast, 3D digital brain Image computing is critical for mapping brain atlases contain rich and integrated information on the and building neuron databases spatial coordinates of 3D reconstructed neurons, gene ex- A common goal among many current studies is to map pression, functional modules, and networks, all mapped brain anatomy by systematically characterizing the distri- into a ‘standard’ view of brain anatomy. Examples of re- bution, projection, and connectivity of neurons through- cent digital atlases include a C. elegans connectome [12] out a nervous system. Pursuing this ambitious goal is produced from electron microscopic images, a large-scale increasingly practical, due to converging advances in single-neuron atlas (flycircuit.org [13]) and a stereotypy neuron labeling [3,4], histological preparation [5], multi- atlas of major neurite tracts [14] of the adult fruit fly brain, dimensional imaging [6,7], and image computing [8,9]. both produced from confocal microscopic images, a Several large-scale 3D imaging-based computational neuro- genome-wide gene expression map of the mouse brain science initiatives, such as the HHMI Janelia FlyLight produced using wide-field microscopic images (brain- (janelia.org/team-project/fly-light) and FlyEM projects map.org [15]), and a high-resolution reconstruction of the (janelia.org/team-project/fly-em), the Allen Mouse Brain rat hippocampus [16]. “MindScope” [10] and Connectivity Atlas (alleninstitute. Producing databases and atlases of neuronal structures from microscopy data entails a pipeline of image pro- * Correspondence: [email protected] cessing and analysis (Figure 1) (also see [9]). Key steps 1Allen Institute for Brain Science, Seattle, WA, USA Full list of author information is available at the end of the article include (1) image pre-processing: correct images for © 2013 Peng et al.; licensee BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Peng et al. BMC Bioinformatics 2013, 14:293 Page 2 of 5 http://www.biomedcentral.com/1471-2105/14/293 algorithms capable of automatically tracing stacks of images visualizing the tree-like shape of neuronal axons and dendrites into 3D digital reconstructions. This com- petition succeeded in stimulating a burst of progress. Five finalist-teams were selected from more than one hundred entries. A novel metric methodology was cus- tom designed to compare automated and manual reconstructions [22]. While none of the algorithms presented at the finishing stage reached the official goal of a 20-fold speed-up in the reconstruction process, all finalists’ software programs were made publicly downloadable along with the challenge data sets and the DIADEM metric, providing a useful toolbox for continuous and future development. To achieve full automation, libraries of computer programs for neuronal pattern extraction, comparison and inference must be integrated into toolkits for multi- Figure 1 Major steps in building digital neuronal databases and brain atlases. After necessary image processing, the 3D dimensional image visualization and analysis (e.g. vaa3d. morphology of a neuronal pattern is reconstructed from microscopy org [23]; farsight-toolkit.org). Optimizing each step of data.Multipleneuronalpatternsneedtobeorganizedbasedonbrain automated image computing (Figure 1) is also important regions and neuronal types. For brain atlases, these reconstructions are to ensure a lower error rate compared to humans. In mapped into a common 3D spatial coordinate system. This may addition to a metric of precision, automation of image involve an iterative analysis process. computing should also be assessed using measures of the speed-up compared to manual work, as well as the imaging artifacts and standardize the raw data; (2) regis- robustness of the method with respect to changes of its tration: align brain morphology and stitch together sep- parameters and the type and level of noise in the data. arately imaged brain regions; (3) segmentation and These aspects relate to the scalability to large-scale ap- validation: trace neuron morphology and ensure that the plications and the generalization to similar or new prob- reconstructions are correct; (4) feature extraction: make lems. The above considerations apply to image data detailed quantitative measurements of traced neurons; produced by most imaging modalities. Despite consider- and (5) analytics: model, infer, and predict the identities able progress, many computational problems remain of neurons or neuronal patterns using morphology, loca- open. For instance, the DIADEM neuron reconstruction tion, genetic, lineage, and other extracted features. Many challenge stands unmet. Part of the reason could be that of these steps, such as 3D neuron tracing, are also widely only relatively small image data sets on neuron morph- used in smaller-scale analysis of neuronal phenotypes. ology were previously available to researchers. We hope this situation will change soon due to the availability of The challenge of full automation benchmark datasets and the strong demand of very The long-standing need to automate the laborious and large-scale single neuron screening projects in the next subjective manual analysis of light-microscopic and five years. electron-microscopic images has motivated a large num- ber of bioimage informatics efforts [8,17-21]. The recent Making computer programs more intelligent spurt in imaging throughput, combined with the desire Increasingly, researchers have developed ways to make for large-scale computational modeling, has added a some of these algorithms more intelligent. Domain- sense of urgency to this need. For instance, manual tra- specific prior knowledge can be used to improve the cing of neuron morphology is prohibitively expensive for “intelligence” of an algorithm, akin to the observation analyzing image data that is approaching the scale of that a “supervised” machine-learning algorithm generally terabytes and thousands of image stacks, let alone min- performs better (e.g. predicts more accurately) than an ing the higher-order associations in these data. “unsupervised” one. How to incorporate domain know- For computational neuroscience, this great demand ledge in an algorithm? One way is to code this informa- motivated several major institutions to conduct the tion into explicit rules. For instance, the spatial location worldwide DIADEM Challenge (diademchallenge.org) in of cells coded as directed graphs has been used as con- 2010 as a way to stimulate progress and attract new straints in training a nuclear image segmentation and computational researchers to join the technology devel- recognition algorithm [24]. In another example, particu- opment community. The challenge was to develop lar types of neuron tracing errors could be propagated Peng et al. BMC Bioinformatics 2013, 14:293 Page 3 of 5 http://www.biomedcentral.com/1471-2105/14/293 through a large set of results to identify other potential to seconds). Intravital imaging [27] provides excellent erroneous reconstruction loci [25]. However, in many means

Automated Image Computing Reshapes Computational Neuroscience Hanchuan Peng1*, Badrinath Roysam2 and Giorgio a Ascoli3

Bioimage Analysis Tools

Other Departments and Institutes Courses 1

Bioimage Informatics

Bioimage Informatics 2019

A Bioimage Informatics Platform for High-Throughput Embryo Phenotyping James M

View of Above Observations and Arguments, the Fol- Scopy Environment) [9]

Principles of Bioimage Informatics: Focus on Machine Learning of Cell Patterns

Introduction to Bio(Medical)- Informatics

Bioimagexd: an Open, General-Purpose and High-Throughput Image-Processing Platform

Bioimage Informatics for Plant Sciences Blobs and Curves: Object-Based Colocalisation for Plant Cells

A Short Course on Bioimage Informatics Lecture 1: Basic Concepts and Techniques

Phenotypic Image Analysis Software Tools for Exploring and Understanding Big Image Data from Cell-Based Assays