Download Article (PDF)
Total Page:16
File Type:pdf, Size:1020Kb
International Journal of Computational Intelligence Systems, Vol. 11 (2018) 575–590 ___________________________________________________________________________________________________________ Computational Intelligence in Astronomy: A Survey Ke Wang1, Ping Guo2*, Fusheng Yu3, Lingzi Duan3, Yuping Wang4, Hui Du5 1 School of Computer Science and Technology, Beijing Institute of Technology, Beijing, 100081, China 2 Image Processing and Pattern Recognition Laboratory, School of System Science, Beijing Normal University, Beijing, 100875, China 3Laboratory of Mathematics and Complex Systems, Ministry of Education, School of Mathematical Sciences, Beijing Normal University, Beijing, 100875, China 4School of Computer Science and Technology, Xidian University, Xi'an, 710071, China 5 College of Computer Science and Engineering, Northwest Normal University, Lanzhou, 730070, China Received 28 April 2017 Accepted 5 January 2018 Abstract With explosive growth of the astronomical data, astronomy has become a representative data-rich discipline so as to defy traditional research methodologies and paradigm to analyze data and discover new knowledge from the data. How to effectively process and analyze the astronomical data is a fundamental work while a key scientific requirement of modern astronomical surveys. This situation has motivated needs for fostering of a wide range of cooperation with the astronomers and computer scientists. Computational intelligence, an important research direction of artificial intelligence and information sciences, has been shown to be promising to solve complex problems in scientific research and engineering. This paper presents a review of the current state of the application of computational intelligence in astronomy. We believe that computational intelligence is expected to provide powerful tools for addressing challenges in astronomical data analysis. Keywords: Computational intelligence; astronomical data analysis; neural networks; fuzzy set; evolutionary computation * [email protected] Copyright © 2018, the Authors. Published by Atlantis Press. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/). 575 International Journal of Computational Intelligence Systems, Vol. 11 (2018) 575–590 ___________________________________________________________________________________________________________ Introduction performing such tasks is applying computational intelligence (CI) techniques. Generally speaking, The technology advancement of observational computational intelligence is considered to be the instruments in astronomy will lead to a big bang of family of scientific methods consisting of artificial astronomical data. These data are typically high neural networks, fuzzy systems, evolutionary dimensional and multi-modal, which include multi-band computation, learning theory and probabilistic methods. spectra and images, various catalogs, time series data, Compared with other data processing methods, the most and synthetic data. At the end of the last century, the important advantage of CI is that it does not need to total amount of image data of the Digitized Palomar exactly establish the model of the problem and dose not Observatory Sky Survey (DPOSS) [1] was no more than requires prior knowledge but rather directly process the 3 TB. By the beginning of this century, the data size of low level observed data (or only use inexact and Sloan Digital Sky Survey (SDSS) [2] with multi-target incomplete knowledge) in nature-inspired fiber spectroscopy has reached 40 TB. The forthcoming computational methodologies and approaches. This Large Synoptic Survey Telescope (LSST) will generate feature is well suited to solve the complex problems of about 30 TB data at each observation night with an astronomical data analysis in which traditional methods observation period of up to 10 years, and it is expected or models are usually incompetent because of the lack that 100,000 variable objects will be found every night. of prior knowledge (e.g. the unknown or rare The total amount of LSST image data is estimated to astronomical objects), or the data analysis involves reach about 70 PB, the catalog will reach 10~20 PB [3]. uncertainties (e.g. the specious or incomplete data Astronomical data big bang is mainly due to two factors: common in astronomy), or the data need to be processed First, the advancement of technology has given in a stochastic way due to the complexity. We believe observational instruments stronger capacity since 1970s. that CI will reinvent the study of modern astronomy. For example, the Keck telescope, at the top of Monaco This is because of two separate factors. The first one is in Hawaii, is the optical telescope with the largest the fact that large scale astronomical data have become aperture diameter of over 10 meters built in the world. available thanks to the advent of astronomical The forthcoming Thirty Meter Telescope (TMT), which instruments and observation technology. The second is being jointly built by multiple countries, has a larger one includes improvements in software and hardware diameter of 30 meters. These large telescopes have technologies, such as artificial intelligence, deep reached unprecedented level of observation width and learning, and high-performance computing. depth. Secondly, with the development of CCD However, researchers including computer scientists, technology, astronomical data can be digitized directly data scientists and astronomers have not yet to into electronic documents, replacing the traditional communicate effectively across specialties, to assimilate means of digitizing by scanning negatives, which their achievements, and to consult with cross- significantly improves the efficiency of data collection disciplinary experts. We therefore review the current [4]. state of the application of computational intelligence in The large astronomical surveys have become the main astronomy with the hope of giving a fairly forms and effective means of astronomers to study the comprehensive overview to the researchers from both universe. The richness of the data has brought new computational intelligence and astronomy who are not scientific opportunities to astronomy and severe very familiar with this new cross-discipline. We also challenges at the same time. Hence, the data-intensive hope that the review could provide inspiration for era of astronomy necessitates a focus on automated, exciting new ideas and applications. The remainder of efficient and intelligential techniques and the paper summarizes the related application of methodologies that can ‘understand’ certain tasks in computational intelligence techniques in astronomy astronomical research like astronomers and from three aspects, i.e. the artificial neural network, the automatically mine large scale astronomical data for fuzzy set, and the evolutionary computation. We end scientific discoveries, such as recognizing known this paper with the prospect for the future research in the objects, discovering unknown objects, search of rare last section. objects and astronomical phenomenon. One way of 576 International Journal of Computational Intelligence Systems, Vol. 11 (2018) 575–590 ___________________________________________________________________________________________________________ 1. Artificial neural networks in astronomy In 1992, ANN was first applied in the morphological classification of galaxies. Storrie-Lombardi et al. [8] An artificial neural network (ANN) is a computational adopted a three layers feed-forward neural network to model inspired by biological neural networks and is classify 3,517 galaxies into five classes, i.e., E, SO, used to approximate general nonlinear functions. ANN Sa+Sb, Sc+Sd and Irr with 1,700 galaxies as training is a data-driven and self-adaptive computational samples. Since then, ANN has always been used as an intelligence technique. Unlike other parametric models, important technique for the morphological classification ANN does not have to make any prior assumptions of galaxies [9-15]. In this field, there are several issues about the underlying structure of the data and does not that are worthy of consideration: 1) what is the network require any prior knowledge. Hence, it is easy to use structure to be used? In the early works, the data sets are and understand compared to other models, e.g., very small. Simple network structure, e.g. the basic statistical methods. ANN could be used as a powerful single hidden layer feed-forward networks, are tool to model complicated physical processes due to its employed in the classification task. With the increase of ability to approximate any arbitrary nonlinear function. data amount, it is possible to employ more complicated network structure with more learning capacity and less Since the first astronomical application with ANN was over-fitting risk. Recently, some deep learning models presented in 1990 [5], the neural network has been the have been employed in morphological classification of most widely used and well-known computational galaxies. For instance, Dieleman et al. [14] use images intelligence technique and machine learning model in from the Galaxy Zoo project to train a convolutional astronomy [6]. Nowadays, ANN has been applied with neural network (CNN) for galaxy morphology success in many astronomical tasks, including but not classification in the context of the Galaxy Challenge limited to morphological classification of galaxies, (https://www.kaggle.com/c/galaxy-zoo-the-galaxy-