Using Artificial Epigenetic Regulatory Networks To

Using Artificial Epigenetic Regulatory Networks To Control Complex Tasks Within Chaotic Systems Alexander P. Turner1;4, Michael A. Lones1;4, Luis A. Fuente1;4, Susan Stepney2;4, Leo S. Caves3;4 and Andy M. Tyrrell1;4 1 Department of Electronics, fapt503,mal503,laf509,[email protected] 2 Department of Computer Science, [email protected] 3 Department of Biology, [email protected] 4 York Centre for Complex Systems Analysis (YCCSA) University of York, Heslington, York YO10 5DD, UK Abstract. Artificial gene regulatory networks are computational models which draw inspiration from real world networks of biological gene regulation. Since their inception they have been used to infer knowledge about gene regulation and as methods of computation. These computational models have been shown to possess properties typically found in the biological world such as robustness and self organisation. Recently, it has become apparent that epigenetic mechanisms play an important role in gene regulation. This paper introduces a new model, the Artifi- cial Epigenetic Regulatory Network (AERN) which builds upon existing models by adding an epigenetic control layer. The results demonstrate that the AERNs are more adept at controlling multiple opposing trajectories within Chirikov's standard map, suggesting that AERNs are an interesting area for further investigation. 1 Introduction Gene regulatory networks are complex structures which underpin an organism's ability to control its internal environment [23]. From a biological perspective, the study of gene regulation is of interest because it determines cellular differentiation which is responsible for the development of the different tissues and organs that underpin the structure of higher organisms [17]. From a computational perspective, gene networks are interesting because they are robust control structures, capable of dealing with serious environmental perturbations, and yet maintaining structure and order. Because of this, there has been significant interest in modeling gene regulatory networks in silico in order to capture these features [18, 21]. Due to the complexity of biological gene regulation, computational analogues are vastly simplified. However, research has shown that relatively simple networks such as the random Boolean network can exhibit emergent properties such as self organisation and robustness and, in addition to this, can model real regulatory circuits [15, 14]. It is the balance between complexity and function which is often the limiting factor in the creation of truly analogous computational models. It is apparent that computational analogues of gene regulation fail to include models of one of the most pervasive methods of gene regulation, epigenetics. In this sense, the regulatory nature of these computational analogues may be limited in terms of complexity and performance. In this paper, we attempt to de- fine a representation of epigenetic information which to some extent captures the useful properties of epigenetics for the control of a complex dynamical system. 2 Gene Regulation and Epigenetics A gene is a unit of hereditary information within a living organism, most commonly considered to be a section of DNA that specifies the primary structure of a protein. The genetic code is a biological blueprint that details which pro- teins can be produced, and ultimately, the phenotypic space which the organism can exist within. The lowest known threshold on the number of genes required to naturally facilitate life is that of Mycoplasma genitalium, which has approximately 470 predicted genes [9]. Even in nature's most minimalist example of a gene regulatory network, 470 genes have to be coordinated in such a way to maintain the optimum internal environment of the organism, highlighting that even the simplest of gene regulatory networks are inherently complex. Epigenetic mechanisms allow a further layer of control over gene regulation. This is commonly done by physically restricting the accessibility of the genes. One of the principal epigenetic mechanisms is DNA methylation which refers to the addition of a methyl group to either the cytosine or adenine nucleobase in DNA (Figure 1). It acts as an epigenetic marker which can regulate many physiological processes [3, 11]. With increasingly complex organisms, higher order structures such as chromatin have been shown to influence gene expression. Chromatin has two functions; Firstly, in the case of human cells, there are approximately 3400Mb of DNA of approximately 2.3m in length. Chromatin provides a structure for the condensation of DNA, so that it can be fully contained in a nucleus of approximately 6µm [1, 2, 6]. Secondly, chromatin modifications have the ability to control access to the DNA, which in turn acts as an addi- tional level of genetic control. Generally, DNA methylation provides a more long term, stable effect on gene expression when compared to relatively short term reversible chromatin modifications [7]. However, research suggests that in some instances chromatin modifications and DNA methylation are intrinsically linked [12]. One of the more interesting aspects of epigenetics is that in certain instances, epigenetic traits can be inherited by successive generations of cells, and some- times organisms [4]. In addition to this, epigenetic modifications can give the genetic code a relative memory [3], which can then be used in such processes as cellular differentiation [20]. However, the specific mechanisms and processes which control epigenetics, and in turn gene expression, are not yet wholly un- derstood. Current research demonstrates that epigenetic mechanisms allow for a level of genetic memory, and a method of genetic control above the genetic code itself. M M M M M A C G T C C T G T C C T A A T G C A G G A C A G G A T T M M Fig. 1: An illustration of a highly methylated DNA region. The attachment of a methyl group (M) to the cytosine (C) nucleobase (DNA methylation) has been shown to be able to regulate physiological processes [3, 11] This creates the ability for organisms to express high levels of adaptability via utilisation of the extra dynamical processing which epigenetics facilitates. In the following sections the idea of incorporating a level of artificial epigenetic information in pre-existing artificial gene regulatory networks is introduced. Simulations are conducted to ascertain if the addition of epigenetic information can yield any performance increases when controlling two trajectories within Chirikov's standard map. 3 Artificial Genetic Regulatory Networks Gene regulation in biology is a set of mechanisms which maintains control of an organism's internal environment (homeostasis). The aim of artificial gene regulation is to create a computational model of genetic behaviour which exhibits the useful and interesting properties of gene regulation in nature, namely self organisation and robustness. The first such example was the random Boolean network (RBN) [15]. RBNs represent genes as Boolean expressions. These artificial genes are referred to as node's. The network has a connectivity value (k) which specifies how many node's influence a given nodes expression level. The state of a node is defined by randomly initiated state transition rules. Upon execution, the network is iterated over a number of time steps, during which each node modifies its value depending upon its connectivity and its state transition rules. These networks demonstrate that with a k value of 2 or 3, distinct order and repetitive patterns can be generated. Moreover, for certain parameter ranges, the RBNs expressed high levels of robustness, maintaining relative order when exposed to external perturbations [10]. Subsequent models of gene regulation draw inspiration from the RBN. How- ever there has been a shift towards continuous models [16, 18], as they are com- putationally more flexible. In addition, it has been shown that these models can be applied to the control of complex systems [18]. 3.1 Artificial Epigenetic Regulatory Networks This paper extends the model of artificial gene regulation (AGN) described in [18] by incorporating epigenetic information. The aim is to ascertain if an addi- tional level of regulatory control will result in any performance benefits over the previous model. The Artificial Epigenetic Regulatory Network (AERN) uses an analogue of DNA methylation in combination with chromatin modifications as its epigenetic elements. This gives the network the ability to change its epigenetic information both during evolution, and within a single generation via the changing of epigenetic frames (EG in definition below). Table 1 gives an example of an evolved AERN. The AERN can be formally described as: < G, LG, IG, OG, EG > where : G = An indexed set of genes fg0; :::; gn : gi =< λi;Ri; fi >g, where: λi : R is the expression level of a gene Ri⊆ G is the set of regulatory inputs used by the genes R i :Ri ! λi is a gene's regulatory function LG is an indexed set of initial expression levels, where, jLGj = jGj IG⊂G are the external inputs applied to the network OG⊂G are the outputs of the network EG⊂G is a data structure specifying which genes are active at a given instance The AERN is executed as follows : G1. λ0....λn are initialised from LG (if AERN not previously executed). G2. Expression levels of enzymes in IG are set by the external inputs. G3. At each time step, each active gene gi applies its regulatory function fi to the current expression levels of its active regulating genes Ri in order 0 to calculate its expression at the next time step, λi G4. After a given number of iterations, execution is halted and the expression levels of enzymes in OG are copied to the external outputs. The expectation is that epigenetic control will enable the network to evolve in such a way that it will have certain genes that are more able to perform a given objective.

Load more